Next page prediction with popularity based page rank, duration based page rank and semantic tagging approach

Download
2012
Yanık, Banu Deniz
Using page rank and semantic information are frequently used techniques in next page prediction systems. In our work, we extend the use of Page Rank algorithm for next page prediction with several navigational attributes, which are size of the page, duration of the page visit and duration of transition (two page visits sequentially), frequency of page and transition. In our model, we define popularity of transitions and pages by using duration information, use it in a relation with page size, and visit frequency factors. By using the popularity value of pages, we bias conventional Page Rank algorithm and model a next page prediction system that produces page recommendations under given top-n value. Moreover, we extract semantic terms from web URLs in order to tag pages semantically. The extracted terms are mapped into web URLs with different level of details in order to find semantically similar pages for next page recommendations. With this tagging, we model another next page prediction method, which uses Semantic Tagging (ST) similarity and exploits PPR values as a supportive method. Moreover, we model a Hybrid Page Rank (HPR) algorithm that uses both Semantic Tagging based approach and Popularity Based Page Rank values of pages together in order to investigate the effect of PPR and ST with equal weights. In addition, we investigate the effect of local (a synopsis of directed web graph) and global (whole directed web graph) modeling on next page prediction accuracy.

Suggestions

Topic-centric querying of web information resources
Altıngövde, İsmail Sengör; Ulusoy, O; Ozsoyoglu, G; Ozsoyoglu, ZM (2001-01-01)
This paper deals with the problem of modeling web information resources using expert knowledge and personalized user information, and querying them in terms of topics and topic relationships. We propose a model for web information resources, and a query language SQL-TC (Topic-Centric SQL) to query the model. The model is composed of web-based information resources (XML or HTML documents on the web), expert advice repositories (domain-expert-specified metadata for information resources), and personalized inf...
Probabilistic matrix factorization based collaborative filtering with implicit trust derived from review ratings information
Ercan, Eda; Taşkaya Temizel, Tuğba; Department of Information Systems (2010)
Recommender systems aim to suggest relevant items that are likely to be of interest to the users using a variety of information resources such as user profiles, trust information and users past predictions. However, typical recommender systems suffer from poor scalability, generating incomprehensible and not useful recommendations and data sparsity problem. In this work, we have proposed a probabilistic matrix factorization based local trust boosted recommendation system which handles data sparsity, scalabil...
Using Google analytics, card sorting and search statistics for getting insights about metu website’s new design: a case study
Dalcı, Mustafa; Taşkaya Temizel, Tuğba; Department of Information Systems (2011)
websites are one of the most popular and quickest way for communicating with users and providing information. Measuring the effectiveness of website, availability of information on website and information architecture on users‟ minds have become key issues. Moreover, using these insights on website‟s new design process will make the process more user-centered. v There is no consensus on how to define web site effectiveness, which dimensions need to be used for the evaluation of these web sites and which pro...
Process based information systems evaluation: Towards the attributes of "pRISE"
Özkan Yıldırım, Sevgi; Bilgen, Semih (2007-10-31)
Purpose The purpose of this paper is to demonstrate the importance of undertaking a systemic view of information systems evaluation that augments the frequently reported prescriptive (cost/benefit) analysis approaches. Design/methodology/approach The paper adopts a qualitative case perspective and derives a framework for substantive information systems evaluation factors (PRISE). Three empirical formulations are considered and a comparison made to determine the content and context of the findings. Finding...
Optimization of an online course with web usage mining
Akman, LE; Akkan, B; Baykal, Nazife (2004-02-18)
The huge amount of information existing in the World Wide Web constitutes an ideal environment to implement data mining techniques. Web mining is the mining of web data. There are different applications of web mining: web content mining, web structure mining and web usage mining. In our study we analyzed an online course by web usage mining techniques in order to optimize the navigation paths, the duration of the time spend on each page and the number of visits throughout the semester of the course. Moreove...
Citation Formats
B. D. Yanık, “Next page prediction with popularity based page rank, duration based page rank and semantic tagging approach,” M.S. - Master of Science, Middle East Technical University, 2012.