Web Pages Ranking Algorithms: A Survey

Authors

  • Ayad Abdulrahman Information Technology Department, Duhok Polytechnic University, Computer Science Department, University of Zakho, Zakho, Iraq

DOI:

https://doi.org/10.48161/qaj.v1n3a79

Keywords:

Search Engines, Ranking Algorithms, WCM, Web Mining, HITS, Crawler

Abstract

Due to the daily expansion of the web, the amount of information has increased significantly. Thus, the need for retrieving relevant information has also increased. In order to explore the internet, users depend on various search engines. Search engines face a significant challenge in returning the most relevant results for a user's query. The search engine's performance is determined by the algorithm used to rank web pages, which prioritizes the pages with the most relevancy to appear at the top of the result page. In this paper, various web page ranking algorithms such as Page Rank, Time Rank, EigenRumor, Distance Rank, SimRank, etc. are analyzed and compared based on some parameters, including the mining technique to which the algorithm belongs (for instance, Web Content Mining, Web Structure Mining, and Web Usage Mining), the methodology used for ranking web pages, time complexity (amount of time to run an algorithm), input parameters (parameters utilized in the ranking process such as InLink, OutLink, Tag name, Keyword, etc.), and the result relevancy to the user query.

Downloads

Download data is not yet available.

References

U. Naik and D. Shivalingaiah, “Comparative Study of Web 1. 0, Web 2. 0 and Web 3. 0,” 6th International CALIBER, pp. 499–507, 2008.

K. Jacksi and S. M. Abass, “Development history of the world wide web,” Int. J. Sci. Technol. Res, vol. 8, no. 9, Art. no. 9, 2019.

R. KumarRana and N. Tyagi, “A Novel Architecture of Ontology-based Semantic Web Crawler,” International Journal of Computer Applications, vol. 44, no. 18, pp. 31–36, 2012, doi: 10. 5120/6365-8724.

M. A. Sadeeq, S. R. Zeebaree, R. Qashi, S. H. Ahmed, and K. Jacksi, “Internet of Things security: a survey,” 2018, pp. 162–166.

K. Jacksi, N. Dimililer, and S. Zeebaree, “State of the art exploration systems for linked data: a review,” Int. J. Adv. Comput. Sci. Appl. IJACSA, vol. 7, no. 11, pp. 155–164, 2016.

K. Jacksi, N. Dimililer, and S. R. Zeebaree, “A survey of exploratory search systems based on LOD resources,” 2015.

K. Jacksi, “Design And Implementation Of Online Submission And Peer Review System: A Case Study Of E-Journal Of University Of Zakho,” International Journal of Scientific & Technology Research, vol. 4, no. 8, pp. 83–85, 2015.

M. P. S. M. E, “Ranking Techniques for Social Networking Sites based on Popularity,” Indian Journal of Computer Science and Engineering (IJCSE) Ranking, vol. 3, no. 3, pp. 522–526, 2012.

J. Cho, S. Roy, and R. E. Adams, “Page quality: In search of an unbiased web ranking,” Proceedings of the ACM SIGMOD International Conference on Management of Data, pp. 551–562, 2005, doi: 10. 1145/1066157. 1066220.

K. Jacksi, S. Zeebaree, and N. Dimililer, “Design and Implementation of LOD Explorer: A LOD Exploration and Visualization Model,” Journal of Applied Science and Technology Trends, vol. 1, no. 2, pp. 31–39, 2020.

A. Barbar and A. Ismail, “Search engine optimization (SEO) for websites,” ACM International Conference Proceeding Series, vol. Part F1482, pp. 51–55, 2019, doi: 10. 1145/3323933. 3324072.

K. Jacksi and S. Badiozamany, “General method for data indexing using clustering methods,” Int. J. Sci. Eng, vol. 6, no. 3, pp. 641–644, 2015.

S. R. Z. Karwan Jacksi Nazife Dimililer, “AN IMPROVED APPROACH FOR INFORMATION RETRIEVAL WITH SEMANTIC-WEB CRAWLING,” University of Zakho, 2018.

N. Grover and R. Wason, “Comparative Analysis Of Page Rank And HITS Algorithms,” International Journal of Engineering Research & Technology (IJERT), vol. 1, no. 8, pp. 1–15, 2012.

D. Mukhopadhyay, P. Biswas, and Y. -C. Kim, “A Syntactic Classification based Web Page Ranking Algorithm,” pp. 83–92, 2011.

M. PaulSelvan, A. Chandra Sekar, and A. Priya Dharshini, “Survey on Web Page Ranking Algorithms,” International Journal of Computer Applications, vol. 41, no. 19, pp. 1–7, 2012, doi: 10. 5120/5646-7764.

P. S. Sharma, D. Yadav, and P. Garg, “A systematic review on page ranking algorithms,” International Journal of Information Technology (Singapore), vol. 12, no. 2, pp. 329–337, 2020, doi: 10. 1007/s41870-020-00439-3.

P. Desikan, J. Srivastava, V. Kumar, and P. Tan, “Hyperlink analysis : techniques and applications,” p. 42, 2002.

D. Kumar Sharma and A. K. Sharma, “A Comparative Analysis of Web Page Ranking Algorithms,” IJCSE) International Journal on Computer Science and Engineering, vol. 02, no. 08, pp. 2670–2676, 2010.

et al Srivastava, Jaideep, “Web Usage Mining : Discovery and Applications of Usage Patterns from Web Data INTRODUCTION,” ACM SIGKDD Explorations Newsletter 1. 2, vol. 1, no. 2, pp. 12–23, 2011.

R. Ibrahim, S. Zeebaree, and K. Jacksi, “Survey on Semantic Similarity Based on Document Clustering,” Adv. sci. technol. eng. syst. j, vol. 4, no. 5, pp. 115–122, 2019.

K. Jacksi, R. Kh. Ibrahim, S. R. M. Zeebaree, R. R. Zebari, and M. A. M. Sadeeq, “Clustering Documents based on Semantic Similarity using HAC and K-Mean Algorithms,” in 2020 International Conference on Advanced Science and Engineering (ICOASE), Dec. 2020, pp. 205–210. doi: 10. 1109/ICOASE51841. 2020. 9436570.

S. M. Mohammed, K. Jacksi, and S. Zeebaree, “A state-of-the-art survey on semantic similarity for document clustering using GloVe and density-based algorithms,” Indonesian Journal of Electrical Engineering and Computer Science, vol. 22, no. 1, pp. 552–562, 2021.

S. M. Mohammed, K. Jacksi, and S. R. M. Zeebaree, “Glove Word Embedding and DBSCAN algorithms for Semantic Document Clustering,” in 2020 International Conference on Advanced Science and Engineering (ICOASE), Dec. 2020, pp. 1–6. doi: 10. 1109/ICOASE51841. 2020. 9436540.

N. M. Salih and K. Jacksi, “State of the art document clustering algorithms based on semantic similarity,” JURNAL INFORMATIKA, vol. 14, no. 2, pp. 58–75, 2020.

N. M. Salih and K. Jacksi, “Semantic Document Clustering using K-means algorithm and Ward’s Method,” in 2020 International Conference on Advanced Science and Engineering (ICOASE), Dec. 2020, pp. 1–6. doi: 10. 1109/ICOASE51841. 2020. 9436588.

P. Gupta, S. K. Singh, D. Yadav, and A. K. Sharma, “An improved approach to ranking web documents,” Journal of Information Processing Systems, vol. 9, no. 2, pp. 217–236, 2013, doi: 10. 3745/JIPS. 2013. 9. 2. 217.

L. Jin, L. Feng, G. Liu, and C. Wang, “Personal Web Revisitation by Context and Content Keywords with Relevance Feedback,” IEEE Transactions on Knowledge and Data Engineering, vol. 29, no. 7, pp. 1508–1521, 2017, doi: 10. 1109/TKDE. 2017. 2672747.

P. Gupta, A. K. Sharma, and D. Yadav, “A NOVEL TECHNIQUE FOR BACK-LINK EXTRACTION AND RELEVANCE EVALUATION,” Int. J. Comput. Sci. Inf. Technol., vol. 3, no. 3, pp. 227–238, 2011.

W. Xing and A. Ghorbani, “Weighted PageRank algorithm,” Proceedings - Second Annual Conference on Communication Networks and Services Research, pp. 305–314, 2004, doi: 10. 1109/dnsr. 2004. 1344743.

J. M. Kleinberg, “Authoritative sources in a hyperlinked environment,” Proceedings of the ninth annual ACM-SIAM symposium on Discrete algorithms, vol. 9781400841, no. May 1997, pp. 668–677, 1998, doi: 10. 1515/9781400841356. 514.

K. Fujimura, T. Inoue, and M. Sugisaki, “The EigenRumor Algorithm for Ranking Blogs,” Proceedings of the Second Annual Workshop on the Weblogging Ecosystem: Aggregation, Analysis and Dynamics, Chiba, Japan, pp. 59–74, 2005.

A. M. ZarehBidoki and N. Yazdani, “DistanceRank: An intelligent ranking algorithm for web pages,” Information Processing and Management, vol. 44, no. 2, pp. 877–892, 2008, doi: 10. 1016/j. ipm. 2007. 06. 004.

L. W. Lee, J. Y. Jiang, C. Der Wu, and S. J. Lee, “A query-dependent ranking approach for search engines,” 2nd International Workshop on Computer Science and Engineering, WCSE 2009, vol. 1, pp. 259–263, 2009, doi: 10. 1109/WCSE. 2009. 666.

C. Li et al. , “Fast computation of SimRank for static and dynamic information networks,” Advances in Database Technology - EDBT 2010 - 13th International Conference on Extending Database Technology, Proceedings, pp. 465–476, 2010, doi: 10. 1145/1739041. 1739098.

R. Baeza-Yates and E. Davis, “Web Page Ranking using Link Attributes Categories and Subject Descriptors,” Proceedings of the 13th international World Wide Web conference on Alternate track papers & posters, pp. 328–329, 2004.

F. Lamberti, A. Sanna, and C. Demartini, “A relation-based page rank algorithm for semantic Web search engines,” IEEE Transactions on Knowledge and Data Engineering, vol. 21, no. 1, pp. 123–136, 2009, doi: 10. 1109/TKDE. 2008. 113.

D. H. Maulud, S. R. Zeebaree, K. Jacksi, M. A. M. Sadeeq, and K. H. Sharif, “State of art for semantic analysis of natural language processing,” Qubahan Academic Journal, vol. 1, no. 2, pp. 21–28, 2021.

K. J. A Zeebaree SRM Zeebaree, “Designing an Ontology of E-learning system for Duhok Polytechnic University Using Protégé OWL Tool,” J. Adv. Res. Dyn. Control Syst. , vol, vol. 11, no. 5, pp. 24–37, 2019.

S. R. M. Z. Adel AL-Zebari Karwan Jacksi and Ali Selamat, “ELMS–DPU Ontology Visualization with Protégé VOWL and Web VOWL,” Journal of Advanced Research in Dynamical and Control Systems, vol. 11, no. 1, pp. 478–485, 2019.

A. -Z. Adel, S. Zebari, and K. Jacksi, “Football Ontology Construction using Oriented Programming,” Journal of Applied Science and Technology Trends, vol. 1, no. 1, pp. 24–30, 2020.

K. Jacksi, “Design and Implementation of E-Campus Ontology with a Hybrid Software Engineering Methodology,” Science Journal of University of Zakho, vol. 7, no. 3, pp. 95–100, 2019.

S. Jie, C. Chen, Z. Hui, S. Rong-Shuang, Z. Yan, and H. Kun, “Tag Rank: A new rank algorithm for webpage based on social web,” Proceedings of the International Conference on Computer Science and Information Technology, ICCSIT 2008, pp. 254–258, 2008, doi: 10. 1109/ICCSIT. 2008. 45.

H. Jiang, Y. X. Ge, D. Zuo, and B. Han, “TimeRank: A method of improving ranking scores by visited time,” Proceedings of the 7th International Conference on Machine Learning and Cybernetics, ICMLC, vol. 3, pp. 1654–1657, 2008, doi: 10. 1109/ICMLC. 2008. 4620671.

Published

2021-07-01

How to Cite

Abdulrahman, A. (2021). Web Pages Ranking Algorithms: A Survey. Qubahan Academic Journal, 1(3), 29–34. https://doi.org/10.48161/qaj.v1n3a79

Issue

Section

Articles