Validating Ranking in Web Documents Using Normalized Social Media Information

Document Type : Original Article

Authors

1 Computer Science and Engineering Department, Faculty of Electronic Engineering in Menouf, Menoufia University

2 Dept. of Computer Eng., Arab Academy for Science and Technology & Maritime Transport, Egypt.

Abstract

Social Information Retrieval (SIR) is a relatively new domain which uses social information to enhance information retrieval processes. To find more interesting search results, social behavior can indicate how much these results are interesting. Social interaction over Web 2.0 are used here to enhance ranking of web results in response to a query. A dataset from Open Directory Project (ODP) is used here to show the improvement of ranking. We propose the usage of normalization and social services weights to give better performance. The proposed framework gets data from various types of social info (social bookmarking, social news, social network, discovery engines). Data is parsed into fields and significant values are used in the ranking process. Precision and Mean Average Precision (MAP) are used to evaluate results. Simulation results show better ranking with the proposed model.

xt-size-adjust: auto; -webkit-text-str[1] M. Hilbert, “How much information is there in the ‘information society’?,”
Significance, vol. 9, no. 4, pp. 812, Aug. 2012.
[2] Eb. Inc., “Top 15 Most Popular Search Engines | December 2014.”
[Online]. Available: http://www.ebizmba.com/articles/search-engines.
[Accessed: 03-Dec-2014].
[3] J. Surowiecki, The wisdom of crowds. New York: Anchor Books, 2005.
[4] S. K. M. Yi, M. Steyvers, M. D. Lee, and M. J. Dry, “The Wisdom of the
Crowd in Combinatorial Problems,” Cognitive Science, vol. 36, no. 3, pp.
452470, Apr. 2012.
[5] “Facebook.” [Online]. http://www.facebook.com. [Accessed: 08-Feb-
2016].
[6] “Twitter.” [Online]. http://www.twitter.com. [Accessed: 08-Feb-2016].
[7] “Google+.” [Online]. http://plus.google.com. [Accessed: 08-Feb-2016].
[8] “LinkedIn.” [Online]. http://www.linkedin.com. [Accessed: 08-Feb-2016].
[9] “Delicious.” [Online]. https://delicious.com/. [Accessed: 08-Feb-2016].
[10] “StumbleUpon.” [Online]. http://www.stumbleupon.com. [Accessed: 08-
Feb-2016].
[11] “Reddit.” [Online]. Available: https://www.reddit.com. [Accessed: 08-
Feb-2016].
[12] A. Muralidharan, Z. Gyongyi, and E. Chi, “Social annotations in web
search,” in Proceedings of the 2012 ACM annual conference on Human
Factors in Computing Systems - CHI ’12, 2012, p. 1085.
[13] S. Xu, S. Bao, B. Fei, Z. Su, and Y. Yu, “Exploring folksonomy for
px; -webkit-text-size-adjust: auto; -wpersonalized search,” in Proceedings of the 31st annual international ACM
SIGIR conference on Research and development in information retrieval -
SIGIR ’08, 2008, pp. 155162.
[14] M. R. Bouadjenek, H. Hacid, and M. Bouzeghoub, “Sopra,” in
Proceedings of the 36th international ACM SIGIR conference on Research
and development in information retrieval - SIGIR ’13, 2013, pp. 861864.
[15] Y. Yanbe, A. Jatowt, S. Nakamura, and K. Tanaka, “Can social
bookmarking enhance search in the web?,” in Proceedings of the 7th
ACM/IEEE-CS joint conference on Digital libraries, 2007, pp. 107116.
[16] S. Bao, G. Xue, X. Wu, Y. Yu, B. Fei, and Z. Su, “Optimizing web search
using social annotations,” in Proceedings of the 16th international
conference on World Wide Web - WWW ’07, 2007, pp. 501510.
[17] I. Badache and M. Boughanem, “Harnessing Social Signals to Enhance a
Search,” in 2014 IEEE/WIC/ACM International Joint Conferences on Web
Intelligence (WI) and Intelligent Agent Technologies (IAT), 2014, pp. 303
309.
[18] A. S. Hashwa, N. El-Fishawy, and S. M. Youssef, “Evaluating Social
Media Information Usage in Ranking the Web,” in 25th International
Conference on Computer Theory and Applications, 2015, pp. 8490.
[19] P. Heymann, G. Koutrika, H. Garcia-molina, I. T. Report, L. U.
November, G. Koutrika, and H. Garcia-molina, “Can social bookmarking
improve web search?,” in Proceeding WSDM ’08 Proceedings of the 2008
International Conference on Web Search and Data Mining, 2008, pp. 195
206.
[20] “Similarity (Lucene 3.0.3 API).” [Online]. Available:
http://lucene.apache.org/core/3_0_3/api/core/org/apache/lucene/search/Si
milarity.html. [Accessed: 15-Feb-2015].
[21] G. R. PRICE, “Extension of covariance selection mathematics,” Annals of
Human Genetics, vol. 35, no. 4, pp. 485490, Apr. 1972.
[22] “DMOZ - the Open Directory Project.” [Online]. Available:
http://www.dmoz.org. [Accessed: 08-Feb-2016].
[23] P. A. Chirita, W. Nejdl, R. Paiu, and C. Kohlschütter, “Using ODP
metadata to personalize search,” in Proceedings of the 28th annual
international ACM SIGIR conference on Research and development in
information retrieval - SIGIR ’05, 2005, p. 178.
[24] C. D. Manning, Prabhakar Raghavan, and H. Schütze, An Introduction to
Information Retrieval. Cambridge University Press, 2009.