{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,10]],"date-time":"2026-04-10T10:04:04Z","timestamp":1775815444434,"version":"3.50.1"},"reference-count":61,"publisher":"Association for Computing Machinery (ACM)","issue":"2","license":[{"start":{"date-parts":[[2012,5,1]],"date-time":"2012-05-01T00:00:00Z","timestamp":1335830400000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"funder":[{"DOI":"10.13039\/100000145","name":"Division of Information and Intelligent Systems","doi-asserted-by":"publisher","award":["IIS-0811994"],"award-info":[{"award-number":["IIS-0811994"]}],"id":[{"id":"10.13039\/100000145","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["ACM Trans. Web"],"published-print":{"date-parts":[[2012,5]]},"abstract":"<jats:p>Social media have attracted considerable attention because their open-ended nature allows users to create lightweight semantic scaffolding to organize and share content. To date, the interplay of the social and topical components of social media has been only partially explored. Here, we study the presence of homophily in three systems that combine tagging social media with online social networks. We find a substantial level of topical similarity among users who are close to each other in the social network. We introduce a null model that preserves user activity while removing local correlations, allowing us to disentangle the actual local similarity between users from statistical effects due to the assortative mixing of user activity and centrality in the social network. This analysis suggests that users with similar interests are more likely to be friends, and therefore topical similarity measures among users based solely on their annotation metadata should be predictive of social links. We test this hypothesis on several datasets, confirming that social networks constructed from topical similarity capture actual friendship accurately. When combined with topological features, topical similarity achieves a link prediction accuracy of about 92%.<\/jats:p>","DOI":"10.1145\/2180861.2180866","type":"journal-article","created":{"date-parts":[[2012,6,1]],"date-time":"2012-06-01T15:51:28Z","timestamp":1338565888000},"page":"1-33","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":281,"title":["Friendship prediction and homophily in social media"],"prefix":"10.1145","volume":"6","author":[{"given":"Luca Maria","family":"Aiello","sequence":"first","affiliation":[{"name":"University of Turin, Italy"}]},{"given":"Alain","family":"Barrat","sequence":"additional","affiliation":[{"name":"Aix-Marseille University and University Sud Toulon, France, ISI Foundation, Italy"}]},{"given":"Rossano","family":"Schifanella","sequence":"additional","affiliation":[{"name":"University of Turin, Italy"}]},{"given":"Ciro","family":"Cattuto","sequence":"additional","affiliation":[{"name":"ISI Foundation, Italy"}]},{"given":"Benjamin","family":"Markines","sequence":"additional","affiliation":[{"name":"Indiana University, Bloomington"}]},{"given":"Filippo","family":"Menczer","sequence":"additional","affiliation":[{"name":"Indiana University, Bloomington"}]}],"member":"320","published-online":{"date-parts":[[2012,6,4]]},"reference":[{"key":"e_1_2_1_1_1","doi-asserted-by":"publisher","DOI":"10.1109\/SocialCom.2010.42"},{"key":"e_1_2_1_2_1","doi-asserted-by":"publisher","DOI":"10.1073\/pnas.0908800106"},{"key":"e_1_2_1_3_1","doi-asserted-by":"publisher","DOI":"10.1145\/1935826.1935914"},{"key":"e_1_2_1_4_1","doi-asserted-by":"publisher","DOI":"10.1109\/ASONAM.2010.87"},{"key":"e_1_2_1_5_1","volume-title":"Proceedings of the 8th Symposium on Abstraction, Reformulation and Approximation (SARA'09)","author":"Caragea D.","unstructured":"Caragea , D. , Bahirwani , V. , Aljandal , W. , and H. Hsu , W. 2009. Ontology-based link prediction in the live journal social network . In Proceedings of the 8th Symposium on Abstraction, Reformulation and Approximation (SARA'09) . Caragea, D., Bahirwani, V., Aljandal, W., and H. Hsu, W. 2009. Ontology-based link prediction in the live journal social network. In Proceedings of the 8th Symposium on Abstraction, Reformulation and Approximation (SARA'09)."},{"key":"e_1_2_1_6_1","doi-asserted-by":"publisher","DOI":"10.1103\/PhysRevE.71.027103"},{"key":"e_1_2_1_7_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-540-88564-1_39"},{"key":"e_1_2_1_8_1","doi-asserted-by":"publisher","DOI":"10.1038\/nature06830"},{"key":"e_1_2_1_9_1","doi-asserted-by":"publisher","DOI":"10.1145\/1401890.1401914"},{"key":"e_1_2_1_10_1","unstructured":"Dunlavy D. M. Kolda T. G. and Acar E. 2010. Temporal link prediction using matrix and tensor factorizations. arXiv:1005.4006 Cornell University Library.  Dunlavy D. M. Kolda T. G. and Acar E. 2010. Temporal link prediction using matrix and tensor factorizations. arXiv:1005.4006 Cornell University Library."},{"key":"e_1_2_1_11_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.patrec.2005.10.010"},{"key":"e_1_2_1_12_1","doi-asserted-by":"crossref","unstructured":"Feldman R. and Sanger J. 2006. Text Mining Handbook: Advanced Approaches in Analyzing Unstructured Data. Cambridge University Press.   Feldman R. and Sanger J. 2006. Text Mining Handbook: Advanced Approaches in Analyzing Unstructured Data. Cambridge University Press.","DOI":"10.1017\/CBO9780511546914"},{"key":"e_1_2_1_13_1","doi-asserted-by":"publisher","DOI":"10.1145\/1117454.1117456"},{"key":"e_1_2_1_14_1","doi-asserted-by":"publisher","DOI":"10.5555\/944919.944950"},{"key":"e_1_2_1_15_1","doi-asserted-by":"publisher","DOI":"10.1177\/0165551506062337"},{"key":"e_1_2_1_16_1","doi-asserted-by":"publisher","DOI":"10.1145\/1656274.1656278"},{"key":"e_1_2_1_17_1","volume-title":"Proceedings of the SDM Workshop on Link Analysis, Counterterrorism and Security.","author":"Hasan M. A.","unstructured":"Hasan , M. A. , Chaoji , V. , Salem , S. , and Zaki , M . 2006. Link prediction using supervised learning . In Proceedings of the SDM Workshop on Link Analysis, Counterterrorism and Security. Hasan, M. A., Chaoji, V., Salem, S., and Zaki, M. 2006. Link prediction using supervised learning. In Proceedings of the SDM Workshop on Link Analysis, Counterterrorism and Security."},{"key":"e_1_2_1_18_1","doi-asserted-by":"publisher","DOI":"10.1109\/TKDE.2003.1208999"},{"key":"e_1_2_1_19_1","volume-title":"Proceedings of 12th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (LinkKDD'06)","author":"Huan Z.","year":"2006","unstructured":"Huan , Z. 2006 . Link prediction based on graph topology: The predictive value of the generalized clustering coefficient . In Proceedings of 12th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (LinkKDD'06) . ACM, New York. Huan, Z. 2006. Link prediction based on graph topology: The predictive value of the generalized clustering coefficient. In Proceedings of 12th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (LinkKDD'06). ACM, New York."},{"key":"e_1_2_1_20_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICDM.2006.8"},{"key":"e_1_2_1_21_1","doi-asserted-by":"publisher","DOI":"10.1145\/1150402.1150476"},{"key":"e_1_2_1_22_1","doi-asserted-by":"crossref","unstructured":"Kunegis J. De Luca E. and \n      Albayrak S\n  . \n  2010\n  . The link prediction problem in bipartite networks. In Computational Intelligence for Knowledge-Based Systems Design E. Hllermeier et al. Eds. Lecture Notes in Computer Science vol. \n  6178 Springer Berlin 380--389.   Kunegis J. De Luca E. and Albayrak S. 2010. The link prediction problem in bipartite networks. In Computational Intelligence for Knowledge-Based Systems Design E. Hllermeier et al. Eds. Lecture Notes in Computer Science vol. 6178 Springer Berlin 380--389.","DOI":"10.1007\/978-3-642-14049-5_39"},{"key":"e_1_2_1_23_1","volume-title":"Evolution of Social Networks","volume":"1","author":"Leenders R.","year":"1997","unstructured":"Leenders , R. 1997 . Longitudinal behavior of network structure and actor attributes: Modeling interdependence of contagion and selection . In Evolution of Social Networks , Vol. 1 , P. Doreian and F. Stokman, Eds. Leenders, R. 1997. Longitudinal behavior of network structure and actor attributes: Modeling interdependence of contagion and selection. In Evolution of Social Networks, Vol. 1, P. Doreian and F. Stokman, Eds."},{"key":"e_1_2_1_24_1","volume-title":"Proceedings of International Conference on Weblogs and Social Media (ICWSM'07)","author":"Lerman K.","year":"2047","unstructured":"Lerman , K. and Jones , L . 2007. Social browsing on flickr . In Proceedings of International Conference on Weblogs and Social Media (ICWSM'07) . http:\/\/arxiv.org\/abs\/cs.HC\/061 2047 . Lerman, K. and Jones, L. 2007. Social browsing on flickr. In Proceedings of International Conference on Weblogs and Social Media (ICWSM'07). http:\/\/arxiv.org\/abs\/cs.HC\/0612047."},{"key":"e_1_2_1_25_1","doi-asserted-by":"publisher","DOI":"10.1145\/1835804.1835855"},{"key":"e_1_2_1_26_1","doi-asserted-by":"publisher","DOI":"10.1145\/1401890.1401948"},{"key":"e_1_2_1_27_1","doi-asserted-by":"publisher","DOI":"10.1145\/1367497.1367620"},{"key":"e_1_2_1_28_1","doi-asserted-by":"publisher","DOI":"10.1145\/1772690.1772756"},{"key":"e_1_2_1_29_1","doi-asserted-by":"publisher","DOI":"10.1145\/1367497.1367589"},{"key":"e_1_2_1_30_1","doi-asserted-by":"publisher","DOI":"10.1145\/956863.956972"},{"key":"e_1_2_1_31_1","volume-title":"Proceedings of the 15th International Conference on Machine Learning (ICML). J. W. Shavlik, Ed., Morgan Kaufmann, 296--304","author":"Lin D.","year":"1998","unstructured":"Lin , D. 1998 . An information-theoretic definition of similarity . In Proceedings of the 15th International Conference on Machine Learning (ICML). J. W. Shavlik, Ed., Morgan Kaufmann, 296--304 . Lin, D. 1998. An information-theoretic definition of similarity. In Proceedings of the 15th International Conference on Machine Learning (ICML). J. W. Shavlik, Ed., Morgan Kaufmann, 296--304."},{"key":"e_1_2_1_32_1","doi-asserted-by":"publisher","DOI":"10.1145\/1651274.1651285"},{"key":"e_1_2_1_33_1","unstructured":"L\u00fc L. and Zhou T. 2010. Link prediction in complex networks: A survey. Preprint. http:\/\/arxiv.org\/abs\/1010.0725.  L\u00fc L. and Zhou T. 2010. Link prediction in complex networks: A survey. Preprint. http:\/\/arxiv.org\/abs\/1010.0725."},{"key":"e_1_2_1_34_1","doi-asserted-by":"publisher","DOI":"10.1145\/1526709.1526796"},{"key":"e_1_2_1_35_1","doi-asserted-by":"publisher","DOI":"10.1145\/1557914.1557982"},{"key":"e_1_2_1_36_1","doi-asserted-by":"publisher","DOI":"10.1145\/1379092.1379122"},{"key":"e_1_2_1_37_1","doi-asserted-by":"publisher","DOI":"10.1145\/1149941.1149949"},{"key":"e_1_2_1_38_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.physa.2003.06.002"},{"key":"e_1_2_1_39_1","doi-asserted-by":"publisher","DOI":"10.1146\/annurev.soc.27.1.415"},{"key":"e_1_2_1_40_1","doi-asserted-by":"publisher","DOI":"10.1145\/1397735.1397742"},{"key":"e_1_2_1_41_1","doi-asserted-by":"publisher","DOI":"10.1145\/1298306.1298311"},{"key":"e_1_2_1_42_1","doi-asserted-by":"publisher","DOI":"10.1145\/1718487.1718519"},{"key":"e_1_2_1_43_1","doi-asserted-by":"publisher","DOI":"10.5555\/259573.259582"},{"key":"e_1_2_1_44_1","doi-asserted-by":"publisher","DOI":"10.1109\/WI.2007.71"},{"key":"e_1_2_1_45_1","doi-asserted-by":"publisher","DOI":"10.1103\/PhysRevE.64.025102"},{"key":"e_1_2_1_46_1","doi-asserted-by":"publisher","DOI":"10.1103\/PhysRevLett.89.208701"},{"key":"e_1_2_1_47_1","doi-asserted-by":"publisher","DOI":"10.1103\/PhysRevE.67.026126"},{"key":"e_1_2_1_48_1","doi-asserted-by":"publisher","DOI":"10.1103\/PhysRevE.68.036122"},{"key":"e_1_2_1_49_1","doi-asserted-by":"publisher","DOI":"10.1103\/PhysRevLett.87.258701"},{"key":"e_1_2_1_50_1","volume-title":"Proceedings of the 2nd International Workshop on Multirelational Data Mining.","author":"Popescul A.","unstructured":"Popescul , A. , Popescul , R. , and Ungar , L. H . 2003. Structural logistic regression for link analysis . In Proceedings of the 2nd International Workshop on Multirelational Data Mining. Popescul, A., Popescul, R., and Ungar, L. H. 2003. Structural logistic regression for link analysis. In Proceedings of the 2nd International Workshop on Multirelational Data Mining."},{"key":"e_1_2_1_51_1","unstructured":"Prieur C. Cardon D. Beuscart J.-S. Pissard N. and Pons P. 2008. The strength of weak cooperation: A case study on flickr. Tech. rep. arXiv:0802.2317v1 CoRR.  Prieur C. Cardon D. Beuscart J.-S. Pissard N. and Pons P. 2008. The strength of weak cooperation: A case study on flickr. Tech. rep. arXiv:0802.2317v1 CoRR."},{"key":"e_1_2_1_52_1","volume-title":"Automatic Text Processing: The Transformation, Analysis, and Retrieval of Information by Computer","author":"Salton G.","unstructured":"Salton , G. 1989. Automatic Text Processing: The Transformation, Analysis, and Retrieval of Information by Computer . Addison-Wesley , Boston, MA . Salton, G. 1989. Automatic Text Processing: The Transformation, Analysis, and Retrieval of Information by Computer. Addison-Wesley, Boston, MA."},{"key":"e_1_2_1_53_1","doi-asserted-by":"publisher","DOI":"10.1145\/1557914.1557947"},{"key":"e_1_2_1_54_1","doi-asserted-by":"publisher","DOI":"10.1145\/1718487.1718521"},{"key":"e_1_2_1_55_1","doi-asserted-by":"publisher","DOI":"10.1103\/PhysRevE.72.036133"},{"key":"e_1_2_1_56_1","unstructured":"Shalizi C. and Thomas A. 2010. Homophily and contagion are generically confounded in observational social network studies. Preprint arxiv:1004.4704.  Shalizi C. and Thomas A. 2010. Homophily and contagion are generically confounded in observational social network studies. Preprint arxiv:1004.4704."},{"key":"e_1_2_1_57_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICDCSW.2006.36"},{"key":"e_1_2_1_58_1","doi-asserted-by":"publisher","DOI":"10.1073\/pnas.1004008107"},{"key":"e_1_2_1_59_1","volume-title":"Proceedings of the Neural Information Processing Systems Conference (NIPS'03)","author":"Taskar B.","unstructured":"Taskar , B. , Wong , M. F. , Abbeel , P. , and Koller , D . 2003. Link prediction in relational data . In Proceedings of the Neural Information Processing Systems Conference (NIPS'03) . Taskar, B.,Wong, M. F., Abbeel, P., and Koller, D. 2003. Link prediction in relational data. In Proceedings of the Neural Information Processing Systems Conference (NIPS'03)."},{"key":"e_1_2_1_60_1","doi-asserted-by":"publisher","DOI":"10.1109\/WI.2007.60"},{"key":"e_1_2_1_61_1","doi-asserted-by":"publisher","DOI":"10.1103\/PhysRevE.65.066130"}],"container-title":["ACM Transactions on the Web"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/2180861.2180866","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/2180861.2180866","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,18]],"date-time":"2025-06-18T09:54:21Z","timestamp":1750240461000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/2180861.2180866"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2012,5]]},"references-count":61,"journal-issue":{"issue":"2","published-print":{"date-parts":[[2012,5]]}},"alternative-id":["10.1145\/2180861.2180866"],"URL":"https:\/\/doi.org\/10.1145\/2180861.2180866","relation":{},"ISSN":["1559-1131","1559-114X"],"issn-type":[{"value":"1559-1131","type":"print"},{"value":"1559-114X","type":"electronic"}],"subject":[],"published":{"date-parts":[[2012,5]]},"assertion":[{"value":"2010-07-01","order":0,"name":"received","label":"Received","group":{"name":"publication_history","label":"Publication History"}},{"value":"2011-10-01","order":1,"name":"accepted","label":"Accepted","group":{"name":"publication_history","label":"Publication History"}},{"value":"2012-06-04","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}