{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,6,19]],"date-time":"2025-06-19T04:35:03Z","timestamp":1750307703355,"version":"3.41.0"},"reference-count":47,"publisher":"Association for Computing Machinery (ACM)","issue":"3","license":[{"start":{"date-parts":[[2009,7,1]],"date-time":"2009-07-01T00:00:00Z","timestamp":1246406400000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["ACM Trans. Internet Technol."],"published-print":{"date-parts":[[2009,7]]},"abstract":"<jats:p>There is an increasing number of people reading, writing, and commenting on blogs. According to a recent survey made by Technorati, there are about 75,000 new blogs and 1.2 million new posts everyday. However, it is difficult and time consuming for a blog reader to find the most interesting posts in the huge and dynamic blog world. In this article, an online Personalized Blog Reader (PBR) system is proposed, which facilitates blog readers in browsing the coolest and newest blog posts of their interests by automatically clustering the most relevant stories. PBR aims to make a user's potential favorite topics always ranked higher than those nonfavorite ones. This is accomplished in the following steps. First, the system collects and provides a unified incremental index of posts coming from different blogs. Then, an incremental clustering algorithm with a flexible half-bounded window of observation is proposed to satisfy the requirements of online processing. It learns people's personalized reading preferences to present a user with a final reading list. The experimental results show that the proposed incremental clustering algorithm is effective and efficient, and the personalization of the PBR performs well.<\/jats:p>","DOI":"10.1145\/1552291.1552292","type":"journal-article","created":{"date-parts":[[2009,7,28]],"date-time":"2009-07-28T12:43:55Z","timestamp":1248785035000},"page":"1-26","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":5,"title":["An online blog reading system by topic clustering and personalized ranking"],"prefix":"10.1145","volume":"9","author":[{"given":"Xin","family":"Li","sequence":"first","affiliation":[{"name":"Peking University"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Jun","family":"Yan","sequence":"additional","affiliation":[{"name":"Microsoft Research Asia"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Weiguo","family":"Fan","sequence":"additional","affiliation":[{"name":"Virginia Tech"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Ning","family":"Liu","sequence":"additional","affiliation":[{"name":"Microsoft Research Asia"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Shuicheng","family":"Yan","sequence":"additional","affiliation":[{"name":"National University of Singapore"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Zheng","family":"Chen","sequence":"additional","affiliation":[{"name":"Microsoft Research Asia"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"320","published-online":{"date-parts":[[2009,7,30]]},"reference":[{"volume-title":"Proceedings of the 13th International World Wide Web Conference Workshop on the Weblogging Ecosystem. 35--39","author":"Adar E.","key":"e_1_2_1_1_1","unstructured":"Adar , E. , Zhang , L. , Adamic , L. A. , and Lukose , R. M . 2004. Implicit structure and the dynamics of blogspace . In Proceedings of the 13th International World Wide Web Conference Workshop on the Weblogging Ecosystem. 35--39 . Adar, E., Zhang, L., Adamic, L. A., and Lukose, R. M. 2004. Implicit structure and the dynamics of blogspace. In Proceedings of the 13th International World Wide Web Conference Workshop on the Weblogging Ecosystem. 35--39."},{"key":"e_1_2_1_2_1","doi-asserted-by":"publisher","DOI":"10.1145\/290941.290954"},{"volume-title":"Proceedings of the 2nd Annual Workshop on the Weblogging Ecosystem: Aggregation, Analysis and Dynamics, E. Adar, N. Glance, and M. Hurst, Eds.","author":"Avesani P.","key":"e_1_2_1_3_1","unstructured":"Avesani , P. , Cova , M. , Hayes , C. , and Massa , P . 2005. Learning contextualised Weblog topics . In Proceedings of the 2nd Annual Workshop on the Weblogging Ecosystem: Aggregation, Analysis and Dynamics, E. Adar, N. Glance, and M. Hurst, Eds. Avesani, P., Cova, M., Hayes, C., and Massa, P. 2005. Learning contextualised Weblog topics. In Proceedings of the 2nd Annual Workshop on the Weblogging Ecosystem: Aggregation, Analysis and Dynamics, E. Adar, N. Glance, and M. Hurst, Eds."},{"key":"e_1_2_1_4_1","doi-asserted-by":"publisher","DOI":"10.1145\/290941.290970"},{"key":"e_1_2_1_5_1","doi-asserted-by":"publisher","DOI":"10.1145\/245108.245124"},{"key":"e_1_2_1_6_1","doi-asserted-by":"publisher","DOI":"10.1023\/B:MACH.0000033116.57574.95"},{"volume-title":"Proceedings of the 33rd International Conference on Very Large Databases. 806--817","author":"Bansal N.","key":"e_1_2_1_7_1","unstructured":"Bansal , N. , Chiang , F. , Koudas , N. , and Tompa , W. F . 2007. Seeking stable clusters in the blogosphere . In Proceedings of the 33rd International Conference on Very Large Databases. 806--817 . Bansal, N., Chiang, F., Koudas, N., and Tompa, W. F. 2007. Seeking stable clusters in the blogosphere. In Proceedings of the 33rd International Conference on Very Large Databases. 806--817."},{"volume-title":"Proceedings of the 33rd International Conference on Very Large Databases. 1410--1413","author":"Bansal N.","key":"e_1_2_1_8_1","unstructured":"Bansal , N. and Koudas , N . 2007. BLOGSCOPE: A system for online analysis of high volume text streams . In Proceedings of the 33rd International Conference on Very Large Databases. 1410--1413 . Bansal, N. and Koudas, N. 2007. BLOGSCOPE: A system for online analysis of high volume text streams. In Proceedings of the 33rd International Conference on Very Large Databases. 1410--1413."},{"key":"e_1_2_1_9_1","unstructured":"Bern M. and Eppstein D. 1996. Approximation algorithms for geometric problems. In Approximation Algorithms for NP-Hard Problems D. S. Hochbaum Ed. PWS Publishing Company Boston 296--345.   Bern M. and Eppstein D. 1996. Approximation algorithms for geometric problems. In Approximation Algorithms for NP-Hard Problems D. S. Hochbaum Ed. PWS Publishing Company Boston 296--345."},{"key":"e_1_2_1_10_1","unstructured":"Bonett M. 2001. Personalization of Web services: Opportunities and challenges. Ariadne 28.  Bonett M. 2001. Personalization of Web services: Opportunities and challenges. Ariadne 28."},{"key":"e_1_2_1_11_1","doi-asserted-by":"publisher","DOI":"10.1145\/860435.860495"},{"key":"e_1_2_1_12_1","volume-title":"AAAI Spring Symposium on Computational Approaches to Analyzing Weblogs","volume":"4737","author":"Brooks C. H.","unstructured":"Brooks , C. H. and Andmontanez , N . 2005. An analysis of the effectiveness of tagging in blogs . In AAAI Spring Symposium on Computational Approaches to Analyzing Weblogs , vol. 4737 , 1--20. Brooks, C. H. and Andmontanez, N. 2005. An analysis of the effectiveness of tagging in blogs. In AAAI Spring Symposium on Computational Approaches to Analyzing Weblogs, vol. 4737, 1--20."},{"key":"e_1_2_1_13_1","doi-asserted-by":"publisher","DOI":"10.1145\/1035134.1035164"},{"key":"e_1_2_1_14_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICDM.2006.61"},{"key":"e_1_2_1_15_1","doi-asserted-by":"publisher","DOI":"10.1002\/(SICI)1097-4571(199009)41:6<391::AID-ASI1>3.0.CO;2-9"},{"key":"e_1_2_1_16_1","doi-asserted-by":"publisher","DOI":"10.5210\/fm.v10i12.1300"},{"key":"e_1_2_1_17_1","doi-asserted-by":"publisher","DOI":"10.1109\/TKDE.2004.1269663"},{"key":"e_1_2_1_18_1","doi-asserted-by":"publisher","DOI":"10.1080\/07421222.2005.11045828"},{"key":"e_1_2_1_19_1","doi-asserted-by":"publisher","DOI":"10.1145\/1062745.1062760"},{"key":"e_1_2_1_20_1","doi-asserted-by":"publisher","DOI":"10.1145\/276627.276652"},{"key":"e_1_2_1_21_1","doi-asserted-by":"publisher","DOI":"10.5555\/1109557.1109686"},{"key":"e_1_2_1_22_1","doi-asserted-by":"publisher","DOI":"10.1109\/MIS.2006.86"},{"key":"e_1_2_1_23_1","unstructured":"Hayes C. Avesani P. and Veeramachaneni S. 2006a. An analysis of the use of tags in a blog recommender system. ITC-IRST Tech. rep. IJCAI: 2772--2777.   Hayes C. Avesani P. and Veeramachaneni S. 2006a. An analysis of the use of tags in a blog recommender system. ITC-IRST Tech. rep. IJCAI: 2772--2777."},{"volume-title":"Proceedings of the 7th European Conference on Machine Learning and the 10th European Conference on Principles and Practice of Knowledge Discovery in Databases (ECML\/PKDD) Workshop on Web Mining.","author":"Hayes C.","key":"e_1_2_1_24_1","unstructured":"Hayes , C. , Avesani , P. , and Veeramachaneni , S . 2006b. An analysis of bloggers and topics for a blog recommender system . In Proceedings of the 7th European Conference on Machine Learning and the 10th European Conference on Principles and Practice of Knowledge Discovery in Databases (ECML\/PKDD) Workshop on Web Mining. Hayes, C., Avesani, P., and Veeramachaneni, S. 2006b. An analysis of bloggers and topics for a blog recommender system. In Proceedings of the 7th European Conference on Machine Learning and the 10th European Conference on Principles and Practice of Knowledge Discovery in Databases (ECML\/PKDD) Workshop on Web Mining."},{"key":"e_1_2_1_25_1","doi-asserted-by":"publisher","DOI":"10.1109\/HICSS.2005.167"},{"key":"e_1_2_1_26_1","doi-asserted-by":"publisher","DOI":"10.1145\/775152.775191"},{"key":"e_1_2_1_27_1","doi-asserted-by":"publisher","DOI":"10.1145\/345508.345650"},{"key":"e_1_2_1_28_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.websem.2005.06.002"},{"key":"e_1_2_1_29_1","doi-asserted-by":"publisher","DOI":"10.1006\/jpdc.1997.1404"},{"key":"e_1_2_1_30_1","doi-asserted-by":"publisher","DOI":"10.1023\/B:AIRE.0000036255.53433.26"},{"key":"e_1_2_1_31_1","doi-asserted-by":"publisher","DOI":"10.1145\/324133.324140"},{"volume-title":"Proceedings of the 14th International Conference on Machine Learning. 170--178","author":"Koller D.","key":"e_1_2_1_32_1","unstructured":"Koller , D. and Sahami , M . 1997. Hierarchically classifying documents using very few words . In Proceedings of the 14th International Conference on Machine Learning. 170--178 . Koller, D. and Sahami, M. 1997. Hierarchically classifying documents using very few words. In Proceedings of the 14th International Conference on Machine Learning. 170--178."},{"key":"e_1_2_1_33_1","doi-asserted-by":"publisher","DOI":"10.1145\/775152.775233"},{"key":"e_1_2_1_34_1","doi-asserted-by":"publisher","DOI":"10.1109\/TKDE.2004.1264820"},{"volume-title":"Proceedings of the 6th International Workshop on Program Comprehension. 45--53","author":"Mancoridis S.","key":"e_1_2_1_35_1","unstructured":"Mancoridis , S. , Mitchell , B. , Rorres , C. , Chen , Y. , and Gansner , E . 1998. Using automatic clustering to produce high-level system organizations of source code . In Proceedings of the 6th International Workshop on Program Comprehension. 45--53 . Mancoridis, S., Mitchell, B., Rorres, C., Chen, Y., and Gansner, E. 1998. Using automatic clustering to produce high-level system organizations of source code. In Proceedings of the 6th International Workshop on Program Comprehension. 45--53."},{"key":"e_1_2_1_36_1","volume-title":"Proceedings of the International Communication Association Conference.","author":"Marlow C.","year":"2004","unstructured":"Marlow , C. 2004 . Audience, structure and authority in the Weblog community . In Proceedings of the International Communication Association Conference. Marlow, C. 2004. Audience, structure and authority in the Weblog community. In Proceedings of the International Communication Association Conference."},{"key":"e_1_2_1_37_1","unstructured":"Page L. Brin S. Motwani R. and Winograd T. 1998. The PageRank citation ranking: Bringing order to the Web. Tech. rep. Stanford University.  Page L. Brin S. Motwani R. and Winograd T. 1998. The PageRank citation ranking: Bringing order to the Web. Tech. rep. Stanford University."},{"key":"e_1_2_1_38_1","doi-asserted-by":"publisher","DOI":"10.1145\/1135777.1135883"},{"key":"e_1_2_1_39_1","volume-title":"ISKO Italy-UniMIB Meeting.","author":"Quintarelli E.","year":"2005","unstructured":"Quintarelli , E. 2005 . Folksonomies: Power to the people . ISKO Italy-UniMIB Meeting. Quintarelli, E. 2005. Folksonomies: Power to the people. ISKO Italy-UniMIB Meeting."},{"key":"e_1_2_1_40_1","doi-asserted-by":"publisher","DOI":"10.1080\/01621459.1971.10482356"},{"volume-title":"An Introduction to Modern Information Retrieval","author":"Salton G.","key":"e_1_2_1_41_1","unstructured":"Salton , G. , and McGill , M. J. 1983. An Introduction to Modern Information Retrieval . McGraw-Hill, Inc. , New York . Salton, G., and McGill, M. J. 1983. An Introduction to Modern Information Retrieval. McGraw-Hill, Inc., New York."},{"volume-title":"Proceedings of the 5th International Conference on Computer and Information Technology.","author":"Sarwar B. M.","key":"e_1_2_1_42_1","unstructured":"Sarwar , B. M. , Karypis , G. , Konstan , J. , and Riedl , J . 2002. Recommender systems for large-scale e-commerce: Scalable neighborhood formation using clustering . In Proceedings of the 5th International Conference on Computer and Information Technology. Sarwar, B. M., Karypis, G., Konstan, J., and Riedl, J. 2002. Recommender systems for large-scale e-commerce: Scalable neighborhood formation using clustering. In Proceedings of the 5th International Conference on Computer and Information Technology."},{"volume-title":"Proceedings of the 5th Dual-Use Technologies and Applications Conference. 318--324","author":"Singhal A.","key":"e_1_2_1_43_1","unstructured":"Singhal , A. and Salton , G . 1995. Automatic text browsing using vector space model . In Proceedings of the 5th Dual-Use Technologies and Applications Conference. 318--324 . Singhal, A. and Salton, G. 1995. Automatic text browsing using vector space model. In Proceedings of the 5th Dual-Use Technologies and Applications Conference. 318--324."},{"volume-title":"Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing. 757--760","author":"Solomonoff A.","key":"e_1_2_1_44_1","unstructured":"Solomonoff , A. , Mielke , A. , Schmidt , M. , and Gish , H . 1998. Clustering speakers by their voices . In Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing. 757--760 . Solomonoff, A., Mielke, A., Schmidt, M., and Gish, H. 1998. Clustering speakers by their voices. In Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing. 757--760."},{"key":"e_1_2_1_45_1","doi-asserted-by":"publisher","DOI":"10.1109\/34.244673"},{"key":"e_1_2_1_46_1","volume-title":"-C. T","author":"Tsai T.-M.","year":"2006","unstructured":"Tsai , T.-M. , Shih , C.-C. , and Chou , S . -C. T . 2006 . Personalized blog recommendation using the value, semantic, and social model. In Innovations in Information Technology . 1--5. Tsai, T.-M., Shih, C.-C., and Chou, S.-C. T. 2006. Personalized blog recommendation using the value, semantic, and social model. In Innovations in Information Technology. 1--5."},{"key":"e_1_2_1_47_1","doi-asserted-by":"publisher","DOI":"10.1145\/290941.290953"}],"container-title":["ACM Transactions on Internet Technology"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/1552291.1552292","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/1552291.1552292","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,18]],"date-time":"2025-06-18T13:30:04Z","timestamp":1750253404000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/1552291.1552292"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2009,7]]},"references-count":47,"journal-issue":{"issue":"3","published-print":{"date-parts":[[2009,7]]}},"alternative-id":["10.1145\/1552291.1552292"],"URL":"https:\/\/doi.org\/10.1145\/1552291.1552292","relation":{},"ISSN":["1533-5399","1557-6051"],"issn-type":[{"type":"print","value":"1533-5399"},{"type":"electronic","value":"1557-6051"}],"subject":[],"published":{"date-parts":[[2009,7]]},"assertion":[{"value":"2006-10-01","order":0,"name":"received","label":"Received","group":{"name":"publication_history","label":"Publication History"}},{"value":"2008-10-01","order":1,"name":"accepted","label":"Accepted","group":{"name":"publication_history","label":"Publication History"}},{"value":"2009-07-30","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}