{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,23]],"date-time":"2026-03-23T17:51:28Z","timestamp":1774288288174,"version":"3.50.1"},"reference-count":73,"publisher":"Association for Computing Machinery (ACM)","issue":"2","license":[{"start":{"date-parts":[[2010,5,1]],"date-time":"2010-05-01T00:00:00Z","timestamp":1272672000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["ACM Trans. Knowl. Discov. Data"],"published-print":{"date-parts":[[2010,5]]},"abstract":"<jats:p>Insight into the growth (or shrinkage) of \u201cknowledge communities\u201d of authors that build on each other's work can be gained by studying the evolution over time of clusters of documents. We cluster documents based on the documents they cite in common using the Streemer clustering method, which finds cohesive foreground clusters (the knowledge communities) embedded in a diffuse background. We build predictive models with features based on the citation structure, the vocabulary of the papers, and the affiliations and prestige of the authors and use these models to study the drivers of community growth and the predictors of how widely a paper will be cited. We find that scientific knowledge communities tend to grow more rapidly if their publications build on diverse information and use narrow vocabulary and that papers that lie on the periphery of a community have the highest impact, while those not in any community have the lowest impact.<\/jats:p>","DOI":"10.1145\/1754428.1754430","type":"journal-article","created":{"date-parts":[[2010,6,1]],"date-time":"2010-06-01T12:21:35Z","timestamp":1275394895000},"page":"1-35","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":18,"title":["Analyzing knowledge communities using foreground and background clusters"],"prefix":"10.1145","volume":"4","author":[{"given":"Vasileios","family":"Kandylas","sequence":"first","affiliation":[{"name":"University of Pennsylvania, Pennsylvania"}]},{"given":"S. Phineas","family":"Upham","sequence":"additional","affiliation":[{"name":"University of Pennsylvania, Pennsylvania"}]},{"given":"Lyle H.","family":"Ungar","sequence":"additional","affiliation":[{"name":"University of Pennsylvania, Pennsylvania"}]}],"member":"320","published-online":{"date-parts":[[2010,5,28]]},"reference":[{"key":"e_1_2_1_1_1","doi-asserted-by":"publisher","DOI":"10.1111\/1467-9531.00117"},{"key":"e_1_2_1_2_1","doi-asserted-by":"publisher","DOI":"10.1145\/1014052.1014111"},{"key":"e_1_2_1_3_1","doi-asserted-by":"publisher","DOI":"10.5555\/944919.944937"},{"key":"e_1_2_1_4_1","doi-asserted-by":"publisher","DOI":"10.1145\/1143844.1143859"},{"key":"e_1_2_1_5_1","doi-asserted-by":"publisher","DOI":"10.2307\/270805"},{"key":"e_1_2_1_6_1","doi-asserted-by":"publisher","DOI":"10.1016\/S0378-8733(96)00301-2"},{"key":"e_1_2_1_7_1","doi-asserted-by":"publisher","DOI":"10.1002\/(SICI)1097-4571(199105)42:4<252::AID-ASI2>3.0.CO;2-G"},{"key":"e_1_2_1_8_1","volume-title":"Invisible Colleges: Diffusion of Knowledge in Scientific Communities","author":"Crane D.","year":"1972"},{"key":"e_1_2_1_9_1","volume-title":"Proceedings of the 16th Conference on Uncertainty in Artificial Intelligence (UAI'00)","author":"Dasgupta S."},{"key":"e_1_2_1_10_1","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1111\/j.2517-6161.1977.tb01600.x","article-title":"Maximum likelihood from incomplete data via the EM algorithm","volume":"39","author":"Dempster A.","year":"1977","journal-title":"J. Royal Statist. Soc."},{"key":"e_1_2_1_11_1","doi-asserted-by":"publisher","DOI":"10.1023\/A:1007612920971"},{"key":"e_1_2_1_12_1","doi-asserted-by":"publisher","DOI":"10.1145\/502512.502550"},{"key":"e_1_2_1_13_1","volume-title":"Proceedings of the 3rd IEEE International Conference on Data Mining (ICDM'03)","author":"Dhillon I. S."},{"key":"e_1_2_1_14_1","doi-asserted-by":"publisher","DOI":"10.1145\/1014052.1014118"},{"key":"e_1_2_1_15_1","doi-asserted-by":"publisher","DOI":"10.1145\/956750.956764"},{"key":"e_1_2_1_16_1","unstructured":"Doreian P. 1979. On the delineation of small group structure. In Classifying Social Data H. C. Hudson Ed. Jossey-Bass San Francisco CA 215--230. Doreian P. 1979. On the delineation of small group structure. In Classifying Social Data H. C. Hudson Ed. Jossey-Bass San Francisco CA 215--230."},{"key":"e_1_2_1_17_1","volume-title":"Proceedings of the 2nd International Conference on Knowledge Discovery and Data Mining. AAAI Press","author":"Ester M."},{"key":"e_1_2_1_18_1","doi-asserted-by":"publisher","DOI":"10.1016\/0378-8733(93)90007-8"},{"key":"e_1_2_1_19_1","volume-title":"Proceedings of the 20th International Conference of Machine Learning. AAAI Press","author":"Fern X. Z."},{"key":"e_1_2_1_20_1","doi-asserted-by":"publisher","DOI":"10.1145\/347090.347121"},{"key":"e_1_2_1_21_1","volume-title":"Dynamic Social Network Modeling and Analysis: Workshop Summary and Papers. National Academy Press","author":"Freeman L.","year":"2003"},{"key":"e_1_2_1_22_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.jengtecman.2004.11.002"},{"key":"e_1_2_1_23_1","volume-title":"Proceedings of the 18th International Conference on Machine Learning (ICML'01)","author":"Getoor L."},{"key":"e_1_2_1_24_1","doi-asserted-by":"crossref","unstructured":"Gibson D. Kleinberg J. and Raghavan P. 1998. Inferring Web Communities from Link Topology. ACM New York NY. Gibson D. Kleinberg J. and Raghavan P. 1998. Inferring Web Communities from Link Topology. ACM New York NY.","DOI":"10.1145\/276627.276652"},{"key":"e_1_2_1_25_1","doi-asserted-by":"publisher","DOI":"10.1145\/276675.276685"},{"key":"e_1_2_1_26_1","first-page":"94","article-title":"Accounting for excess zeros and sample selection in Poisson and negative binomial regression models","author":"Greene W. H.","year":"1994","journal-title":"Working Papers"},{"key":"e_1_2_1_27_1","doi-asserted-by":"publisher","DOI":"10.1177\/030631277400400402"},{"key":"e_1_2_1_28_1","doi-asserted-by":"publisher","DOI":"10.1073\/pnas.0307752101"},{"key":"e_1_2_1_29_1","doi-asserted-by":"publisher","DOI":"10.1109\/TKDE.2003.1198387"},{"key":"e_1_2_1_30_1","doi-asserted-by":"publisher","DOI":"10.1016\/S0306-4379(01)00008-4"},{"key":"e_1_2_1_31_1","doi-asserted-by":"publisher","DOI":"10.1146\/annurev.soc.25.1.597"},{"key":"e_1_2_1_32_1","doi-asserted-by":"crossref","unstructured":"Hage J. and Meeus M. 2006. Innovation Science and Institutional Change. Oxford University Press Oxford UK. Hage J. and Meeus M. 2006. Innovation Science and Institutional Change. Oxford University Press Oxford UK.","DOI":"10.1093\/oso\/9780199299195.001.0001"},{"key":"e_1_2_1_33_1","doi-asserted-by":"publisher","DOI":"10.1086\/226424"},{"key":"e_1_2_1_34_1","doi-asserted-by":"publisher","DOI":"10.2307\/1911191"},{"key":"e_1_2_1_35_1","doi-asserted-by":"publisher","DOI":"10.1016\/S0306-4573(01)00046-2"},{"key":"e_1_2_1_36_1","doi-asserted-by":"publisher","DOI":"10.1145\/956750.956816"},{"key":"e_1_2_1_37_1","doi-asserted-by":"publisher","DOI":"10.5555\/839282.840965"},{"key":"e_1_2_1_38_1","doi-asserted-by":"publisher","DOI":"10.1145\/331499.331504"},{"key":"e_1_2_1_39_1","doi-asserted-by":"publisher","DOI":"10.1145\/1281192.1281233"},{"key":"e_1_2_1_40_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICDM.2007.22"},{"key":"e_1_2_1_41_1","volume-title":"Procedings of the Conference on Uncertainty in Artificial Intelligence. Morgan Kaufmann, 282--293","author":"Kearns M. J."},{"key":"e_1_2_1_42_1","doi-asserted-by":"publisher","DOI":"10.1002\/asi.1181"},{"key":"e_1_2_1_43_1","doi-asserted-by":"publisher","DOI":"10.1145\/312129.312186"},{"key":"e_1_2_1_44_1","doi-asserted-by":"publisher","DOI":"10.1023\/A:1021967111530"},{"key":"e_1_2_1_45_1","doi-asserted-by":"publisher","DOI":"10.1145\/1081870.1081893"},{"key":"e_1_2_1_46_1","doi-asserted-by":"publisher","DOI":"10.5555\/646491.694954"},{"key":"e_1_2_1_47_1","doi-asserted-by":"publisher","DOI":"10.5555\/2869931.2869936"},{"key":"e_1_2_1_48_1","doi-asserted-by":"publisher","DOI":"10.5555\/645960.673929"},{"key":"e_1_2_1_49_1","doi-asserted-by":"crossref","unstructured":"McCullagh P. and Nelder J. 1989. Generalized Linear Models. Chapman &amp; Hall\/CRC Boca Raton FL. McCullagh P. and Nelder J. 1989. Generalized Linear Models. Chapman &amp; Hall\/CRC Boca Raton FL.","DOI":"10.1007\/978-1-4899-3242-6"},{"key":"e_1_2_1_50_1","doi-asserted-by":"publisher","DOI":"10.1177\/095169280201400104"},{"key":"e_1_2_1_51_1","doi-asserted-by":"publisher","DOI":"10.1145\/980972.980999"},{"key":"e_1_2_1_52_1","doi-asserted-by":"publisher","DOI":"10.1016\/S0378-8733(01)00042-9"},{"key":"e_1_2_1_53_1","doi-asserted-by":"publisher","DOI":"10.1023\/A:1007692713085"},{"key":"e_1_2_1_54_1","doi-asserted-by":"publisher","DOI":"10.1145\/564376.564412"},{"key":"e_1_2_1_55_1","doi-asserted-by":"publisher","DOI":"10.1145\/980972.980994"},{"key":"e_1_2_1_56_1","doi-asserted-by":"publisher","DOI":"10.5465\/amr.1993.9402210152"},{"key":"e_1_2_1_57_1","volume-title":"Proceedings of the IEEE Conferece on Advances in Digital Libraries. IEEE Computer Society","author":"Popescul A."},{"key":"e_1_2_1_58_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICIP.1998.999064"},{"key":"e_1_2_1_59_1","volume-title":"Proceedings of the Conference on Computer Vision and Pattern Recognition (CVPR'97)","author":"Shi J."},{"key":"e_1_2_1_60_1","unstructured":"Slonim N. Friedman N. and Tishby N. 2001. Agglomerative multivariate information bottleneck. Advanc. Neur. Inform. Process. Syst. Slonim N. Friedman N. and Tishby N. 2001. Agglomerative multivariate information bottleneck. Advanc. Neur. Inform. Process. Syst."},{"key":"e_1_2_1_61_1","doi-asserted-by":"publisher","DOI":"10.1002\/asi.10225"},{"key":"e_1_2_1_62_1","doi-asserted-by":"publisher","DOI":"10.1007\/BF02016661"},{"key":"e_1_2_1_63_1","doi-asserted-by":"publisher","DOI":"10.1007\/BF02017157"},{"key":"e_1_2_1_64_1","doi-asserted-by":"publisher","DOI":"10.1007\/BF02018057"},{"key":"e_1_2_1_65_1","first-page":"35","article-title":"A comparison of document clustering techniques","volume":"34","author":"Steinbach M.","year":"2000","journal-title":"Proceedings of the KDD Workshop on Text Mining"},{"key":"e_1_2_1_66_1","doi-asserted-by":"publisher","DOI":"10.1162\/153244303321897735"},{"key":"e_1_2_1_67_1","doi-asserted-by":"publisher","DOI":"10.1177\/030631277700700205"},{"key":"e_1_2_1_68_1","doi-asserted-by":"publisher","DOI":"10.1145\/1281192.1281266"},{"key":"e_1_2_1_69_1","doi-asserted-by":"publisher","DOI":"10.1145\/1281192.1281269"},{"key":"e_1_2_1_70_1","unstructured":"Upham S. P. 2006. Communities of innovation. Ph.D. thesis University of Pennsylvania. Upham S. P. 2006. Communities of innovation. Ph.D. thesis University of Pennsylvania."},{"key":"e_1_2_1_71_1","doi-asserted-by":"publisher","DOI":"10.1016\/S0165-1765(02)00262-8"},{"key":"e_1_2_1_72_1","doi-asserted-by":"publisher","DOI":"10.1145\/1150402.1150450"},{"key":"e_1_2_1_73_1","doi-asserted-by":"publisher","DOI":"10.1145\/233269.233324"}],"container-title":["ACM Transactions on Knowledge Discovery from Data"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/1754428.1754430","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/1754428.1754430","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,18]],"date-time":"2025-06-18T11:22:50Z","timestamp":1750245770000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/1754428.1754430"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2010,5]]},"references-count":73,"journal-issue":{"issue":"2","published-print":{"date-parts":[[2010,5]]}},"alternative-id":["10.1145\/1754428.1754430"],"URL":"https:\/\/doi.org\/10.1145\/1754428.1754430","relation":{},"ISSN":["1556-4681","1556-472X"],"issn-type":[{"value":"1556-4681","type":"print"},{"value":"1556-472X","type":"electronic"}],"subject":[],"published":{"date-parts":[[2010,5]]},"assertion":[{"value":"2008-02-01","order":0,"name":"received","label":"Received","group":{"name":"publication_history","label":"Publication History"}},{"value":"2009-08-01","order":1,"name":"accepted","label":"Accepted","group":{"name":"publication_history","label":"Publication History"}},{"value":"2010-05-28","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}