{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,10,21]],"date-time":"2025-10-21T00:44:51Z","timestamp":1761007491536,"version":"build-2065373602"},"reference-count":6,"publisher":"Wiley","issue":"1","license":[{"start":{"date-parts":[[2012,1,11]],"date-time":"2012-01-11T00:00:00Z","timestamp":1326240000000},"content-version":"vor","delay-in-days":375,"URL":"http:\/\/onlinelibrary.wiley.com\/termsAndConditions#vor"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Proc of Assoc for Info"],"published-print":{"date-parts":[[2011,1]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:p>Library and museum digital collections are increasingly aggregated at various levels. Large\u2010scale aggregations, often characterized by heterogeneous or messy metadata, pose unique and growing challenges to aggregation administrators \u2013 not only in facilitating end\u2010user discovery and access, but in performing basic administrative and curatorial tasks in a scalable way, such as finding messy data and determining the overall topical landscape of the aggregation. This poster describes early findings on using statistical text analysis techniques to improve the scalability of an aggregation development workflow for a large\u2010scale aggregation. These techniques hold great promise for automating historically labor\u2010intensive evaluative aspects of aggregation development and form the basis for the development of an aggregator's dashboard. The aggregator's dashboard is planned as a statistical text\u2010analysis\u2010driven tool for supporting large\u2010scale aggregation development and maintenance, through multifaceted, automatic visualization of an aggregation's metadata quality and topical coverage. The administrator's dashboard will support principled yet scalable aggregation development.<\/jats:p>","DOI":"10.1002\/meet.2011.14504801319","type":"journal-article","created":{"date-parts":[[2012,1,11]],"date-time":"2012-01-11T12:23:03Z","timestamp":1326284583000},"page":"1-3","source":"Crossref","is-referenced-by-count":1,"title":["Semi\u2010automated collection evaluation for large\u2010scale aggregations"],"prefix":"10.1002","volume":"48","author":[{"given":"Katrina","family":"Fenlon","sequence":"first","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Peter","family":"Organisciak","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Jacob","family":"Jett","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Miles","family":"Efron","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"311","published-online":{"date-parts":[[2012,1,11]]},"reference":[{"key":"e_1_2_6_2_1","doi-asserted-by":"crossref","unstructured":"Blei D. M. &Jordan M. I.(2003).Modeling Annotated Data.Proceedings of the 26th annual international ACM SIGIR conference on Research and development in informaion retrieval. New York NY USA: ACM.","DOI":"10.1145\/860458.860460"},{"key":"e_1_2_6_3_1","doi-asserted-by":"publisher","DOI":"10.1108\/10650750810847251"},{"key":"e_1_2_6_4_1","doi-asserted-by":"crossref","unstructured":"Efron M. Organisciak P. Efron M.(2011).Building Topic Models in a Federated Digital Library Through Selective Document Exclusion. InProceedings of the ASIS&T Annual Meeting. (New Orleans LA Oct. 9\u201313).","DOI":"10.1002\/meet.2011.14504801048"},{"key":"e_1_2_6_5_1","unstructured":"Hillmann D. Dushay N. Phipps J.(2004) \u201cImproving metadata quality: Augmentation and recombination.\u201d InProceedings of the 2004 international conference on Dublin Core and metadata applications: metadata across languages and cultures 11\u201314 October 2004 Shanghai China."},{"volume-title":"Preliminary Analysis of Item\u2010level Metadata Harvested (White paper)","year":"2006","author":"Jackson A.","key":"e_1_2_6_6_1"},{"key":"e_1_2_6_7_1","doi-asserted-by":"crossref","unstructured":"Palmer C. L. Zavalina O. Fenlon K.(2010).Beyond size and search: Building contextual mass in digital aggregations for scholarly use. InProceedings of the ASIS&T Annual Meeting. (Pittsburgh PA Oct. 22\u201327).","DOI":"10.1002\/meet.14504701213"}],"container-title":["Proceedings of the American Society for Information Science and Technology"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/api.wiley.com\/onlinelibrary\/tdm\/v1\/articles\/10.1002%2Fmeet.2011.14504801319","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/asistdl.onlinelibrary.wiley.com\/doi\/pdf\/10.1002\/meet.2011.14504801319","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,20]],"date-time":"2025-10-20T13:47:45Z","timestamp":1760968065000},"score":1,"resource":{"primary":{"URL":"https:\/\/asistdl.onlinelibrary.wiley.com\/doi\/10.1002\/meet.2011.14504801319"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2011,1]]},"references-count":6,"journal-issue":{"issue":"1","published-print":{"date-parts":[[2011,1]]}},"alternative-id":["10.1002\/meet.2011.14504801319"],"URL":"https:\/\/doi.org\/10.1002\/meet.2011.14504801319","archive":["Portico"],"relation":{},"ISSN":["0044-7870","1550-8390"],"issn-type":[{"type":"print","value":"0044-7870"},{"type":"electronic","value":"1550-8390"}],"subject":[],"published":{"date-parts":[[2011,1]]}}}