{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,5,8]],"date-time":"2026-05-08T04:31:47Z","timestamp":1778214707011,"version":"3.51.4"},"publisher-location":"New York, NY, USA","reference-count":21,"publisher":"ACM","license":[{"start":{"date-parts":[[2021,1,16]],"date-time":"2021-01-16T00:00:00Z","timestamp":1610755200000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2021,1,16]]},"DOI":"10.1145\/3451471.3451489","type":"proceedings-article","created":{"date-parts":[[2021,7,13]],"date-time":"2021-07-13T22:20:59Z","timestamp":1626214859000},"page":"106-112","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":2,"title":["Analysis of Clustering Algorithms to Clean and Normalize Early Modern European Book Titles"],"prefix":"10.1145","author":[{"given":"Evan","family":"Bryer","sequence":"first","affiliation":[{"name":"University of South Carolina, United States"}]},{"given":"Theppatorn","family":"Rhujittawiwat","sequence":"additional","affiliation":[{"name":"University of South Carolina, United States"}]},{"given":"Samyu","family":"Comandur","sequence":"additional","affiliation":[{"name":"University of South Carolina, United States"}]},{"given":"Vasco","family":"Madrid","sequence":"additional","affiliation":[{"name":"University of South Carolina, United States"}]},{"given":"Stephanie","family":"Riley","sequence":"additional","affiliation":[{"name":"University of South Carolina, United States"}]},{"given":"John","family":"Rose","sequence":"additional","affiliation":[{"name":"University of South Carolina, United States"}]},{"given":"Colin","family":"Wilder","sequence":"additional","affiliation":[{"name":"University of South Carolina, United States"}]}],"member":"320","published-online":{"date-parts":[[2021,7,13]]},"reference":[{"key":"e_1_3_2_1_1_1","volume-title":"Changing the tasks of cataloging. Journal of library adminis-tration25, 2-3","author":"Leighton Lee","year":"1998","unstructured":"Lee Leighton . 1998. Changing the tasks of cataloging. Journal of library adminis-tration25, 2-3 ( 1998 ), 45\u201354 Lee Leighton. 1998. Changing the tasks of cataloging. Journal of library adminis-tration25, 2-3 (1998), 45\u201354"},{"key":"e_1_3_2_1_2_1","volume-title":"Congress. Network Development and MARC Standards Office, Frequently Asked Questions (FAQ). Retrieved","author":"Library","year":"2020","unstructured":"Library of Congress. Network Development and MARC Standards Office, Frequently Asked Questions (FAQ). Retrieved November 2, 2020 from https:\/\/www.loc.gov\/marc\/faq.html#marc21vsuscan Library of Congress. Network Development and MARC Standards Office, Frequently Asked Questions (FAQ). Retrieved November 2, 2020 from https:\/\/www.loc.gov\/marc\/faq.html#marc21vsuscan"},{"key":"e_1_3_2_1_3_1","doi-asserted-by":"publisher","DOI":"10.1080\/07317131.2011.574519"},{"key":"e_1_3_2_1_4_1","doi-asserted-by":"publisher","DOI":"10.1300\/J111v25n04_05"},{"key":"e_1_3_2_1_5_1","doi-asserted-by":"publisher","DOI":"10.1080\/01930820903260648"},{"key":"e_1_3_2_1_6_1","volume-title":"Chronology: Noteworthy Achievements of the Cooperative1967\u20132008","author":"Schieber Phil","year":"2009","unstructured":"Phil Schieber . 2009 . Chronology: Noteworthy Achievements of the Cooperative1967\u20132008 . Journal of Library Administration 49, 7 (2009), 763\u2013775 Phil Schieber. 2009. Chronology: Noteworthy Achievements of the Cooperative1967\u20132008.Journal of Library Administration49, 7 (2009), 763\u2013775"},{"key":"e_1_3_2_1_7_1","volume-title":"OCLC Technology. Retrieved","author":"OCLC.","year":"2020","unstructured":"OCLC. OCLC Technology. Retrieved November 2, 2020 from https:\/\/www.oclc.org\/en\/technology.html OCLC. OCLC Technology. Retrieved November 2, 2020 from https:\/\/www.oclc.org\/en\/technology.html"},{"key":"e_1_3_2_1_8_1","unstructured":"OCLC. Inside WorldCat. Retrieved November 2 2020 from https:\/\/www.oclc.org\/en\/worldcat\/inside-worldcat.html OCLC. Inside WorldCat. Retrieved November 2 2020 from https:\/\/www.oclc.org\/en\/worldcat\/inside-worldcat.html"},{"key":"e_1_3_2_1_9_1","volume-title":"OCLC Delivers Quality. Retrieved","author":"OCLC.","year":"2020","unstructured":"OCLC. OCLC Delivers Quality. Retrieved November 2, 2020 from https:\/\/www.oclc.org\/en\/worldcat\/cooperative-quality.html OCLC. OCLC Delivers Quality. Retrieved November 2, 2020 from https:\/\/www.oclc.org\/en\/worldcat\/cooperative-quality.html"},{"key":"e_1_3_2_1_10_1","doi-asserted-by":"crossref","unstructured":"ARIF MARDI WALUYO EKO PRASETYO and ARIF ARIZAL. 2018. CLASIFICA-TION SYSTEM OF LIBRARY BOOK BASED ON SIMILARITY OF THE BOOK TI-TLE USING K-MEANS METHOD (CASE STUDY LIBRARY OF BHAYANGKARASURABAYA).JOURNAL OF ELECTRICAL ENGINEERING AND COMPUTER SCI-ENCES VOL 3 NUMBER 1 JUNE 20183 1 (2018) ARIF MARDI WALUYO EKO PRASETYO and ARIF ARIZAL. 2018. CLASIFICA-TION SYSTEM OF LIBRARY BOOK BASED ON SIMILARITY OF THE BOOK TI-TLE USING K-MEANS METHOD (CASE STUDY LIBRARY OF BHAYANGKARASURABAYA).JOURNAL OF ELECTRICAL ENGINEERING AND COMPUTER SCI-ENCES VOL 3 NUMBER 1 JUNE 20183 1 (2018)","DOI":"10.54732\/jeecs.v3i1.146"},{"key":"e_1_3_2_1_11_1","volume-title":"A quantitative approach to book-printing in Sweden and Finland, 1640\u20131828.HistoricalMethods: A Journal of Quantitative and Interdisciplinary History52, 1","author":"Tolonen Mikko","year":"2019","unstructured":"Mikko Tolonen , Leo Lahti , Hege Roivainen , and Jani Marjanen . 2019. A quantitative approach to book-printing in Sweden and Finland, 1640\u20131828.HistoricalMethods: A Journal of Quantitative and Interdisciplinary History52, 1 ( 2019 ),57\u201378 Mikko Tolonen, Leo Lahti, Hege Roivainen, and Jani Marjanen. 2019. A quantitative approach to book-printing in Sweden and Finland, 1640\u20131828.HistoricalMethods: A Journal of Quantitative and Interdisciplinary History52, 1 (2019),57\u201378"},{"key":"e_1_3_2_1_12_1","unstructured":"Mikko Tolonen Jani Marjanen Hege Roivainen and Leo Lahti. 2019. Scaling Up Bibliographic Data Science. In DHN. 450\u2013456 Mikko Tolonen Jani Marjanen Hege Roivainen and Leo Lahti. 2019. Scaling Up Bibliographic Data Science. In DHN. 450\u2013456"},{"key":"e_1_3_2_1_13_1","volume-title":"A National Public Sphere? Analyzing the Language, Location, and Form of Newspapers in Finland, 1771\u20131917.Journal of European Periodical Studies4, 1","author":"Marjanen Jani","year":"2019","unstructured":"Jani Marjanen , Ville Vaara , Antti Kanner , Hege Roivainen , Eetu M\u00e4kel\u00e4 , Leo Lahti ,and Mikko Tolonen . 2019. A National Public Sphere? Analyzing the Language, Location, and Form of Newspapers in Finland, 1771\u20131917.Journal of European Periodical Studies4, 1 ( 2019 ), 54\u201377 Jani Marjanen, Ville Vaara, Antti Kanner, Hege Roivainen, Eetu M\u00e4kel\u00e4, Leo Lahti,and Mikko Tolonen. 2019. A National Public Sphere? Analyzing the Language, Location, and Form of Newspapers in Finland, 1771\u20131917.Journal of European Periodical Studies4, 1 (2019), 54\u201377"},{"key":"e_1_3_2_1_14_1","doi-asserted-by":"publisher","DOI":"10.1145\/1835449.1835653"},{"key":"e_1_3_2_1_15_1","doi-asserted-by":"publisher","DOI":"10.1145\/3322905.3322929"},{"key":"e_1_3_2_1_16_1","volume-title":"Book Genre Classification Based on Titles with Comparative Machine Learning Algorithms. In2019 IEEE 4th International Conference on Computer and Communication Systems (ICCCS). IEEE, 14\u201320","author":"Ozsarfati Eran","year":"2019","unstructured":"Eran Ozsarfati , Egemen Sahin , Can Jozef Saul , and Alper Yilmaz . 2019 . Book Genre Classification Based on Titles with Comparative Machine Learning Algorithms. In2019 IEEE 4th International Conference on Computer and Communication Systems (ICCCS). IEEE, 14\u201320 Eran Ozsarfati, Egemen Sahin, Can Jozef Saul, and Alper Yilmaz. 2019. Book Genre Classification Based on Titles with Comparative Machine Learning Algorithms. In2019 IEEE 4th International Conference on Computer and Communication Systems (ICCCS). IEEE, 14\u201320"},{"key":"e_1_3_2_1_17_1","doi-asserted-by":"crossref","unstructured":"Avi Bleiweiss. 2017. A Hierarchical Book Representation of Word Embeddings for Effective Semantic Clustering and Search. In ICAART (2). 154\u2013163 Avi Bleiweiss. 2017. A Hierarchical Book Representation of Word Embeddings for Effective Semantic Clustering and Search. In ICAART (2). 154\u2013163","DOI":"10.5220\/0006192701540163"},{"key":"e_1_3_2_1_18_1","doi-asserted-by":"publisher","DOI":"10.5555\/644108.644253"},{"key":"#cr-split#-e_1_3_2_1_19_1.1","doi-asserted-by":"crossref","unstructured":"Senthil Shanmugasundaram and L. Robert. 2011. A Comparative Study of Text Compression Algorithms. ICTACT Journal on Communication Technology2 (122011). https:\/\/doi.org\/10.21917\/ijct.2011.0062 10.21917\/ijct.2011.0062","DOI":"10.21917\/ijct.2011.0062"},{"key":"#cr-split#-e_1_3_2_1_19_1.2","doi-asserted-by":"crossref","unstructured":"Senthil Shanmugasundaram and L. Robert. 2011. A Comparative Study of Text Compression Algorithms. ICTACT Journal on Communication Technology2 (122011). https:\/\/doi.org\/10.21917\/ijct.2011.0062","DOI":"10.21917\/ijct.2011.0062"},{"key":"e_1_3_2_1_20_1","volume-title":"Characteristics of duplicate records in oclc's online union catalog","author":"Rogers Edward","year":"1993","unstructured":"O'Neill, Edward T and Rogers , Sally A and Oskins , W Michael , \u201c Characteristics of duplicate records in oclc's online union catalog ,\u201d 1993 . O'Neill, Edward T and Rogers, Sally A and Oskins, W Michael, \u201cCharacteristics of duplicate records in oclc's online union catalog,\u201d 1993."}],"event":{"name":"ICSIM 2021: 2021 The 4th International Conference on Software Engineering and Information Management","location":"Yokohama Japan","acronym":"ICSIM 2021"},"container-title":["2021 The 4th International Conference on Software Engineering and Information Management"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3451471.3451489","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3451471.3451489","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T22:02:59Z","timestamp":1750197779000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3451471.3451489"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2021,1,16]]},"references-count":21,"alternative-id":["10.1145\/3451471.3451489","10.1145\/3451471"],"URL":"https:\/\/doi.org\/10.1145\/3451471.3451489","relation":{},"subject":[],"published":{"date-parts":[[2021,1,16]]},"assertion":[{"value":"2021-07-13","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}