{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,2,21]],"date-time":"2025-02-21T21:25:09Z","timestamp":1740173109444,"version":"3.37.3"},"reference-count":30,"publisher":"Springer Fachmedien Wiesbaden GmbH","issue":"3","license":[{"start":{"date-parts":[[2018,1,11]],"date-time":"2018-01-11T00:00:00Z","timestamp":1515628800000},"content-version":"unspecified","delay-in-days":0,"URL":"http:\/\/www.springer.com\/tdm"}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["HMD"],"published-print":{"date-parts":[[2018,6]]},"DOI":"10.1365\/s40702-017-0387-1","type":"journal-article","created":{"date-parts":[[2018,1,11]],"date-time":"2018-01-11T10:06:16Z","timestamp":1515665176000},"page":"581-600","update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":0,"title":["Erkennung von Duplikaten in Big Data am Fallbeispiel der digitalen Musiknutzung","Detection of Duplicates in Big Data in the Use Case of Digital Music Usage"],"prefix":"10.1365","volume":"55","author":[{"given":"Tobias","family":"Lindner","sequence":"first","affiliation":[]},{"ORCID":"https:\/\/orcid.org\/0000-0003-4508-7667","authenticated-orcid":false,"given":"Peter","family":"Mandl","sequence":"additional","affiliation":[]},{"given":"Nikolai","family":"Bauer","sequence":"additional","affiliation":[]},{"given":"Markus","family":"Grimm","sequence":"additional","affiliation":[]}],"member":"93","published-online":{"date-parts":[[2018,1,11]]},"reference":[{"key":"387_CR2","volume-title":"Amazon Elastic MapReduce (EMR)","author":"Amazon","year":"2016","unstructured":"Amazon (2016) Amazon Elastic MapReduce (EMR). https:\/\/aws.amazon.com\/de\/elasticmapreduce\/. Zugegriffen: 14. M\u00e4rz 2016"},{"key":"387_CR1","doi-asserted-by":"crossref","DOI":"10.3139\/9783446426535","volume-title":"Datenqualit\u00e4t erfolgreich steuern \u2013 Praxisl\u00f6sungen f\u00fcr Business-Intelligence-Projekte. 2., vollst\u00e4ndig \u00fcberarbeitete und erweiterte Auflage","author":"D Apel","year":"2010","unstructured":"Apel D, Behme W, Eberlei R, Merighi C (2010) Datenqualit\u00e4t erfolgreich steuern \u2013 Praxisl\u00f6sungen f\u00fcr Business-Intelligence-Projekte. 2., vollst\u00e4ndig \u00fcberarbeitete und erweiterte Auflage. Carl Hanser, M\u00fcnchen"},{"key":"387_CR3","volume-title":"Modern information retrieval","author":"R Baeza-Yates","year":"1999","unstructured":"Baeza-Yates R, Ribeiro-Neto B (1999) Modern information retrieval. Addison-Wesley, Harlow"},{"key":"387_CR4","first-page":"39","volume-title":"SPIRE (String Processing and Information Retrieval)","author":"L Bergroth","year":"2000","unstructured":"Bergroth L, Hakonen H, Raita T (2000) A survey of longest common subsequence algorithms. In: SPIRE (String Processing and Information Retrieval), S\u00a039\u201348"},{"key":"387_CR5","volume-title":"Handbook of exact string matching algorithms","author":"C Charras","year":"2004","unstructured":"Charras C, Lecroq T (2004) Handbook of exact string matching algorithms. King\u2019s College Publications, London"},{"issue":"3","key":"387_CR7","doi-asserted-by":"crossref","first-page":"171","DOI":"10.1145\/363958.363994","volume":"7","author":"FJ Damerau","year":"1964","unstructured":"Damerau FJ (1964) A technique for computer detection and correction of spelling errors. Commun ACM 7(3):171\u2013176","journal-title":"Commun ACM"},{"key":"387_CR8","volume-title":"MapReduce: simplified data processing on large clusters. Google labs","author":"J Dean","year":"2004","unstructured":"Dean J, Ghemawat, Sanjay (2004) MapReduce: simplified data processing on large clusters. Google labs. OSDI\u201904: Sixth Symposium on Operating System Design and Implementation, San Francisco"},{"issue":"3","key":"387_CR9","doi-asserted-by":"crossref","first-page":"297","DOI":"10.2307\/1932409","volume":"26","author":"LR Dice","year":"1945","unstructured":"Dice LR (1945) Measures of the amount of ecologic association between species. Ecology 26(3):297\u2013302","journal-title":"Ecology"},{"key":"387_CR10","volume-title":"Datenbank und Marktplatz f\u00fcr Musik auf Schallplatte, CD, Kassette und anderen Formaten","author":"Discogs","year":"2015","unstructured":"Discogs (2015) Datenbank und Marktplatz f\u00fcr Musik auf Schallplatte, CD, Kassette und anderen Formaten. http:\/\/www.discogs.com. Zugegriffen: 2. Nov. 2015"},{"key":"387_CR12","doi-asserted-by":"crossref","first-page":"705","DOI":"10.1016\/0022-2836(82)90398-9","volume":"162","author":"O Gotoh","year":"1982","unstructured":"Gotoh O (1982) An improved algorithm for matching biological sequences. J\u00a0Mol Biol 162:705\u2013708","journal-title":"J Mol Biol"},{"issue":"2","key":"387_CR13","doi-asserted-by":"crossref","first-page":"147","DOI":"10.1002\/j.1538-7305.1950.tb00463.x","volume":"29","author":"RW Hamming","year":"1950","unstructured":"Hamming RW (1950) Error-detecting and error-correcting codes. Bell Syst Tech\u00a0J 29(2):147\u2013160","journal-title":"Bell Syst Tech J"},{"key":"387_CR14","first-page":"547","volume":"37","author":"P Jaccard","year":"1901","unstructured":"Jaccard P (1901) \u00c9tude comparative de la distribution florale dans une portion des Alpes et des Jura. Bull Soc Vaudoise Des Sci Nat 37:547\u2013579","journal-title":"Bull Soc Vaudoise Des Sci Nat"},{"issue":"406","key":"387_CR15","doi-asserted-by":"crossref","first-page":"414","DOI":"10.1080\/01621459.1989.10478785","volume":"84","author":"MA Jaro","year":"1989","unstructured":"Jaro MA (1989) Advances in record-linkage methodology as applied to matching the 1985 census of Tampa, Florida. J\u00a0Am Stat Assoc 84(406):414\u2013420","journal-title":"J Am Stat Assoc"},{"issue":"5\u20137","key":"387_CR16","doi-asserted-by":"crossref","first-page":"491","DOI":"10.1002\/sim.4780140510","volume":"14","author":"MA Jaro","year":"1995","unstructured":"Jaro MA (1995) Probabilistic linkage of large public health data files. Stat Med 14(5\u20137):491\u2013498","journal-title":"Stat Med"},{"issue":"4","key":"387_CR17","first-page":"845","volume":"163","author":"VI Levenshtein","year":"1965","unstructured":"Levenshtein VI (1965) Binary codes capable of correcting deletions, insertions, and reversals. Dokl Akad Nauk SSSR 163(4):845\u2013848 (Russisch, Englische \u00dcbersetzung in: Soviet Physics Doklady, 10(8) pp.\u00a0707\u2013710, 1966)","journal-title":"Dokl Akad Nauk SSSR"},{"key":"387_CR18","volume-title":"Apache mahout","author":"Mahout","year":"2016","unstructured":"Mahout (2016) Apache mahout. https:\/\/mahout.apache.org. Zugegriffen: 15. Febr. 2016"},{"issue":"1","key":"387_CR19","doi-asserted-by":"publisher","first-page":"126","DOI":"10.1365\/s40702-015-0191-8","volume":"53","author":"P Mandl","year":"2015","unstructured":"Mandl P, Bauer N, D\u00f6schl A, Grimm M, Wickertsheim L (2015) Die Verwertung von Online-Musiknutzungen \u2013 Herausforderungen f\u00fcr die IT. HMD Prax Wirtschaftsinform 53(1):126\u2013138. https:\/\/doi.org\/10.1365\/s40702-015-0191-8","journal-title":"HMD Prax Wirtschaftsinform"},{"key":"387_CR20","first-page":"267","volume-title":"The field matching problem: algorithms and applications","author":"AE Monge","year":"1996","unstructured":"Monge AE, Elkan CP (1996) The field matching problem: algorithms and applications. Proc. 2nd Int. Conf. on Knowledge Discovery and Data Mining, S\u00a0267\u2013270"},{"key":"387_CR21","volume-title":"MusicBrainz","author":"MusicBrainz","year":"2015","unstructured":"MusicBrainz (2015) MusicBrainz. https:\/\/www.musicbrainz.org. Zugegriffen: 2. Nov. 2015"},{"key":"387_CR22","doi-asserted-by":"crossref","DOI":"10.1007\/978-3-031-01835-0","volume-title":"An introduction to duplicate detection","author":"F Naumann","year":"2010","unstructured":"Naumann F, Herschel M (2010) An introduction to duplicate detection. Morgan and Claypool, San Rafael"},{"key":"387_CR23","doi-asserted-by":"publisher","DOI":"10.1145\/375360.375365","author":"G Navarro","year":"1999","unstructured":"Navarro G (1999) A guided tour to approximate string matching. ACM Comput Surv. https:\/\/doi.org\/10.1145\/375360.375365","journal-title":"ACM Comput Surv"},{"key":"387_CR25","first-page":"531","volume-title":"Building on progress: expanding the research infrastructure for the social, economic, and behavioral sciences","author":"R Schnell","year":"2010","unstructured":"Schnell R (2010) Record linkage from a\u00a0technical point of view. In: German Data Forum (RatSWD) (Hrsg) Building on progress: expanding the research infrastructure for the social, economic, and behavioral sciences, Bd. 1. Budrich UniPress, Opladen, S\u00a0531\u2013545"},{"key":"387_CR24","volume-title":"Algorithmik","author":"U Sch\u00f6ning","year":"2001","unstructured":"Sch\u00f6ning U (2001) Algorithmik, 13.\u00a0Aufl. Spektrum Akademischer Verlag, Heidelberg","edition":"13"},{"key":"387_CR26","volume-title":"Die verwendete SimMetrics-Bibliothek","author":"SimMetrics","year":"2016","unstructured":"SimMetrics (2016) Die verwendete SimMetrics-Bibliothek. https:\/\/github.com\/Simmetrics\/simmetrics. Zugegriffen: 8. Febr. 2016"},{"issue":"4","key":"387_CR27","first-page":"35","volume":"24","author":"A Singhal","year":"2001","unstructured":"Singhal A (2001) Modern information retrieval: a brief overview. Bull IEEE Comput Soc Tech Comm Data Eng 24(4):35\u201343","journal-title":"Bull IEEE Comput Soc Tech Comm Data Eng"},{"key":"387_CR28","doi-asserted-by":"crossref","first-page":"195","DOI":"10.1016\/0022-2836(81)90087-5","volume":"147","author":"TF Smith","year":"1981","unstructured":"Smith TF, Waterman MS (1981) Identification of common molecular subsequences. J\u00a0Mol Biol 147:195\u2013197","journal-title":"J Mol Biol"},{"issue":"4","key":"387_CR29","first-page":"1","volume":"5","author":"T S\u00f8rensen","year":"1948","unstructured":"S\u00f8rensen T (1948) A method of establishing groups of equal amplitude in plant sociology based on similarity of species and its application to analyses of the vegetation on Danish commons. K Dan Videnskab Selsk 5(4):1\u201334","journal-title":"K. Dan. Videnskab. Selsk."},{"key":"387_CR30","volume-title":"Acceleration of the Smith-Waterman algorithm for DNA sequence alignment using an FPGA platform","author":"B Strengholt","year":"2013","unstructured":"Strengholt B, Brobbel M (2013) Acceleration of the Smith-Waterman algorithm for DNA sequence alignment using an FPGA platform. Delft University of Technology, Delft"},{"key":"387_CR31","first-page":"354","volume-title":"String comparator metrics and enhanced decision rules in the Fellegi-Sunter model of record linkage","author":"WE Winkler","year":"1990","unstructured":"Winkler WE (1990) String comparator metrics and enhanced decision rules in the Fellegi-Sunter model of record linkage. Proceedings of the Section on Survey Research Methods (American Statistical Association), S 354\u2013359"},{"key":"387_CR32","volume-title":"n application of the Fellegi-Sunter model of record linkage to the 1990\u202fU.S. Census. Technical report, US bureau of the census","author":"WE Winkler","year":"1991","unstructured":"Winkler WE, Thibaudeau Y (1991) n application of the Fellegi-Sunter model of record linkage to the 1990\u202fU.S. Census. Technical report, US bureau of the census"}],"container-title":["HMD Praxis der Wirtschaftsinformatik"],"original-title":[],"language":"de","link":[{"URL":"http:\/\/link.springer.com\/article\/10.1365\/s40702-017-0387-1\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"http:\/\/link.springer.com\/content\/pdf\/10.1365\/s40702-017-0387-1.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"http:\/\/link.springer.com\/content\/pdf\/10.1365\/s40702-017-0387-1.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2022,8,12]],"date-time":"2022-08-12T05:54:13Z","timestamp":1660283653000},"score":1,"resource":{"primary":{"URL":"http:\/\/link.springer.com\/10.1365\/s40702-017-0387-1"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2018,1,11]]},"references-count":30,"journal-issue":{"issue":"3","published-print":{"date-parts":[[2018,6]]}},"alternative-id":["387"],"URL":"https:\/\/doi.org\/10.1365\/s40702-017-0387-1","relation":{},"ISSN":["1436-3011","2198-2775"],"issn-type":[{"type":"print","value":"1436-3011"},{"type":"electronic","value":"2198-2775"}],"subject":[],"published":{"date-parts":[[2018,1,11]]}}}