{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,25]],"date-time":"2026-03-25T14:27:31Z","timestamp":1774448851036,"version":"3.50.1"},"publisher-location":"New York, NY, USA","reference-count":21,"publisher":"ACM","license":[{"start":{"date-parts":[[2017,12,7]],"date-time":"2017-12-07T00:00:00Z","timestamp":1512604800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2017,12,7]]},"DOI":"10.1145\/3166072.3166076","type":"proceedings-article","created":{"date-parts":[[2017,12,13]],"date-time":"2017-12-13T14:50:46Z","timestamp":1513176646000},"page":"1-4","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":12,"title":["K-Means Clustering of Biological Sequences"],"prefix":"10.1145","author":[{"given":"Timothy","family":"Chappell","sequence":"first","affiliation":[{"name":"School of Electrical Engineering and Computer Science, QUT, Brisbane, Queensland"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Shlomo","family":"Geva","sequence":"additional","affiliation":[{"name":"School of Electrical Engineering and Computer Science, QUT, Brisbane, Queensland"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"James","family":"Hogan","sequence":"additional","affiliation":[{"name":"School of Electrical Engineering and Computer Science, QUT, Brisbane, Queensland"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"320","published-online":{"date-parts":[[2017,12,7]]},"reference":[{"key":"e_1_3_2_1_1_1","doi-asserted-by":"publisher","DOI":"10.1145\/509907.509965"},{"key":"e_1_3_2_1_2_1","volume-title":"The Sanger FASTQ file format for sequences with quality scores, and the Solexa\/Illumina FASTQ variants. Nucleic acids research 38, 6","author":"Cock Peter JA","year":"2009","unstructured":"Peter JA Cock , Christopher J Fields , Naohisa Goto , Michael L Heuer , and Peter M Rice . 2009. The Sanger FASTQ file format for sequences with quality scores, and the Solexa\/Illumina FASTQ variants. Nucleic acids research 38, 6 ( 2009 ), 1767--1771. Peter JA Cock, Christopher J Fields, Naohisa Goto, Michael L Heuer, and Peter M Rice. 2009. The Sanger FASTQ file format for sequences with quality scores, and the Solexa\/Illumina FASTQ variants. Nucleic acids research 38, 6 (2009), 1767--1771."},{"key":"e_1_3_2_1_3_1","volume-title":"Robert D Finn, Guy Cochrane, Ewan Birney, and Rolf Apweiler.","author":"Cook Charles E","year":"2015","unstructured":"Charles E Cook , Mary Todd Bergman , Robert D Finn, Guy Cochrane, Ewan Birney, and Rolf Apweiler. 2015 . The European Bioinformatics Institute in 2016: data growth and integration. Nucleic acids research 44, D1 (2015), D20--D26. Charles E Cook, Mary Todd Bergman, Robert D Finn, Guy Cochrane, Ewan Birney, and Rolf Apweiler. 2015. The European Bioinformatics Institute in 2016: data growth and integration. Nucleic acids research 44, D1 (2015), D20--D26."},{"key":"e_1_3_2_1_4_1","doi-asserted-by":"publisher","DOI":"10.1093\/bioinformatics\/btq461"},{"key":"e_1_3_2_1_5_1","unstructured":"R. C Edgar. 2012. Local clustering. (2012). https:\/\/www.drive5.com\/usearch\/manual\/local_clustering.html  R. C Edgar. 2012. Local clustering. (2012). https:\/\/www.drive5.com\/usearch\/manual\/local_clustering.html"},{"key":"e_1_3_2_1_6_1","unstructured":"R. C Edgar. 2012. UCLUST algorithm. (2012). https:\/\/www.drive5.com\/usearch\/manual\/uclust_algo.html  R. C Edgar. 2012. UCLUST algorithm. (2012). https:\/\/www.drive5.com\/usearch\/manual\/uclust_algo.html"},{"key":"e_1_3_2_1_7_1","volume-title":"Information retrieval: data structures and algorithms, William B Frakes and Ricardo Baeza-Yates (Eds.)","author":"Faloutsos C","unstructured":"C Faloutsos . 1992. Signature files . In Information retrieval: data structures and algorithms, William B Frakes and Ricardo Baeza-Yates (Eds.) . Prentice Hall , Chapter 4, 44--65. C Faloutsos. 1992. Signature files. In Information retrieval: data structures and algorithms, William B Frakes and Ricardo Baeza-Yates (Eds.). Prentice Hall, Chapter 4, 44--65."},{"key":"e_1_3_2_1_8_1","doi-asserted-by":"publisher","DOI":"10.1145\/2063576.2063629"},{"key":"e_1_3_2_1_9_1","volume-title":"DNACLUST: accurate and efficient clustering of phylogenetic marker genes. BMC bioinformatics 12, 1","author":"Ghodsi Mohammadreza","year":"2011","unstructured":"Mohammadreza Ghodsi , Bo Liu , and Mihai Pop . 2011. DNACLUST: accurate and efficient clustering of phylogenetic marker genes. BMC bioinformatics 12, 1 ( 2011 ), 271. Mohammadreza Ghodsi, Bo Liu, and Mihai Pop. 2011. DNACLUST: accurate and efficient clustering of phylogenetic marker genes. BMC bioinformatics 12, 1 (2011), 271."},{"key":"e_1_3_2_1_10_1","doi-asserted-by":"publisher","DOI":"10.1016\/0022-2836(82)90398-9"},{"key":"e_1_3_2_1_11_1","volume-title":"Error detecting and error correcting codes. Bell System technical journal 29, 2","author":"Hamming Richard W","year":"1950","unstructured":"Richard W Hamming . 1950. Error detecting and error correcting codes. Bell System technical journal 29, 2 ( 1950 ), 147--160. Richard W Hamming. 1950. Error detecting and error correcting codes. Bell System technical journal 29, 2 (1950), 147--160."},{"key":"e_1_3_2_1_12_1","volume-title":"https:\/\/www.ebi.ac.uk\/ena\/data\/view\/ERR000001","author":"The Wellcome Trust Sanger Institute","year":"2007","unstructured":"The Wellcome Trust Sanger Institute . 2007. ERR000001. ( 2007 ). https:\/\/www.ebi.ac.uk\/ena\/data\/view\/ERR000001 The Wellcome Trust Sanger Institute. 2007. ERR000001. (2007). https:\/\/www.ebi.ac.uk\/ena\/data\/view\/ERR000001"},{"key":"e_1_3_2_1_13_1","doi-asserted-by":"crossref","unstructured":"Rasko Leinonen Ruth Akhtar Ewan Birney Lawrence Bower Ana Cerdeno-T\u00e1rraga Ying Cheng Iain Cleland Nadeem Faruque Neil Goodgame Richard Gibson etal 2010. The European nucleotide archive. Nucleic acids research 39 suppl_1 (2010) D28--D31.  Rasko Leinonen Ruth Akhtar Ewan Birney Lawrence Bower Ana Cerdeno-T\u00e1rraga Ying Cheng Iain Cleland Nadeem Faruque Neil Goodgame Richard Gibson et al. 2010. The European nucleotide archive. Nucleic acids research 39 suppl_1 (2010) D28--D31.","DOI":"10.1093\/nar\/gkq967"},{"key":"e_1_3_2_1_14_1","doi-asserted-by":"publisher","DOI":"10.1093\/bioinformatics\/btl158"},{"key":"e_1_3_2_1_15_1","volume-title":"Proceedings of the fifth Berkeley symposium on mathematical statistics and probability","volume":"1","author":"MacQueen James","year":"1967","unstructured":"James MacQueen . 1967 . Some methods for classification and analysis of multivariate observations . In Proceedings of the fifth Berkeley symposium on mathematical statistics and probability , Vol. 1 . Oakland, CA, USA., 281--297. James MacQueen. 1967. Some methods for classification and analysis of multivariate observations. In Proceedings of the fifth Berkeley symposium on mathematical statistics and probability, Vol. 1. Oakland, CA, USA., 281--297."},{"key":"e_1_3_2_1_16_1","doi-asserted-by":"publisher","DOI":"10.1145\/1242572.1242592"},{"key":"e_1_3_2_1_17_1","volume-title":"Comparison of the PAM and BLOSUM amino acid substitution matrices. Cold Spring Harbor Protocols","author":"Mount David W","year":"2008","unstructured":"David W Mount . 2008. Comparison of the PAM and BLOSUM amino acid substitution matrices. Cold Spring Harbor Protocols 2008 , 6 (2008), pdb--ip59. David W Mount. 2008. Comparison of the PAM and BLOSUM amino acid substitution matrices. Cold Spring Harbor Protocols 2008, 6 (2008), pdb--ip59."},{"key":"e_1_3_2_1_18_1","doi-asserted-by":"publisher","DOI":"10.1016\/0022-2836(70)90057-4"},{"key":"e_1_3_2_1_19_1","doi-asserted-by":"publisher","DOI":"10.1073\/pnas.85.8.2444"},{"key":"e_1_3_2_1_20_1","volume-title":"Methods and Applications of Semantic Indexing Workshop at the 7th International Conference on Terminology and Knowledge Engineering, TKE","volume":"5","author":"Sahlgren M.","year":"2005","unstructured":"M. Sahlgren . 2005 . An introduction to random indexing . In Methods and Applications of Semantic Indexing Workshop at the 7th International Conference on Terminology and Knowledge Engineering, TKE , Vol. 5 . M. Sahlgren. 2005. An introduction to random indexing. In Methods and Applications of Semantic Indexing Workshop at the 7th International Conference on Terminology and Knowledge Engineering, TKE, Vol. 5."},{"key":"e_1_3_2_1_21_1","doi-asserted-by":"publisher","DOI":"10.1016\/0022-2836(81)90087-5"}],"event":{"name":"ADCS 2017: The 22nd Australasian Document Computing Symposium","location":"Brisbane QLD Australia","acronym":"ADCS 2017","sponsor":["Queensland University of Technology","CSIRO Commonwealth Scientific and Industrial Research Organisation"]},"container-title":["Proceedings of the 22nd Australasian Document Computing Symposium"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3166072.3166076","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3166072.3166076","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,18]],"date-time":"2025-06-18T02:26:55Z","timestamp":1750213615000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3166072.3166076"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2017,12,7]]},"references-count":21,"alternative-id":["10.1145\/3166072.3166076","10.1145\/3166072"],"URL":"https:\/\/doi.org\/10.1145\/3166072.3166076","relation":{},"subject":[],"published":{"date-parts":[[2017,12,7]]},"assertion":[{"value":"2017-12-07","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}