{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,9,29]],"date-time":"2025-09-29T08:23:15Z","timestamp":1759134195925,"version":"3.41.0"},"publisher-location":"New York, NY, USA","reference-count":27,"publisher":"ACM","license":[{"start":{"date-parts":[[2020,9,21]],"date-time":"2020-09-21T00:00:00Z","timestamp":1600646400000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"funder":[{"name":"Nebraska NSF EPSCoR","award":["OIA-1557417"],"award-info":[{"award-number":["OIA-1557417"]}]}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2020,9,21]]},"DOI":"10.1145\/3388440.3412424","type":"proceedings-article","created":{"date-parts":[[2020,11,10]],"date-time":"2020-11-10T12:43:43Z","timestamp":1605012223000},"page":"1-7","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":2,"title":["MinIsoClust"],"prefix":"10.1145","author":[{"given":"Sairam","family":"Behera","sequence":"first","affiliation":[{"name":"Dept. of Computer Sc. &amp; Engineering, University of Nebraska-Lincoln, Lincoln, NE, USA"}]},{"given":"Jitender S.","family":"Deogun","sequence":"additional","affiliation":[{"name":"Dept. of Computer Sc. &amp; Engineering, University of Nebraska-Lincoln, Lincoln, NE, USA"}]},{"given":"Etsuko N.","family":"Moriyama","sequence":"additional","affiliation":[{"name":"School of Biological Sciences, Center for Plant Science Innovation, University of Nebraska-Lincoln, Lincoln, NE, USA"}]}],"member":"320","published-online":{"date-parts":[[2020,11,10]]},"reference":[{"key":"e_1_3_2_1_1_1","doi-asserted-by":"publisher","DOI":"10.1145\/362686.362692"},{"key":"e_1_3_2_1_2_1","doi-asserted-by":"publisher","DOI":"10.1006\/jcss.1999.1690"},{"key":"e_1_3_2_1_3_1","doi-asserted-by":"publisher","DOI":"10.1145\/3055399.3055443"},{"key":"e_1_3_2_1_4_1","first-page":"7","article-title":"Corset: enabling differential gene expression analysis for de novo assembled transcriptomes","volume":"15","author":"Davidson Nadia M.","year":"2014","unstructured":"Nadia M. Davidson and Alicia Oshlack . 2014 . Corset: enabling differential gene expression analysis for de novo assembled transcriptomes . Genome Biology 15 , 7 (Jul 2014), 410--423. https:\/\/doi.org\/10.1186\/s13059-014-0410-6 10.1186\/s13059-014-0410-6 Nadia M. Davidson and Alicia Oshlack. 2014. Corset: enabling differential gene expression analysis for de novo assembled transcriptomes. Genome Biology 15, 7 (Jul 2014), 410--423. https:\/\/doi.org\/10.1186\/s13059-014-0410-6","journal-title":"Genome Biology"},{"key":"#cr-split#-e_1_3_2_1_5_1.1","doi-asserted-by":"crossref","unstructured":"David Swarbreck et al. 2008. The Arabidopsis Information Resource (TAIR): gene structure and function annotation. Nucleic Acids Research 36 (Database issue) (Jan 2008) D1009-D1014. https:\/\/doi.org\/10.1093\/nar\/gkm965 10.1093\/nar","DOI":"10.1093\/nar\/gkm965"},{"key":"#cr-split#-e_1_3_2_1_5_1.2","doi-asserted-by":"crossref","unstructured":"David Swarbreck et al. 2008. The Arabidopsis Information Resource (TAIR): gene structure and function annotation. Nucleic Acids Research 36 (Database issue) (Jan 2008) D1009-D1014. https:\/\/doi.org\/10.1093\/nar\/gkm965","DOI":"10.1093\/nar\/gkm965"},{"key":"e_1_3_2_1_6_1","first-page":"1","article-title":"AtRTD - a comprehensive reference transcript dataset resource for accurate quantification of transcript-specific expression in Arabidopsis thaliana","volume":"201","author":"Runxuan Zhang","year":"2018","unstructured":"Runxuan Zhang et al. 2018 . AtRTD - a comprehensive reference transcript dataset resource for accurate quantification of transcript-specific expression in Arabidopsis thaliana . New Phytol 201 , 1 (Oct 2018), 96--101. https:\/\/doi.org\/10.1111\/nph.13545 10.1111\/nph.13545 Runxuan Zhang et al. 2018. AtRTD - a comprehensive reference transcript dataset resource for accurate quantification of transcript-specific expression in Arabidopsis thaliana. New Phytol 201, 1 (Oct 2018), 96--101. https:\/\/doi.org\/10.1111\/nph.13545","journal-title":"New Phytol"},{"key":"e_1_3_2_1_7_1","doi-asserted-by":"publisher","DOI":"10.5555\/645925.671516"},{"key":"e_1_3_2_1_8_1","doi-asserted-by":"publisher","DOI":"10.1007\/BF01908075"},{"key":"e_1_3_2_1_9_1","volume-title":"Improving MinHash via the containment index with applications to metagenomic analysis. Appl. Math. Comput. 354 (Aug","author":"Koslicki David","year":"2019","unstructured":"David Koslicki and Hooman Zabeti . 2019. Improving MinHash via the containment index with applications to metagenomic analysis. Appl. Math. Comput. 354 (Aug 2019 ), 206--215. https:\/\/doi.org\/10.1016\/j.amc.2019.02.018 10.1016\/j.amc.2019.02.018 David Koslicki and Hooman Zabeti. 2019. Improving MinHash via the containment index with applications to metagenomic analysis. Appl. Math. Comput. 354 (Aug 2019), 206--215. https:\/\/doi.org\/10.1016\/j.amc.2019.02.018"},{"key":"e_1_3_2_1_10_1","doi-asserted-by":"publisher","DOI":"10.1186\/s13059-019-1910-1"},{"key":"e_1_3_2_1_11_1","volume-title":"Mining of Massive Datasets","author":"Leskovec Jure","unstructured":"Jure Leskovec , Anand Rajaraman , and Jeffrey David Ullman . 2014. Mining of Massive Datasets ( 2 nd ed.). Cambridge University Press , USA. Jure Leskovec, Anand Rajaraman, and Jeffrey David Ullman. 2014. Mining of Massive Datasets (2nd ed.). Cambridge University Press, USA.","edition":"2"},{"key":"e_1_3_2_1_12_1","doi-asserted-by":"publisher","DOI":"10.1093\/bioinformatics\/btl158"},{"key":"e_1_3_2_1_13_1","volume-title":"Are splicing mutations the most frequent cause of hereditary disease? FEBS letters 579, 9 (Mar","author":"L\u00f3pez-Bigas N\u00faria","year":"2005","unstructured":"N\u00faria L\u00f3pez-Bigas , Benjamin Audit , Christos A. Ouzounis , G Luna Parra , and Roderic Guig\u00f3 . 2005. Are splicing mutations the most frequent cause of hereditary disease? FEBS letters 579, 9 (Mar 2005 ), 1900--1903. https:\/\/doi.org\/10.1016\/j.febslet.2005.02.047 10.1016\/j.febslet.2005.02.047 N\u00faria L\u00f3pez-Bigas, Benjamin Audit, Christos A. Ouzounis, G Luna Parra, and Roderic Guig\u00f3. 2005. Are splicing mutations the most frequent cause of hereditary disease? FEBS letters 579, 9 (Mar 2005), 1900--1903. https:\/\/doi.org\/10.1016\/j.febslet.2005.02.047"},{"key":"e_1_3_2_1_14_1","doi-asserted-by":"publisher","DOI":"10.1186\/s13059-016-0997-x"},{"key":"e_1_3_2_1_15_1","volume-title":"Blencowe","author":"Pan Qun","year":"2008","unstructured":"Qun Pan , Ofer Shai , Leo J. Lee , Brendan J. Frey , and Benjamin J . Blencowe . 2008 . Deep surveying of alternative splicing complexity in the human transcriptome by high-throughput sequencing. Nature Genetics 40 (Nov 2008), 1413--1415. https:\/\/doi.org\/10.1038\/ng.259 10.1038\/ng.259 Qun Pan, Ofer Shai, Leo J. Lee, Brendan J. Frey, and Benjamin J. Blencowe. 2008. Deep surveying of alternative splicing complexity in the human transcriptome by high-throughput sequencing. Nature Genetics 40 (Nov 2008), 1413--1415. https:\/\/doi.org\/10.1038\/ng.259"},{"key":"e_1_3_2_1_16_1","article-title":"Scikit-learn: Machine learning in Python","author":"Pedregosa Fabian","year":"2011","unstructured":"Fabian Pedregosa , Ga\u00ebl Varoquaux , Alexandre Gramfort , Vincent Michel , Bertrand Thirion , Olivier Grisel , Mathieu Blondel , Peter Prettenhofer , Ron Weiss , Vincent Dubourg , 2011 . Scikit-learn: Machine learning in Python . Journal of Machine Learning Research 12 (Nov 2011), 2825--2830. Fabian Pedregosa, Ga\u00ebl Varoquaux, Alexandre Gramfort, Vincent Michel, Bertrand Thirion, Olivier Grisel, Mathieu Blondel, Peter Prettenhofer, Ron Weiss, Vincent Dubourg, et al. 2011. Scikit-learn: Machine learning in Python. Journal of Machine Learning Research 12 (Nov 2011), 2825--2830.","journal-title":"Journal of Machine Learning Research 12"},{"key":"e_1_3_2_1_17_1","doi-asserted-by":"publisher","DOI":"10.1038\/nbt.3122"},{"key":"e_1_3_2_1_18_1","volume-title":"Brown","author":"Pierce Tessa N.","year":"2019","unstructured":"Tessa N. Pierce , Luiz Irber , Taylor Reiter , Phillip Brooks , and Titus C . Brown . 2019 . Large-scale sequence comparisons with sourmash. F1000Research 8, 1006 (Jul 2019). https:\/\/doi.org\/10.12688\/f1000research.19675.1 10.12688\/f1000research.19675.1 Tessa N. Pierce, Luiz Irber, Taylor Reiter, Phillip Brooks, and Titus C. Brown. 2019. Large-scale sequence comparisons with sourmash. F1000Research 8, 1006 (Jul 2019). https:\/\/doi.org\/10.12688\/f1000research.19675.1"},{"volume-title":"Proceedings of the 2007 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning (EMNLP-CoNLL)","author":"Rosenberg Andrew","key":"e_1_3_2_1_19_1","unstructured":"Andrew Rosenberg and Julia Hirschberg . 2007. V-Measure: A Conditional Entropy-Based External Cluster Evaluation Measure . In Proceedings of the 2007 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning (EMNLP-CoNLL) . Association for Computational Linguistics , Prague, Czech Republic, 410--420. https:\/\/doi.org\/10.7916\/D80V8N84 10.7916\/D80V8N84 Andrew Rosenberg and Julia Hirschberg. 2007. V-Measure: A Conditional Entropy-Based External Cluster Evaluation Measure. In Proceedings of the 2007 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning (EMNLP-CoNLL). Association for Computational Linguistics, Prague, Czech Republic, 410--420. https:\/\/doi.org\/10.7916\/D80V8N84"},{"volume-title":"Quality-Value Based Algorithm. In Research in Computational Molecular Biology (RECOMB) (Lecture Notes in Computer Science), Lenore J","author":"Sahlin Kristoffer","key":"e_1_3_2_1_20_1","unstructured":"Kristoffer Sahlin and Paul Medvedev . 2019. De Novo Clustering of Long-Read Transcriptome Data Using a Greedy , Quality-Value Based Algorithm. In Research in Computational Molecular Biology (RECOMB) (Lecture Notes in Computer Science), Lenore J . Cowen (Ed.). Springer International Publishing , Cham , 227--242. https:\/\/doi.org\/10.1007\/978-3-030-17083-7_14 10.1007\/978-3-030-17083-7_14 Kristoffer Sahlin and Paul Medvedev. 2019. De Novo Clustering of Long-Read Transcriptome Data Using a Greedy, Quality-Value Based Algorithm. In Research in Computational Molecular Biology (RECOMB) (Lecture Notes in Computer Science), Lenore J. Cowen (Ed.). Springer International Publishing, Cham, 227--242. https:\/\/doi.org\/10.1007\/978-3-030-17083-7_14"},{"key":"e_1_3_2_1_21_1","doi-asserted-by":"publisher","DOI":"10.1145\/2736277.2741285"},{"key":"e_1_3_2_1_22_1","volume-title":"MMseqs2 enables sensitive protein sequence searching for the analysis of massive data sets. Nature Biotechnology 35 (Oct","author":"Steinegger Martin","year":"2017","unstructured":"Martin Steinegger and Johannes S\u00f6ding . 2017. MMseqs2 enables sensitive protein sequence searching for the analysis of massive data sets. Nature Biotechnology 35 (Oct 2017 ), 1026--1028. https:\/\/doi.org\/10.1038\/nbt.3988 10.1038\/nbt.3988 Martin Steinegger and Johannes S\u00f6ding. 2017. MMseqs2 enables sensitive protein sequence searching for the analysis of massive data sets. Nature Biotechnology 35 (Oct 2017), 1026--1028. https:\/\/doi.org\/10.1038\/nbt.3988"},{"key":"e_1_3_2_1_23_1","volume-title":"Differential gene and transcript expression analysis of RNA-seq experiments with TopHat and Cufflinks. Nature Protocols 7 (Mar","author":"Trapnell Cole","year":"2012","unstructured":"Cole Trapnell , Adam Roberts , Loyal A. Goff , Geo Pertea , Daehwan Kim , David R. Kelley , Harold Pimentel , Steven L. Salzberg , John L. Rinn , and Lior Pachter . 2012. Differential gene and transcript expression analysis of RNA-seq experiments with TopHat and Cufflinks. Nature Protocols 7 (Mar 2012 ), 562--578. https:\/\/doi.org\/10.1038\/nprot.2012.016 10.1038\/nprot.2012.016 Cole Trapnell, Adam Roberts, Loyal A. Goff, Geo Pertea, Daehwan Kim, David R. Kelley, Harold Pimentel, Steven L. Salzberg, John L. Rinn, and Lior Pachter. 2012. Differential gene and transcript expression analysis of RNA-seq experiments with TopHat and Cufflinks. Nature Protocols 7 (Mar 2012), 562--578. https:\/\/doi.org\/10.1038\/nprot.2012.016"},{"key":"e_1_3_2_1_24_1","volume-title":"Moriyama","author":"Voshall Adam","year":"2020","unstructured":"Adam Voshall , Sairam Behera , Xiangjun Li , Xiao-Hong Yu , Kushagra Kapil , Jitender S. Deogun , John Shanklin , Edgar B. Cahoon , and Etsuko N . Moriyama . 2020 . A consensus-based ensemble approach to improve de novo transcriptome assembly. bioRxiv (Jun 2020). https:\/\/doi.org\/10.1101\/2020.06.08.139964 10.1101\/2020.06.08.139964 Adam Voshall, Sairam Behera, Xiangjun Li, Xiao-Hong Yu, Kushagra Kapil, Jitender S. Deogun, John Shanklin, Edgar B. Cahoon, and Etsuko N. Moriyama. 2020. A consensus-based ensemble approach to improve de novo transcriptome assembly. bioRxiv (Jun 2020). https:\/\/doi.org\/10.1101\/2020.06.08.139964"},{"key":"e_1_3_2_1_25_1","volume-title":"Moriyama","author":"Voshall Adam","year":"2018","unstructured":"Adam Voshall and Etsuko N . Moriyama . 2018 . Next-Generation Transcriptome Assembly: Strategies and Performance Analysis. In Bioinformatics in the Era of Post Genomics and Big Data, Ibrokhim Y. Abdurakhmonov (Ed.). IntechOpen, Rijeka . https:\/\/doi.org\/10.5772\/intechopen.73497 10.5772\/intechopen.73497 Adam Voshall and Etsuko N. Moriyama. 2018. Next-Generation Transcriptome Assembly: Strategies and Performance Analysis. In Bioinformatics in the Era of Post Genomics and Big Data, Ibrokhim Y. Abdurakhmonov (Ed.). IntechOpen, Rijeka. https:\/\/doi.org\/10.5772\/intechopen.73497"},{"key":"e_1_3_2_1_26_1","doi-asserted-by":"publisher","DOI":"10.1093\/bioinformatics\/btw753"}],"event":{"name":"BCB '20: 11th ACM International Conference on Bioinformatics, Computational Biology and Health Informatics","sponsor":["SIGBio ACM Special Interest Group on Bioinformatics"],"location":"Virtual Event USA","acronym":"BCB '20"},"container-title":["Proceedings of the 11th ACM International Conference on Bioinformatics, Computational Biology and Health Informatics"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3388440.3412424","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3388440.3412424","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T22:33:29Z","timestamp":1750199609000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3388440.3412424"}},"subtitle":["Isoform clustering using minhash and locality sensitive hashing"],"short-title":[],"issued":{"date-parts":[[2020,9,21]]},"references-count":27,"alternative-id":["10.1145\/3388440.3412424","10.1145\/3388440"],"URL":"https:\/\/doi.org\/10.1145\/3388440.3412424","relation":{},"subject":[],"published":{"date-parts":[[2020,9,21]]},"assertion":[{"value":"2020-11-10","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}