{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,1,31]],"date-time":"2026-01-31T07:02:38Z","timestamp":1769842958100,"version":"3.49.0"},"reference-count":131,"publisher":"MIT Press","issue":"4","license":[{"start":{"date-parts":[[2021,3,6]],"date-time":"2021-03-06T00:00:00Z","timestamp":1614988800000},"content-version":"vor","delay-in-days":33,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/legalcode"}],"content-domain":{"domain":["direct.mit.edu"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2021,2,1]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:p>The transcription bottleneck is often cited as a major obstacle for efforts to document the world\u2019s endangered languages and supply them with language technologies. One solution is to extend methods from automatic speech recognition and machine translation, and recruit linguists to provide narrow phonetic transcriptions and sentence-aligned translations. However, I believe that these approaches are not a good fit with the available data and skills, or with long-established practices that are essentially word-based. In seeking a more effective approach, I consider a century of transcription practice and a wide range of computational approaches, before proposing a computational model based on spoken term detection that I call \u201csparse transcription.\u201d This represents a shift away from current assumptions that we transcribe phones, transcribe fully, and transcribe first. Instead, sparse transcription combines the older practice of word-level transcription with interpretive, iterative, and interactive processes that are amenable to wider participation and that open the way to new methods for processing oral languages.<\/jats:p>","DOI":"10.1162\/coli_a_00387","type":"journal-article","created":{"date-parts":[[2020,10,20]],"date-time":"2020-10-20T15:53:45Z","timestamp":1603209225000},"page":"713-744","update-policy":"https:\/\/doi.org\/10.1162\/mitpressjournals.corrections.policy","source":"Crossref","is-referenced-by-count":9,"title":["Sparse Transcription"],"prefix":"10.1162","volume":"46","author":[{"given":"Steven","family":"Bird","sequence":"first","affiliation":[{"name":"Northern Institute, Charles Darwin University. steven.bird@cdu.edu.au"}]}],"member":"281","published-online":{"date-parts":[[2021,2,1]]},"reference":[{"key":"2022022220190576800_bib1","unstructured":"Abiteboul, Serge, PeterBuneman, and DanSuciu. 2000. Data on the Web: From Relations to Semistructured Data and XML. Morgan Kaufmann."},{"key":"2022022220190576800_bib2","unstructured":"Abney, Steven and StevenBird. 2010. The Human Language Project: Building a universal corpus of the world\u2019s languages. In Proceedings of the 48th Meeting of the Association for Computational Linguistics, pages 88\u201397, Uppsala."},{"key":"2022022220190576800_bib3","unstructured":"Adams, Oliver . 2017. Automatic Understanding of Unwritten Languages. Ph.D. thesis, University of Melbourne."},{"key":"2022022220190576800_bib4","unstructured":"Adams, Oliver, TrevorCohn, GrahamNeubig, StevenBird, and AlexisMichaud. 2018. Evaluating phonemic transcription of low-resource tonal languages for language documentation. In Proceedings of the 11th International Conference on Language Resources and Evaluation, pages 3356\u20133365, Miyazaki."},{"key":"2022022220190576800_bib5","unstructured":"Adams, Oliver, TrevorCohn, GrahamNeubig, and AlexisMichaud. 2017. Phonemic transcription of low-resource tonal languages. In Proceedings of the Australasian Language Technology Association Workshop, pages 53\u201360, Brisbane."},{"key":"2022022220190576800_bib6","unstructured":"Adams, Oliver, GrahamNeubig, TrevorCohn, and StevenBird. 2015. Inducing bilingual lexicons from small quantities of sentence-aligned phonemic transcriptions. In Proceedings of the International Workshop on Spoken Language Translation, pages 246\u2013255, Da Nang."},{"key":"2022022220190576800_bib7","doi-asserted-by":"crossref","unstructured":"Adams, Oliver, GrahamNeubig, TrevorCohn, and StevenBird. 2016a. Learning a translation model from word lattices. In Proceedings of the 17th Annual Conference of the International Speech Communication Association, pages 2518\u20132522, San Francisco, CA. DOI:\u00a0https:\/\/doi.org\/10.21437\/Interspeech.2016-862","DOI":"10.21437\/Interspeech.2016-862"},{"key":"2022022220190576800_bib8","doi-asserted-by":"crossref","unstructured":"Adams, Oliver, GrahamNeubig, TrevorCohn, StevenBird, Quoc TruongDo, and SatoshiNakamura. 2016b. Learning a lexicon and translation model from phoneme lattices. In Proceedings of the Conference on Empirical Methods on Natural Language Processing, pages 2377\u20132382, Austin, TX. DOI:\u00a0https:\/\/doi.org\/10.18653\/v1\/D16-1263","DOI":"10.18653\/v1\/D16-1263"},{"key":"2022022220190576800_bib9","unstructured":"Albright, Eric and JohnHatton. 2008. WeSay, a tool for engaging communities in dictionary building. In Victoria D.Rau and MargaretFlorey, editors, Documenting and Revitalizing Austronesian Languages, number 1 in Language Documentation and Conservation Special Publication. University of Hawai\u2018i Press, pages 189\u2013201."},{"key":"2022022220190576800_bib10","doi-asserted-by":"crossref","unstructured":"Anastasopoulos, Antonios, SameerBansal, DavidChiang, SharonGoldwater, and AdamLopez. 2017. Spoken term discovery for language documentation using translations. In Proceedings of the Workshop on Speech-Centric NLP, pages 53\u201358, Copenhagen. DOI:\u00a0https:\/\/doi.org\/10.18653\/v1\/W17-4607","DOI":"10.18653\/v1\/W17-4607"},{"key":"2022022220190576800_bib11","doi-asserted-by":"crossref","unstructured":"Anastasopoulos, Antonios and DavidChiang. 2017. A case study on using speech-to-translation alignments for language documentation. In Proceedings of the Workshop on the Use of Computational Methods in Study of Endangered Languages, pages 170\u2013178, Honolulu, HI. DOI:\u00a0https:\/\/doi.org\/10.18653\/v1\/W17-0123","DOI":"10.18653\/v1\/W17-0123"},{"key":"2022022220190576800_bib12","doi-asserted-by":"crossref","unstructured":"Anastasopoulos, Antonios, DavidChiang, and LongDuong. 2016. An unsupervised probability model for speech-to-translation alignment of low-resource languages. In Proceedings of the Conference on Empirical Methods on Natural Language Processing, pages 1255\u20131263, Austin, TX. DOI:\u00a0https:\/\/doi.org\/10.18653\/v1\/D16-1133","DOI":"10.18653\/v1\/D16-1133"},{"key":"2022022220190576800_bib13","doi-asserted-by":"crossref","unstructured":"Anastasopoulos, Antonis and DavidChiang. 2018. Leveraging translations for speech transcription in low-resource settings. In Proceedings of the 19th Annual Conference of the International Speech Communication Association, pages 1279\u20131283, Hyderabad. DOI:\u00a0https:\/\/doi.org\/10.21437\/Interspeech.2018-2162","DOI":"10.21437\/Interspeech.2018-2162"},{"key":"2022022220190576800_bib14","unstructured":"Austin, Peter K . 2007. Training for language documentation: Experiences at the School of Oriental and African Studies. In D.Victoria Rau and MargaretFlorey, editors, Documenting and Revitalizing Austronesian Languages, number 1 in Language Documentation and Conservation Special Issue, University of Hawai\u2018i Press, pages 25\u201341."},{"key":"2022022220190576800_bib15","doi-asserted-by":"crossref","unstructured":"Bansal, Sameer, HermanKamper, AdamLopez, and SharonGoldwater. 2017. Towards speech-to-text translation without speech recognition. In Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics, pages 474\u2013479, Valencia. DOI:\u00a0https:\/\/doi.org\/10.18653\/v1\/E17-2076, PMID:\u00a028345436","DOI":"10.18653\/v1\/E17-2076"},{"key":"2022022220190576800_bib16","doi-asserted-by":"crossref","unstructured":"Besacier, Laurent, BowenZhou, and YuqingGao. 2006. Towards speech translation of non written languages. In Spoken Language Technology Workshop, pages 222\u2013225, IEEE. DOI:\u00a0https:\/\/doi.org\/10.1109\/SLT.2006.326795","DOI":"10.1109\/SLT.2006.326795"},{"key":"2022022220190576800_bib17","unstructured":"Bettinson, Mat . 2013. The effect of respeaking on transcription accuracy. Honours Thesis, Department of Linguistics, University of Melbourne."},{"key":"2022022220190576800_bib18","doi-asserted-by":"crossref","unstructured":"Bird, Steven . 2010. A scalable method for preserving oral literature from small languages. In Proceedings of the 12th International Conference on Asia-Pacific Digital Libraries, pages 5\u201314, Gold Coast. DOI:\u00a0https:\/\/doi.org\/10.1007\/978-3-642-13654-2_2","DOI":"10.1007\/978-3-642-13654-2_2"},{"key":"2022022220190576800_bib19","doi-asserted-by":"crossref","unstructured":"Bird, Steven . 2020. Decolonising speech and language technology. In Proceedings of the 28th International Conference on Computational Linguistics. Barcelona, Spain. To appear.","DOI":"10.18653\/v1\/2020.coling-main.313"},{"key":"2022022220190576800_bib20","unstructured":"Bird, Steven and DavidChiang. 2012. Machine translation for language preservation. In Proceedings of the 24th International Conference on Computational Linguistics, pages 125\u2013134, Mumbai."},{"key":"2022022220190576800_bib21","doi-asserted-by":"crossref","unstructured":"Bird, Steven, FlorianHanke, OliverAdams, and HaejoongLee. 2014. Aikuma: A mobile app for collaborative language documentation. In Proceedings of the Workshop on the Use of Computational Methods in the Study of Endangered Languages, pages 1\u20135, Baltimore, MD. DOI:\u00a0https:\/\/doi.org\/10.3115\/v1\/W14-2201","DOI":"10.3115\/v1\/W14-2201"},{"key":"2022022220190576800_bib22","doi-asserted-by":"crossref","unstructured":"Bird, Steven and JonathanHarrington, editors. 2001. Speech Communication: Special Issue on Speech Annotation and Corpus Tools, 33 (1\u20132). Elsevier. DOI:\u00a0https:\/\/doi.org\/10.1016\/S0167-6393(00)00066-2","DOI":"10.1016\/S0167-6393(00)00066-2"},{"key":"2022022220190576800_bib23","doi-asserted-by":"crossref","unstructured":"Bird, Steven, EwanKlein, and EdwardLoper. 2009. Natural Language Processing with Python. O\u2019Reilly Media. DOI:\u00a0https:\/\/doi.org\/10.1016\/S0167-6393(00)00068-6","DOI":"10.1016\/S0167-6393(00)00068-6"},{"key":"2022022220190576800_bib24","doi-asserted-by":"crossref","unstructured":"Bird, Steven and MarkLiberman. 2001. A formal framework for linguistic annotation. Speech Communication, 33:23\u201360.","DOI":"10.1016\/S0167-6393(00)00068-6"},{"key":"2022022220190576800_bib25","unstructured":"Black, H. Andrew and Gary F.Simons. 2008. The SIL FieldWorks Language Explorer approach to morphological parsing. In NicholasGaylord, StephenHilderbrand, HeeyoungLyu, AlexisPalmer, and EliasPonvert, editors, Computational Linguistics for Less-studied Languages: Proceedings of Texas Linguistics Society 10, CSLI, pages 37\u201355."},{"key":"2022022220190576800_bib26","unstructured":"Boas, Franz , editor. 1911. Handbook of American Indian Languages, volume 40 of Smithsonian Institution Bureau of American Ethnology Bulletin. Washington, DC: Government Printing Office."},{"key":"2022022220190576800_bib27","doi-asserted-by":"crossref","unstructured":"Boito, Marcely Zanon, AntoniosAnastasopoulos, AlineVillavicencio, LaurentBesacier, and MarikaLekakou. 2018. A small Griko-Italian speech translation corpus. In 6th International Workshop on Spoken Language Technologies for Under-Resourced Languages, pages 36\u201341, Gurugram. DOI:\u00a0https:\/\/doi.org\/10.21437\/SLTU.2018-8","DOI":"10.21437\/SLTU.2018-8"},{"key":"2022022220190576800_bib28","unstructured":"Bouquiaux, Luc and Jacqueline M. C.Thomas. 1992. Studying and describing unwritten languages. Dallas, TX: Summer Institute of Linguistics."},{"key":"2022022220190576800_bib29","doi-asserted-by":"crossref","unstructured":"Bowern, Claire . 2008. Linguistic Fieldwork: A Practical Guide. Palgrave Macmillan. DOI:\u00a0https:\/\/doi.org\/10.1017\/S0952675700001019","DOI":"10.1017\/S0952675700001019"},{"key":"2022022220190576800_bib30","doi-asserted-by":"crossref","unstructured":"Browman, Catherine and LouisGoldstein. 1989. Articulatory gestures as phonological units. Phonology, 6:201\u2013251.","DOI":"10.1017\/S0952675700001019"},{"key":"2022022220190576800_bib31","unstructured":"Brown, P. F., S. A.Della Pietra, V. J.Della Pietra, and R. L.Mercer. 1993. The mathematics of statistical machine translation: Parameter estimation. Computational Linguistics, 19:263\u2013311."},{"key":"2022022220190576800_bib32","doi-asserted-by":"crossref","unstructured":"Bucholtz, Mary . 2007. Variability in transcription. Discourse Studies, 9:784\u2013808. DOI:\u00a0https:\/\/doi.org\/10.1177\/1461445607082580","DOI":"10.1177\/1461445607082580"},{"key":"2022022220190576800_bib33","unstructured":"Buseman, Alan, KarenBuseman, and RodEarly. 1996. The Linguist\u2019s Shoebox: Integrated Data Management and Analysis for the Field Linguist. Waxhaw NC: SIL."},{"key":"2022022220190576800_bib34","unstructured":"Butler, Lynnika and Heather VanVolkinburg. 2007. Fieldworks Language Explorer (FLEx). Language Documentation and Conservation, 1:100\u2013106."},{"key":"2022022220190576800_bib35","unstructured":"Cahill, Michael and KerenRice, editors. 2014. Developing orthographies for unwritten languages. SIL International."},{"key":"2022022220190576800_bib36","doi-asserted-by":"crossref","unstructured":"Cairns, Paul, RichardShillcock, NickChater, and JoeLevy. 1997. Bootstrapping word boundaries: A bottom-up corpus-based approach to speech segmentation. Cognitive Psychology, 33:111\u2013153. DOI:\u00a0https:\/\/doi.org\/10.1006\/cogp.1997.0649, PMID:\u00a09245468","DOI":"10.1006\/cogp.1997.0649"},{"key":"2022022220190576800_bib37","unstructured":"Cartwright, Timothy A. and Michael R.Brent. 1994. Segmenting speech without a lexicon: the roles of phonotactics and speech source. In Proceedings of the First Meeting of the ACL Special Interest Group in Computational Phonology, pages 83\u201390, Las Cruces, NM."},{"key":"2022022220190576800_bib38","doi-asserted-by":"crossref","unstructured":"Chelliah, Shobhana . 2018. The design and implementation of documentation projects for spoken languages. In Oxford Handbook of Endangered Languages. Oxford University Press. DOI:\u00a0https:\/\/doi.org\/10.1093\/oxfordhb\/9780190610029.013.9","DOI":"10.1093\/oxfordhb\/9780190610029.013.9"},{"key":"2022022220190576800_bib39","doi-asserted-by":"crossref","unstructured":"Chen, Nancy F., ChongjiaNi, I-FanChen, SunilSivadas, HaihuaXu, XiongXiao, Tze SiongLau, SuJun Leow, Boon PangLim, Cheung-ChiLeung, et al. 2015. Low-resource keyword search strategies for Tamil. In Proceedings of the International Conference on Acoustics, Speech and Signal Processing, pages 5366\u20135370, IEEE. DOI:\u00a0https:\/\/doi.org\/10.1109\/ICASSP.2015.7178996 PMID:\u00a026244568","DOI":"10.1109\/ICASSP.2015.7178996"},{"key":"2022022220190576800_bib40","doi-asserted-by":"crossref","unstructured":"Chung, Yu An, Wei-HungWeng, SchrasingTong, and JamesGlass. 2019. Towards unsupervised speech-to-text translation. In Proceedings of the International Conference on Acoustics, Speech, and Signal Processing, pages 7170\u20137174, Brisbane. DOI:\u00a0https:\/\/doi.org\/10.1109\/ICASSP.2019.8683550","DOI":"10.1109\/ICASSP.2019.8683550"},{"key":"2022022220190576800_bib41","doi-asserted-by":"crossref","unstructured":"Clifford, James . 1990. Notes on (field) notes. In RogerSanjek, editor, Fieldnotes: The Makings of Anthropology. Cornell University Press, pages 47\u201370. DOI:\u00a0https:\/\/doi.org\/10.7591\/9781501711954-004, PMID:\u00a029994340","DOI":"10.7591\/9781501711954-004"},{"key":"2022022220190576800_bib42","unstructured":"Cox, Christopher, GillesBoulianne, and JahangirAlam. 2019. Taking aim at the \u2018transcription bottleneck\u2019: Integrating speech technology into language documentation and conservation. Paper presented at the 6th International Conference on Language Documentation and Conservation, Honolulu, HI, https:\/\/instagram.com\/p\/Buho4Z0B7xT\/"},{"key":"2022022220190576800_bib43","doi-asserted-by":"crossref","unstructured":"Crowley, Terry . 2007. Field Linguistics: A Beginner\u2019s Guide. Oxford University Press.","DOI":"10.1093\/oso\/9780199284344.001.0001"},{"key":"2022022220190576800_bib44","unstructured":"Cucchiarini, Catia . 1993. Phonetic Transcription: A Methodological and Empirical Study. Ph.D. thesis, Radboud University."},{"key":"2022022220190576800_bib45","doi-asserted-by":"crossref","unstructured":"Do, Van Hai, Nancy F.Chen, Boon PangLim, and MarkHasegawa-Johnson. 2016. Analysis of mismatched transcriptions generated by humans and machines for under-resourced languages. In Proceedings of the 17th Annual Conference of the International Speech Communication Association, pages 3863\u20133867, San Francisco, CA. DOI:\u00a0https:\/\/doi.org\/10.21437\/Interspeech.2016-736","DOI":"10.21437\/Interspeech.2016-736"},{"key":"2022022220190576800_bib46","doi-asserted-by":"crossref","unstructured":"Dobrin, Lise M. 2008. From linguistic elicitation to eliciting the linguist: Lessons in community empowerment from Melanesia. Language, 84:300\u2013324. DOI:\u00a0https:\/\/doi.org\/10.1353\/lan.0.0009","DOI":"10.1353\/lan.0.0009"},{"key":"2022022220190576800_bib47","doi-asserted-by":"crossref","unstructured":"Dunbar, Ewan, Xuan NgaCao, JuanBenjumea, JulienKaradayi, MathieuBernard, LaurentBesacier, XavierAnguera, and EmmanuelDupoux. 2017. The Zero Resource Speech Challenge 2017. In Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, pages 323\u2013330, Okinawa. DOI:\u00a0https:\/\/doi.org\/10.1109\/ASRU.2017.8268953","DOI":"10.1109\/ASRU.2017.8268953"},{"key":"2022022220190576800_bib48","doi-asserted-by":"crossref","unstructured":"Duong, Long, AntoniosAnastasopoulos, DavidChiang, StevenBird, and TrevorCohn. 2016. An attentional model for speech translation without transcription. In Proceedings of the 15th Annual Conference of the North American Chapter of the Association for Computational Linguistics, pages 949\u2013959, San Diego, CA. DOI:\u00a0https:\/\/doi.org\/10.18653\/v1\/N16-1109","DOI":"10.18653\/v1\/N16-1109"},{"key":"2022022220190576800_bib49","unstructured":"Elsner, Micha, SharonGoldwater, NaomiFeldman, and FrankWood. 2013. A joint learning model of word segmentation, lexical acquisition, and phonetic variability. In Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing, pages 42\u201354, Seattle, WA."},{"key":"2022022220190576800_bib50","unstructured":"Evans, Nicholas . 2003. Bininj Gun-wok: A Pan-dialectal Grammar of Mayali, Kunwinjku and Kune. Pacific Linguistics. Australian National University."},{"key":"2022022220190576800_bib51","unstructured":"Evans, Nicholas and Hans-J\u00fcrgenSasse. 2007. Searching for meaning in the library of babel: field semantics and problems of digital archiving. Archives and Social Studies: A Journal of Interdisciplinary Research, 1:63\u2013123."},{"key":"2022022220190576800_bib52","unstructured":"Fiscus, Jonathan G., JeromeAjot, John S.Garofolo, and GeorgeDoddingtion. 2007. Results of the 2006 spoken term detection evaluation. In Proceedings of the Workshop on Searching Spontaneous Conversational Speech, pages 51\u201357, Amsterdam."},{"key":"2022022220190576800_bib53","doi-asserted-by":"crossref","unstructured":"Foley, Ben, JoshArnold, RolandoCoto-Solano, GautierDurantin, T.Mark Ellison, Daan vanEsch, ScottHeath, Franti\u0161ekKratochv\u00ed, ZaraMaxwell-Smith, DavidNash, OlaOlsson, MarkRichards, NaySan, HywelStoakes, NickThieberger, and JanetWiles. 2018. Building speech recognition systems for language documentation: The CoEDL Endangered Language Pipeline and Inference System. In Proceedings of the 6th International Workshop on Spoken Language Technologies for Under-Resourced Languages, pages 205\u2013209, Gurugram. DOI:\u00a0https:\/\/doi.org\/10.21437\/SLTU.2018-42","DOI":"10.21437\/SLTU.2018-43"},{"key":"2022022220190576800_bib54","unstructured":"Gales, Mark, KateKnill, AntonRagni, and ShaktiRath. 2014. Speech recognition and keyword spotting for low-resource languages: BABEL project research at CUED. In Workshop on Spoken Language Technologies for Under-Resourced Languages, pages 16\u201323, St. Petersburg."},{"key":"2022022220190576800_bib55","unstructured":"Garofolo, John S., Lori F.Lamel, William M.Fisher, Jonathon G.Fiscus, David S.Pallett, and Nancy L.Dahlgren. 1986. The DARPA TIMIT Acoustic-Phonetic Continuous Speech Corpus CDROM. NIST."},{"key":"2022022220190576800_bib56","unstructured":"Gaved, Tim and SophieSalffner. 2014. Working with ELAN and FLEx together. https:\/\/www.soas.ac.uk\/elar\/helpsheets\/file122785.pdf, accessed 21 March 2019."},{"key":"2022022220190576800_bib57","doi-asserted-by":"crossref","unstructured":"Godard, Pierre, GillesAdda, MartineAdda-Decker, AlexandreAllauzen, LaurentBesacier, HeleneBonneau-Maynard, Guy-No\u00eblKouarata, KevinL\u00f6ser, AnnieRialland, and Fran\u00e7oisYvon. 2016. Preliminary experiments on unsupervised word discovery in Mboshi. In Proceedings of the 17th Annual Conference of the International Speech Communication Association, pages 3539\u20133543, San Francisco, CA. DOI:\u00a0https:\/\/doi.org\/10.21437\/Interspeech.2016-886","DOI":"10.21437\/Interspeech.2016-886"},{"key":"2022022220190576800_bib58","unstructured":"Godard, Pierre, GillesAdda, MartineAdda-Decker, JuanBenjumea, LaurentBesacier, JamisonCooper-Leavitt, Guy-NoelKouarata, LoriLamel, H\u00e9l\u00e8neMaynard, MarkusMueller, AnnieRialland, SebastianStueker, Fran\u00e7oisYvon, and MarcelyZanon-Boito. 2018a. A very low resource language speech corpus for computational language documentation experiments. In Proceedings of the 11th Language Resources and Evaluation Conference, pages 3366\u20133370, Miyazaki."},{"key":"2022022220190576800_bib59","doi-asserted-by":"crossref","unstructured":"Godard, Pierre, Marcely ZanonBoito, LucasOndel, AlexandreBerard, Fran\u00e7oisYvon, AlineVillavicencio, and LaurentBesacier. 2018b. Unsupervised word segmentation from speech with attention. In Proceedings of the 19th Annual Conference of the International Speech Communication Association, pages 2678\u20132682, Hyderabad. DOI:\u00a0https:\/\/doi.org\/10.21437\/Interspeech.2018-1308","DOI":"10.21437\/Interspeech.2018-1308"},{"key":"2022022220190576800_bib60","doi-asserted-by":"crossref","unstructured":"Goldwater, Sharon, ThomasGriffiths, and MarkJohnson. 2009. A Bayesian framework for word segmentation: Exploring the effects of context. Cognition, 112:21\u201354. DOI:\u00a0https:\/\/doi.org\/10.1016\/j.cognition.2009.03.008, PMID:\u00a019409539","DOI":"10.1016\/j.cognition.2009.03.008"},{"key":"2022022220190576800_bib61","doi-asserted-by":"crossref","unstructured":"Goldwater, Sharon, Thomas L.Griffiths, and MarkJohnson. 2006. Contextual dependencies in unsupervised word segmentation. In Proceedings of the 21st International Conference on Computational Linguistics and the 44th Annual Meeting of the Association for Computational Linguistics, pages 673\u2013680, Sydney. DOI:\u00a0https:\/\/doi.org\/10.3115\/1220175.1220260","DOI":"10.3115\/1220175.1220260"},{"key":"2022022220190576800_bib62","doi-asserted-by":"crossref","unstructured":"Gudschinsky, Sarah . 1967. How to Learn an Unwritten Language. Holt, Rinehart and Winston. DOI:\u00a0https:\/\/doi.org\/10.1017\/CBO9780511810206.005","DOI":"10.1017\/CBO9780511810206.005"},{"key":"2022022220190576800_bib63","doi-asserted-by":"crossref","unstructured":"Hale, Ken . 2001. Ulwa (Southern Sumu): The beginnings of a language research project. In PaulNewman and MarthaRatliff, editors, Linguistic Fieldwork. Cambridge University Press, pages 76\u2013101.","DOI":"10.1017\/CBO9780511810206.005"},{"key":"2022022220190576800_bib64","unstructured":"Hanke, Florian . 2017. Computer-Supported Cooperative Language Documentation. Ph.D. thesis, University of Melbourne."},{"key":"2022022220190576800_bib65","unstructured":"Hanke, Florian and StevenBird. 2013. Large-scale text collection for unwritten languages. In Proceedings of the 6th International Joint Conference on Natural Language Processing, pages 1134\u20131138, Nagoya."},{"key":"2022022220190576800_bib66","doi-asserted-by":"crossref","unstructured":"Hasegawa-Johnson, Mark A., PreethiJyothi, DanielMcCloy, MajidMirbagheri, Giovanni M diLiberto, AmitDas, BradleyEkin, ChunxiLiu, VimalManohar, HaoTang, and others. 2016. ASR for under-resourced languages from probabilistic transcription. IEEE\/ACM Transactions on Audio, Speech and Language Processing, 25:50\u201363. DOI:\u00a0https:\/\/doi.org\/10.1109\/TASLP.2016.2621659","DOI":"10.1109\/TASLP.2016.2621659"},{"key":"2022022220190576800_bib67","unstructured":"Hatton, John . 2013. SayMore: Language documentation productivity. Paper presented at the Third International Conference on Language Documentation and Conservation, http:\/\/hdl.handle.net\/10125\/26153"},{"key":"2022022220190576800_bib68","unstructured":"Hermes, Mary and MelEngman. 2017. Resounding the clarion call: Indigenous language learners and documentation. Language Documentation and Description, 14:59\u201387."},{"key":"2022022220190576800_bib69","doi-asserted-by":"crossref","unstructured":"Himmelmann, Nikolaus . 1998. Documentary and descriptive linguistics. Linguistics, 36:161\u2013195. DOI:\u00a0https:\/\/doi.org\/10.1515\/ling.1998.36.1.161","DOI":"10.1515\/ling.1998.36.1.161"},{"key":"2022022220190576800_bib70","unstructured":"Himmelmann, Nikolaus . 2006a. The challenges of segmenting spoken language. In JostGippert, NikolausHimmelmann, and UlrikeMosel, editors, Essentials of Language Documentation. Mouton de Gruyter, pages 253\u2013274."},{"key":"2022022220190576800_bib71","unstructured":"Himmelmann, Nikolaus . 2006b. Language documentation: What is it and what is it good for? In JostGippert, NikolausHimmelmann, and UlrikeMosel, editors, Essentials of Language Documentation. Mouton de Gruyter, pages 1\u201330."},{"key":"2022022220190576800_bib72","unstructured":"Himmelmann, Nikolaus . 2018. Meeting the transcription challenge. In Reflections on Language Documentation 20 Years after Himmelmann 1998, number 15 in Language Documentation and Conservation Special Publication, University of Hawai\u2019i Press, pages 33\u201340."},{"key":"2022022220190576800_bib73","doi-asserted-by":"crossref","unstructured":"Jacobson, Michel, BoydMichailovsky, and John B.Lowe. 2001. Linguistic documents synchronizing sound and text. Speech Communication, 33:79\u201396. DOI:\u00a0https:\/\/doi.org\/10.1016\/S0167-6393(00)00070-4","DOI":"10.1016\/S0167-6393(00)00070-4"},{"key":"2022022220190576800_bib74","doi-asserted-by":"crossref","unstructured":"Jansen, Aren, KennethChurch, and HynekHermansky. 2010. Towards spoken term discovery at scale with zero resources. In Proceedings of the 11th Annual Conference of the International Speech Communication Association, pages 1676\u20131679, Chiba.","DOI":"10.21437\/Interspeech.2010-483"},{"key":"2022022220190576800_bib75","doi-asserted-by":"crossref","unstructured":"Johnson, Mark and SharonGoldwater. 2009. Improving nonparameteric Bayesian inference: Experiments on unsupervised word segmentation with adaptor grammars. In Proceedings of the North American Chapter of the Association for Computational Linguistics, pages 317\u2013325, Boulder, CO. DOI:\u00a0https:\/\/doi.org\/10.3115\/1620754.1620800","DOI":"10.3115\/1620754.1620800"},{"key":"2022022220190576800_bib76","doi-asserted-by":"crossref","unstructured":"Jukes, Anthony . 2011. Researcher training and capacity development in language documentation. In The Cambridge Handbook of Endangered Languages. Cambridge University Press, pages 423\u2013445. DOI:\u00a0https:\/\/doi.org\/10.1017\/CBO9780511975981.021","DOI":"10.1017\/CBO9780511975981.021"},{"key":"2022022220190576800_bib77","doi-asserted-by":"crossref","unstructured":"Kaufman, Daniel and RossPerlin. 2018. Language documentation in diaspora communities. In Oxford Handbook of Endangered Languages. Oxford University Press, pages 399\u2013418. DOI:\u00a0https:\/\/doi.org\/10.1093\/oxfordhb\/9780190610029.013.20","DOI":"10.1093\/oxfordhb\/9780190610029.013.20"},{"key":"2022022220190576800_bib78","doi-asserted-by":"crossref","unstructured":"King, Alexander D . 2015. Add language documentation to any ethnographic project in six steps. Anthropology Today, 31:8\u201312. DOI:\u00a0https:\/\/doi.org\/10.1093\/oxfordhb\/9780190610029.013.20","DOI":"10.1111\/1467-8322.12187"},{"key":"2022022220190576800_bib79","doi-asserted-by":"crossref","unstructured":"Liu, Chunxi, ArenJansen, GuoguoChen, KeithKintzley, JanTrmal, and SanjeevKhudanpur. 2014. Low-resource open vocabulary keyword search using point process models. In Proceedings of the 15th Annual Conference of the International Speech Communication Association, pages 2789\u20132793, Liu.","DOI":"10.21437\/Interspeech.2014-533"},{"key":"2022022220190576800_bib80","doi-asserted-by":"crossref","unstructured":"Maddieson, Ian . 2001. Phonetic fieldwork. In PaulNewman and MarthaRatliff, editors, Linguistic Fieldwork. Cambridge University Press, pages 211\u2013229. DOI:\u00a0https:\/\/doi.org\/10.1017\/CBO9780511810206.011","DOI":"10.1017\/CBO9780511810206.011"},{"key":"2022022220190576800_bib81","unstructured":"McCrae, John P., JuliaBosque-Gil, JorgeGracia, PaulBuitelaar, and PhilippCimiano. 2017. The Ontolex-Lemon model: Development and applications. In Proceedings of the eLex Conference, pages 19\u201321, Leiden."},{"key":"2022022220190576800_bib82","doi-asserted-by":"crossref","unstructured":"Meakins, Felicity, JennyGreen, and MyfanyTurpin. 2018. Understanding Linguistic Fieldwork. Routledge.","DOI":"10.4324\/9780203701294"},{"key":"2022022220190576800_bib83","doi-asserted-by":"crossref","unstructured":"Metze, Florian, AnkurGandhe, YajieMiao, ZaidSheikh, YunWang, DiXu, HaoZhang, JungsukKim, IanLane, Won KyumLee, et al 2015. Semi-supervised training in low-resource ASR and KWS. In Proceedings of the International Conference on Acoustics, Speech and Signal Processing, pages 4699\u20134703, Brisbane.","DOI":"10.1109\/ICASSP.2015.7178862"},{"key":"2022022220190576800_bib84","unstructured":"Michaud, Alexis, OliverAdams, TrevorCohn, GrahamNeubig, and S\u00e9verineGuillaume. 2018. Integrating automatic transcription into the language documentation workflow: Experiments with Na data and the Persephone Toolkit. Language Documentation and Conservation, 12:481\u2013513."},{"key":"2022022220190576800_bib85","unstructured":"Moe, Ron . 2008. FieldWorks Language Explorer 1.0. Number 2008\u2013011 in SIL Forum for Language Fieldwork. SIL International."},{"key":"2022022220190576800_bib86","unstructured":"Mosel, Ulrike . 2006. Fieldwork and community language work. In JostGippert, NikolausHimmelmann, and UlrikeMosel, editors, Essentials of Language Documentation. Mouton de Gruyter, pages 67\u201385."},{"key":"2022022220190576800_bib87","doi-asserted-by":"crossref","unstructured":"Myers, Cory, LawrenceRabiner, and AndrewRosenberg. 1980. An investigation of the use of dynamic time warping for word spotting and connected speech recognition. In Proceedings of the International Conference on Acoustics, Speech, and Signal Processing, pages 173\u2013177, Denver, CO.","DOI":"10.1109\/ICASSP.1980.1171067"},{"key":"2022022220190576800_bib88","unstructured":"Nathan, David and MeiliFang. 2009. Language documentation and pedagogy for endangered languages: A mutual revitalisation. Language Documentation and Description, 6:132\u2013160."},{"key":"2022022220190576800_bib89","doi-asserted-by":"crossref","unstructured":"Neubig, Graham, MasatoMimura, ShinsukeMori, and TatsuyaKawahara. 2010. Learning a language model from continuous speech. In Proceedings of the 11th Annual Conference of the International Speech Communication Association, pages 1053\u20131056, Chiba.","DOI":"10.21437\/Interspeech.2010-345"},{"key":"2022022220190576800_bib90","unstructured":"Neubig, Graham, TaroWatanabe, ShinsukeMori, and TatsuyaKawahara. 2012. Machine translation without words through substring alignment. In Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics, pages 165\u2013174, Jeju Island."},{"key":"2022022220190576800_bib91","unstructured":"Newman, Paul and MarthaRatliff. 2001. Introduction. In PaulNewman and MarthaRatliff, editors, Linguistic Fieldwork. Cambridge University Press."},{"key":"2022022220190576800_bib92","unstructured":"Norman, Don . 2013. The Design of Everyday Things. Basic Books."},{"key":"2022022220190576800_bib93","unstructured":"Ochs, Elinor . 1979. Transcription as theory. Developmental Pragmatics, 10:43\u201372."},{"key":"2022022220190576800_bib94","unstructured":"Ostendorf, Mari . 1999. Moving beyond the \u2018beads-on-a-string\u2019 model of speech. In Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, pages 79\u201384, Keystone, USA."},{"key":"2022022220190576800_bib95","doi-asserted-by":"crossref","unstructured":"Papineni, Kishore, SalimRoukos, ToddWard, and Wei-JingZhu. 2002. BLEU: A method for automatic evaluation of machine translation. In Proceedings of the 40th Annual Meeting of the Association for Computational Linguistics, pages 311\u2013318, Philadelphia, PA. DOI:\u00a0https:\/\/doi.org\/10.3115\/1073083.1073135","DOI":"10.3115\/1073083.1073135"},{"key":"2022022220190576800_bib96","doi-asserted-by":"crossref","unstructured":"Park, Alex and JamesGlass. 2008. Unsupervised pattern discovery in speech. IEEE Transactions on Audio, Speech, and Language Processing, 16:186\u2013197. DOI:\u00a0https:\/\/doi.org\/10.1109\/TASL.2007.909282","DOI":"10.1109\/TASL.2007.909282"},{"key":"2022022220190576800_bib97","unstructured":"Pike, Kenneth L. 1947. Phonemics: A Technique for Reducing Language to Writing. Ann Arbor: University of Michigan Press."},{"key":"2022022220190576800_bib98","unstructured":"Rapidwords. 2019. Rapid Word Collection. rapidwords.net, accessed 26 June 2019."},{"key":"2022022220190576800_bib99","doi-asserted-by":"crossref","unstructured":"Rath, Shakti, KateKnill, AntonRagni, and MarkGales. 2014. Combining tandem and hybrid systems for improved speech recognition and keyword spotting on low resource languages. In Proceedings of the 15th Annual Conference of the International Speech Communication Association, pages 14\u201318, Singapore.","DOI":"10.21437\/Interspeech.2014-212"},{"key":"2022022220190576800_bib100","unstructured":"Reiman, Will . 2010. Basic oral language documentation. Language Documentation and Conservation, 4:254\u2013268."},{"key":"2022022220190576800_bib101","unstructured":"Rialland, Annie, MartineAdda-Decker, Guy-No\u00eblKouarata, GillesAdda, LaurentBesacier, LoriLamel, ElodieGauthier, PierreGodard, and JamisonCooper-Leavitt. 2018. Parallel corpora in Mboshi (Bantu C25, Congo-Brazzaville). In Proceedings of the 11th Language Resources and Evaluation Conference, pages 4272\u20134276, Miyazaki."},{"key":"2022022220190576800_bib102","doi-asserted-by":"crossref","unstructured":"Rice, Keren . 2001. Learning as one goes. In PaulNewman and MarthaRatliff, editors, Linguistic Fieldwork. Cambridge University Press, pages 230\u2013249. DOI:\u00a0https:\/\/doi.org\/10.1017\/CBO9780511810206.012","DOI":"10.1017\/CBO9780511810206.012"},{"key":"2022022220190576800_bib103","unstructured":"Rice, Keren . 2009. Must there be two solitudes? Language activists and linguists working together. In JonReyhner and LouiseLockhard, editors, Indigenous language revitalization: Encouragement, guidance, and lessons learned. Northern Arizona University, pages 37\u201359."},{"key":"2022022220190576800_bib104","unstructured":"Rice, Keren . 2011. Documentary linguistics and community relations. Language Documentation and Conservation, 5:187\u2013207."},{"key":"2022022220190576800_bib105","unstructured":"Robinson, Stuart, GregAumann, and StevenBird. 2007. Managing fieldwork data with Toolbox and the Natural Language Toolkit. Language Documentation and Conservation, 1:44\u201357."},{"key":"2022022220190576800_bib106","unstructured":"Rogers, Chris . 2010. Fieldworks Language Explorer (FLEx) 3.0. Language Documentation and Conservation, 4:78\u201384."},{"key":"2022022220190576800_bib107","doi-asserted-by":"crossref","unstructured":"Rohlicek, Jan Robin . 1995. Word spotting. In Ravi P.Ramachandran and Richard J.Mammone, editors, Modern Methods of Speech Processing. Springer, pages 123\u2013157. DOI:\u00a0https:\/\/doi.org\/10.1007\/978-1-4615-2281-2_6","DOI":"10.1007\/978-1-4615-2281-2_6"},{"key":"2022022220190576800_bib108","unstructured":"Samarin, William . 1967. Field Linguistics: A Guide to Linguistic Field Work. Holt, Rinehart and Winston."},{"key":"2022022220190576800_bib109","doi-asserted-by":"crossref","unstructured":"Sanjek, Roger . 1990. The secret life of fieldnotes. In RogerSanjek, editor, Fieldnotes: The Makings of Anthropology. Cornell University Press, pages 187\u2013272. DOI:\u00a0https:\/\/doi.org\/10.7591\/9781501711954","DOI":"10.7591\/9781501711954"},{"key":"2022022220190576800_bib110","doi-asserted-by":"crossref","unstructured":"Sapi\u00e9n, Racquel Mar\u00eda . 2018. Design and implementation of collaborative language documentation projects. In Oxford Handbook of Endangered Languages. Oxford University Press, pages 203\u2013224. DOI:\u00a0https:\/\/doi.org\/10.1093\/oxfordhb\/9780190610029.013.12","DOI":"10.1093\/oxfordhb\/9780190610029.013.12"},{"key":"2022022220190576800_bib111","unstructured":"Schultze-Berndt, Eva . 2006. Linguistic annotation. In JostGippert, NikolausHimmelmann, and UlrikeMosel, editors, Essentials of Language Documentation. Mouton de Gruyter, pages 213\u2013251."},{"key":"2022022220190576800_bib112","doi-asserted-by":"crossref","unstructured":"Seifart, Frank, HaraldHammarstro\u00f6m, NicholasEvans, and Stephen C.Levinson. 2018. Language documentation twenty-five years on. Language, 94:e324\u2013e345. DOI:\u00a0https:\/\/doi.org\/10.1353\/lan.2018.0070","DOI":"10.1353\/lan.2018.0070"},{"key":"2022022220190576800_bib113","unstructured":"Shillcock, Richard . 1990. Lexical hypotheses in continuous speech. In GerryAltmann, editor, Cognitive Models of Speech Processing. MIT Press, pages 24\u201349."},{"key":"2022022220190576800_bib114","unstructured":"SIL Language Technology . 2000. Shoebox. https:\/\/software.sil.org\/shoebox\/, accessed 26 April 2020."},{"key":"2022022220190576800_bib115","doi-asserted-by":"crossref","unstructured":"Sloetjes, Han, HermanStehouwer, and SebastianDrude. 2013. Novel developments in Elan. Paper presented at the Third International Conference on Language Documentation and Conservation, Honolulu, HI, http:\/\/hdl.handle.net\/10125\/26154. DOI:\u00a0https:\/\/doi.org\/10.1093\/oxfordhb\/9780199571932.013.019","DOI":"10.1093\/oxfordhb\/9780199571932.013.019"},{"key":"2022022220190576800_bib116","doi-asserted-by":"crossref","unstructured":"Sperber, Matthias, GrahamNeubig, ChristianF\u00fcgen, SatoshiNakamura, and AlexWaibel. 2013. Efficient speech transcription through respeaking. In Proceedings of the 14th Annual Conference of the International Speech Communication Association, pages 1087\u20131091, Lyon.","DOI":"10.21437\/Interspeech.2013-294"},{"key":"2022022220190576800_bib117","doi-asserted-by":"crossref","unstructured":"Stahlberg, Felix, TimSchlippe, StephanVogel, and TanjaSchultz. 2015. Cross-lingual lexical language discovery from audio data using multiple translations. In Proceedings of the International Conference on Acoustics, Speech and Signal Processing, pages 5823\u20135827, Brisbane. DOI:\u00a0https:\/\/doi.org\/10.1109\/ICASSP.2015.7179088","DOI":"10.1109\/ICASSP.2015.7179088"},{"key":"2022022220190576800_bib118","doi-asserted-by":"crossref","unstructured":"Stahlberg, Felix, TimSchlippe, StephanVogel, and TanjaSchultz. 2016. Word segmentation and pronunciation extraction from phoneme sequences through cross-lingual word-to-phoneme alignment. Computer Speech and Language, 35:234\u2013261. DOI:\u00a0https:\/\/doi.org\/10.1016\/j.csl.2014.10.001","DOI":"10.1016\/j.csl.2014.10.001"},{"key":"2022022220190576800_bib119","doi-asserted-by":"crossref","unstructured":"Tedlock, Dennis . 1983. The Spoken Word and the Work of Interpretation. University of Pennsylvania Press. DOI:\u00a0https:\/\/doi.org\/10.9783\/9780812205305","DOI":"10.9783\/9780812205305"},{"key":"2022022220190576800_bib120","doi-asserted-by":"crossref","unstructured":"Twaddell, William. F. 1954. A linguistic archive as an indexed depot. International Journal of American Linguistics, 20:108\u2013110. DOI:\u00a0https:\/\/doi.org\/10.1086\/464261","DOI":"10.1086\/464261"},{"key":"2022022220190576800_bib121","doi-asserted-by":"crossref","unstructured":"Valenta, Tom\u00e1\u0161, Lubo\u0161\u0160m\u00eddl, Jan\u0160vec, and DanielSoutner. 2014. Inter-annotator agreement on spontaneous Czech language. In Proceedings of the International Conference on Text, Speech, and Dialogue, pages 390\u2013397, Brno. DOI:\u00a0https:\/\/doi.org\/10.1007\/978-3-319-10816-2_47","DOI":"10.1007\/978-3-319-10816-2_47"},{"key":"2022022220190576800_bib122","unstructured":"Voegelin, Charles Frederick and Florence MarieVoegelin. 1959. Guide for transcribing unwritten languages in field work. Anthropological Linguistics, pages 1\u201328."},{"key":"2022022220190576800_bib123","doi-asserted-by":"crossref","unstructured":"Weiss, Ron, JanChorowski, NavdeepJaitly, YonghuiWu, and ZhifengChen. 2017. Sequence-to-sequence models can directly translate foreign speech. In Proceedings of the 18th Annual Conference of the International Speech Communication Association, pages 2625\u20132629, Stockholm. DOI:\u00a0https:\/\/doi.org\/10.21437\/Interspeech.2017-503","DOI":"10.21437\/Interspeech.2017-503"},{"key":"2022022220190576800_bib124","unstructured":"Winkelmann, Raphael and GeorgRaess. 2014. Introducing a web application for labeling, visualizing speech and correcting derived speech signals. In Proceedings of the 9th International Conference on Language Resources and Evaluation, pages 4129\u20134133, Reykjavic."},{"key":"2022022220190576800_bib125","doi-asserted-by":"crossref","unstructured":"Woodbury, Anthony C. 1998. Documenting rhetorical, aesthetic, and expressive loss in language shift. In LenoreGrenoble and LindsayWhaley, editors, Endangered Languages: Language Loss and Community Response. Cambridge University Press, pages 234\u2013258. DOI:\u00a0https:\/\/doi.org\/10.1017\/CBO9781139166959.011","DOI":"10.1017\/CBO9781139166959.011"},{"key":"2022022220190576800_bib126","unstructured":"Woodbury, Anthony C. 2003. Defining documentary linguistics. Language Documentation and Description, 1:35\u201351."},{"key":"2022022220190576800_bib127","unstructured":"Woodbury, Anthony C. 2007. On thick translation in linguistic documentation. Language Documentation and Description, 4:120\u2013135."},{"key":"2022022220190576800_bib128","unstructured":"Wu, Dekai . 1997. Stochastic inversion transduction grammars and bilingual parsing of parallel corpora. Computational Linguistics, pages 377\u2013403."},{"key":"2022022220190576800_bib129","unstructured":"Xia, Fei and William D.Lewis. 2007. Multilingual structural projection across interlinearized text. In Proceedings of the North American Chapter of the Association for Computational Linguistics. ACL, pages 452\u2013459, Rochester, NY."},{"key":"2022022220190576800_bib130","unstructured":"Yamada, Racquel Mar\u00eda . 2014. Training in the community-collaborative context: A case study. Language Documentation and Conservation, 8:326\u2013344."},{"key":"2022022220190576800_bib131","doi-asserted-by":"crossref","unstructured":"Zanon Boito, Marcely, AlexandreB\u00e9rard, AlineVillavicencio, and LaurentBesacier. 2017. Unwritten languages demand attention too! Word discovery with encoder-decoder models. In IEEE Workshop on Automatic Speech Recognition and Understanding, pages 458\u2013465, Okinawa. DOI:\u00a0https:\/\/doi.org\/10.1109\/ASRU.2017.8268972","DOI":"10.1109\/ASRU.2017.8268972"}],"container-title":["Computational Linguistics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/direct.mit.edu\/coli\/article-pdf\/46\/4\/713\/1992567\/coli_a_00387.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/direct.mit.edu\/coli\/article-pdf\/46\/4\/713\/1992567\/coli_a_00387.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2024,8,16]],"date-time":"2024-08-16T04:34:46Z","timestamp":1723782886000},"score":1,"resource":{"primary":{"URL":"https:\/\/direct.mit.edu\/coli\/article\/46\/4\/713\/97329\/Sparse-Transcription"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2021,2,1]]},"references-count":131,"journal-issue":{"issue":"4","published-online":{"date-parts":[[2021,2,1]]},"published-print":{"date-parts":[[2021,2,1]]}},"URL":"https:\/\/doi.org\/10.1162\/coli_a_00387","relation":{},"ISSN":["0891-2017","1530-9312"],"issn-type":[{"value":"0891-2017","type":"print"},{"value":"1530-9312","type":"electronic"}],"subject":[],"published-other":{"date-parts":[[2021,12]]},"published":{"date-parts":[[2021,2,1]]}}}