{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,9,11]],"date-time":"2025-09-11T19:16:23Z","timestamp":1757618183654,"version":"3.44.0"},"reference-count":47,"publisher":"Springer Science and Business Media LLC","issue":"3","license":[{"start":{"date-parts":[[2025,6,19]],"date-time":"2025-06-19T00:00:00Z","timestamp":1750291200000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2025,6,19]],"date-time":"2025-06-19T00:00:00Z","timestamp":1750291200000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"funder":[{"DOI":"10.13039\/501100001652","name":"Friedrich-Alexander-Universit\u00e4t Erlangen-N\u00fcrnberg","doi-asserted-by":"crossref","id":[{"id":"10.13039\/501100001652","id-type":"DOI","asserted-by":"crossref"}]}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["Lang Resources &amp; Evaluation"],"published-print":{"date-parts":[[2025,9]]},"abstract":"<jats:title>Abstract<\/jats:title>\n          <jats:p>Narrative skills are crucial for young children as they not only indicate literacy and academic performance but also serve as effective tools to foster children\u2019s relationships with the world. However, the linguistic resources for narratives produced by bilingual children are often limited, posing major challenges to the fields of child language development and language resource studies. Moreover, with the increasing prevalence of remote data collection, there are few guidelines on how to collect such data remotely. In this context, we present kidsNARRATE. KidsNARRATE is a non-native speech corpus designed to study the narrative comprehension of Chinese-English bilingual children in their L2 English. KidsNARRATE comprises 6 hours of audio recordings of children taking the narrative test Multilingual Instrument for Narratives (MAIN), along with transcriptions, human-rated scores, and annotations of grammatical and pronunciation errors at the word level. The audio recordings of the English section have been processed to meet the requirements of certain machine learning applications. Additionally, for cognitive baseline comparison, kidsNARRATE contains the audio and video data of the same group of children taking the parallel MAIN test in L1 Chinese. In the course of this study, we developed a remote recording method using accessible recording tools and an easy-to-use setup. Despite its simplicity, the data collected using this method meets the rigorous requirements for machine learning studies and is also suitable for linguistic research. This method can serve as a specific template for researchers and educators seeking to remotely record audio and\/or video data for linguistic studies. Overall, the rich linguistic content and compatibility with machine learning processes make kidsNARRATE a valuable resource for studies of early child L2 acquisition and the development of children\u2019s speech patterns in the field of automatic speech recognition. Finally, we propose future work regarding data collection methods and second language teaching.<\/jats:p>","DOI":"10.1007\/s10579-025-09851-2","type":"journal-article","created":{"date-parts":[[2025,6,19]],"date-time":"2025-06-19T12:39:55Z","timestamp":1750336795000},"page":"3117-3138","update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":0,"title":["kidsNARRATE: a versatile corpus for studying Chinese-english bilingual L2 narrative skills in preschoolers"],"prefix":"10.1007","volume":"59","author":[{"given":"Hiu Ching","family":"Hung","sequence":"first","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Thorsten","family":"Piske","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Paula Andrea","family":"P\u00e9rez-Toro","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Tom\u00e1s","family":"Arias-Vergara","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Andreas","family":"Maier","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"297","published-online":{"date-parts":[[2025,6,19]]},"reference":[{"issue":"1","key":"9851_CR1","doi-asserted-by":"publisher","first-page":"165","DOI":"10.1017\/S0142716415000466","volume":"37","author":"C Altman","year":"2016","unstructured":"Altman, C., Armon-Lotem, S., Fichman, S., & Walters, J. (2016). Macrostructure, microstructure, and mental state terms in the narratives of english-hebrew bilingual preschool children with and without specific language impairment. Applied Psycholinguistics, 37(1), 165\u2013193.","journal-title":"Applied Psycholinguistics"},{"key":"9851_CR2","doi-asserted-by":"publisher","first-page":"160940691987459","DOI":"10.1177\/1609406919874596","volume":"18","author":"MM Archibald","year":"2019","unstructured":"Archibald, M. M., Ambagtsheer, R. C., Casey, M. G., & Lawless, M. (2019). Using zoom videoconferencing for qualitative data collection: perceptions and experiences of researchers and participants. International journal of qualitative methods, 18, 1609406919874596.","journal-title":"International journal of qualitative methods"},{"key":"9851_CR3","unstructured":"Baker, C. (2011). Foundations of bilingual education and bilingualism. Multilingual matters."},{"key":"9851_CR4","unstructured":"Batliner, A. , Hacker, C. , Steidl, S. , N\u00f6th, E. , D\u2019Arcy, S. , Russell, M.J. , & Wong, M. (2004). \u201cyou stupid tin box\u201d-children interacting with the aibo robot: A cross-linguistic emotional speech corpus."},{"key":"9851_CR5","doi-asserted-by":"publisher","first-page":"461","DOI":"10.1007\/s10212-015-0273-6","volume":"31","author":"L Bigozzi","year":"2016","unstructured":"Bigozzi, L., & Vettori, G. (2016). To tell a story, to write it: developmental patterns of narrative skills from preschool to first grade. European Journal of Psychology of Education, 31, 461\u2013477.","journal-title":"European Journal of Psychology of Education"},{"key":"9851_CR6","doi-asserted-by":"crossref","unstructured":"Bohnacker, U., & Gagarina, N. (2020). Introduction to main\u2013revised, how to use the instrument and adapt it to further languages. ZAS papers in linguistics64:xiii\u2013xxi","DOI":"10.21248\/zaspil.64.2020.549"},{"key":"9851_CR7","doi-asserted-by":"crossref","unstructured":"Calder, J. , Wheeler, R. , Adams, S. , Amarelo, D. , Arnold-Murray, K. , Bai, J., others (2022). Is zoom viable for sociophonetic research? a comparison of in-person and online recordings for vocalic analysis. Linguistics Vanguard0:20200148","DOI":"10.1515\/lingvan-2020-0148"},{"key":"9851_CR8","doi-asserted-by":"crossref","unstructured":"Chan, M.P.Y. , Choe, J. , Li, A. , Chen, Y. , Gao, X. , & Holliday, N.R. (2022). Training and typological bias in asr performance for world englishes. Interspeech (pp. 1273\u20131277).","DOI":"10.21437\/Interspeech.2022-10869"},{"key":"9851_CR9","doi-asserted-by":"crossref","unstructured":"Chen, N.F. , Tong, R. , Wee, D. , Lee, P.X. , Ma, B. , & Li, H. (2016). Singakids-mandarin: Speech corpus of singaporean children speaking mandarin chinese. Interspeech (pp. 1545\u20131549).","DOI":"10.21437\/Interspeech.2016-139"},{"key":"9851_CR10","doi-asserted-by":"crossref","unstructured":"Chua, V.Y.H. , Liu, H. , Garcia, L.P. , Woon, F.T. , Wong, J. , Zhang, X., & Styles, S.J. (2023). MERLIon CCS Challenge: A English-Mandarin code-switching child-directed speech corpus for language identification and diarization. Proc. interspeech 2023 (pp. 4109\u20134113).","DOI":"10.21437\/Interspeech.2023-1446"},{"key":"9851_CR11","doi-asserted-by":"crossref","unstructured":"Connelly, F.M., & Clandinin, D.J. (1990). Stories of experience and narrative inquiry. Educational researcher19(5):2\u201314","DOI":"10.3102\/0013189X019005002"},{"issue":"3","key":"9851_CR12","doi-asserted-by":"publisher","first-page":"299","DOI":"10.1007\/s10993-021-09597-x","volume":"20","author":"MG Delavan","year":"2021","unstructured":"Delavan, M. G., Freire, J. A., & Menken, K. (2021). Editorial introduction: A historical overview of the expanding critique (s) of the gentrification of dual language bilingual education. Language Policy, 20(3), 299\u2013321.","journal-title":"Language Policy"},{"key":"9851_CR13","doi-asserted-by":"publisher","first-page":"20","DOI":"10.21248\/zaspil.63.2019.516","volume":"63","author":"N Gagarina","year":"2019","unstructured":"Gagarina, N., Klop, D., Kunnari, S., Tantele, K., V\u00e4limaa, T., Bohnacker, U., & Walters, J. (2019). Main: Multilingual assessment instrument for narratives-revised. ZAS Papers in Linguistics, 63, 20\u201320.","journal-title":"ZAS Papers in Linguistics"},{"key":"9851_CR14","doi-asserted-by":"crossref","unstructured":"Ge, C. , Xiong, Y. , & Mok, P. (2021). How reliable are phonetic data collected remotely? comparison of recording devices and environments on acoustic measurements. Interspeech (pp. 3984\u20133988).","DOI":"10.21437\/Interspeech.2021-1122"},{"key":"9851_CR15","unstructured":"Gesemann, S. (2024). crosstalkpy. https:\/\/github.com\/sellibitze\/crosstalkpy. Accessed: 2024-07-25."},{"key":"9851_CR16","doi-asserted-by":"crossref","unstructured":"Golonka, E.M. , Bowles, A.R. , Frank, V.M. , Richardson, D.L. , & Freynik, S. (2014). Technologies for foreign language learning: A review of technology types and their effectiveness. Computer assisted language learning. 27(1):70\u2013105","DOI":"10.1080\/09588221.2012.700315"},{"key":"9851_CR17","unstructured":"Gretter, R. , Matassoni, M. , Bann\u00f2, S. , & Daniele, F. (2020). TLT-school: a corpus of non native children speech. N.\u00a0Calzolari et al. (Eds.), Proceedings of the twelfth language resources and evaluation conference (pp. 378\u2013385). European Language Resources Association. https:\/\/aclanthology.org\/2020.lrec-1.47"},{"key":"9851_CR18","doi-asserted-by":"crossref","unstructured":"Hardt, O. , Nader, K. , & Nadel, L. (2013). Decay happens: the role of active forgetting in memory. Trends in cognitive sciences17(3):111\u2013120","DOI":"10.1016\/j.tics.2013.01.001"},{"key":"9851_CR19","doi-asserted-by":"crossref","unstructured":"Hynninen, N. , Pietik\u00e4inen, K.S. , & Vetchinnikova, S. (2017). Multilingualism in english as a lingua franca: Flagging as an indicator of perceived acceptability and intelligibility. Challenging the myth of monolingual corpora (pp. 95\u2013126). Brill.","DOI":"10.1163\/9789004276697_007"},{"issue":"1","key":"9851_CR20","doi-asserted-by":"publisher","first-page":"6","DOI":"10.1080\/19313152.2016.1118667","volume":"10","author":"E.J.d. Jong","year":"2016","unstructured":"Jong, E. .J. .d. (2016). Two-way immersion for the next generation: Models, policies, and principles. International Multilingual Research Journal, 10(1), 6\u201316.","journal-title":"International Multilingual Research Journal"},{"key":"9851_CR21","doi-asserted-by":"crossref","unstructured":"Kannan, J., & Munday, P. (2018). New trends in second language learning and teaching through the lens of ict, networked learning, and artificial intelligence.","DOI":"10.5209\/CLAC.62495"},{"issue":"6","key":"9851_CR22","doi-asserted-by":"publisher","first-page":"703","DOI":"10.1177\/01427237211066405","volume":"42","author":"E Kidd","year":"2022","unstructured":"Kidd, E., & Garcia, R. (2022). How diverse is child language acquisition research? First Language, 42(6), 703\u2013735.","journal-title":"First Language"},{"issue":"2","key":"9851_CR23","doi-asserted-by":"publisher","first-page":"236","DOI":"10.1080\/00131911.2013.865593","volume":"67","author":"YK Kim","year":"2015","unstructured":"Kim, Y. K., Hutchison, L. A., & Winsler, A. (2015). Bilingual education in the united states: An historical overview and examination of two-way immersion. Educational Review, 67(2), 236\u2013252.","journal-title":"Educational Review"},{"issue":"5","key":"9851_CR24","doi-asserted-by":"publisher","first-page":"767","DOI":"10.1080\/00313831.2020.1754903","volume":"65","author":"HBS Knudsen","year":"2021","unstructured":"Knudsen, H. B. S., Donau, P. S., Mifsud, L. C., Papadopoulos, T. C., & Dockrell, J. E. (2021). Multilingual classrooms\u2013danish teachers\u2019 practices, beliefs and attitudes. Scandinavian Journal of Educational Research, 65(5), 767\u2013782.","journal-title":"Scandinavian Journal of Educational Research"},{"issue":"14","key":"9851_CR25","doi-asserted-by":"publisher","first-page":"7684","DOI":"10.1073\/pnas.1915768117","volume":"117","author":"A Koenecke","year":"2020","unstructured":"Koenecke, A., Nam, A., Lake, E., Nudell, J., Quartey, M., Mengesha, Z., & Goel, S. (2020). Racial disparities in automated speech recognition. Proceedings of the national academy of sciences., 117(14), 7684\u20137689.","journal-title":"Proceedings of the national academy of sciences."},{"key":"9851_CR26","doi-asserted-by":"crossref","unstructured":"Kukk, K., & Alum\u00e4e, T. (2022). Improving language identification of accented speech. arXiv preprint arXiv:2203.16972","DOI":"10.21437\/Interspeech.2022-10455"},{"key":"9851_CR27","doi-asserted-by":"crossref","unstructured":"Leemann, A. , Jeszenszky, P. , Steiner, C. , Studerus, M. , & Messerli, J. (2020). Linguistic fieldwork in a pandemic: Supervised data collection combining smartphone recordings and videoconferencing. Linguistics Vanguard6s3","DOI":"10.1515\/lingvan-2020-0061"},{"key":"9851_CR28","unstructured":"MacWhinney, B. (2000). The childes project: Tools for analyzing talk. transcription format and programs (VOL\u00a01). Psychology Press."},{"key":"9851_CR29","doi-asserted-by":"crossref","unstructured":"Maier, A. , Haderlein, T. , & N\u00f6th, E. (2006). Environmental adaptation with a small data set of the target domain. Text, speech and dialogue: 9th international conference, tsd 2006, brno, czech republic, september 11-15, 2006. proceedings 9 (pp. 431\u2013437).","DOI":"10.1007\/11846406_54"},{"issue":"1","key":"9851_CR30","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1186\/s13636-020-00180-6","volume":"2020","author":"P Meyer","year":"2020","unstructured":"Meyer, P., Elshamy, S., & Fingscheidt, T. (2020). Multichannel speaker interference reduction using frequency domain adaptive filtering. EURASIP Journal on Audio, Speech, and Music Processing, 2020(1), 1\u201317.","journal-title":"EURASIP Journal on Audio, Speech, and Music Processing"},{"key":"9851_CR31","unstructured":"Neri, A. , Cucchiarini, C. , & Strik, H. (2003). Automatic speech recognition for second language learning: how and why it actually works. Proc. icphs (pp. 1157\u20131160)."},{"issue":"5","key":"9851_CR32","doi-asserted-by":"publisher","first-page":"571","DOI":"10.1080\/17549507.2020.1712476","volume":"22","author":"J Newbury","year":"2020","unstructured":"Newbury, J., Bartoszewicz Poole, A., & Theys, C. (2020). Current practices of new zealand speech-language pathologists working with multilingual children. International Journal of Speech-Language Pathology, 22(5), 571\u2013582.","journal-title":"International Journal of Speech-Language Pathology"},{"key":"9851_CR33","doi-asserted-by":"crossref","unstructured":"Oliver, D.G. , Serovich, J.M. , & Mason, T.L. (2005). Constraints and opportunities with interview transcription: Towards reflection in qualitative research. Social forces84(2):1273\u20131289","DOI":"10.1353\/sof.2006.0023"},{"issue":"1","key":"9851_CR34","doi-asserted-by":"publisher","first-page":"36","DOI":"10.1598\/RRQ.38.1.3","volume":"38","author":"AH Paris","year":"2003","unstructured":"Paris, A. H., & Paris, S. G. (2003). Assessing narrative comprehension in young children. Reading Research Quarterly, 38(1), 36\u201376.","journal-title":"Reading Research Quarterly"},{"key":"9851_CR35","first-page":"22","volume":"9","author":"T Piske","year":"2010","unstructured":"Piske, T. (2010). A small mouse? a small mouth! zur entwicklung einer guten aussprache im fr\u00fchen englischunterricht. Grundschule, 9, 22\u201324.","journal-title":"Grundschule"},{"key":"9851_CR36","doi-asserted-by":"publisher","DOI":"10.21832\/9781847691118","volume-title":"Input matters in sla (VOL 35)","author":"T Piske","year":"2008","unstructured":"Piske, T., & Young-Scholten, M. (2008). Input matters in sla (VOL 35). Multilingual Matters."},{"issue":"10","key":"9851_CR37","doi-asserted-by":"publisher","first-page":"1490","DOI":"10.3390\/e24101490","volume":"24","author":"K Radha","year":"2022","unstructured":"Radha, K., & Bansal, M. (2022). Audio augmentation for non-native children\u2019s speech recognition through discriminative learning. Entropy, 24(10), 1490.","journal-title":"Entropy"},{"key":"9851_CR38","doi-asserted-by":"publisher","first-page":"627","DOI":"10.1007\/s11145-009-9175-9","volume":"23","author":"E Reese","year":"2010","unstructured":"Reese, E., Suggate, S., Long, J., & Schaughency, E. (2010). Children\u2019s oral narrative and reading skills in the first 3 years of reading instruction. Reading and Writing, 23, 627\u2013644.","journal-title":"Reading and Writing"},{"key":"9851_CR39","doi-asserted-by":"crossref","unstructured":"Rumberg, L. , Gebauer, C. , Ehlert, H. , Wallbaum, M. , Bornholt, L. , Ostermann, J. , & L\u00fcdtke, U. (2022). kidstalc: A corpus of 3-to 11-year-old german children\u2019s connected natural speech. Proceedings interspeech.","DOI":"10.21437\/Interspeech.2022-330"},{"issue":"1","key":"9851_CR40","doi-asserted-by":"publisher","first-page":"9053","DOI":"10.1149\/10701.9053ecst","volume":"107","author":"R Sobti","year":"2022","unstructured":"Sobti, R., Kadyan, V., & Guleria, K. (2022). Challenges for designing of children speech corpora: A state-of-the-art review. ECS Transactions, 107(1), 9053.","journal-title":"ECS Transactions"},{"key":"9851_CR41","unstructured":"Stein, N. , Albro, E. , & Bamberg, M. (1997). Of goal-structured knowledge in telling stories. Narrative development: Six approaches:5\u201344"},{"key":"9851_CR42","doi-asserted-by":"publisher","first-page":"82","DOI":"10.1016\/j.cogdev.2018.04.005","volume":"47","author":"S Suggate","year":"2018","unstructured":"Suggate, S., Schaughency, E., McAnally, H., & Reese, E. (2018). From infancy to adolescence: The longitudinal links between vocabulary, early literacy skills, oral narrative, and reading comprehension. Cognitive Development, 47, 82\u201395.","journal-title":"Cognitive Development"},{"key":"9851_CR43","doi-asserted-by":"crossref","unstructured":"Vogel, A.P. , Rosen, K.M. , Morgan, A.T. , & Reilly, S. (2015). Comparability of modern recording devices for speech analysis: smartphone, landline, laptop, and hard disc recorder. Folia phoniatrica et logopaedica66(6):244\u2013250","DOI":"10.1159\/000368227"},{"issue":"1","key":"9851_CR44","doi-asserted-by":"publisher","first-page":"235","DOI":"10.1146\/annurev.psych.55.090902.141555","volume":"55","author":"JT Wixted","year":"2004","unstructured":"Wixted, J. T. (2004). The psychology and neuroscience of forgetting. Annual Review of Psychology, 55(1), 235\u2013269.","journal-title":"Annual Review of Psychology"},{"key":"9851_CR45","doi-asserted-by":"crossref","unstructured":"Wrigley, S.N. , Brown, G.J. , Wan, V. , & Renals, S. (2004). Speech and crosstalk detection in multichannel audio. IEEE Transactions on speech and audio processing13(1):84\u201391","DOI":"10.1109\/TSA.2004.838531"},{"key":"9851_CR46","doi-asserted-by":"crossref","unstructured":"Yeung, G., & Alwan, A. (2018). 2018. Interspeech: On the difficulties of automatic speech recognition for kindergarten-aged children.","DOI":"10.21437\/Interspeech.2018-2297"},{"key":"9851_CR47","doi-asserted-by":"crossref","unstructured":"Zhang, J. , Zhang, Z. , Wang, Y. , Yan, Z. , Song, Q. , Huang, Y., & Wang, Y. (2021). speechocean762: An Open-Source Non-Native English Speech Corpus for Pronunciation Assessment. Proc. interspeech 2021 (pp. 3710\u20133714).","DOI":"10.21437\/Interspeech.2021-1259"}],"container-title":["Language Resources and Evaluation"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s10579-025-09851-2.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1007\/s10579-025-09851-2\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s10579-025-09851-2.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,9,6]],"date-time":"2025-09-06T20:30:54Z","timestamp":1757190654000},"score":1,"resource":{"primary":{"URL":"https:\/\/link.springer.com\/10.1007\/s10579-025-09851-2"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2025,6,19]]},"references-count":47,"journal-issue":{"issue":"3","published-print":{"date-parts":[[2025,9]]}},"alternative-id":["9851"],"URL":"https:\/\/doi.org\/10.1007\/s10579-025-09851-2","relation":{},"ISSN":["1574-020X","1574-0218"],"issn-type":[{"type":"print","value":"1574-020X"},{"type":"electronic","value":"1574-0218"}],"subject":[],"published":{"date-parts":[[2025,6,19]]},"assertion":[{"value":"28 May 2025","order":1,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"19 June 2025","order":2,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}},{"order":1,"name":"Ethics","group":{"name":"EthicsHeading","label":"Declarations"}},{"value":"The authors declare no competing interests.","order":2,"name":"Ethics","group":{"name":"EthicsHeading","label":"Competing interests"}}]}}