{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2024,6,5]],"date-time":"2024-06-05T11:38:52Z","timestamp":1717587532666},"reference-count":72,"publisher":"MIT Press","issue":"1","license":[{"start":{"date-parts":[[2021,3,6]],"date-time":"2021-03-06T00:00:00Z","timestamp":1614988800000},"content-version":"vor","delay-in-days":5,"URL":"https:\/\/creativecommons.org\/licenses\/by-nc-nd\/4.0\/"}],"content-domain":{"domain":["direct.mit.edu"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2021,4,21]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:p>This article describes a simple PCFG induction model with a fixed category domain that predicts a large majority of attested constituent boundaries, and predicts labels consistent with nearly half of attested constituent labels on a standard evaluation data set of child-directed speech. The article then explores the idea that the difference between simple grammars exhibited by child learners and fully recursive grammars exhibited by adult learners may be an effect of increasing working memory capacity, where the shallow grammars are constrained images of the recursive grammars. An implementation of these memory bounds as limits on center embedding in a depth-specific transform of a recursive grammar yields a significant improvement over an equivalent but unbounded baseline, suggesting that this arrangement may indeed confer a learning advantage.<\/jats:p>","DOI":"10.1162\/coli_a_00399","type":"journal-article","created":{"date-parts":[[2021,3,5]],"date-time":"2021-03-05T18:59:47Z","timestamp":1614970787000},"page":"181-216","update-policy":"http:\/\/dx.doi.org\/10.1162\/mitpressjournals.corrections.policy","source":"Crossref","is-referenced-by-count":4,"title":["Depth-Bounded Statistical PCFG Induction as a Model of Human Grammar Acquisition"],"prefix":"10.1162","volume":"47","author":[{"given":"Lifeng","family":"Jin","sequence":"first","affiliation":[{"name":"The Ohio State University, Department of Linguistics. jin.544@osu.edu"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Lane","family":"Schwartz","sequence":"additional","affiliation":[{"name":"University of Illinois at Urbana-Champaign, Department of Linguistics. lanes@illinois.edu"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Finale","family":"Doshi-Velez","sequence":"additional","affiliation":[{"name":"Harvard University, Department of Computer Science. finale@seas.harvard.edu"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Timothy","family":"Miller","sequence":"additional","affiliation":[{"name":"Boston Children\u2019s Hospital & Harvard Medical School, Computational Health Informatics Program. timothy.miller@childrens.harvard.edu"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"William","family":"Schuler","sequence":"additional","affiliation":[{"name":"The Ohio State University, Department of Linguistics. schuler@ling.osu.edu"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"281","published-online":{"date-parts":[[2021,4,21]]},"reference":[{"issue":"3","key":"2021042218045001200_bib1","doi-asserted-by":"crossref","first-page":"233","DOI":"10.1007\/BF01067217","article-title":"Memory Requirements and local ambiguities of parsing strategies","volume":"20","author":"Abney","year":"1991","journal-title":"Journal of Psycholinguistic Research"},{"issue":"41","key":"2021042218045001200_bib2","doi-asserted-by":"crossref","first-page":"17284","DOI":"10.1073\/pnas.0905638106","article-title":"Modeling children\u2019s early grammatical knowledge","volume":"106","author":"Bannard","year":"2009","journal-title":"Proceedings of the National Academy of Sciences of the United States of America"},{"issue":"1\u20133","key":"2021042218045001200_bib3","doi-asserted-by":"crossref","first-page":"2","DOI":"10.1080\/01690960400001721","article-title":"The input-output relationship in first language acquisition","volume":"21","author":"Behrens","year":"2006","journal-title":"Language and Cognitive Processes"},{"key":"2021042218045001200_bib4","first-page":"582","article-title":"Painless unsupervised learning with features","volume-title":"Human Language Technologies: The 2010 Annual Conference of the North American Chapter of the Association for Computational Linguistics","author":"Berg-Kirkpatrick","year":"2010"},{"key":"2021042218045001200_bib5","first-page":"870","article-title":"Labeled grammar induction with minimal supervision","volume-title":"Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing (Volume 2: Short Papers)","author":"Bisk","year":"2015"},{"issue":"1","key":"2021042218045001200_bib6","doi-asserted-by":"crossref","first-page":"1643","DOI":"10.1609\/aaai.v26i1.8355","article-title":"Simple robust grammar induction with combinatory categorial grammars","volume":"26","author":"Bisk","year":"2012","journal-title":"Proceedings of the Twenty-Sixth AAAI Conference on Artificial Intelligence"},{"key":"2021042218045001200_bib7","doi-asserted-by":"crossref","DOI":"10.4159\/harvard.9780674732469","volume-title":"A First Language: The Early Stages","author":"Brown","year":"1973"},{"issue":"March","key":"2021042218045001200_bib8","first-page":"1","article-title":"Two experiments on learning probabilistic dependency grammars from corpora","author":"Carroll","year":"1992","journal-title":"Working Notes of the Workshop on Statistically-Based NLP Techniques"},{"key":"2021042218045001200_bib9","first-page":"173","article-title":"Coarse-to-fine n-best parsing and MaxEnt discriminative reranking","volume-title":"Proceedings of the 43rd Annual Meeting of the Association for Computational Linguistics (ACL\u201905)","author":"Charniak","year":"2005"},{"key":"2021042218045001200_bib10","volume-title":"Aspects of the Theory of Syntax","author":"Chomsky","year":"1965"},{"key":"2021042218045001200_bib11","first-page":"751","article-title":"On cognitive structures and their development: A reply to Piaget","volume-title":"Language and Learning: The Debate Between Jean Piaget and Noam Chomsky","author":"Chomsky","year":"1980"},{"key":"2021042218045001200_bib12","volume-title":"Knowledge of Language: Its Nature, Origin, and Use","author":"Chomsky","year":"1986"},{"key":"2021042218045001200_bib13","first-page":"269","article-title":"Introduction to the formal analysis of natural languages","volume-title":"Handbook of Mathematical Psychology","author":"Chomsky","year":"1963"},{"key":"2021042218045001200_bib14","first-page":"43","article-title":"Limitations of current grammar induction algorithms","volume-title":"Proceedings of the ACL 2007 Student Research Workshop","author":"Cramer","year":"2007"},{"key":"2021042218045001200_bib15","volume-title":"Cours de linguistique g\u00e9n\u00e9rale","author":"de Saussure","year":"1916"},{"key":"2021042218045001200_bib16","first-page":"69","volume-title":"A multimedia corpus of child Mandarin: The Tong corpus","author":"Deng","year":"2018"},{"issue":"1","key":"2021042218045001200_bib17","first-page":"102","article-title":"Semantic change versus categorical change: A study of the development Of BA in Mandarin","volume":"29","author":"Ding","year":"2001","journal-title":"Journal of Chinese Linguistics"},{"key":"2021042218045001200_bib18","first-page":"1129","article-title":"Unsupervised latent tree induction with deep inside-outside recursive auto-encoders","volume-title":"Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long and Short Papers)","author":"Drozdov","year":"2019"},{"key":"2021042218045001200_bib19","doi-asserted-by":"crossref","first-page":"25","DOI":"10.18653\/v1\/W15-3304","article-title":"Parsing Chinese with a generalized categorial grammar","volume-title":"Proceedings of the Grammar Engineering Across Frameworks (GEAF) 2015 Workshop","author":"Duan","year":"2015"},{"issue":"2","key":"2021042218045001200_bib20","doi-asserted-by":"crossref","first-page":"311","DOI":"10.1080\/15326900701221454","article-title":"Modeling the developmental patterning of finiteness marking in English, Dutch, German, and Spanish using MOSAIC","volume":"31","author":"Freudenthal","year":"2007","journal-title":"Cognitive Science"},{"issue":"1,4","key":"2021042218045001200_bib21","doi-asserted-by":"crossref","first-page":"95","DOI":"10.1109\/TSMC.1975.5409159","article-title":"Grammatical inference: Introduction and survey","volume":"SMC-5","author":"Fu","year":"1975","journal-title":"IEEE Transactions on Systems, Man, and Cybernetics"},{"issue":"10","key":"2021042218045001200_bib22","doi-asserted-by":"crossref","first-page":"447","DOI":"10.1016\/S0019-9958(67)91165-5","article-title":"Language identification in the limit","author":"Gold","year":"1967","journal-title":"Information and Control"},{"key":"2021042218045001200_bib23","first-page":"744","article-title":"A fully Bayesian approach to unsupervised part-of-speech tagging","volume-title":"Proceedings of the 45th Annual Meeting of the Association of Computational Linguistics","author":"Goldwater","year":"2007"},{"key":"2021042218045001200_bib24","article-title":"Parsing Inside-Out","author":"Goodman","year":"1998"},{"key":"2021042218045001200_bib25","doi-asserted-by":"crossref","first-page":"763","DOI":"10.18653\/v1\/D16-1073","article-title":"Unsupervised neural dependency parsing","volume-title":"Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing","author":"Jiang","year":"2016"},{"key":"2021042218045001200_bib26","doi-asserted-by":"crossref","first-page":"2721","DOI":"10.18653\/v1\/D18-1292","article-title":"Depth-bounding is effective: Improvements and evaluation of unsupervised PCFG induction","volume-title":"Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing","author":"Jin","year":"2018"},{"key":"2021042218045001200_bib27","doi-asserted-by":"crossref","first-page":"211","DOI":"10.1162\/tacl_a_00016","article-title":"Unsupervised grammar induction with depth-bounded PCFG","volume":"6","author":"Jin","year":"2018","journal-title":"Transactions of the Association for Computational Linguistics"},{"key":"2021042218045001200_bib28","doi-asserted-by":"crossref","first-page":"2442","DOI":"10.18653\/v1\/P19-1234","article-title":"Unsupervised learning of PCFGs with normalizing flow","volume-title":"Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics","author":"Jin","year":"2019"},{"key":"2021042218045001200_bib29","first-page":"139","article-title":"Bayesian inference for PCFGs via Markov chain Monte Carlo","volume-title":"Human Language Technologies 2007: The Conference of the North American Chapter of the Association for Computational Linguistics; Proceedings of the Main Conference","author":"Johnson","year":"2007"},{"key":"2021042218045001200_bib30","volume-title":"Mental Models: Towards a Cognitive Science of Language, Inference, and Consciousness","author":"Johnson-Laird","year":"1983"},{"key":"2021042218045001200_bib31","doi-asserted-by":"crossref","first-page":"365","DOI":"10.1017\/S0022226707004616","article-title":"Constraints on multiple center-embedding of clauses","volume":"43","author":"Karlsson","year":"2007","journal-title":"Journal of Linguistics"},{"key":"2021042218045001200_bib32","first-page":"2045","article-title":"Working memory constraints on multiple center-embedding","volume-title":"Proceedings from the 32nd Annual Meeting of the Cognitive Science Society","author":"Karlsson","year":"2010"},{"issue":"(1)","key":"2021042218045001200_bib33","first-page":"15","article-title":"A critique of Chomsky\u2019s theory of grammatical competence","volume":"1","author":"Kates","year":"1976","journal-title":"Forum Linguisticum"},{"key":"2021042218045001200_bib34","doi-asserted-by":"crossref","first-page":"2369","DOI":"10.18653\/v1\/P19-1228","article-title":"Compound probabilistic context-free grammars for grammar induction","volume-title":"Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics","author":"Kim","year":"2019"},{"key":"2021042218045001200_bib35","first-page":"1105","article-title":"Unsupervised recurrent neural network grammars","volume-title":"Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long and Short Papers)","author":"Kim","year":"2019"},{"key":"2021042218045001200_bib36","doi-asserted-by":"crossref","first-page":"2676","DOI":"10.18653\/v1\/P18-1249","article-title":"Constituency parsing with a self-attentive encoder","volume-title":"Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)","author":"Kitaev","year":"2018"},{"key":"2021042218045001200_bib37","doi-asserted-by":"crossref","first-page":"478","DOI":"10.3115\/1218955.1219016","article-title":"Corpus-based induction of syntactic structure: Models of dependency and constituency","volume-title":"Proceedings of the 42nd Annual Meeting of the Association for Computational Linguistics (ACL-04)","author":"Klein","year":"2004"},{"key":"2021042218045001200_bib38","first-page":"128","article-title":"A generative constituent-context model for improved grammar induction","volume-title":"Proceedings of the 40th Annual Meeting of the Association for Computational Linguistics","author":"Klein","year":"2002"},{"key":"2021042218045001200_bib39","first-page":"688","article-title":"The infinite PCFG using hierarchical Dirichlet processes","volume-title":"Proceedings of the 2007 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning (EMNLP-CoNLL)","author":"Liang","year":"2007"},{"issue":"1","key":"2021042218045001200_bib40","doi-asserted-by":"crossref","first-page":"187","DOI":"10.1017\/S0305000996002930","article-title":"Lexically-based learning and early grammatical development","volume":"24","author":"Lieven","year":"1997","journal-title":"Journal of Child Language"},{"key":"2021042218045001200_bib41","volume-title":"The CHILDES Project: Tools for Analyzing Talk","author":"Macwhinney","year":"1992","edition":"third edition"},{"issue":"2","key":"2021042218045001200_bib42","first-page":"313","article-title":"Building a large annotated corpus of English: The Penn Treebank","volume":"19","author":"Marcus","year":"1993","journal-title":"Computational Linguistics"},{"key":"2021042218045001200_bib43","first-page":"201","article-title":"Some comments on competence and performance","volume-title":"Developmental Psycholinguistics and Communication Disorders","author":"Miller","year":"1975"},{"issue":"1","key":"2021042218045001200_bib44","doi-asserted-by":"crossref","first-page":"91","DOI":"10.1016\/S0010-0277(03)00140-9","article-title":"Frequent frames as a cue for grammatical categories in child directed speech","volume":"90","author":"Mintz","year":"2003","journal-title":"Cognition"},{"key":"2021042218045001200_bib45","first-page":"1234","article-title":"Using universal linguistic knowledge to guide grammar induction","volume-title":"Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing","author":"Naseem","year":"2010"},{"key":"2021042218045001200_bib46","first-page":"682","article-title":"Grammar is grammar and usage is usage","volume-title":"Language","author":"Newmeyer","year":"2010"},{"key":"2021042218045001200_bib47","doi-asserted-by":"crossref","first-page":"33","DOI":"10.18653\/v1\/D16-1004","article-title":"Using left-corner parsing to encode universal structural constraints in grammar induction","volume-title":"Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing","author":"Noji","year":"2016"},{"key":"2021042218045001200_bib48","doi-asserted-by":"crossref","first-page":"63","DOI":"10.1162\/tacl_a_00210","article-title":"Unsupervised dependency parsing with acoustic cues","volume":"1","author":"Pate","year":"2013","journal-title":"Transactions of the Association for Computational Linguistics"},{"issue":"1","key":"2021042218045001200_bib49","doi-asserted-by":"crossref","first-page":"23","DOI":"10.1080\/10489223.2012.738742","article-title":"Syntactic islands and learning biases: Combining experimental syntax and computational modeling to investigate the language acquisition problem","volume":"20","author":"Pearl","year":"2013","journal-title":"Language Acquisition"},{"key":"2021042218045001200_bib50","doi-asserted-by":"crossref","first-page":"128","DOI":"10.3115\/981967.981984","article-title":"Inside-outside reestimation from partially bracketed corpora","volume-title":"30th Annual Meeting of the Association for Computational Linguistics","author":"Pereira","year":"1992"},{"key":"2021042218045001200_bib51","first-page":"663","article-title":"Poverty of the stimulus? A rational approach","volume-title":"Proceedings of the 28th Annual Conference of the Cognitive Science Society","author":"Perfors","year":"2006"},{"key":"2021042218045001200_bib52","first-page":"1077","article-title":"Simple unsupervised grammar induction from raw text with cascaded finite state models","volume-title":"Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies","author":"Ponvert","year":"2011"},{"key":"2021042218045001200_bib53","doi-asserted-by":"crossref","first-page":"9","DOI":"10.1515\/tlir.19.1-2.9","article-title":"Empirical assessment of stimulus poverty arguments","volume":"18","author":"Pullum","year":"2002","journal-title":"Linguistic Review"},{"issue":"1","key":"2021042218045001200_bib54","doi-asserted-by":"crossref","first-page":"21","DOI":"10.1007\/BF01067110","article-title":"The role of competence theories in cognitive psychology","volume":"2","author":"Pylyshyn","year":"1973","journal-title":"Journal of Psycholinguistic Research"},{"issue":"4","key":"2021042218045001200_bib55","doi-asserted-by":"crossref","first-page":"425","DOI":"10.1207\/s15516709cog2204_2","article-title":"Distributional information: A powerful cue for acquiring syntactic categories","volume":"22","author":"Redington","year":"1998","journal-title":"Cognitive Science"},{"key":"2021042218045001200_bib56","first-page":"139","article-title":"Deterministic left corner parsing","volume-title":"11th Annual Symposium on Switching and Automata Theory","author":"Rosenkrantz","year":"1970"},{"issue":"1","key":"2021042218045001200_bib57","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1162\/coli.2010.36.1.36100","article-title":"Broad-coverage parsing using human-like memory constraints","volume":"36","author":"Schuler","year":"2010","journal-title":"Computational Linguistics"},{"key":"2021042218045001200_bib58","first-page":"384","article-title":"Fast unsupervised incremental parsing","volume-title":"Proceedings of the 45th Annual Meeting of the Association of Computational Linguistics","author":"Seginer","year":"2007"},{"key":"2021042218045001200_bib59","unstructured":"Seginer, Yoav . 2007b. Learning Syntactic Structure. Ph.D. thesis, University of Amsterdam."},{"key":"2021042218045001200_bib60","first-page":"964","article-title":"Memory-bounded left-corner unsupervised grammar induction on child-directed input","volume-title":"Proceedings of COLING 2016, the 26th International Conference on Computational Linguistics: Technical Papers","author":"Shain","year":"2016"},{"key":"2021042218045001200_bib61","article-title":"Neural language modeling by jointly learning syntax and lexicon","volume-title":"6th International Conference on Learning Representations, ICLR 2018, Conference Track Proceedings","author":"Shen","year":"2018"},{"key":"2021042218045001200_bib62","article-title":"Ordered neurons: Integrating tree structures into recurrent neural networks","volume-title":"7th International Conference on Learning Representations, ICLR 2019","author":"Shen","year":"2019"},{"key":"2021042218045001200_bib63","first-page":"7","article-title":"A linguistically interpreted corpus of German newspaper text","volume-title":"Proceedings of the ESSLLI Workshop on Recent Advances in Corpus Annotation","author":"Skut","year":"1998"},{"key":"2021042218045001200_bib64","unstructured":"Smith, Noah Ashton . 2006. Novel Estimation Methods for Unsupervised Discovery of Latent Structure in Natural Language Text. PhD Thesis, Johns Hopkins University."},{"issue":"1\u20132","key":"2021042218045001200_bib65","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1016\/S0019-9958(64)90223-2","article-title":"A formal theory of inductive inference","volume":"7","author":"Solomonoff","year":"1964","journal-title":"Information and Control"},{"key":"2021042218045001200_bib66","first-page":"834","article-title":"Formalizing affordance","volume-title":"Proceedings of the Annual Meeting of the Cognitive Science Society","author":"Steedman","year":"2002"},{"issue":"1","key":"2021042218045001200_bib67","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1080\/15475440709336999","article-title":"Statistical learning of syntax: The role of transitional probability","volume":"3","author":"Thompson","year":"2007","journal-title":"Language Learning and Development"},{"key":"2021042218045001200_bib68","volume-title":"Constructing a Language: A Usage-Based Theory of Language Acquisition","author":"Tomasello","year":"2003"},{"key":"2021042218045001200_bib69","unstructured":"Tu, Kewei . 2012. Unsupervised Learning of Probabilistic Grammars. Ph.D. thesis, Iowa State University."},{"issue":"3","key":"2021042218045001200_bib70","doi-asserted-by":"crossref","first-page":"522","DOI":"10.1111\/tops.12034","article-title":"A model of language processing as hierarchic sequential prediction","volume":"5","author":"van Schijndel","year":"2013","journal-title":"Topics in Cognitive Science"},{"key":"2021042218045001200_bib71","article-title":"Developing guidelines and ensuring consistency for Chinese text annotation","volume-title":"Proceedings of the Second International Conference on Language Resources and Evaluation (LREC\u201900)","author":"Xia","year":"2000"},{"issue":"1","key":"2021042218045001200_bib72","doi-asserted-by":"crossref","first-page":"135","DOI":"10.1016\/j.brainres.2007.01.030","article-title":"The semantic processing of syntactic structure in sentence comprehension: An ERP study","volume":"1142","author":"Ye","year":"2007","journal-title":"Brain Research"}],"container-title":["Computational Linguistics"],"original-title":[],"language":"en","link":[{"URL":"http:\/\/direct.mit.edu\/coli\/article-pdf\/47\/1\/181\/1911441\/coli_a_00399.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"http:\/\/direct.mit.edu\/coli\/article-pdf\/47\/1\/181\/1911441\/coli_a_00399.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2022,12,20]],"date-time":"2022-12-20T07:13:09Z","timestamp":1671520389000},"score":1,"resource":{"primary":{"URL":"https:\/\/direct.mit.edu\/coli\/article\/47\/1\/181\/97336\/Depth-Bounded-Statistical-PCFG-Induction-as-a"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2021,3]]},"references-count":72,"journal-issue":{"issue":"1","published-online":{"date-parts":[[2021,4,21]]},"published-print":{"date-parts":[[2021,4,21]]}},"URL":"https:\/\/doi.org\/10.1162\/coli_a_00399","relation":{},"ISSN":["0891-2017","1530-9312"],"issn-type":[{"value":"0891-2017","type":"print"},{"value":"1530-9312","type":"electronic"}],"subject":[],"published-other":{"date-parts":[[2021,3]]},"published":{"date-parts":[[2021,3]]}}}