{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,2,27]],"date-time":"2026-02-27T04:00:51Z","timestamp":1772164851191,"version":"3.50.1"},"reference-count":69,"publisher":"MIT Press","issue":"5","funder":[{"DOI":"10.13039\/100010663","name":"European Research Council","doi-asserted-by":"publisher","award":["787307"],"award-info":[{"award-number":["787307"]}],"id":[{"id":"10.13039\/100010663","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":["direct.mit.edu"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2025,5,1]]},"abstract":"<jats:title>Abstract<\/jats:title>\n                  <jats:p>In natural and artificial neural networks, modularity and distributed structure afford complementary but competing benefits. The former allows for hierarchical representations that can flexibly recombine modules to address novel problems, whereas the latter can benefit from less constrained training, potentially uncovering fruitful statistical regularities. Here, we investigate these competing demands in the context of human sequential behavior. First, we explore this setting by comparing the properties of several recurrent neural network models. We find that explicit hierarchical structure by itself fails to provide a critical performance advantage when compared with a \u201cflat\u201d model that does not incorporate hierarchical structure. However, hierarchy appears to facilitate cognitive control processes that support nonroutine behaviors and behaviors that are carried out under computational stress. Second, we compare these models against fMRI data using representational similarity analysis. We find that a model that incorporates so-called wiring costs in the cost function, which produces a hierarchically organized gradient of representational structure across the hidden layer of the neural network, best accounts for fMRI data collected from human participants in a previous study [Holroyd, C. B., Ribas-Fernandes, J. J. F., Shahnazian, D., Silvetti, M., &amp; Verguts, T., Human midcingulate cortex encodes distributed representations of task progress. Proceedings of the National Academy of Sciences, U.S.A., 115, 6398\u20136403, 2018]. The results reveal that the ACC encodes distributed representations of sequential task context along a rostro-caudal gradient of abstraction: Rostral ACC encodes relatively abstract and temporally extended patterns of activity compared with those encoded by caudal ACC. These results provide insight into the role of ACC in motivation and cognitive control.<\/jats:p>","DOI":"10.1162\/jocn_a_02285","type":"journal-article","created":{"date-parts":[[2025,1,9]],"date-time":"2025-01-09T09:51:34Z","timestamp":1736416294000},"page":"941-969","update-policy":"https:\/\/doi.org\/10.1162\/mitpressjournals.corrections.policy","source":"Crossref","is-referenced-by-count":5,"title":["Distributed Representations for Cognitive Control in Frontal Medial Cortex"],"prefix":"10.1162","volume":"37","author":[{"ORCID":"https:\/\/orcid.org\/0000-0001-6244-326X","authenticated-orcid":true,"given":"Thomas R.","family":"Colin","sequence":"first","affiliation":[{"name":"Ghent University"}]},{"given":"Iris","family":"Ikink","sequence":"additional","affiliation":[{"name":"Ghent University"}]},{"given":"Clay B.","family":"Holroyd","sequence":"additional","affiliation":[{"name":"Ghent University"}]}],"member":"281","published-online":{"date-parts":[[2025,5,1]]},"reference":[{"key":"2025040708505360500_bib1","doi-asserted-by":"publisher","first-page":"4671","DOI":"10.1523\/JNEUROSCI.4400-07.2008","article-title":"Anticipatory activity in anterior cingulate cortex can be independent of conflict and error likelihood","volume":"28","author":"Aarts","year":"2008","journal-title":"Journal of Neuroscience"},{"key":"2025040708505360500_bib2","doi-asserted-by":"publisher","first-page":"149","DOI":"10.1016\/j.neuron.2020.10.013","article-title":"The anterior cingulate cortex predicts future states to mediate model-based action selection","volume":"109","author":"Akam","year":"2021","journal-title":"Neuron"},{"key":"2025040708505360500_bib4","doi-asserted-by":"publisher","first-page":"605271","DOI":"10.3389\/fncom.2021.605271","article-title":"Interactions of medial and lateral prefrontal cortex in hierarchical predictive coding","volume":"15","author":"Alexander","year":"2021","journal-title":"Frontiers in Computational Neuroscience"},{"key":"2025040708505360500_bib5","doi-asserted-by":"publisher","DOI":"10.1609\/aaai.v31i1.10916","article-title":"The option-critic architecture","volume":"31","author":"Bacon","year":"2017","journal-title":"Proceedings of the AAAI Conference on Artificial Intelligence"},{"key":"2025040708505360500_bib6","doi-asserted-by":"publisher","first-page":"20","DOI":"10.1016\/j.cobeha.2020.07.002","article-title":"The dimensionality of neural representations for control","volume":"38","author":"Badre","year":"2021","journal-title":"Current Opinion in Behavioral Sciences"},{"key":"2025040708505360500_bib7","doi-asserted-by":"publisher","first-page":"165","DOI":"10.1016\/B978-0-12-804281-6.00009-4","article-title":"Hierarchical cognitive control and the frontal lobes","volume":"163","author":"Badre","year":"2019","journal-title":"Handbook of Clinical Neurology"},{"key":"2025040708505360500_bib8","doi-asserted-by":"publisher","first-page":"170","DOI":"10.1016\/j.tics.2017.11.005","article-title":"Frontal cortex and the hierarchical control of behavior","volume":"22","author":"Badre","year":"2018","journal-title":"Trends in Cognitive Sciences"},{"key":"2025040708505360500_bib9","first-page":"25164","article-title":"The functional specialization of visual cortex emerges from training parallel pathways with self-supervised predictive learning","volume":"34","author":"Bakhtiari","year":"2021","journal-title":"Advances in Neural Information Processing Systems"},{"key":"2025040708505360500_bib10","doi-asserted-by":"publisher","first-page":"1011","DOI":"10.3758\/bf03206436","article-title":"Distraction and action slips in an everyday task: Evidence for a dynamic representation of task context","volume":"12","author":"Botvinick","year":"2005","journal-title":"Psychonomic Bulletin & Review"},{"key":"2025040708505360500_bib11","doi-asserted-by":"publisher","first-page":"395","DOI":"10.1037\/0033-295X.111.2.395","article-title":"Doing without schema hierarchies: A recurrent connectionist approach to normal and impaired routine sequential action","volume":"111","author":"Botvinick","year":"2004","journal-title":"Psychological Review"},{"key":"2025040708505360500_bib12","doi-asserted-by":"publisher","first-page":"950","DOI":"10.1016\/j.tics.2017.09.013","article-title":"Parallel distributed processing theory in the age of deep networks","volume":"21","author":"Bowers","year":"2017","journal-title":"Trends in Cognitive Sciences"},{"key":"2025040708505360500_bib13","first-page":"1877","article-title":"Language models are few-shot learners","volume":"33","author":"Brown","year":"2020","journal-title":"Advances in Neural Information Processing Systems"},{"key":"2025040708505360500_bib14","doi-asserted-by":"publisher","first-page":"336","DOI":"10.1038\/nrn3214","article-title":"The economy of brain network organization","volume":"13","author":"Bullmore","year":"2012","journal-title":"Nature Reviews Neuroscience"},{"key":"2025040708505360500_bib15","doi-asserted-by":"publisher","first-page":"4723","DOI":"10.1073\/pnas.0506806103","article-title":"Wiring optimization can relate neuronal structure and function","volume":"103","author":"Chen","year":"2006","journal-title":"Proceedings of the National Academy of Sciences, U.S.A."},{"key":"2025040708505360500_bib16","doi-asserted-by":"publisher","first-page":"2418","DOI":"10.1523\/JNEUROSCI.14-04-02418.1994","article-title":"Component placement optimization in the brain","volume":"14","author":"Cherniak","year":"1994","journal-title":"Journal of Neuroscience"},{"key":"2025040708505360500_bib17","doi-asserted-by":"publisher","first-page":"369","DOI":"10.1146\/annurev.neuro.27.070203.144226","article-title":"Maps in the brain: What can we learn from them?","volume":"27","author":"Chklovskii","year":"2004","journal-title":"Annual Review of Neuroscience"},{"key":"2025040708505360500_bib18","doi-asserted-by":"publisher","first-page":"284","DOI":"10.1002\/asi.5090080406","article-title":"Logical structures in language","volume":"8","author":"Chomsky","year":"1957","journal-title":"American Documentation"},{"key":"2025040708505360500_bib19","doi-asserted-by":"publisher","first-page":"94","DOI":"10.1016\/j.brainres.2009.05.096","article-title":"Prefrontal organization of cognitive control according to levels of abstraction","volume":"1286","author":"Christoff","year":"2009","journal-title":"Brain Research"},{"key":"2025040708505360500_bib20","doi-asserted-by":"publisher","first-page":"27755","DOI":"10.1038\/srep27755","article-title":"Comparison of deep neural networks to spatio-temporal cortical dynamics of human visual object recognition reveals hierarchical correspondence","volume":"6","author":"Cichy","year":"2016","journal-title":"Scientific Reports"},{"key":"2025040708505360500_bib21","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1002\/9781118920497.ch1","article-title":"Cognitive control: Core constructs and current considerations","volume-title":"The Wiley handbook of cognitive control","author":"Cohen","year":"2017"},{"key":"2025040708505360500_bib22","doi-asserted-by":"publisher","first-page":"332","DOI":"10.1037\/0033-295x.97.3.332","article-title":"On the control of automatic processes: A parallel distributed processing account of the Stroop effect","volume":"97","author":"Cohen","year":"1990","journal-title":"Psychological Review"},{"key":"2025040708505360500_bib23","first-page":"71","article-title":"Neural network models of anterior cingulate cortex in the Atari domain","volume-title":"Proceedings of the 7th bi-annual ESCAN meeting","author":"Colin","year":"2024"},{"key":"2025040708505360500_bib24","doi-asserted-by":"publisher","first-page":"244","DOI":"10.1111\/cogs.12067","article-title":"The goal circuit model: A hierarchical multi-route model of the acquisition and control of routine sequential action in humans","volume":"38","author":"Cooper","year":"2014","journal-title":"Cognitive Science"},{"key":"2025040708505360500_bib3","doi-asserted-by":"publisher","first-page":"724","DOI":"10.1016\/j.neuron.2020.11.021","article-title":"Using deep reinforcement learning to reveal how the brain encodes abstract state-space representations in high-dimensional environments","volume":"109","author":"Cross","year":"2021","journal-title":"Neuron"},{"key":"2025040708505360500_bib204","doi-asserted-by":"publisher","first-page":"179","DOI":"10.1207\/s15516709cog1402_1","article-title":"Finding structure in time","volume":"14","author":"Elman","year":"1990","journal-title":"Cognitive Science"},{"key":"2025040708505360500_bib25","doi-asserted-by":"publisher","first-page":"1258","DOI":"10.1016\/j.neuron.2022.01.005","article-title":"Orthogonal representations for robust context-dependent task performance in brains and neural networks","volume":"110","author":"Flesch","year":"2022","journal-title":"Neuron"},{"key":"2025040708505360500_bib202","doi-asserted-by":"publisher","first-page":"622","DOI":"10.1016\/j.tics.2021.03.011","article-title":"Neural coding of cognitive control: The representational similarity analysis approach","volume":"25","author":"Freund","year":"2021","journal-title":"Trends in Cognitive Sciences"},{"key":"2025040708505360500_bib26","volume-title":"The mind's new science: A history of the cognitive revolution","author":"Gardner","year":"1985"},{"key":"2025040708505360500_bib27","doi-asserted-by":"publisher","first-page":"17","DOI":"10.1016\/j.cobeha.2018.12.010","article-title":"Reconciling deep learning with symbolic artificial intelligence: Representing objects and relations","volume":"29","author":"Garnelo","year":"2019","journal-title":"Current Opinion in Behavioral Sciences"},{"key":"2025040708505360500_bib28","doi-asserted-by":"publisher","first-page":"337","DOI":"10.1111\/j.1749-6632.1995.tb38149.x","article-title":"Similarities and distinctions among current models of prefrontal cortical functions","volume":"769","author":"Grafman","year":"1995","journal-title":"Annals of the New York Academy of Sciences"},{"key":"2025040708505360500_bib29","first-page":"1851","article-title":"Latent space policies for hierarchical reinforcement learning","volume-title":"Proceedings of the 35th international conference on machine learning","author":"Haarnoja","year":"2018"},{"key":"2025040708505360500_bib30","doi-asserted-by":"publisher","first-page":"471","DOI":"10.1017\/S0140525X00079760","article-title":"What connectionist models learn: Learning and representation in connectionist networks","volume":"13","author":"Hanson","year":"1990","journal-title":"Behavioral and Brain Sciences"},{"key":"2025040708505360500_bib31","doi-asserted-by":"publisher","first-page":"1026","DOI":"10.1109\/ICCV.2015.123","article-title":"Delving deep into rectifiers: Surpassing human-level performance on ImageNet classification","volume-title":"IEEE International Conference on Computer Vision","author":"He","year":"2015"},{"key":"2025040708505360500_bib32","doi-asserted-by":"publisher","DOI":"10.1037\/rev0000467","article-title":"The controllosphere: The neural origin of cognitive effort","author":"Holroyd","year":"2024","journal-title":"Psychological Review"},{"key":"2025040708505360500_bib33","doi-asserted-by":"publisher","first-page":"54","DOI":"10.1037\/a0038339","article-title":"Hierarchical control over effortful behavior by rodent medial frontal cortex: A computational model","volume":"122","author":"Holroyd","year":"2015","journal-title":"Psychological Review"},{"key":"2025040708505360500_bib34","doi-asserted-by":"publisher","first-page":"6398","DOI":"10.1073\/pnas.1803650115","article-title":"Human midcingulate cortex encodes distributed representations of task progress","volume":"115","author":"Holroyd","year":"2018","journal-title":"Proceedings of the National Academy of Sciences, U.S.A."},{"key":"2025040708505360500_bib35","doi-asserted-by":"publisher","first-page":"316","DOI":"10.1016\/j.tics.2021.01.008","article-title":"The best laid plans: Computational principles of anterior cingulate cortex","volume":"25","author":"Holroyd","year":"2021","journal-title":"Trends in Cognitive Sciences"},{"key":"2025040708505360500_bib36","doi-asserted-by":"publisher","first-page":"402","DOI":"10.1016\/S0166-2236(03)00175-9","article-title":"Alcohol and error processing","volume":"26","author":"Holroyd","year":"2003","journal-title":"Trends in Neurosciences"},{"key":"2025040708505360500_bib37","doi-asserted-by":"publisher","first-page":"122","DOI":"10.1016\/j.tics.2011.12.008","article-title":"Motivation of extended behaviors by anterior cingulate cortex","volume":"16","author":"Holroyd","year":"2012","journal-title":"Trends in Cognitive Sciences"},{"key":"2025040708505360500_bib38","doi-asserted-by":"publisher","DOI":"10.48550\/arXiv.1412.6980","article-title":"ADAM: A method for stochastic optimization","author":"Kingma","year":"2014","journal-title":"arXiv"},{"key":"2025040708505360500_bib39","doi-asserted-by":"publisher","first-page":"1181","DOI":"10.1126\/science.1088545","article-title":"The architecture of cognitive control in the human prefrontal cortex","volume":"302","author":"Koechlin","year":"2003","journal-title":"Science"},{"key":"2025040708505360500_bib40","doi-asserted-by":"publisher","first-page":"4","DOI":"10.3389\/neuro.06.004.2008","article-title":"Representational similarity analysis\u2014Connecting the branches of systems neuroscience","volume":"2","author":"Kriegeskorte","year":"2008","journal-title":"Frontiers in Systems Neuroscience"},{"key":"2025040708505360500_bib41","doi-asserted-by":"publisher","first-page":"e253","DOI":"10.1017\/S0140525X16001837","article-title":"Building machines that learn and think like people","volume":"40","author":"Lake","year":"2017","journal-title":"Behavioral and Brain Sciences"},{"key":"2025040708505360500_bib42","first-page":"112","article-title":"The problem of serial order in behavior","volume-title":"Cerebral mechanisms in behavior","author":"Lashley","year":"1951"},{"key":"2025040708505360500_bib43","doi-asserted-by":"publisher","first-page":"335","DOI":"10.1038\/s41583-020-0277-3","article-title":"Backpropagation and the brain","volume":"21","author":"Lillicrap","year":"2020","journal-title":"Nature Reviews Neuroscience"},{"key":"2025040708505360500_bib44","doi-asserted-by":"publisher","DOI":"10.48550\/arXiv.1811.11684","article-title":"Shared representational geometry across neural networks","author":"Lu","year":"2018","journal-title":"Integration of deep learning theories workshop, NeurIPS 2018"},{"key":"2025040708505360500_bib205","doi-asserted-by":"publisher","DOI":"10.48550\/arXiv.1907.08549","article-title":"Universality and individuality in neural dynamics across large populations of recurrent networks","volume-title":"Advances in Neural Information Processing Systems","author":"Maheswaranathan","year":"2019"},{"key":"2025040708505360500_bib45","doi-asserted-by":"publisher","first-page":"94","DOI":"10.3389\/fncom.2016.00094","article-title":"Toward an integration of deep learning and neuroscience","volume":"10","author":"Marblestone","year":"2016","journal-title":"Frontiers in Computational Neuroscience"},{"key":"2025040708505360500_bib46","volume-title":"Macro-actions in reinforcement learning: An empirical analysis","author":"McGovern","year":"1998"},{"key":"2025040708505360500_bib47","doi-asserted-by":"publisher","first-page":"167","DOI":"10.1146\/annurev.neuro.24.1.167","article-title":"An integrative theory of prefrontal cortex function","volume":"24","author":"Miller","year":"2001","journal-title":"Annual Review of Neuroscience"},{"key":"2025040708505360500_bib48","doi-asserted-by":"publisher","DOI":"10.1037\/10039-000","volume-title":"Plans and the structure of behavior","author":"Miller","year":"1960"},{"key":"2025040708505360500_bib49","doi-asserted-by":"publisher","first-page":"757","DOI":"10.1016\/j.tics.2021.06.001","article-title":"Rationalizing constraints on the capacity for cognitive control","volume":"25","author":"Musslick","year":"2021","journal-title":"Trends in Cognitive Sciences"},{"key":"2025040708505360500_bib50","doi-asserted-by":"publisher","first-page":"1761","DOI":"10.1038\/s41593-019-0520-2","article-title":"A deep learning framework for neuroscience","volume":"22","author":"Richards","year":"2019","journal-title":"Nature Neuroscience"},{"key":"2025040708505360500_bib51","doi-asserted-by":"publisher","first-page":"988","DOI":"10.1038\/s41593-024-01607-5","article-title":"Natural language instructions induce compositional generalization in networks of neurons","volume":"27","author":"Riveland","year":"2024","journal-title":"Nature Neuroscience"},{"key":"2025040708505360500_bib52","doi-asserted-by":"publisher","first-page":"1024","DOI":"10.1111\/cogs.12148","article-title":"Parallel distributed processing at 25: Further explorations in the microstructure of cognition","volume":"38","author":"Rogers","year":"2014","journal-title":"Cognitive Science"},{"key":"2025040708505360500_bib53","volume-title":"Scripts, plans, goals and understanding: An inquiry into human knowledge structures","author":"Schank","year":"1977"},{"key":"2025040708505360500_bib54","doi-asserted-by":"publisher","first-page":"85","DOI":"10.1016\/j.neunet.2014.09.003","article-title":"Deep learning in neural networks: An overview","volume":"61","author":"Schmidhuber","year":"2015","journal-title":"Neural Networks"},{"key":"2025040708505360500_bib200","doi-asserted-by":"publisher","DOI":"10.7551\/mitpress\/11474.001.0001","volume-title":"The deep learning revolution","author":"Sejnowski","year":"2018"},{"key":"2025040708505360500_bib55","doi-asserted-by":"publisher","first-page":"302","DOI":"10.3758\/s13423-017-1280-1","article-title":"Distributed representations of action sequences in anterior cingulate cortex: A recurrent neural network approach","volume":"25","author":"Shahnazian","year":"2018","journal-title":"Psychonomic Bulletin & Review"},{"key":"2025040708505360500_bib56","doi-asserted-by":"publisher","first-page":"223","DOI":"10.1111\/tops.12533","article-title":"Neural representations of task context and temporal order during action sequence execution","volume":"14","author":"Shahnazian","year":"2022","journal-title":"Topics in Cognitive Science"},{"key":"2025040708505360500_bib57","doi-asserted-by":"publisher","first-page":"967","DOI":"10.1016\/j.neuron.2019.08.034","article-title":"Engineering a less artificial intelligence","volume":"103","author":"Sinz","year":"2019","journal-title":"Neuron"},{"key":"2025040708505360500_bib58","doi-asserted-by":"publisher","first-page":"2044","DOI":"10.1162\/jocn_a_01755","article-title":"Diverse deep neural networks all predict human inferior temporal cortex well, after training and fitting","volume":"33","author":"Storrs","year":"2021","journal-title":"Journal of Cognitive Neuroscience"},{"key":"2025040708505360500_bib59","article-title":"The bitter lesson","author":"Sutton","year":"2019"},{"key":"2025040708505360500_bib60","doi-asserted-by":"publisher","first-page":"181","DOI":"10.1016\/s0004-3702(99)00052-1","article-title":"Between MDPs and semi-MDPs: A framework for temporal abstraction in reinforcement learning","volume":"112","author":"Sutton","year":"1999","journal-title":"Artificial Intelligence"},{"key":"2025040708505360500_bib61","first-page":"2579","article-title":"Visualizing data using t-SNE","volume":"9","author":"Van der Maaten","year":"2008","journal-title":"Journal of Machine Learning Research"},{"key":"2025040708505360500_bib62","doi-asserted-by":"publisher","first-page":"316","DOI":"10.3389\/fnins.2017.00316","article-title":"Computational models of anterior cingulate cortex: At the crossroads between prediction and effort","volume":"11","author":"Vassena","year":"2017","journal-title":"Frontiers in Neuroscience"},{"key":"2025040708505360500_bib201","doi-asserted-by":"publisher","first-page":"3540","DOI":"10.48550\/arXiv.1703.01161","article-title":"FeUdal networks for hierarchical reinforcement learning","volume":"70","author":"Vezhnevets","year":"2017","journal-title":"Proceedings of the 34th International Conference on Machine Learning"},{"key":"2025040708505360500_bib63","article-title":"Towards understanding learning representations: To what extent do different neural networks learn the same representation","volume":"31","author":"Wang","year":"2018","journal-title":"Advances in Neural Information Processing Systems"},{"key":"2025040708505360500_bib64","article-title":"How neural networks extrapolate: From feedforward to graph neural networks","volume-title":"Ninth International conference on Learning Representations","author":"Xu","year":"2021"}],"container-title":["Journal of Cognitive Neuroscience"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/direct.mit.edu\/jocn\/article-pdf\/37\/5\/941\/2486689\/jocn_a_02285.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/direct.mit.edu\/jocn\/article-pdf\/37\/5\/941\/2486689\/jocn_a_02285.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,4,7]],"date-time":"2025-04-07T09:18:33Z","timestamp":1744017513000},"score":1,"resource":{"primary":{"URL":"https:\/\/direct.mit.edu\/jocn\/article\/37\/5\/941\/125886\/Distributed-Representations-for-Cognitive-Control"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2025]]},"references-count":69,"journal-issue":{"issue":"5","published-online":{"date-parts":[[2025,5,1]]},"published-print":{"date-parts":[[2025,5,1]]}},"URL":"https:\/\/doi.org\/10.1162\/jocn_a_02285","relation":{"has-preprint":[{"id-type":"doi","id":"10.1101\/2023.12.12.571242","asserted-by":"object"}]},"ISSN":["0898-929X","1530-8898"],"issn-type":[{"value":"0898-929X","type":"print"},{"value":"1530-8898","type":"electronic"}],"subject":[],"published-other":{"date-parts":[[2025]]},"published":{"date-parts":[[2025]]}}}