{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,18]],"date-time":"2026-03-18T01:02:07Z","timestamp":1773795727696,"version":"3.50.1"},"update-to":[{"DOI":"10.1371\/journal.pcbi.1012968","type":"new_version","label":"New version","source":"publisher","updated":{"date-parts":[[2025,12,16]],"date-time":"2025-12-16T00:00:00Z","timestamp":1765843200000}}],"reference-count":32,"publisher":"Public Library of Science (PLoS)","issue":"12","license":[{"start":{"date-parts":[[2025,12,9]],"date-time":"2025-12-09T00:00:00Z","timestamp":1765238400000},"content-version":"vor","delay-in-days":0,"URL":"http:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"DOI":"10.13039\/501100002301","name":"Estonian Research Council","doi-asserted-by":"crossref","award":["PSG728"],"award-info":[{"award-number":["PSG728"]}],"id":[{"id":"10.13039\/501100002301","id-type":"DOI","asserted-by":"crossref"}]},{"DOI":"10.13039\/501100002301","name":"Estonian Research Council","doi-asserted-by":"crossref","award":["PSG728"],"award-info":[{"award-number":["PSG728"]}],"id":[{"id":"10.13039\/501100002301","id-type":"DOI","asserted-by":"crossref"}]},{"DOI":"10.13039\/501100002301","name":"Estonian Research Council","doi-asserted-by":"crossref","award":["PSG728"],"award-info":[{"award-number":["PSG728"]}],"id":[{"id":"10.13039\/501100002301","id-type":"DOI","asserted-by":"crossref"}]},{"DOI":"10.13039\/501100002301","name":"Eesti Teadusagentuur","doi-asserted-by":"publisher","award":["Tem-TA 120"],"award-info":[{"award-number":["Tem-TA 120"]}],"id":[{"id":"10.13039\/501100002301","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100003510","name":"Haridus- ja Teadusministeerium","doi-asserted-by":"publisher","award":["Estonian Center of Excellence in AI"],"award-info":[{"award-number":["Estonian Center of Excellence in AI"]}],"id":[{"id":"10.13039\/501100003510","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100002301","name":"Eesti Teadusagentuur","doi-asserted-by":"publisher","award":["PUTJD1252"],"award-info":[{"award-number":["PUTJD1252"]}],"id":[{"id":"10.13039\/501100002301","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":["www.ploscompbiol.org"],"crossmark-restriction":false},"short-container-title":["PLoS Comput Biol"],"abstract":"<jats:p>Human vision is not merely a passive process of interpreting sensory input but can also function as a problem-solving process incorporating generative mechanisms to interpret ambiguous or noisy data. This synergy between the generative and discriminative components, often described as analysis-by-synthesis, enables robust perception and rapid adaptation to out-of-distribution inputs. In this work, we investigate a computational implementation of the analysis-by-synthesis paradigm using genetic search in a generative model, applied to a visual problem-solving task inspired by star constellations. The search is guided by low-level cues based on the structural fitness of candidate solutions compared to the test images. This dataset serves as a testbed for exploring how inferred signals can guide the synthesis of suitable solutions in ambiguous conditions, framing visual inference as an instance of complex problem solving. Drawing on insights from human experiments, we develop a generative search algorithm and compare its performance to humans, examining factors such as accuracy, reaction time, and overlap in drawings. Our results shed light on possible mechanisms of human visual problem solving and highlight the potential of generative search models to emulate aspects of this process.<\/jats:p>","DOI":"10.1371\/journal.pcbi.1012968","type":"journal-article","created":{"date-parts":[[2025,12,9]],"date-time":"2025-12-09T18:37:23Z","timestamp":1765305443000},"page":"e1012968","update-policy":"https:\/\/doi.org\/10.1371\/journal.pcbi.corrections_policy","source":"Crossref","is-referenced-by-count":1,"title":["Comparing a computational model of visual problem solving with human vision on a difficult vision task"],"prefix":"10.1371","volume":"21","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-7089-659X","authenticated-orcid":true,"given":"Tarun","family":"Khajuria","sequence":"first","affiliation":[]},{"given":"Kadi","family":"Tulver","sequence":"additional","affiliation":[]},{"given":"Jaan","family":"Aru","sequence":"additional","affiliation":[]}],"member":"340","published-online":{"date-parts":[[2025,12,9]]},"reference":[{"issue":"8","key":"pcbi.1012968.ref001","doi-asserted-by":"crossref","first-page":"617","DOI":"10.1038\/nrn1476","article-title":"Visual objects in context","volume":"5","author":"M Bar","year":"2004","journal-title":"Nat Rev Neurosci."},{"issue":"12","key":"pcbi.1012968.ref002","doi-asserted-by":"crossref","first-page":"520","DOI":"10.1016\/j.tics.2007.09.009","article-title":"The role of context in object recognition","volume":"11","author":"A Oliva","year":"2007","journal-title":"Trends Cogn Sci."},{"issue":"7","key":"pcbi.1012968.ref003","doi-asserted-by":"crossref","first-page":"301","DOI":"10.1016\/j.tics.2006.05.002","article-title":"Vision as Bayesian inference: analysis by synthesis?","volume":"10","author":"A Yuille","year":"2006","journal-title":"Trends Cogn Sci."},{"key":"pcbi.1012968.ref004","unstructured":"Yilmaz H, Singh G, Egger B, Tenenbaum J, Yildirim I. Seeing in the dark: Testing deep neural network and analysis-by-synthesis accounts of 3D shape perception with highly degraded images. In: Proceedings of the Annual Meeting of the Cognitive Science Society. 2021."},{"issue":"10","key":"pcbi.1012968.ref005","doi-asserted-by":"crossref","DOI":"10.1126\/sciadv.aax5979","article-title":"Efficient inverse graphics in biological face processing","volume":"6","author":"I Yildirim","year":"2020","journal-title":"Sci Adv."},{"key":"pcbi.1012968.ref006","doi-asserted-by":"crossref","unstructured":"Xu D, Zhu Y, Choy CB, Fei-Fei L. Scene graph generation by iterative message passing. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 2017. p. 5410\u20139.","DOI":"10.1109\/CVPR.2017.330"},{"issue":"40","key":"pcbi.1012968.ref007","doi-asserted-by":"crossref","DOI":"10.1073\/pnas.2211179120","article-title":"Human-like scene interpretation by a guided counterstream processing","volume":"120","author":"S Ullman","year":"2023","journal-title":"Proc Natl Acad Sci U S A."},{"key":"pcbi.1012968.ref008","doi-asserted-by":"crossref","unstructured":"Shi B, Darrell T, Wang X. Top-down visual attention from analysis by synthesis. In: 2023 IEEE\/CVF Conference on Computer Vision and Pattern Recognition (CVPR). 2023. p. 2102\u201312. https:\/\/doi.org\/10.1109\/cvpr52729.2023.00209","DOI":"10.1109\/CVPR52729.2023.00209"},{"key":"pcbi.1012968.ref009","doi-asserted-by":"crossref","unstructured":"Khajuria T, Tulver K, Luik T, Aru J. Constellations: a novel dataset for studying iterative inference in humans and AI. In: Proceedings of the IEEE\/CVF Conference on Computer Vision and Pattern Recognition. 2022. p. 5142\u201352.","DOI":"10.1109\/CVPRW56347.2022.00562"},{"key":"pcbi.1012968.ref010","article-title":"Generative adversarial nets","volume":"27","author":"IJ Goodfellow","year":"2014","journal-title":"Advances in Neural Information Processing Systems."},{"issue":"11","key":"pcbi.1012968.ref011","doi-asserted-by":"crossref","first-page":"2278","DOI":"10.1109\/5.726791","article-title":"Gradient-based learning applied to document recognition","volume":"86","author":"Y Lecun","year":"1998","journal-title":"Proc IEEE."},{"key":"pcbi.1012968.ref012","unstructured":"Xiao H, Rasul K, Vollgraf R. Fashion-mnist: a novel image dataset for benchmarking machine learning algorithms. arXiv preprint 2017. https:\/\/arxiv.org\/abs\/1708.07747"},{"issue":"23","key":"pcbi.1012968.ref013","doi-asserted-by":"crossref","first-page":"8619","DOI":"10.1073\/pnas.1403112111","article-title":"Performance-optimized hierarchical models predict neural responses in higher visual cortex","volume":"111","author":"DLK Yamins","year":"2014","journal-title":"Proc Natl Acad Sci U S A."},{"key":"pcbi.1012968.ref014","unstructured":"Radford A, Kim JW, Hallacy C, Ramesh A, Goh G, Agarwal S, et al. Learning transferable visual models from natural language supervision. In: International conference on machine learning. PmLR; 2021. p. 8748\u201363."},{"key":"pcbi.1012968.ref015","doi-asserted-by":"crossref","unstructured":"Isola P, Zhu JY, Zhou T, Efros AA. Image-to-image translation with conditional adversarial networks. In: Proceedings of the IEEE conference on computer vision and pattern recognition. 2017. p. 1125\u201334.","DOI":"10.1109\/CVPR.2017.632"},{"key":"pcbi.1012968.ref016","doi-asserted-by":"crossref","first-page":"336","DOI":"10.1007\/s11263-019-01228-7","article-title":"Grad-CAM: visual explanations from deep networks via gradient-based localization","volume":"128","author":"RR Selvaraju","year":"2020","journal-title":"International Journal of Computer Vision."},{"key":"pcbi.1012968.ref017","unstructured":"Ullman S. High-level vision: object recognition and visual cognition. MIT Press; 2000."},{"issue":"5","key":"pcbi.1012968.ref018","doi-asserted-by":"crossref","first-page":"1087","DOI":"10.1177\/0956797614522816","article-title":"Beyond gist: strategic and incremental information accumulation for scene categorization","volume":"25","author":"GL Malcolm","year":"2014","journal-title":"Psychol Sci."},{"key":"pcbi.1012968.ref019","doi-asserted-by":"crossref","first-page":"104","DOI":"10.1016\/j.cognition.2015.08.008","article-title":"The importance of iteration in creative conceptual combination","volume":"145","author":"J Chan","year":"2015","journal-title":"Cognition."},{"issue":"43","key":"pcbi.1012968.ref020","doi-asserted-by":"crossref","first-page":"21854","DOI":"10.1073\/pnas.1905544116","article-title":"Recurrence is required to capture the representational dynamics of the human visual system","volume":"116","author":"TC Kietzmann","year":"2019","journal-title":"Proc Natl Acad Sci U S A."},{"issue":"5","key":"pcbi.1012968.ref021","doi-asserted-by":"crossref","first-page":"748","DOI":"10.1037\/0033-2909.130.5.748","article-title":"Enduring interest in perceptual ambiguity: alternating views of reversible figures","volume":"130","author":"GM Long","year":"2004","journal-title":"Psychol Bull."},{"key":"pcbi.1012968.ref022","doi-asserted-by":"crossref","first-page":"71","DOI":"10.1146\/annurev-psych-010213-115154","article-title":"The cognitive neuroscience of insight","volume":"65","author":"J Kounios","year":"2014","journal-title":"Annu Rev Psychol."},{"key":"pcbi.1012968.ref023","doi-asserted-by":"crossref","first-page":"103494","DOI":"10.1016\/j.concog.2023.103494","article-title":"Restructuring insight: an integrative review of insight in problem-solving, meditation, psychotherapy, delusions and psychedelics","volume":"110","author":"K Tulver","year":"2023","journal-title":"Conscious Cogn."},{"key":"pcbi.1012968.ref024","doi-asserted-by":"crossref","first-page":"106081","DOI":"10.1016\/j.cognition.2025.106081","article-title":"The road to Aha: a recipe for mental breakthroughs","volume":"257","author":"K Tulver","year":"2025","journal-title":"Cognition."},{"key":"pcbi.1012968.ref025","unstructured":"Yildirim I, Kulkarni TD, Freiwald WA, Tenenbaum JB. Efficient and robust analysis-by-synthesis in vision: a computational framework, behavioral tests, and modeling neuronal representations. In: Annual conference of the cognitive science society. 2015."},{"key":"pcbi.1012968.ref026","doi-asserted-by":"crossref","unstructured":"Zhang L, Rao A, Agrawala M. Adding conditional control to text-to-image diffusion models. In: Proceedings of the IEEE\/CVF International Conference on Computer Vision. 2023. p. 3836\u201347.","DOI":"10.1109\/ICCV51070.2023.00355"},{"key":"pcbi.1012968.ref027","doi-asserted-by":"crossref","unstructured":"Linsley D, Eberhardt S, Sharma T, Gupta P, Serre T. What are the visual features underlying human versus machine vision? In: 2017 IEEE International Conference on Computer Vision Workshops (ICCVW). 2017. p. 2706\u201314. https:\/\/doi.org\/10.1109\/iccvw.2017.331","DOI":"10.1109\/ICCVW.2017.331"},{"issue":"10","key":"pcbi.1012968.ref028","doi-asserted-by":"crossref","DOI":"10.1371\/journal.pcbi.1008215","article-title":"Recurrent neural networks can explain flexible trading of speed and accuracy in biological vision","volume":"16","author":"CJ Spoerer","year":"2020","journal-title":"PLoS Comput Biol."},{"key":"pcbi.1012968.ref029","doi-asserted-by":"crossref","first-page":"1551","DOI":"10.3389\/fpsyg.2017.01551","article-title":"Recurrent convolutional neural networks: a better model of biological object recognition","volume":"8","author":"CJ Spoerer","year":"2017","journal-title":"Front Psychol."},{"key":"pcbi.1012968.ref030","unstructured":"Thorat S, Aldegheri G, Kietzmann TC. Category-orthogonal object features guide information processing in recurrent neural networks trained for object categorization. arXiv preprint 2021. https:\/\/arxiv.org\/abs\/2111.07898"},{"key":"pcbi.1012968.ref031","unstructured":"Radford A, Metz L, Chintala S. Unsupervised representation learning with deep convolutional generative adversarial networks. arXiv preprint 2015. https:\/\/arxiv.org\/abs\/1511.06434"},{"issue":"20","key":"pcbi.1012968.ref032","doi-asserted-by":"crossref","first-page":"58029","DOI":"10.1007\/s11042-023-17167-y","article-title":"PyGAD: an intuitive genetic algorithm Python library","volume":"83","author":"AF Gad","year":"2023","journal-title":"Multimed Tools Appl."}],"updated-by":[{"DOI":"10.1371\/journal.pcbi.1012968","type":"new_version","label":"New version","source":"publisher","updated":{"date-parts":[[2025,12,16]],"date-time":"2025-12-16T00:00:00Z","timestamp":1765843200000}}],"container-title":["PLOS Computational Biology"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dx.plos.org\/10.1371\/journal.pcbi.1012968","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,12,16]],"date-time":"2025-12-16T18:38:06Z","timestamp":1765910286000},"score":1,"resource":{"primary":{"URL":"https:\/\/dx.plos.org\/10.1371\/journal.pcbi.1012968"}},"subtitle":[],"editor":[{"given":"Tim Christian","family":"Kietzmann","sequence":"first","affiliation":[]}],"short-title":[],"issued":{"date-parts":[[2025,12,9]]},"references-count":32,"journal-issue":{"issue":"12","published-online":{"date-parts":[[2025,12,9]]}},"URL":"https:\/\/doi.org\/10.1371\/journal.pcbi.1012968","relation":{},"ISSN":["1553-7358"],"issn-type":[{"value":"1553-7358","type":"electronic"}],"subject":[],"published":{"date-parts":[[2025,12,9]]}}}