{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,9,29]],"date-time":"2025-09-29T06:40:13Z","timestamp":1759128013310,"version":"3.44.0"},"reference-count":30,"publisher":"Springer Science and Business Media LLC","issue":"28","license":[{"start":{"date-parts":[[2025,8,5]],"date-time":"2025-08-05T00:00:00Z","timestamp":1754352000000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2025,8,5]],"date-time":"2025-08-05T00:00:00Z","timestamp":1754352000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["Neural Comput &amp; Applic"],"published-print":{"date-parts":[[2025,10]]},"abstract":"<jats:title>Abstract<\/jats:title>\n          <jats:p>This study explores the integration of information-rich prior knowledge, specifically human gaze data, to enhance representation learning through contrastive methods. We propose gaze-guided contrastive unsupervised representation learning, a novel framework harnessing human gaze data to guide the selection of positive and negative samples for contrastive learning. By leveraging human gaze information, we capture meaningful patterns in visual task dynamics, enabling the agent to acquire effective strategies from demonstrations and achieve superior performance. Our findings demonstrate significant improvements over baseline algorithms, highlighting the value of gaze-guided representation learning in reducing data requirements and accelerating learning. This approach offers broad applicability to vision-based tasks, emphasizing the critical role of human gaze in improving task efficiency and generalization.<\/jats:p>","DOI":"10.1007\/s00521-025-11526-6","type":"journal-article","created":{"date-parts":[[2025,8,5]],"date-time":"2025-08-05T17:06:08Z","timestamp":1754413568000},"page":"23381-23393","update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":0,"title":["Gaze-guided contrastive unsupervised representations learning"],"prefix":"10.1007","volume":"37","author":[{"given":"Joseph P.","family":"Distefano","sequence":"first","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Hemanth","family":"Manjunatha","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Chaithanya","family":"Thammineni","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Kristian","family":"Dalland","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-5893-4664","authenticated-orcid":false,"given":"Ehsan T.","family":"Esfahani","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"297","published-online":{"date-parts":[[2025,8,5]]},"reference":[{"key":"11526_CR1","unstructured":"Chen T, Kornblith S, Norouzi M, Hinton G (2020) A simple framework for contrastive learning of visual representations, Proceedings of the 37th International Conference on Machine Learning, 1597\u20131607"},{"key":"11526_CR2","doi-asserted-by":"crossref","unstructured":"He K, Fan H, Wu Y, Xie S, Girshick R (2020) Momentum contrast for unsupervised visual representation learning, Proceedings of the IEEE\/CVF conference on computer vision and pattern recognition, 9729\u20139738","DOI":"10.1109\/CVPR42600.2020.00975"},{"key":"11526_CR3","first-page":"21271","volume":"33","author":"J-B Grill","year":"2020","unstructured":"Grill J-B et al (2020) Bootstrap your own latent-a new approach to self-supervised learning. Adv Neural Inf Process Syst 33:21271\u201321284","journal-title":"Adv Neural Inf Process Syst"},{"key":"11526_CR4","doi-asserted-by":"crossref","unstructured":"Chen X, He K (2021) Exploring simple siamese representation learning, Proceedings of the IEEE\/CVF conference on computer vision and pattern recognition, 15750\u201315758","DOI":"10.1109\/CVPR46437.2021.01549"},{"key":"11526_CR5","doi-asserted-by":"crossref","unstructured":"Caron M, et\u00a0al. (2021) Emerging properties in self-supervised vision transformers, Proceedings of the IEEE\/CVF international conference on computer vision, 9650\u20139660","DOI":"10.1109\/ICCV48922.2021.00951"},{"key":"11526_CR6","doi-asserted-by":"crossref","unstructured":"Zhong Z, Zheng L, Kang G, Li S, Yang Y (2020) Random erasing data augmentation, Vol.\u00a034 of Proceedings of the AAAI conference on artificial intelligence, 13001\u201313008","DOI":"10.1609\/aaai.v34i07.7000"},{"key":"11526_CR7","unstructured":"Chen X, Fan H, Girshick R, He K (2020) Improved baselines with momentum contrastive learning. arXiv preprint arXiv:2003.04297"},{"key":"11526_CR8","doi-asserted-by":"publisher","first-page":"23401","DOI":"10.1007\/s00521-021-06367-y","volume":"35","author":"C Thammineni","year":"2023","unstructured":"Thammineni C, Manjunatha H, Esfahani ET (2023) Selective eye-gaze augmentation to enhance imitation learning in Atari games. Neural Comput Appl 35:23401\u201323410","journal-title":"Neural Comput Appl"},{"key":"11526_CR9","doi-asserted-by":"crossref","unstructured":"Zhang R, et\u00a0al. (2020) Atari-head: Atari human eye-tracking and demonstration dataset, Vol.\u00a034 of Proceedings of the AAAI conference on artificial intelligence, 6811\u20136820","DOI":"10.1609\/aaai.v34i04.6161"},{"key":"11526_CR10","doi-asserted-by":"crossref","unstructured":"Zhang R, et\u00a0al. (2020) Human gaze assisted artificial intelligence: A review, Vol. 2020 of IJCAI: Proceed Conf, 4951","DOI":"10.24963\/ijcai.2020\/689"},{"key":"11526_CR11","doi-asserted-by":"crossref","unstructured":"Chen Y, Liu C, Tai L, Liu M, Shi BE Gaze training by modulated dropout improves imitation learning, 2019 IEEE\/RSJ International Conference on Intelligent Robots and Systems (IROS), 7756\u20137761 (IEEE, 2019)","DOI":"10.1109\/IROS40897.2019.8967843"},{"key":"11526_CR12","doi-asserted-by":"crossref","unstructured":"Zhang R, et\u00a0al. (2018)Agil: Learning attention from human for visuomotor tasks, 663\u2013679","DOI":"10.1007\/978-3-030-01252-6_41"},{"key":"11526_CR13","unstructured":"Saran A, Zhang R, Short ES, Niekum S (2020) Efficiently guiding imitation learning algorithms with human gaze. arXiv preprint arXiv:2002.12500"},{"key":"11526_CR14","doi-asserted-by":"crossref","unstructured":"Yang Z, et\u00a0al. (2020) Predicting goal-directed human attention using inverse reinforcement learning, Proceedings of the IEEE\/CVF conference on computer vision and pattern recognition, 193\u2013202","DOI":"10.1109\/CVPR42600.2020.00027"},{"key":"11526_CR15","unstructured":"Oord AVD, Li Y, Vinyals (2018) O. Representation learning with contrastive predictive coding. arXiv preprint arXiv:1807.03748"},{"key":"11526_CR16","doi-asserted-by":"publisher","first-page":"2","DOI":"10.3390\/technologies9010002","volume":"9","author":"A Jaiswal","year":"2020","unstructured":"Jaiswal A, Babu AR, Zadeh MZ, Banerjee D, Makedon F (2020) A survey on contrastive self-supervised learning. Technologies 9:2","journal-title":"Technologies"},{"key":"11526_CR17","first-page":"18661","volume":"33","author":"P Khosla","year":"2020","unstructured":"Khosla P et al (2020) Supervised contrastive learning. Adv Neural Inf Process Syst 33:18661\u201318673","journal-title":"Adv Neural Inf Process Syst"},{"key":"11526_CR18","first-page":"857","volume":"35","author":"X Liu","year":"2021","unstructured":"Liu X et al (2021) Self-supervised learning: generative or contrastive. IEEE Trans Knowl Data Eng 35:857\u2013876","journal-title":"IEEE Trans Knowl Data Eng"},{"key":"11526_CR19","unstructured":"Jaderberg M, et\u00a0al. (2016) Reinforcement learning with unsupervised auxiliary tasks. arXiv preprint arXiv:1611.05397"},{"key":"11526_CR20","unstructured":"Hafner D, et\u00a0al. (2019) Learning latent dynamics for planning from pixels, Int Conf Mach Learn, 2555\u20132565"},{"key":"11526_CR21","unstructured":"Kaiser \u0141, et\u00a0al. (2019)Model based reinforcement learning for atari, International Conference on Learning Representations"},{"key":"11526_CR22","unstructured":"Higgins I, et\u00a0al. (2017) Darla: Improving zero-shot transfer in reinforcement learning, 1480\u20131490"},{"key":"11526_CR23","first-page":"253","volume":"47","author":"MG Bellemare","year":"2013","unstructured":"Bellemare MG, Naddaf Y, Veness J, Bowling M (2013) The arcade learning environment:an evaluation platform for general agents. J Artif Int Res 47:253\u2013279","journal-title":"J Artif Int Res"},{"key":"11526_CR24","doi-asserted-by":"crossref","unstructured":"Hessel M, et\u00a0al. (2018) Rainbow: Combining improvements in deep reinforcement learning, Vol.\u00a032 of Proceedings of the AAAI conference on artificial intelligence","DOI":"10.1609\/aaai.v32i1.11796"},{"key":"11526_CR25","unstructured":"Henaff O, (2020) Data-efficient image recognition with contrastive predictive coding, Int Conf Mach Learn, 4182\u20134192"},{"key":"11526_CR26","doi-asserted-by":"crossref","unstructured":"Wu Z, Xiong Y, Yu SX, Lin D (2018) Unsupervised feature learning via non-parametric instance discrimination, Proceed IEEE Conf. Comput. Vision Pattern Recognition, 3733\u20133742 (2018)","DOI":"10.1109\/CVPR.2018.00393"},{"key":"11526_CR27","unstructured":"Laskin M, Srinivas A, Abbeel P (2020) Curl: Contrastive unsupervised representations for reinforcement learning, Int Conf Mach Learn, 5639\u20135650"},{"key":"11526_CR28","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1145\/3656376","volume":"13","author":"A Belardinelli","year":"2024","unstructured":"Belardinelli A (2024) Gaze-based intention estimation: principles, methodologies, and applications in HRI. ACM Trans Human-Robot Int 13:1\u201330","journal-title":"ACM Trans Human-Robot Int"},{"key":"11526_CR29","doi-asserted-by":"publisher","first-page":"5540","DOI":"10.3390\/s19245540","volume":"19","author":"MQ Khan","year":"2019","unstructured":"Khan MQ, Lee S (2019) Gaze and eye tracking: Techniques and applications in ADAS. Sensors 19:5540","journal-title":"Sensors"},{"key":"11526_CR30","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1186\/s41235-018-0149-9","volume":"4","author":"TT Bruny\u00e9","year":"2019","unstructured":"Bruny\u00e9 TT, Drew T, Weaver DL, Elmore JG (2019) A review of eye tracking for understanding and improving diagnostic interpretation. Cognitive Res Principles implicat 4:1\u201316","journal-title":"Cognitive Res Principles implicat"}],"container-title":["Neural Computing and Applications"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s00521-025-11526-6.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1007\/s00521-025-11526-6\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s00521-025-11526-6.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,9,29]],"date-time":"2025-09-29T06:04:23Z","timestamp":1759125863000},"score":1,"resource":{"primary":{"URL":"https:\/\/link.springer.com\/10.1007\/s00521-025-11526-6"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2025,8,5]]},"references-count":30,"journal-issue":{"issue":"28","published-print":{"date-parts":[[2025,10]]}},"alternative-id":["11526"],"URL":"https:\/\/doi.org\/10.1007\/s00521-025-11526-6","relation":{},"ISSN":["0941-0643","1433-3058"],"issn-type":[{"type":"print","value":"0941-0643"},{"type":"electronic","value":"1433-3058"}],"subject":[],"published":{"date-parts":[[2025,8,5]]},"assertion":[{"value":"11 May 2024","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"10 June 2025","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"5 August 2025","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}},{"order":1,"name":"Ethics","group":{"name":"EthicsHeading","label":"Declarations"}},{"value":"The authors have no relevant financial or non-financial interests to disclose.","order":2,"name":"Ethics","group":{"name":"EthicsHeading","label":"Conflict of interest"}}]}}