{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,5,9]],"date-time":"2026-05-09T08:05:18Z","timestamp":1778313918507,"version":"3.51.4"},"publisher-location":"New York, NY, USA","reference-count":17,"publisher":"ACM","license":[{"start":{"date-parts":[[2020,4,20]],"date-time":"2020-04-20T00:00:00Z","timestamp":1587340800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2020,4,20]]},"DOI":"10.1145\/3366424.3383542","type":"proceedings-article","created":{"date-parts":[[2020,5,4]],"date-time":"2020-05-04T08:10:56Z","timestamp":1588579856000},"page":"207-211","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":23,"title":["VisBERT: Hidden-State Visualizations for Transformers"],"prefix":"10.1145","author":[{"given":"Betty van","family":"Aken","sequence":"first","affiliation":[{"name":"Beuth University of Applied Sciences Berlin"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Benjamin","family":"Winter","sequence":"additional","affiliation":[{"name":"Beuth University of Applied Sciences Berlin"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Alexander","family":"L\u00f6ser","sequence":"additional","affiliation":[{"name":"Beuth University of Applied Sciences Berlin"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Felix A.","family":"Gers","sequence":"additional","affiliation":[{"name":"Beuth University of Applied Sciences Berlin"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"320","published-online":{"date-parts":[[2020,4,20]]},"reference":[{"key":"e_1_3_2_1_1_1","doi-asserted-by":"crossref","unstructured":"Pierre Comon. 1994. Independent component analysis A new concept?Signal Processing 36(1994).  Pierre Comon. 1994. Independent component analysis A new concept?Signal Processing 36(1994).","DOI":"10.1016\/0165-1684(94)90029-9"},{"key":"e_1_3_2_1_2_1","doi-asserted-by":"crossref","unstructured":"Karl\u00a0Pearson F.R.S.1901. LIII. On lines and planes of closest fit to systems of points in space. The London Edinburgh and Dublin Philosophical Magazine and Journal of Science 2(1901).  Karl\u00a0Pearson F.R.S.1901. LIII. On lines and planes of closest fit to systems of points in space. The London Edinburgh and Dublin Philosophical Magazine and Journal of Science 2(1901).","DOI":"10.1080\/14786440109462720"},{"key":"e_1_3_2_1_3_1","unstructured":"Sarthak Jain and Byron\u00a0C. Wallace. 2019. Attention is not Explanation. In NAACL \u201919.  Sarthak Jain and Byron\u00a0C. Wallace. 2019. Attention is not Explanation. In NAACL \u201919."},{"key":"e_1_3_2_1_4_1","volume-title":"Adversarial Examples for Evaluating Reading Comprehension Systems. EMNLP \u201917","author":"Jia Robin","year":"2017"},{"key":"e_1_3_2_1_5_1","doi-asserted-by":"publisher","DOI":"10.1109\/TIT.1982.1056489"},{"key":"e_1_3_2_1_6_1","volume":"201","author":"McInnes L.","journal-title":"J. Melville."},{"key":"e_1_3_2_1_7_1","volume-title":"Workshop Track.","author":"Mikolov Tomas","year":"2013"},{"key":"e_1_3_2_1_8_1","unstructured":"Alec Radford Jeffrey Wu Rewon Child David Luan Dario Amodei and Ilya Sutskever. 2019. Language Models are Unsupervised Multitask Learners.  Alec Radford Jeffrey Wu Rewon Child David Luan Dario Amodei and Ilya Sutskever. 2019. Language Models are Unsupervised Multitask Learners."},{"key":"e_1_3_2_1_9_1","doi-asserted-by":"crossref","unstructured":"Pranav Rajpurkar Jian Zhang Konstantin Lopyrev and Percy Liang. 2016. SQuAD: 100 000+ Questions for Machine Comprehension of Text. In EMNLP \u201916.  Pranav Rajpurkar Jian Zhang Konstantin Lopyrev and Percy Liang. 2016. SQuAD: 100 000+ Questions for Machine Comprehension of Text. In EMNLP \u201916.","DOI":"10.18653\/v1\/D16-1264"},{"key":"e_1_3_2_1_10_1","doi-asserted-by":"crossref","unstructured":"Ian Tenney Dipanjan Das and Ellie Pavlick. 2019. BERT Rediscovers the Classical NLP Pipeline. In ACL \u201919.  Ian Tenney Dipanjan Das and Ellie Pavlick. 2019. BERT Rediscovers the Classical NLP Pipeline. In ACL \u201919.","DOI":"10.18653\/v1\/P19-1452"},{"key":"e_1_3_2_1_11_1","doi-asserted-by":"crossref","unstructured":"Betty van Aken Benjamin Winter Alexander L\u00f6ser and Felix\u00a0A Gers. 2019. How Does BERT Answer Questions?: A Layer-Wise Analysis of Transformer Representations. In CIKM \u201919.  Betty van Aken Benjamin Winter Alexander L\u00f6ser and Felix\u00a0A Gers. 2019. How Does BERT Answer Questions?: A Layer-Wise Analysis of Transformer Representations. In CIKM \u201919.","DOI":"10.1145\/3357384.3358028"},{"key":"e_1_3_2_1_12_1","unstructured":"Laurens van\u00a0der Maaten. 2009. Learning a Parametric Embedding by Preserving Local Structure. In AISTATS \u201909.  Laurens van\u00a0der Maaten. 2009. Learning a Parametric Embedding by Preserving Local Structure. In AISTATS \u201909."},{"key":"e_1_3_2_1_13_1","unstructured":"Ashish Vaswani Noam Shazeer Niki Parmar Jakob Uszkoreit Llion Jones Aidan\u00a0N. Gomez Lukasz Kaiser and Illia Polosukhin. 2017. Attention Is All You Need. In NIPS \u201917.  Ashish Vaswani Noam Shazeer Niki Parmar Jakob Uszkoreit Llion Jones Aidan\u00a0N. Gomez Lukasz Kaiser and Illia Polosukhin. 2017. Attention Is All You Need. In NIPS \u201917."},{"key":"e_1_3_2_1_14_1","doi-asserted-by":"crossref","unstructured":"Jesse Vig. 2019. A Multiscale Visualization of Attention in the Transformer Model. ACL \u201919 System Demonstrations(2019).  Jesse Vig. 2019. A Multiscale Visualization of Attention in the Transformer Model. ACL \u201919 System Demonstrations(2019).","DOI":"10.18653\/v1\/P19-3007"},{"key":"e_1_3_2_1_15_1","unstructured":"Jason Weston Antoine Bordes Sumit Chopra and Tomas Mikolov. 2016. Towards AI-Complete Question Answering: A Set of Prerequisite Toy Tasks. In ICLR \u201916.  Jason Weston Antoine Bordes Sumit Chopra and Tomas Mikolov. 2016. Towards AI-Complete Question Answering: A Set of Prerequisite Toy Tasks. In ICLR \u201916."},{"key":"e_1_3_2_1_16_1","volume-title":"Adversarial Examples: Attacks and Defenses for Deep Learning. arXiv preprint arXiv:1712.07107(2017).","author":"Xiaolin\u00a0Li Xiaoyong\u00a0Yuan Qile Zhu","year":"2017"},{"key":"e_1_3_2_1_17_1","doi-asserted-by":"crossref","unstructured":"Zhilin Yang Peng Qi Saizheng Zhang Yoshua Bengio William\u00a0W. Cohen Ruslan Salakhutdinov and Christopher\u00a0D. Manning. 2018. HotpotQA: A Dataset for Diverse Explainable Multi-hop Question Answering. In EMNLP \u201918.  Zhilin Yang Peng Qi Saizheng Zhang Yoshua Bengio William\u00a0W. Cohen Ruslan Salakhutdinov and Christopher\u00a0D. Manning. 2018. HotpotQA: A Dataset for Diverse Explainable Multi-hop Question Answering. In EMNLP \u201918.","DOI":"10.18653\/v1\/D18-1259"}],"event":{"name":"WWW '20: The Web Conference 2020","location":"Taipei Taiwan","acronym":"WWW '20","sponsor":["SIGWEB ACM Special Interest Group on Hypertext, Hypermedia, and Web"]},"container-title":["Companion Proceedings of the Web Conference 2020"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3366424.3383542","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3366424.3383542","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T22:33:06Z","timestamp":1750199586000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3366424.3383542"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2020,4,20]]},"references-count":17,"alternative-id":["10.1145\/3366424.3383542","10.1145\/3366424"],"URL":"https:\/\/doi.org\/10.1145\/3366424.3383542","relation":{},"subject":[],"published":{"date-parts":[[2020,4,20]]},"assertion":[{"value":"2020-04-20","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}