{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,13]],"date-time":"2026-04-13T16:02:28Z","timestamp":1776096148266,"version":"3.50.1"},"publisher-location":"New York, NY, USA","reference-count":43,"publisher":"ACM","license":[{"start":{"date-parts":[[2017,10,19]],"date-time":"2017-10-19T00:00:00Z","timestamp":1508371200000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2017,10,19]]},"DOI":"10.1145\/3132525.3132542","type":"proceedings-article","created":{"date-parts":[[2017,10,20]],"date-time":"2017-10-20T13:04:26Z","timestamp":1508504666000},"page":"165-174","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":49,"title":["Evaluating the Usability of Automatically Generated Captions for People who are Deaf or Hard of Hearing"],"prefix":"10.1145","author":[{"given":"Sushant","family":"Kafle","sequence":"first","affiliation":[{"name":"Rochester Institute of Technology, Rochester, NY, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Matt","family":"Huenerfauth","sequence":"additional","affiliation":[{"name":"Rochester Institute of Technology, Rochester, NY, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"320","published-online":{"date-parts":[[2017,10,19]]},"reference":[{"key":"e_1_3_2_1_1_1","unstructured":"T. Apone B. Botkin M. Brooks and L. Goldberg. 2011. Caption Accuracy Metrics Project. Research into Automated Error Ranking of Real-time Captions in Live Television News Programs WGBH. Retrieved from http:\/\/ ncam.wgbh.org\/file_download\/136 T. Apone B. Botkin M. Brooks and L. Goldberg. 2011. Caption Accuracy Metrics Project. Research into Automated Error Ranking of Real-time Captions in Live Television News Programs WGBH. Retrieved from http:\/\/ ncam.wgbh.org\/file_download\/136"},{"key":"e_1_3_2_1_2_1","doi-asserted-by":"publisher","DOI":"10.1145\/638249.638284"},{"key":"e_1_3_2_1_3_1","doi-asserted-by":"crossref","unstructured":"N. N. Belanger and K. Rayner. 2013. Frequency and predictability effects in eye fixations for skilled and less-skilled deaf readers. Visual cognition 21(4):477-497 N. N. Belanger and K. Rayner. 2013. Frequency and predictability effects in eye fixations for skilled and less-skilled deaf readers. Visual cognition 21(4):477-497","DOI":"10.1080\/13506285.2013.804016"},{"key":"e_1_3_2_1_4_1","volume-title":"Summary health statistics for us adults: national health interview survey","author":"Blackwell D. L.","year":"2012","unstructured":"D. L. Blackwell , J. W. Lucas , T. C. Clarke . 2014. Summary health statistics for us adults: national health interview survey , 2012 . Vital and health statistics. Series 10, Data from the National Health Survey , (260):1-161 D. L. Blackwell, J. W. Lucas, T. C. Clarke. 2014. Summary health statistics for us adults: national health interview survey, 2012. Vital and health statistics. Series 10, Data from the National Health Survey, (260):1-161"},{"key":"e_1_3_2_1_5_1","volume-title":"Proc. EMNLP-CoNLL'07","author":"Brants T.","year":"2007","unstructured":"T. Brants , A. C. Popat , P. Xu , F. J. Och , J. Dean . 2007 . Large language models in machine translation . In Proc. EMNLP-CoNLL'07 , Prague, Czech Republic, 858-867, Association for Computational Linguistics. T. Brants, A. C. Popat, P. Xu, F. J. Och, J. Dean. 2007. Large language models in machine translation. In Proc. EMNLP-CoNLL'07, Prague, Czech Republic, 858-867, Association for Computational Linguistics."},{"key":"e_1_3_2_1_6_1","doi-asserted-by":"publisher","DOI":"10.1093\/deafed\/enp033"},{"key":"e_1_3_2_1_7_1","doi-asserted-by":"crossref","unstructured":"A.-B. Dominguez M.-S. Carrillo M. del Mar Perez J. Alegr\u00eda. 2014. Analysis of reading strategies in deaf adults as a function of their language and meta-phonological skills. Research in developmental disabilities 35(7):1439-1456. A.-B. Dominguez M.-S. Carrillo M. del Mar Perez J. Alegr\u00eda. 2014. Analysis of reading strategies in deaf adults as a function of their language and meta-phonological skills. Research in developmental disabilities 35(7):1439-1456.","DOI":"10.1016\/j.ridd.2014.03.039"},{"key":"e_1_3_2_1_8_1","doi-asserted-by":"crossref","unstructured":"J. Duffy T. Giolas. 1971. The effect of word predictability on sentence intelligibility. Technical report Submarine Medical Research Laboratory. J. Duffy T. Giolas. 1971. The effect of word predictability on sentence intelligibility. Technical report Submarine Medical Research Laboratory.","DOI":"10.21236\/AD0746118"},{"key":"e_1_3_2_1_9_1","doi-asserted-by":"publisher","DOI":"10.1145\/2982142.2982198"},{"key":"e_1_3_2_1_10_1","doi-asserted-by":"crossref","first-page":"3463","DOI":"10.21437\/Interspeech.2013-610","volume-title":"Proc. Interspeech'13","author":"Favre B.","year":"2013","unstructured":"B. Favre , K. Cheung , S. Kazemian , A. Lee , Y. Liu , C. Munteanu , A. Nenkova , D. Ochei , G. Penn , S. Tratz , 2013 . Automatic human utility evaluation of ASR systems: does WER really predict performance? In Proc. Interspeech'13 , 3463 - 3467 . B. Favre, K. Cheung, S. Kazemian, A. Lee, Y. Liu, C. Munteanu, A. Nenkova, D. Ochei, G. Penn, S. Tratz, et al. 2013. Automatic human utility evaluation of ASR systems: does WER really predict performance? In Proc. Interspeech'13, 3463-3467."},{"key":"e_1_3_2_1_11_1","doi-asserted-by":"publisher","DOI":"10.1145\/2207016.2207053"},{"key":"e_1_3_2_1_12_1","volume-title":"Sclite scoring package version 1.5","author":"Fiscus J.","year":"2017","unstructured":"J. Fiscus . 1998. Sclite scoring package version 1.5 . US National Institute of Standard Technology , Retrieved on May 1, 2017 from http:\/\/www. itl. nist. gov\/iaui\/894.01\/tools. J. Fiscus. 1998. Sclite scoring package version 1.5. US National Institute of Standard Technology, Retrieved on May 1, 2017 from http:\/\/www. itl. nist. gov\/iaui\/894.01\/tools."},{"key":"e_1_3_2_1_13_1","doi-asserted-by":"publisher","DOI":"10.1145\/2384916.2384966"},{"key":"e_1_3_2_1_14_1","doi-asserted-by":"publisher","DOI":"10.1007\/s10209-005-0005-9"},{"key":"e_1_3_2_1_15_1","volume-title":"Content-Based Multimedia Information Access -","author":"Garofolo J.S.","unstructured":"J.S. Garofolo , C.G.P. Auzanne , E.M. Voorhees . 2000. The TREC spoken document retrieval track: A success story . In Content-Based Multimedia Information Access - Volume 1 , RIAO ?00, Paris, France , 1-20. J.S. Garofolo, C.G.P. Auzanne, E.M. Voorhees. 2000. The TREC spoken document retrieval track: A success story. In Content-Based Multimedia Information Access - Volume 1, RIAO ?00, Paris, France, 1-20."},{"key":"e_1_3_2_1_16_1","volume-title":"In Proc. W4A'16","author":"Gaur Y.","year":"2016","unstructured":"Y. Gaur , W. S. Lasecki , F. Metze , J. P. Bigham . 2016 . In Proc. W4A'16 . ACM, New York, NY, USA, Article 23, 8 pages. Y. Gaur, W. S. Lasecki, F. Metze, J. P. Bigham. 2016. In Proc. W4A'16. ACM, New York, NY, USA, Article 23, 8 pages."},{"key":"e_1_3_2_1_17_1","doi-asserted-by":"publisher","DOI":"10.5555\/1895550.1895693"},{"key":"e_1_3_2_1_18_1","unstructured":"D. Grangier A. Vinciarelli H. Bourlard. 2003. Information retrieval on noisy text. Technical report IDIAP. D. Grangier A. Vinciarelli H. Bourlard. 2003. Information retrieval on noisy text. Technical report IDIAP."},{"key":"e_1_3_2_1_19_1","volume-title":"Proc. WOCCI, 21-26","author":"Gray S.S.","year":"2014","unstructured":"S.S. Gray , D. Willett , J. Lu , J. Pinto , P. Maergner , N. Bodenstab . 2014 . Child automatic speech recognition for US English: child interaction with living-room-electronic-devices . In Proc. WOCCI, 21-26 . S.S. Gray, D. Willett, J. Lu, J. Pinto, P. Maergner, N. Bodenstab. 2014. Child automatic speech recognition for US English: child interaction with living-room-electronic-devices. In Proc. WOCCI, 21-26."},{"key":"e_1_3_2_1_20_1","doi-asserted-by":"publisher","DOI":"10.1145\/2513383.2513413"},{"key":"e_1_3_2_1_21_1","doi-asserted-by":"publisher","DOI":"10.1093\/oxfordjournals.deafed.a014323"},{"key":"e_1_3_2_1_22_1","doi-asserted-by":"publisher","DOI":"10.21437\/SLPAT.2016-4"},{"key":"e_1_3_2_1_23_1","doi-asserted-by":"publisher","DOI":"10.1145\/2982142.2982164"},{"key":"e_1_3_2_1_24_1","doi-asserted-by":"publisher","DOI":"10.1080\/09541440340000213"},{"key":"e_1_3_2_1_25_1","doi-asserted-by":"publisher","DOI":"10.1145\/2543578"},{"key":"e_1_3_2_1_26_1","doi-asserted-by":"publisher","DOI":"10.1145\/2594459"},{"key":"e_1_3_2_1_27_1","doi-asserted-by":"crossref","first-page":"662","DOI":"10.21437\/Interspeech.2013-189","volume-title":"Proc. Interspeech'13","volume":"1","author":"Lei X.","year":"2013","unstructured":"X. Lei , A.W. Senior , A. Gruenstein , J. Sorensen . 2013 . Accurate and compact large vocabulary speech recognition on mobile devices . In Proc. Interspeech'13 , vol. 1 , 662 - 665 . X. Lei, A.W. Senior, A. Gruenstein, J. Sorensen. 2013. Accurate and compact large vocabulary speech recognition on mobile devices. In Proc. Interspeech'13, vol. 1, 662-665."},{"key":"e_1_3_2_1_28_1","doi-asserted-by":"publisher","DOI":"10.1109\/TASLP.2014.2304637"},{"key":"e_1_3_2_1_29_1","doi-asserted-by":"publisher","DOI":"10.1353\/aad.0.0006"},{"key":"e_1_3_2_1_30_1","unstructured":"I.A. McCowan D. Moore J. Dines D. Gatica-Perez M. Flynn P. Wellner H. Bourlard. 2004. On the use of information retrieval measures for speech recognition evaluation. Technical report IDIAP. I.A. McCowan D. Moore J. Dines D. Gatica-Perez M. Flynn P. Wellner H. Bourlard. 2004. On the use of information retrieval measures for speech recognition evaluation. Technical report IDIAP."},{"key":"e_1_3_2_1_31_1","volume-title":"Proc. Interspeech'11","author":"Mishra T.","year":"2011","unstructured":"T. Mishra , A. Ljolje , M. Gilbert . 2011 . Predicting human perceived accuracy of ASR systems . In Proc. Interspeech'11 , 1945-1948 T. Mishra, A. Ljolje, M. Gilbert. 2011. Predicting human perceived accuracy of ASR systems. In Proc. Interspeech'11, 1945-1948"},{"key":"e_1_3_2_1_32_1","doi-asserted-by":"crossref","unstructured":"A.C. Morris V. Maier P.D. Green. 2004. From WER and RIL to MER and WIL: improved evaluation measures for connected speech recognition. In Interspeech'04 2765-2768. A.C. Morris V. Maier P.D. Green. 2004. From WER and RIL to MER and WIL: improved evaluation measures for connected speech recognition. In Interspeech'04 2765-2768.","DOI":"10.21437\/Interspeech.2004-668"},{"key":"e_1_3_2_1_33_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICASSP.2005.1415298"},{"key":"e_1_3_2_1_34_1","doi-asserted-by":"publisher","DOI":"10.1037\/0033-2909.124.3.372"},{"key":"e_1_3_2_1_35_1","doi-asserted-by":"publisher","DOI":"10.3758\/BF03206448"},{"key":"e_1_3_2_1_36_1","doi-asserted-by":"publisher","DOI":"10.1037\/0882-7974.21.3.448"},{"key":"e_1_3_2_1_37_1","doi-asserted-by":"publisher","DOI":"10.1037\/a0020990"},{"key":"e_1_3_2_1_38_1","volume-title":"Proc. of LREC,'12 125-129","author":"Rousseau A.","year":"2012","unstructured":"A. Rousseau , P. Deleglise , Y. Esteve . 2012 . Ted-lium: an automatic speech recognition dedicated corpus . In Proc. of LREC,'12 125-129 . ELRA, Paris, France. A. Rousseau, P. Deleglise, Y. Esteve. 2012. Ted-lium: an automatic speech recognition dedicated corpus. In Proc. of LREC,'12 125-129. ELRA, Paris, France."},{"key":"e_1_3_2_1_39_1","doi-asserted-by":"publisher","DOI":"10.1145\/2661334.2661337"},{"key":"e_1_3_2_1_40_1","doi-asserted-by":"publisher","DOI":"10.1145\/2745555.2746648"},{"key":"e_1_3_2_1_41_1","doi-asserted-by":"publisher","DOI":"10.1145\/1969289.1969318"},{"key":"e_1_3_2_1_42_1","volume-title":"Proc. IEEE Workshop on Automatic Speech Recognition and Understanding, 577-582","author":"Wang Y.-Y.","year":"2003","unstructured":"Y.-Y. Wang , A. Acero , C. Chelba . 2003 . Is word error rate a good indicator for spoken language understanding accuracy . In Proc. IEEE Workshop on Automatic Speech Recognition and Understanding, 577-582 . IEEE. Y.-Y. Wang, A. Acero, C. Chelba. 2003. Is word error rate a good indicator for spoken language understanding accuracy. In Proc. IEEE Workshop on Automatic Speech Recognition and Understanding, 577-582. IEEE."},{"key":"e_1_3_2_1_43_1","doi-asserted-by":"crossref","unstructured":"W. Xiong J. Droppo X. Huang F. Seide . Seltzer A. Stolcke D. Yu G. Zweig. 2016. Achieving human parity in conversational speech recognition. Computing Research Repository (CoRR) http:\/\/arxiv.org\/abs\/1610.05256 W. Xiong J. Droppo X. Huang F. Seide . Seltzer A. Stolcke D. Yu G. Zweig. 2016. Achieving human parity in conversational speech recognition. Computing Research Repository (CoRR) http:\/\/arxiv.org\/abs\/1610.05256","DOI":"10.1109\/ICASSP.2017.7953159"}],"event":{"name":"ASSETS '17: The 19th International ACM SIGACCESS Conference on Computers and Accessibility","location":"Baltimore Maryland USA","acronym":"ASSETS '17","sponsor":["SIGACCESS ACM Special Interest Group on Accessible Computing"]},"container-title":["Proceedings of the 19th International ACM SIGACCESS Conference on Computers and Accessibility"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3132525.3132542","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3132525.3132542","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,18]],"date-time":"2025-06-18T03:30:20Z","timestamp":1750217420000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3132525.3132542"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2017,10,19]]},"references-count":43,"alternative-id":["10.1145\/3132525.3132542","10.1145\/3132525"],"URL":"https:\/\/doi.org\/10.1145\/3132525.3132542","relation":{},"subject":[],"published":{"date-parts":[[2017,10,19]]},"assertion":[{"value":"2017-10-19","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}