{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,13]],"date-time":"2026-04-13T17:41:20Z","timestamp":1776102080034,"version":"3.50.1"},"reference-count":50,"publisher":"MDPI AG","issue":"1","license":[{"start":{"date-parts":[[2021,1,11]],"date-time":"2021-01-11T00:00:00Z","timestamp":1610323200000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["MTI"],"abstract":"<jats:p>Speech technology has matured so that voice-based reporting utilizing speech-to-text can be applied in various domains. Speech has two major benefits: it enables efficient reporting and speech input improves the quality of the reports since reporting can be done as a part of the workflow without delays between work and reporting. However, designing reporting voice user interfaces (VUIs) for professional use is challenging, as there are numerous aspects from technology to organization and language that need to be considered. Based on our experience in developing professional reporting VUIs with different stakeholders representing both commercial and public sector, we define a design space for voice-based reporting systems. The design space consists of 28 dimensions grouped into five categories: Language Processing, Structure of Reporting, Technical Limitations in the Work Domain, Interaction Related Aspects in the Work Domain, and Organization. We illustrate the design space by discussing four voice-based reporting systems, designed and implemented by us, and describing a design process that utilizes it. The design space enables designers to identify critical aspects of professional reporting VUIs and optimize those for their target domain. The design space can be used as a practical tool especially by designers with limited experience on speech technologies.<\/jats:p>","DOI":"10.3390\/mti5010003","type":"journal-article","created":{"date-parts":[[2021,1,11]],"date-time":"2021-01-11T20:32:47Z","timestamp":1610397167000},"page":"3","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":3,"title":["Design Space for Voice-Based Professional Reporting"],"prefix":"10.3390","volume":"5","author":[{"ORCID":"https:\/\/orcid.org\/0000-0001-8054-7265","authenticated-orcid":false,"given":"Jaakko","family":"Hakulinen","sequence":"first","affiliation":[{"name":"Faculty of Information Technology and Communication Sciences, Tampere University, P.O. Box 1001, 33014 Tampere, Finland"}]},{"given":"Tuuli","family":"Keskinen","sequence":"additional","affiliation":[{"name":"Faculty of Information Technology and Communication Sciences, Tampere University, P.O. Box 1001, 33014 Tampere, Finland"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-7395-0769","authenticated-orcid":false,"given":"Markku","family":"Turunen","sequence":"additional","affiliation":[{"name":"Faculty of Information Technology and Communication Sciences, Tampere University, P.O. Box 1001, 33014 Tampere, Finland"}]},{"given":"Sanni","family":"Siltanen","sequence":"additional","affiliation":[{"name":"KONE Corporation, KONE Technology and Innovation, Myllykatu 3, 05800 Hyvink\u00e4\u00e4, Finland"}]}],"member":"1968","published-online":{"date-parts":[[2021,1,11]]},"reference":[{"key":"ref_1","unstructured":"Benyon, D. (2019). Designing User Experience: A Guide to HCI, UX and Interaction Design, Pearson. [4th ed.]."},{"key":"ref_2","doi-asserted-by":"crossref","unstructured":"Munteanu, C., and Penn, G. (2017, January 6\u201311). Speech-based Interaction: Myths, Challenges, and Opportunities. Proceedings of the 2017 CHI Conference Extended Abstracts on Human Factors in Computing Systems (CHI EA \u201817), Denver, CO, USA.","DOI":"10.1145\/3027063.3027117"},{"key":"ref_3","doi-asserted-by":"crossref","unstructured":"Bijani, C., White, B.K., and Vilrokx, M. (2013, January 27\u201330). Giving voice to enterprise mobile applications. Proceedings of the 15th International Conference on Human-Computer Interaction with Mobile Devices and Services (MobileHCI \u201813), Munich Germany.","DOI":"10.1145\/2493190.2494086"},{"key":"ref_4","doi-asserted-by":"crossref","first-page":"585","DOI":"10.1148\/radiology.138.3.7465833","article-title":"Computerized radiologic reporting with voice data-entry","volume":"138","author":"Leeming","year":"1981","journal-title":"Radiology"},{"key":"ref_5","doi-asserted-by":"crossref","first-page":"541","DOI":"10.1055\/s-0038-1666844","article-title":"Electronic Health Record Interactions through Voice: A Review","volume":"9","author":"Pirtle","year":"2018","journal-title":"Appl. Clin. Inform."},{"key":"ref_6","doi-asserted-by":"crossref","first-page":"462","DOI":"10.1136\/jamia.2000.0070462","article-title":"Comparative Evaluation of Three Continuous Speech Recognition Software Packages in the Generation of Medical Reports","volume":"7","author":"Devine","year":"2000","journal-title":"J. Am. Med. Inform. Assoc."},{"key":"ref_7","doi-asserted-by":"crossref","unstructured":"Lai, J., and Vergo, J. (1997, January 5\u201310). MedSpeak: Report creation with continuous speech recognition. Proceedings of the SIGCHI Conference on Human Factors in Computing Systems (CHI \u201897), Ft. Lauderdale, FL, USA.","DOI":"10.1145\/258549.258829"},{"key":"ref_8","doi-asserted-by":"crossref","first-page":"156","DOI":"10.1309\/AJCPOI5F1LPSLZKP","article-title":"Experience with Voice Recognition in Surgical Pathology at a Large Academic Multi-Institutional Center","volume":"133","author":"Kang","year":"2010","journal-title":"Am. J. Clin. Pathol."},{"key":"ref_9","doi-asserted-by":"crossref","first-page":"79","DOI":"10.1097\/NNA.0000000000000030","article-title":"Enhancing Nursing Practice by Utilizing Voice Recognition for Direct Documentation","volume":"44","author":"Fratzke","year":"2014","journal-title":"J. Nurs. Adm."},{"key":"ref_10","doi-asserted-by":"crossref","unstructured":"Awan, S.K., Dunoyer, E.J., Genuario, K.E., Levy, A.C., and O\u2019Connor, K.P. (2018, January 27). Using voice recognition enabled smartwatches to improve nurse documentation. Proceedings of the 2018 Systems and Information Engineering Design Symposium, Charlottesville, VA, USA.","DOI":"10.1109\/SIEDS.2018.8374728"},{"key":"ref_11","doi-asserted-by":"crossref","first-page":"157","DOI":"10.14219\/jada.archive.2005.0135","article-title":"The role of voice-activated technology in today\u2019s dental practice","volume":"136","author":"Drevenstedt","year":"2005","journal-title":"J. Am. Dent. Assoc."},{"key":"ref_12","doi-asserted-by":"crossref","first-page":"85","DOI":"10.1197\/jamia.M1130","article-title":"Speech Recognition as a Transcription Aid: A Randomized Comparison with Standard Transcription","volume":"10","author":"Mohr","year":"2003","journal-title":"J. Am. Med. Inform. Assoc."},{"key":"ref_13","doi-asserted-by":"crossref","unstructured":"L\u00f6\u00f6f, J., Falavigna, D., Schl\u00fcter, R., Giuliani, D., Gretter, R., and Ney, H. (2010, January 12\u201315). Evaluation of automatic transcription systems for the judicial domain. Proceedings of the 2010 IEEE Spoken Language Technology Workshop, Berkeley, CA, USA.","DOI":"10.1109\/SLT.2010.5700852"},{"key":"ref_14","doi-asserted-by":"crossref","first-page":"356","DOI":"10.1016\/0360-8352(90)90138-C","article-title":"Voice Data Entry (VDE) training and Implementation Strategies","volume":"19","author":"Hosni","year":"1990","journal-title":"Comput. Ind. Eng."},{"key":"ref_15","first-page":"5","article-title":"An analysis of the implementation and impact of speech-recognition technology in the healthcare sector","volume":"1","author":"Parente","year":"2004","journal-title":"Perspect. Health Inf. Manag."},{"key":"ref_16","doi-asserted-by":"crossref","first-page":"101","DOI":"10.1136\/jamia.2001.0080101","article-title":"Computer-based Speech Recognition as an Alternative to Medical Transcription","volume":"8","author":"Borowitz","year":"2001","journal-title":"J. Am. Med. Inform. Assoc."},{"key":"ref_17","doi-asserted-by":"crossref","first-page":"24","DOI":"10.4018\/joeuc.2007010102","article-title":"User Acceptance of Voice Recognition Technology: An Empirical Extension of the Technology Acceptance Model","volume":"19","author":"Simon","year":"2007","journal-title":"J. Organ. End User Comput."},{"key":"ref_18","unstructured":"Grasso, M.A. (2003, January 26\u201327). The long-term adoption of speech recognition in medical applications. Proceedings of the IEEE Symposium Computer-Based Medical Systems, New York, NY, USA."},{"key":"ref_19","doi-asserted-by":"crossref","first-page":"36","DOI":"10.1016\/j.ijhcs.2008.08.004","article-title":"Acceptance of speech recognition by physicians: A survey of expectations, experiences, and social influence","volume":"67","author":"Alapetite","year":"2009","journal-title":"Int. J. Hum. Comput. Stud."},{"key":"ref_20","doi-asserted-by":"crossref","unstructured":"Abd Ghani, M.K., and Dewi, I.N. (2012, January 17\u201319). Comparing speech recognition and text writing in recording patient health records. Proceedings of the IEEE-EMBS Conference on Biomedical Engineering and Sciences, Langkawi, Malaysia.","DOI":"10.1109\/IECBES.2012.6498100"},{"key":"ref_21","doi-asserted-by":"crossref","unstructured":"Teel, M.M., Sokolowski, R., Rosenthal, D., and Belge, M. (1998, January 18\u201323). Voice-enabled structured medical reporting. Proceedings of the SIGCHI Conference on Human Factors in Computing Systems (CHI \u201898), Los Angeles, CA, USA.","DOI":"10.1145\/274644.274724"},{"key":"ref_22","doi-asserted-by":"crossref","first-page":"103938","DOI":"10.1016\/j.ijmedinf.2019.07.017","article-title":"A clinician survey of using speech recognition for clinical documentation in the electronic health record","volume":"130","author":"Goss","year":"2019","journal-title":"Int. J. Med. Inform."},{"key":"ref_23","first-page":"1e","article-title":"Lessons Learned from Implementation of Voice Recognition for Documentation in the Military Electronic Health Record System","volume":"7","author":"Hoyt","year":"2010","journal-title":"Perspect. Health Inf. Manag."},{"key":"ref_24","doi-asserted-by":"crossref","first-page":"445","DOI":"10.14423\/SMJ.0000000000000302","article-title":"Toward Successful Implementation of Speech Recognition Technology: A Survey of SRT Utilization Issues in Healthcare Settings","volume":"108","author":"Clarke","year":"2015","journal-title":"South. Med. J."},{"key":"ref_25","doi-asserted-by":"crossref","first-page":"282","DOI":"10.1080\/20009666.2017.1379852","article-title":"Use of dictation as a tool to decrease documentation errors in electronic health records","volume":"7","author":"Hadidi","year":"2017","journal-title":"J. Community Hosp. Intern. Med. Perspect."},{"key":"ref_26","first-page":"2","article-title":"Documentation quality and time costs: A randomized controlled trial of structured entry versus dictation","volume":"3","author":"Brown","year":"2012","journal-title":"J. Data Inf. Qual."},{"key":"ref_27","doi-asserted-by":"crossref","first-page":"e169","DOI":"10.1093\/jamia\/ocv152","article-title":"Risks and benefits of speech recognition for clinical documentation: A systematic review","volume":"23","author":"Hodgson","year":"2016","journal-title":"J. Am. Med. Inform. Assoc."},{"key":"ref_28","doi-asserted-by":"crossref","first-page":"176","DOI":"10.1007\/s10278-005-5167-8","article-title":"Conceptual Approach for the Design of Radiology Reporting Interfaces: The Talking Template","volume":"18","author":"Sistrom","year":"2005","journal-title":"J. Digit. Imaging"},{"key":"ref_29","doi-asserted-by":"crossref","first-page":"98","DOI":"10.1007\/s10278-005-8734-0","article-title":"Six Characteristics of Effective Structured Reporting and the Inevitable Integration with Speech Recognition","volume":"19","author":"Liu","year":"2006","journal-title":"J. Digit. Imaging"},{"key":"ref_30","doi-asserted-by":"crossref","first-page":"887","DOI":"10.1016\/S0531-5131(03)00391-1","article-title":"A grammar-based speech user interface generator for structured reporting","volume":"1256","year":"2003","journal-title":"Int. Congr. Ser."},{"key":"ref_31","unstructured":"Nugues, P., ElGuedj, P.O., Cazenave, F., and de Ferri\u00e8re, B. (November, January 29). Issues in the design of a voice man machine dialogue system generating written medical reports. Proceedings of the 14th Annual International Conference of the IEEE Engineering in Medicine and Biology Society, Paris, France."},{"key":"ref_32","doi-asserted-by":"crossref","unstructured":"Jancsary, J., Matiasek, J., and Trost, H. (2008, January 25\u201327). Revealing the structure of medical dictations with conditional random fields. Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP \u201808), Honolulu, HI, USA.","DOI":"10.3115\/1613715.1613717"},{"key":"ref_33","doi-asserted-by":"crossref","unstructured":"Kondratova, I.L. (2005, January 11\u201315). Speech-Enabled Handheld Computing for Fieldwork. Proceedings of the International Conference on Computing in Civil Engineering, Cancun, Mexico.","DOI":"10.1061\/40794(179)102"},{"key":"ref_34","doi-asserted-by":"crossref","unstructured":"Matiasek, J., Jancsary, J., Klein, A., and Trost, H. (2009, January 30). Identifying segment topics in medical dictations. Proceedings of the 2nd Workshop on Semantic Representation of Spoken Language (SRSL \u201809), Athens, Greece.","DOI":"10.3115\/1626296.1626299"},{"key":"ref_35","doi-asserted-by":"crossref","first-page":"771","DOI":"10.1016\/j.ijhcs.2003.07.004","article-title":"Human\u2013computer interaction issues for mobile computing in a variable work context","volume":"60","author":"York","year":"2004","journal-title":"Int. J. Hum. Comput. Stud."},{"key":"ref_36","unstructured":"Westerlun, B. (2009). Design Space Exploration: Co-Operative Creation of Proposals for Desired Interactions with Future Artefacts. [Ph.D. Thesis, Kungliga Tekniska H\u00f6gskolan]."},{"key":"ref_37","doi-asserted-by":"crossref","unstructured":"Nigay, L., and Coutaz, J. (1993, January 24\u201329). A design space for multimodal systems: Concurrent processing and data fusion. Proceedings of the INTERACT \u201893 and CHI \u201893 Conference on Human Factors in Computing Systems (CHI \u201893), Amsterdam, The Netherlands.","DOI":"10.1145\/169059.169143"},{"key":"ref_38","doi-asserted-by":"crossref","unstructured":"Bernhaupt, R., Dalvi, G., Joshi, A.K., Balkrishan, D., O\u2019Neill, J., and Winckler, M. (2017). Coping with Design Complexity: A Conceptual Framework for Design Alternatives and Variants. Human-Computer Interaction\u2014INTERACT 2017, Springer. Lecture Notes in Computer Science.","DOI":"10.1007\/978-3-319-67744-6"},{"key":"ref_39","doi-asserted-by":"crossref","unstructured":"Bowen, J., and Dittmar, A. (2016, January 21\u201324). A semi-formal framework for describing interaction design spaces. Proceedings of the 8th ACM SIGCHI Symposium on Engineering Interactive Computing Systems (EICS \u201916), Brussels, Belgium.","DOI":"10.1145\/2933242.2933247"},{"key":"ref_40","doi-asserted-by":"crossref","unstructured":"Bowen, J., and Dittmar, A. (2017, January 4\u20138). Formal Definitions for Design Spaces and Traces. Proceedings of the 2017 24th Asia-Pacific Software Engineering Conference (APSEC), Nanjing, China.","DOI":"10.1109\/APSEC.2017.72"},{"key":"ref_41","doi-asserted-by":"crossref","unstructured":"Dove, G., Hansen, N.B., and Halskov, K. (2016, January 23\u201327). An Argument for Design Space Reflection. Proceedings of the 9th Nordic Conference on Human-Computer Interaction (NordiCHI \u201916), Gothenburg, Sweden.","DOI":"10.1145\/2971485.2971528"},{"key":"ref_42","doi-asserted-by":"crossref","unstructured":"Braun, M., Broy, N., Pfleging, B., and Alt, F. (2017, January 4\u20137). A design space for conversational in-vehicle information systems. Proceedings of the 19th International Conference on Human-Computer Interaction with Mobile Devices and Services (MobileHCI \u201917), Vienna, Austria.","DOI":"10.1145\/3098279.3122122"},{"key":"ref_43","unstructured":"Card, S.K., and Mackinlay, J. (1997, January 21). The structure of the information visualization design space. Proceedings of the 1997 IEEE Symposium on Information Visualization (InfoVis \u201997) (INFOVIS \u201997), Phoenix, AZ, USA."},{"key":"ref_44","doi-asserted-by":"crossref","unstructured":"M\u00fcller, J., Alt, F., Michelis, D., and Schmidt, A. (2010, January 25\u201329). Requirements and design space for interactive public displays. Proceedings of the 18th ACM International Conference on Multimedia (MM \u201910), Firenze, Italy.","DOI":"10.1145\/1873951.1874203"},{"key":"ref_45","doi-asserted-by":"crossref","unstructured":"Haeuslschmid, R., Pfleging, B., and Alt, F. (2016, January 7\u201312). A Design Space to Support the Development of Windshield Applications for the Car. Proceedings of the 2016 CHI Conference on Human Factors in Computing Systems (CHI \u201916), San Jose, CA, USA.","DOI":"10.1145\/2858036.2858336"},{"key":"ref_46","doi-asserted-by":"crossref","first-page":"201","DOI":"10.1207\/s15327051hci0603&4_2","article-title":"Questions, Options, and Criteria: Elements of Design Space Analysis","volume":"6","author":"MacLean","year":"1991","journal-title":"Hum. Comput. Interact."},{"key":"ref_47","doi-asserted-by":"crossref","unstructured":"Turunen, M., Melto, A., Kainulainen, A., and Hakulinen, J. (2008, January 22\u201326). MobiDic\u2014A Mobile Dictation and Notetaking Application. Proceedings of the Ninth Annual Conference of the International Speech Communication Association, Brisbane, Australia.","DOI":"10.21437\/Interspeech.2008-85"},{"key":"ref_48","doi-asserted-by":"crossref","unstructured":"Keskinen, T., Melto, A., Hakulinen, J., Turunen, M., Saarinen, S., Pallos, T., Kallioniemi, P., Danielsson-Ojala, R., and Salanter\u00e4, S. (2013, January 2\u20135). Mobile dictation for healthcare professionals. Proceedings of the 12th International Conference on Mobile and Ubiquitous Multimedia (MUM \u201813), Lule\u00e5, Sweden.","DOI":"10.1145\/2541831.2541880"},{"key":"ref_49","unstructured":"DIMECC (2017). S-STEP\u2013Smart Technologies for Lifecycle Performance, DIMECC. Available online: https:\/\/www.dimecc.com\/wp-content\/uploads\/2019\/06\/DIMECC_FINAL_REPORT_11_S-Step.pdf."},{"key":"ref_50","unstructured":"Kaasinen, E., and Turunen, M. (2021, January 09). DYNAVIS Dynamic Visualization in Project\/Service Lifecycle, Available online: https:\/\/www.dimecc.com\/wp-content\/uploads\/2019\/06\/DIMECC_DYNAVIS-final-report.pdf."}],"container-title":["Multimodal Technologies and Interaction"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/2414-4088\/5\/1\/3\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,11]],"date-time":"2025-10-11T05:09:53Z","timestamp":1760159393000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/2414-4088\/5\/1\/3"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2021,1,11]]},"references-count":50,"journal-issue":{"issue":"1","published-online":{"date-parts":[[2021,1]]}},"alternative-id":["mti5010003"],"URL":"https:\/\/doi.org\/10.3390\/mti5010003","relation":{},"ISSN":["2414-4088"],"issn-type":[{"value":"2414-4088","type":"electronic"}],"subject":[],"published":{"date-parts":[[2021,1,11]]}}}