{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,7,16]],"date-time":"2025-07-16T13:55:42Z","timestamp":1752674142834,"version":"3.41.0"},"reference-count":15,"publisher":"Association for Computing Machinery (ACM)","issue":"2","license":[{"start":{"date-parts":[[2020,9,29]],"date-time":"2020-09-29T00:00:00Z","timestamp":1601337600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["GetMobile: Mobile Comp. and Comm."],"published-print":{"date-parts":[[2020,9,29]]},"abstract":"<jats:p>Smart glasses are often used in noisy public spaces or industrial settings. Voice commands and automatic speech recognition (ASR) are good user interfaces for such a form factor, but the background noise and interfering speakers pose important challenges. Typical signal processing techniques have limitations in performance and\/or hardware resources. V-Speech is a novel solution that captures the voice signal with a vibration sensor located in the nasal pads of smart glasses. Although signal-to-noise ratio (SNR) is much higher with vibration sensor capture, it introduces a \"nasal distortion,\" which must be dealt with. The second part of our proposed solution involves a voice transformation of the vibration signal using a neural network to produce an output that mimics the characteristics of a conventional microphone. We evaluated V-Speech in noise-free and very noisy conditions with 30 volunteer speakers uttering 145 phrases each, and validated its performance on ASR engines, with assessments of voice quality using the Perceptual Evaluation of Speech Quality (PESQ) metric, and with subjective listeners to determine intelligibility, naturalness and overall quality. The results show, in extreme noise conditions, a mean improvement of 50% for Word Error Rate (WER), 1.0 on a scale of 5.0 for PESQ, and speech regarded intelligible, with naturalness rated as fair to good. The output of V-Speech has low noise, sounds natural, and enables clear voice communication in challenging environments.<\/jats:p>","DOI":"10.1145\/3427384.3427392","type":"journal-article","created":{"date-parts":[[2020,9,30]],"date-time":"2020-09-30T04:14:05Z","timestamp":1601439245000},"page":"18-24","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":2,"title":["V-Speech"],"prefix":"10.1145","volume":"24","author":[{"given":"H\u00e9ctor A.","family":"Cordourier Maruri","sequence":"first","affiliation":[{"name":"Intel Labs"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Paulo","family":"Lopez-Meyer","sequence":"additional","affiliation":[{"name":"Intel Labs"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Jonathan","family":"Huang","sequence":"additional","affiliation":[{"name":"Intel Labs"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Willem","family":"Beltman","sequence":"additional","affiliation":[{"name":"Intel Labs"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Lama","family":"Nachman","sequence":"additional","affiliation":[{"name":"Intel Labs"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Hong","family":"Lu","sequence":"additional","affiliation":[{"name":"Intel Labs"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"320","published-online":{"date-parts":[[2020,9,29]]},"reference":[{"key":"e_1_2_1_1_1","unstructured":"\"Technology for a Quieter America \" National Academy of Engineering ISBN 978-0--309--15632--5 USA 2010.  \"Technology for a Quieter America \" National Academy of Engineering ISBN 978-0--309--15632--5 USA 2010."},{"key":"e_1_2_1_2_1","doi-asserted-by":"publisher","DOI":"10.1109\/TASLP.2016.2647702"},{"key":"e_1_2_1_3_1","doi-asserted-by":"publisher","DOI":"10.1121\/1.408535"},{"key":"e_1_2_1_4_1","unstructured":"Sabine Reinfeldt Bo H\u00e5kansson Hamidreza Taghavi and M\u00e5ns Eeg-Olofsson. 2015. New developments in bone-conduction hearing implants: a review. In Medical Devices (Auckland NZ) vol. 8 79--93. DOI: http:\/\/doi.org\/10.2147\/ MDER.S39691  Sabine Reinfeldt Bo H\u00e5kansson Hamidreza Taghavi and M\u00e5ns Eeg-Olofsson. 2015. New developments in bone-conduction hearing implants: a review. In Medical Devices (Auckland NZ) vol. 8 79--93. DOI: http:\/\/doi.org\/10.2147\/ MDER.S39691"},{"key":"e_1_2_1_5_1","doi-asserted-by":"publisher","DOI":"10.1109\/TSA.2005.855838"},{"volume-title":"Proceedings of New Directions for Improving Audio Effectiveness, Neuilly-sur-Seine, France.","year":"2005","author":"Acker-Mills Barbara","key":"e_1_2_1_6_1"},{"key":"e_1_2_1_7_1","doi-asserted-by":"publisher","DOI":"10.1177\/154193120805200505"},{"key":"e_1_2_1_8_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.apergo.2010.09.004"},{"key":"e_1_2_1_9_1","unstructured":"NeoVictory. 2012. https:\/\/usermanual.wiki\/ NeoVictory-Technology\/S102\/html  NeoVictory. 2012. https:\/\/usermanual.wiki\/ NeoVictory-Technology\/S102\/html"},{"key":"e_1_2_1_10_1","unstructured":"Buhel Soundglasses. 2015. https:\/\/newatlas.com\/ buhel-soundglasses\/35904\/  Buhel Soundglasses. 2015. https:\/\/newatlas.com\/ buhel-soundglasses\/35904\/"},{"key":"e_1_2_1_11_1","doi-asserted-by":"publisher","DOI":"10.1121\/1.3654911"},{"key":"e_1_2_1_12_1","unstructured":"ITU-T Recommendation P.862 2001. PESQ an objective method for end-to-end speech quality assessment of narrowband telephone networks and speech codecs.  ITU-T Recommendation P.862 2001. PESQ an objective method for end-to-end speech quality assessment of narrowband telephone networks and speech codecs."},{"key":"e_1_2_1_13_1","unstructured":"Speech API -- Speech Recognition Google Cloud Platform 2018. Retrieved January 15 2018 from: https:\/\/cloud.google.com\/speech\/?hl=en  Speech API -- Speech Recognition Google Cloud Platform 2018. Retrieved January 15 2018 from: https:\/\/cloud.google.com\/speech\/?hl=en"},{"key":"e_1_2_1_14_1","unstructured":"Standard ECMA-74. 2017 Measurement of Airborne Noise emitted by Information Technology and Telecommunications Equipment ECMA international 14th edition December 2017 https:\/\/www.ecma-international.org\/publications\/ standards\/Ecma-074.htm  Standard ECMA-74. 2017 Measurement of Airborne Noise emitted by Information Technology and Telecommunications Equipment ECMA international 14th edition December 2017 https:\/\/www.ecma-international.org\/publications\/ standards\/Ecma-074.htm"},{"key":"e_1_2_1_15_1","unstructured":"ITU-T Recommendation P.835 2003. Subjective test methodology for evaluating speech communication systems that include noise suppression algorithm.  ITU-T Recommendation P.835 2003. Subjective test methodology for evaluating speech communication systems that include noise suppression algorithm."}],"container-title":["GetMobile: Mobile Computing and Communications"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3427384.3427392","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3427384.3427392","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T22:02:26Z","timestamp":1750197746000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3427384.3427392"}},"subtitle":["Noise-Robust Speech Capturing Glasses Using Vibration Sensors"],"short-title":[],"issued":{"date-parts":[[2020,9,29]]},"references-count":15,"journal-issue":{"issue":"2","published-print":{"date-parts":[[2020,9,29]]}},"alternative-id":["10.1145\/3427384.3427392"],"URL":"https:\/\/doi.org\/10.1145\/3427384.3427392","relation":{},"ISSN":["2375-0529","2375-0537"],"issn-type":[{"type":"print","value":"2375-0529"},{"type":"electronic","value":"2375-0537"}],"subject":[],"published":{"date-parts":[[2020,9,29]]},"assertion":[{"value":"2020-09-29","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}