{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,5,21]],"date-time":"2026-05-21T03:43:41Z","timestamp":1779335021083,"version":"3.51.4"},"reference-count":40,"publisher":"MDPI AG","issue":"2","license":[{"start":{"date-parts":[[2026,2,1]],"date-time":"2026-02-01T00:00:00Z","timestamp":1769904000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["MTI"],"abstract":"<jats:p>Industry 5.0 is composed of a variety of complex tasks and challenging processes requiring specialized labor and multidisciplinary coordination. Specifically, when it comes to shipbuilding, shipyards leverage advanced technologies, seeking to replace operations that continue to rely on traditional methods, such as 2D blueprints and paper-based documentation, which can lead to inefficiencies and alignment errors in precision-dependent tasks. For this reason, this article focuses on embracing Mixed Reality (MR) technologies to address these challenges in the context of electrical outfitting tasks. The design, development and evaluation of a MR application tailored for HoloLens 2 smart glasses aims to streamline the workflow for operators, reducing reliance on paper-based documentation and enhancing the precision of assembly processes. The proposed system allows for the precise positioning of 3D models in the real environment, ensuring accurate alignment during assembly. Additionally, it incorporates automatic dimension generation between objects in the scene. To further enhance usability, the application integrates a Galician on-device Automatic Speech Recognition (ASR) system, allowing operators to interact seamlessly with the MR interface using voice commands. The whole system has been exhaustively tested, both through usability and functionality evaluations, which validate MR as a viable tool for shipyard assembly and inspection tasks.<\/jats:p>","DOI":"10.3390\/mti10020013","type":"journal-article","created":{"date-parts":[[2026,2,2]],"date-time":"2026-02-02T09:00:33Z","timestamp":1770022833000},"page":"13","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":2,"title":["A Mixed Reality Tool with Automatic Speech Recognition for 3D CAD Based Visualization and Automatic Dimension Generation in the Industry 5.0 Shipyard"],"prefix":"10.3390","volume":"10","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-7633-7131","authenticated-orcid":false,"given":"Aida","family":"Vidal-Balea","sequence":"first","affiliation":[{"name":"Department of Computer Engineering, Faculty of Computer Science, Universidade da Coru\u00f1a, 15071 A Coru\u00f1a, Spain"},{"name":"Centro de Investigaci\u00f3n CITIC, Universidade da Coru\u00f1a, 15071 A Coru\u00f1a, Spain"},{"name":"Centro Mixto de Investigaci\u00f3n UDC-Navantia, Universidade da Coru\u00f1a, Edificio de Batallones, s\/n, 15403 Ferrol, Spain"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0009-0007-6998-5919","authenticated-orcid":false,"given":"Ant\u00f3n","family":"Valladares-Poncela","sequence":"additional","affiliation":[{"name":"Department of Computer Engineering, Faculty of Computer Science, Universidade da Coru\u00f1a, 15071 A Coru\u00f1a, Spain"},{"name":"Centro de Investigaci\u00f3n CITIC, Universidade da Coru\u00f1a, 15071 A Coru\u00f1a, Spain"},{"name":"Centro Mixto de Investigaci\u00f3n UDC-Navantia, Universidade da Coru\u00f1a, Edificio de Batallones, s\/n, 15403 Ferrol, Spain"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0009-0008-4488-3008","authenticated-orcid":false,"given":"Javier","family":"Vilar-Mart\u00ednez","sequence":"additional","affiliation":[{"name":"Navantia S. A., Astillero de Ferrol, 15403 Ferrol, Spain"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-2179-5917","authenticated-orcid":false,"given":"Tiago M.","family":"Fern\u00e1ndez-Caram\u00e9s","sequence":"additional","affiliation":[{"name":"Department of Computer Engineering, Faculty of Computer Science, Universidade da Coru\u00f1a, 15071 A Coru\u00f1a, Spain"},{"name":"Centro de Investigaci\u00f3n CITIC, Universidade da Coru\u00f1a, 15071 A Coru\u00f1a, Spain"},{"name":"Centro Mixto de Investigaci\u00f3n UDC-Navantia, Universidade da Coru\u00f1a, Edificio de Batallones, s\/n, 15403 Ferrol, Spain"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-4991-6808","authenticated-orcid":false,"given":"Paula","family":"Fraga-Lamas","sequence":"additional","affiliation":[{"name":"Department of Computer Engineering, Faculty of Computer Science, Universidade da Coru\u00f1a, 15071 A Coru\u00f1a, Spain"},{"name":"Centro de Investigaci\u00f3n CITIC, Universidade da Coru\u00f1a, 15071 A Coru\u00f1a, Spain"},{"name":"Centro Mixto de Investigaci\u00f3n UDC-Navantia, Universidade da Coru\u00f1a, Edificio de Batallones, s\/n, 15403 Ferrol, Spain"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"1968","published-online":{"date-parts":[[2026,2,1]]},"reference":[{"key":"ref_1","doi-asserted-by":"crossref","first-page":"129","DOI":"10.1016\/bs.adcom.2019.10.010","article-title":"Industry 4.0: Industrial internet of things (IIoT)","volume":"Volume 117","author":"Munirathinam","year":"2020","journal-title":"Advances in Computers"},{"key":"ref_2","doi-asserted-by":"crossref","first-page":"106159","DOI":"10.1016\/j.cie.2019.106159","article-title":"A survey of industrial augmented reality","volume":"139","author":"Mariano","year":"2020","journal-title":"Comput. Ind. Eng."},{"key":"ref_3","doi-asserted-by":"crossref","first-page":"428","DOI":"10.1108\/IR-09-2021-0204","article-title":"Augmented reality\u2014An important aspect of Industry 4.0","volume":"49","author":"Sharma","year":"2022","journal-title":"Ind. Robot. Int. J. Robot. Res. Appl."},{"key":"ref_4","doi-asserted-by":"crossref","unstructured":"Grazi, L., Feijoo Alonso, A., G\u0105siorek, A., Pertusa Llopis, A.M., Grajeda, A., Kanakis, A., Rodriguez Vidal, A., Parri, A., Vidal, F., and Ergas, I. (2025). Methodology and Challenges of Implementing Advanced Technological Solutions in Small and Medium Shipyards: The Case Study of the Mari4_YARD Project. Electronics, 14.","DOI":"10.3390\/electronics14081597"},{"key":"ref_5","unstructured":"Navantia (2025, August 26). Shipyard 5.0 Strategic Network, Products Catalog. Available online: https:\/\/www.navantia.es\/en\/catalog\/."},{"key":"ref_6","unstructured":"Hound, N. (2025, December 30). W\u00e4rtsil\u00e4 Moves Towards Remote Guidance for Vessel Repair and Maintenance. Available online: https:\/\/www.iims.org.uk\/wartsila-moves-towards-remote-guidance-for-vessel-repair-and-maintenance."},{"key":"ref_7","doi-asserted-by":"crossref","first-page":"280","DOI":"10.1007\/978-3-030-85607-6_22","article-title":"S.A.M.I.R.: Supporting Tele-Maintenance with Integrated Interaction Using Natural Language and Augmented Reality","volume":"Volume 12936","author":"Cannito","year":"2021","journal-title":"Human-Computer Interaction\u2014INTERACT 2021: 18th IFIP TC 13 International Conference, Bari, Italy, 30 August\u20133 September 2021, Proceedings, Part V"},{"key":"ref_8","doi-asserted-by":"crossref","unstructured":"Rosilius, M., Spiertz, M., Wirsing, B., Geuen, M., Br\u00e4utigam, V., and Ludwig, B. (2024). Impact of Industrial Noise on Speech Interaction Performance and User Acceptance when Using the MS HoloLens 2. Multimodal Technol. Interact., 8.","DOI":"10.3390\/mti8020008"},{"key":"ref_9","doi-asserted-by":"crossref","first-page":"627","DOI":"10.1007\/s10772-023-10036-x","article-title":"Voice user interfaces in manufacturing logistics: A literature review","volume":"26","author":"Ludwig","year":"2023","journal-title":"Int. J. Speech Technol."},{"key":"ref_10","doi-asserted-by":"crossref","first-page":"03119009","DOI":"10.1061\/(ASCE)CO.1943-7862.0001749","article-title":"State-of-the-art review on Mixed Reality applications in the AECO industry","volume":"146","author":"Cheng","year":"2020","journal-title":"J. Constr. Eng. Manag."},{"key":"ref_11","doi-asserted-by":"crossref","first-page":"102071","DOI":"10.1016\/j.rcim.2020.102071","article-title":"AR\/MR remote collaboration on physical tasks: A review","volume":"72","author":"Wang","year":"2021","journal-title":"Robot. Comput.-Integr. Manuf."},{"key":"ref_12","doi-asserted-by":"crossref","first-page":"150","DOI":"10.1016\/j.autcon.2017.11.003","article-title":"A critical review of Virtual and Augmented Reality (VR\/AR) applications in construction safety","volume":"86","author":"Li","year":"2018","journal-title":"Autom. Constr."},{"key":"ref_13","doi-asserted-by":"crossref","first-page":"104054","DOI":"10.1016\/j.autcon.2021.104054","article-title":"BIM data flow architecture with AR\/VR technologies: Use cases in architecture, engineering and construction","volume":"134","author":"Schiavi","year":"2022","journal-title":"Autom. Constr."},{"key":"ref_14","doi-asserted-by":"crossref","first-page":"102726","DOI":"10.1016\/j.jobe.2021.102726","article-title":"Digital twin application in the construction industry: A literature review","volume":"40","author":"Opoku","year":"2021","journal-title":"J. Build. Eng."},{"key":"ref_15","doi-asserted-by":"crossref","first-page":"189","DOI":"10.1016\/j.ifacol.2021.04.098","article-title":"Multi-channel augmented reality interactive framework design for ship outfitting guidance","volume":"53","author":"Wang","year":"2020","journal-title":"IFAC-PapersOnLine"},{"key":"ref_16","unstructured":"Polycam (2025, August 26). Cross-Platform 3D Scanning Floor Plans & Drone Mapping. Available online: https:\/\/poly.cam\/."},{"key":"ref_17","unstructured":"Graphisoft (2025, August 26). BIMx. Available online: https:\/\/graphisoft.com\/solutions\/bimx\/."},{"key":"ref_18","unstructured":"Graphisoft (2025, August 26). Archicad. Available online: https:\/\/graphisoft.com\/solutions\/archicad\/."},{"key":"ref_19","unstructured":"eDrawings (2025, August 26). View CAD Files in AR\/VR. Available online: https:\/\/www.edrawingsviewer.com\/view-cad-files-arvr."},{"key":"ref_20","doi-asserted-by":"crossref","first-page":"143","DOI":"10.1109\/MCG.2021.3114955","article-title":"Measurement and inspection of photo-realistic 3-D VR models","volume":"41","author":"Tadeja","year":"2021","journal-title":"IEEE Comput. Graph. Appl."},{"key":"ref_21","doi-asserted-by":"crossref","unstructured":"Po\u0142ap, D. (2018). Voice Control in Mixed Reality. Proceedings of the 2018 Federated Conference on Computer Science and Information Systems (FedCSIS), ACSIS.","DOI":"10.15439\/2018F13"},{"key":"ref_22","doi-asserted-by":"crossref","unstructured":"Vidal-Balea, A., Blanco-Novoa, O., Fraga-Lamas, P., Vilar-Montesinos, M., and Fern\u00e1ndez-Caram\u00e9s, T.M. (2020). Creating collaborative Augmented Reality experiences for industry 4.0 training and assistance applications: Performance evaluation in the shipyard of the future. Appl. Sci., 10.","DOI":"10.3390\/app10249073"},{"key":"ref_23","doi-asserted-by":"crossref","unstructured":"Sailor, H., Patil, A., and Patil, H. (2018, January 29\u201331). Advances in Low Resource ASR: A Deep Learning Perspective. Proceedings of the 2018 Speech and Language Technology in Under-Resourced Languages (SLTU), Gurugram, India.","DOI":"10.21437\/SLTU.2018-4"},{"key":"ref_24","doi-asserted-by":"crossref","first-page":"179798","DOI":"10.1109\/ACCESS.2020.3027619","article-title":"Hierarchical Transfer Learning for Multilingual, Multi-Speaker, and Style Transfer DNN-Based TTS on Low-Resource Languages","volume":"8","author":"Azizah","year":"2020","journal-title":"IEEE Access"},{"key":"ref_25","doi-asserted-by":"crossref","unstructured":"Conneau, A., Khandelwal, K., Goyal, N., Chaudhary, V., Wenzek, G., Guzm\u00e1n, F., Grave, E., Ott, M., Zettlemoyer, L., and Stoyanov, V. (2019). Unsupervised Cross-lingual Representation Learning at Scale. arXiv.","DOI":"10.18653\/v1\/2020.acl-main.747"},{"key":"ref_26","doi-asserted-by":"crossref","unstructured":"Pratap, V., Sriram, A., Tomasello, P., Hannun, A., Liptchinsky, V., Synnaeve, G., and Collobert, R. (2020). Massively Multilingual ASR: 50 Languages, 1 Model, 1 Billion Parameters. arXiv.","DOI":"10.21437\/Interspeech.2020-2831"},{"key":"ref_27","first-page":"12449","article-title":"wav2vec 2.0: A framework for self-supervised learning of speech representations","volume":"33","author":"Baevski","year":"2020","journal-title":"Adv. Neural Inf. Process. Syst."},{"key":"ref_28","doi-asserted-by":"crossref","unstructured":"Kim, C., Gowda, D., Lee, D., Kim, J., Kumar, A., Kim, S., Garg, A., and Han, C. (2020, January 1\u20134). A Review of On-Device Fully Neural End-to-End Automatic Speech Recognition Algorithms. Proceedings of the 2020 54th Asilomar Conference on Signals, Systems, and Computers, Pacific Grove, CA, USA.","DOI":"10.1109\/IEEECONF51394.2020.9443456"},{"key":"ref_29","doi-asserted-by":"crossref","unstructured":"Marchisio, A., Hanif, M.A., Khalid, F., Plastiras, G., Kyrkou, C., Theocharides, T., and Shafique, M. (2019, January 15\u201317). Deep Learning for Edge Computing: Current Trends, Cross-Layer Optimizations, and Open Research Challenges. Proceedings of the 2019 IEEE Computer Society Annual Symposium on VLSI (ISVLSI), Miami, FL, USA.","DOI":"10.1109\/ISVLSI.2019.00105"},{"key":"ref_30","doi-asserted-by":"crossref","first-page":"2716","DOI":"10.1109\/TASLP.2022.3198548","article-title":"Low Latency Speech Enhancement for Hearing Aids Using Deep Filtering","volume":"30","author":"Rosenkranz","year":"2022","journal-title":"IEEE\/ACM Trans. Audio Speech Lang. Proc."},{"key":"ref_31","doi-asserted-by":"crossref","first-page":"4209","DOI":"10.1109\/ACCESS.2021.3140175","article-title":"A Metaverse: Taxonomy, Components, Applications, and Open Challenges","volume":"10","author":"Park","year":"2022","journal-title":"IEEE Access"},{"key":"ref_32","unstructured":"Chute, D.O. (2026, January 27). Noise Control Methods for Shipbuilding. Available online: https:\/\/www.nsrp.org\/wp-content\/uploads\/2015\/09\/Deliverable-2012-424-Noise_Control_Methods_Final_Report-Atrium.pdf."},{"key":"ref_33","doi-asserted-by":"crossref","first-page":"764","DOI":"10.1007\/s40436-023-00479-5","article-title":"Holorailway: An augmented reality system to support assembly operations in the railway industry","volume":"12","author":"Garcia","year":"2024","journal-title":"Adv. Manuf."},{"key":"ref_34","unstructured":"Unity (2025, August 26). Unity Real-Time Development Platform|3D, 2D, VR & AR Engine. Available online: https:\/\/unity.com\/."},{"key":"ref_35","unstructured":"Microsoft (2025, August 26). Mixed Reality Toolkit 3. Available online: https:\/\/learn.microsoft.com\/en-us\/windows\/mixed-reality\/mrtk-unity\/mrtk3-overview\/."},{"key":"ref_36","doi-asserted-by":"crossref","first-page":"77017","DOI":"10.1109\/ACCESS.2025.3564137","article-title":"On-Device Automatic Speech Recognition for Low-Resource Languages in Mixed Reality Industrial Metaverse Applications: Practical Guidelines and Evaluation of a Shipbuilding Application in Galician","volume":"13","year":"2025","journal-title":"IEEE Access"},{"key":"ref_37","doi-asserted-by":"crossref","first-page":"63623","DOI":"10.1109\/ACCESS.2023.3286391","article-title":"Design, Implementation, and Practical Evaluation of a Voice Recognition Based IoT Home Automation System for Low-Resource Languages and Resource-Constrained Edge IoT Devices: A System for Galician and Mobile Opportunistic Scenarios","volume":"11","year":"2023","journal-title":"IEEE Access"},{"key":"ref_38","doi-asserted-by":"crossref","first-page":"139","DOI":"10.1016\/S0166-4115(08)62386-9","article-title":"Development of NASA-TLX (Task Load Index): Results of empirical and theoretical research","volume":"Volume 52","author":"Hart","year":"1988","journal-title":"Advances in Psychology"},{"key":"ref_39","doi-asserted-by":"crossref","first-page":"557","DOI":"10.1007\/s10055-019-00422-9","article-title":"Development and validation of a simulation workload measure: The simulation task load index (SIM-TLX)","volume":"24","author":"Harris","year":"2020","journal-title":"Virtual Real."},{"key":"ref_40","doi-asserted-by":"crossref","unstructured":"Vidal-Balea, A., Fraga-Lamas, P., and Fern\u00e1ndez-Caram\u00e9s, T.M. (2024). Advancing NASA-TLX: Automatic User Interaction Analysis for Workload Evaluation in XR Scenarios. Proceedings of the 2024 IEEE Gaming, Entertainment, and Media Conference (GEM), Turin, Italy, 5\u20137 June 2024, IEEE.","DOI":"10.1109\/GEM61861.2024.10585425"}],"container-title":["Multimodal Technologies and Interaction"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/2414-4088\/10\/2\/13\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2026,2,13]],"date-time":"2026-02-13T05:16:07Z","timestamp":1770959767000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/2414-4088\/10\/2\/13"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2026,2,1]]},"references-count":40,"journal-issue":{"issue":"2","published-online":{"date-parts":[[2026,2]]}},"alternative-id":["mti10020013"],"URL":"https:\/\/doi.org\/10.3390\/mti10020013","relation":{},"ISSN":["2414-4088"],"issn-type":[{"value":"2414-4088","type":"electronic"}],"subject":[],"published":{"date-parts":[[2026,2,1]]}}}