{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,6,5]],"date-time":"2026-06-05T15:34:08Z","timestamp":1780673648551,"version":"3.54.1"},"reference-count":21,"publisher":"Springer Science and Business Media LLC","issue":"1","license":[{"start":{"date-parts":[[2024,12,27]],"date-time":"2024-12-27T00:00:00Z","timestamp":1735257600000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2024,12,27]],"date-time":"2024-12-27T00:00:00Z","timestamp":1735257600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"funder":[{"DOI":"10.13039\/100014440","name":"Ministerio de Ciencia, Innovaci\u00f3n y Universidades","doi-asserted-by":"publisher","award":["PID2022-137048OA-C43"],"award-info":[{"award-number":["PID2022-137048OA-C43"]}],"id":[{"id":"10.13039\/100014440","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/100014440","name":"Ministerio de Ciencia, Innovaci\u00f3n y Universidades","doi-asserted-by":"publisher","award":["PID2020-113656RB-C21"],"award-info":[{"award-number":["PID2020-113656RB-C21"]}],"id":[{"id":"10.13039\/100014440","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/100014440","name":"Ministerio de Ciencia, Innovaci\u00f3n y Universidades","doi-asserted-by":"publisher","award":["PID2022-137048OA-C43"],"award-info":[{"award-number":["PID2022-137048OA-C43"]}],"id":[{"id":"10.13039\/100014440","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/100014440","name":"Ministerio de Ciencia, Innovaci\u00f3n y Universidades","doi-asserted-by":"publisher","award":["TED2021-131401B-C21"],"award-info":[{"award-number":["TED2021-131401B-C21"]}],"id":[{"id":"10.13039\/100014440","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/100014440","name":"Ministerio de Ciencia, Innovaci\u00f3n y Universidades","doi-asserted-by":"publisher","award":["PID2022-137048OA-C43"],"award-info":[{"award-number":["PID2022-137048OA-C43"]}],"id":[{"id":"10.13039\/100014440","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/100014440","name":"Ministerio de Ciencia, Innovaci\u00f3n y Universidades","doi-asserted-by":"publisher","award":["PID2022-137048OA-C43"],"award-info":[{"award-number":["PID2022-137048OA-C43"]}],"id":[{"id":"10.13039\/100014440","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100004834","name":"Universitat Jaume I","doi-asserted-by":"crossref","id":[{"id":"10.13039\/501100004834","id-type":"DOI","asserted-by":"crossref"}]}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["J Supercomput"],"published-print":{"date-parts":[[2025,1]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:p>Many current embedded systems comprise heterogeneous computing components including quite powerful GPUs, which enables their application across diverse sectors. This study demonstrates the efficient execution of a medium-sized self-supervised audio spectrogram transformer (SSAST) model on a low-power system-on-chip (SoC). Through comprehensive evaluation, including real time inference scenarios, we show that GPUs outperform multi-core CPUs in inference processes. Optimization techniques such as adjusting batch size, model compilation with TensorRT, and reducing data precision significantly enhance inference time, energy consumption, and memory usage. In particular, negligible accuracy degradation is observed, with post-training quantization to 8-bit integers showing less than 1% loss. This research underscores the feasibility of deploying transformer neural networks on low-power embedded devices, ensuring efficiency in time, energy, and memory, while maintaining the accuracy of the results.<\/jats:p>","DOI":"10.1007\/s11227-024-06807-1","type":"journal-article","created":{"date-parts":[[2024,12,27]],"date-time":"2024-12-27T16:44:09Z","timestamp":1735317849000},"update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":9,"title":["Evaluating and accelerating vision transformers on GPU-based embedded edge AI systems"],"prefix":"10.1007","volume":"81","author":[{"given":"Ignacio","family":"Martin-Salinas","sequence":"first","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Jose M.","family":"Badia","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Oscar","family":"Valls","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"German","family":"Leon","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Rocio","family":"del Amor","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Jose A.","family":"Belloch","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Adrian","family":"Amor-Martin","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Valery","family":"Naranjo","sequence":"additional","affiliation":[],"role":[{"vocabulary":"crossref","role":"author"}]}],"member":"297","published-online":{"date-parts":[[2024,12,27]]},"reference":[{"key":"6807_CR1","doi-asserted-by":"publisher","DOI":"10.1007\/978-981-10-5209-5","volume-title":"Deep learning in natural language processing","author":"L Deng","year":"2018","unstructured":"Deng L, Liu Y (2018) Deep learning in natural language processing. Springer"},{"key":"6807_CR2","doi-asserted-by":"crossref","unstructured":"He, K., Zhang, X., Ren, S., Sun, J.: Deep residual learning for image recognition. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 770\u2013778 (2016)","DOI":"10.1109\/CVPR.2016.90"},{"issue":"1","key":"6807_CR3","doi-asserted-by":"publisher","first-page":"187","DOI":"10.1146\/annurev-control-060117-105157","volume":"1","author":"W Schwarting","year":"2018","unstructured":"Schwarting W, Alonso-Mora J, Rus D (2018) Planning and decision-making for autonomous vehicles. Annual Rev Control, Robot Auto Syst 1(1):187\u2013210","journal-title":"Annual Rev Control, Robot Auto Syst"},{"key":"6807_CR4","unstructured":"O\u2019shea, K., Nash, R.: An introduction to convolutional neural networks. arXiv preprint arXiv:1511.08458 (2015)"},{"key":"6807_CR5","unstructured":"Dosovitskiy, A.: An image is worth 16x16 words: Transformers for image recognition at scale. arXiv preprint arXiv:2010.11929 (2020)"},{"key":"6807_CR6","volume-title":"A Survey of Embedded Machine Learning for Smart and Sustainable Healthcare Applications","author":"S An","year":"2023","unstructured":"An S, Tuncel Y, Basaklar T, Ogras UY (2023) A Survey of Embedded Machine Learning for Smart and Sustainable Healthcare Applications. Springer"},{"issue":"3","key":"6807_CR7","doi-asserted-by":"publisher","first-page":"167","DOI":"10.1080\/21681015.2021.1965665","volume":"39","author":"RL Silva","year":"2022","unstructured":"Silva RL, Canciglieri Junior O, Rudek M (2022) A road map for planning-deploying machine vision artifacts in the context of industry 4.0. Journal of Industrial and Production Engineering 39(3):167\u2013180","journal-title":"Journal of Industrial and Production Engineering"},{"key":"6807_CR8","doi-asserted-by":"crossref","unstructured":"de Sousa, F.L.M., da Silva, M.J., de Meira\u00a0Santos, R.C.C., Silva, M.C., Oliveira, R.A.R.: Deep-Learning-Based Embedded ADAS System. IEEE (2021)","DOI":"10.1109\/SBESC53686.2021.9628316"},{"key":"6807_CR9","doi-asserted-by":"publisher","first-page":"71","DOI":"10.1016\/j.iotcps.2023.02.004","volume":"3","author":"R Singh","year":"2023","unstructured":"Singh R, Gill SS (2023) Edge AI: a survey. Internet of Things and Cyber-Phys Syst 3:71\u201392","journal-title":"Internet of Things and Cyber-Phys Syst"},{"key":"6807_CR10","doi-asserted-by":"crossref","unstructured":"Aafaq, N., Saleem, M., Khan, J.T., Abbasi, I.H.: Convolutional neural networks for deep spoken keyword spotting. In: 2023 3rd International Conference on Artificial Intelligence (ICAI), pp. 170\u2013175 (2023). 10.1109\/ICAI58407.2023.10136648","DOI":"10.1109\/ICAI58407.2023.10136648"},{"key":"6807_CR11","doi-asserted-by":"crossref","unstructured":"Sainath, T.N., Parada, C.: Convolutional neural networks for small-footprint keyword spotting. In: Proc. Interspeech 2015, pp. 1478\u20131482 (2015). 10.21437\/Interspeech.2015-352","DOI":"10.21437\/Interspeech.2015-352"},{"key":"6807_CR12","doi-asserted-by":"crossref","unstructured":"Chen, G., Parada, C., Sainath, T.N.: Query-by-example keyword spotting using long short-term memory networks. In: 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 5236\u20135240 (2015). 10.1109\/ICASSP.2015.7178970","DOI":"10.1109\/ICASSP.2015.7178970"},{"key":"6807_CR13","doi-asserted-by":"publisher","first-page":"220","DOI":"10.1007\/978-3-540-74695-9_23","volume-title":"Artificial Neural Networks - ICANN 2007","author":"S Fern\u00e1ndez","year":"2007","unstructured":"Fern\u00e1ndez S, Graves A, Schmidhuber J (2007) An application of recurrent neural networks to discriminative keyword spotting. In: de S\u00e1 JM, Alexandre LA, Duch W, Mandic D (eds) Artificial Neural Networks - ICANN 2007. Springer, Berlin, Heidelberg, pp 220\u2013229"},{"key":"6807_CR14","unstructured":"Dosovitskiy, A., Beyer, L., Kolesnikov, A., Weissenborn, D., Zhai, X., Unterthiner, T., Dehghani, M., Minderer, M., Heigold, G., Gelly, S., Uszkoreit, J., Houlsby, N.: An image is worth 16x16 words: Transformers for image recognition at scale. CoRR arxiv:abs\/2010.11929 (2020)"},{"key":"6807_CR15","doi-asserted-by":"crossref","unstructured":"Gong, Y., Chung, Y., Glass, J.R.: AST: Audio Spectrogram Transformer. CoRR arxiv:abs\/2104.01778 (2021)","DOI":"10.21437\/Interspeech.2021-698"},{"key":"6807_CR16","doi-asserted-by":"crossref","unstructured":"Berg, A., O\u2019Connor, M., Cruz, M.T.: Keyword Transformer: A Self-Attention Model for Keyword Spotting. ISCA (2021). 10.21437\/interspeech.2021-1286","DOI":"10.21437\/Interspeech.2021-1286"},{"key":"6807_CR17","doi-asserted-by":"crossref","unstructured":"Gong, Y., Lai, C.-I.J., Chung, Y.-A., Glass, J.: SSAST: Self-Supervised Audio Spectrogram Transformer (2022)","DOI":"10.21437\/Interspeech.2021-698"},{"key":"6807_CR18","unstructured":"Warden, P.: Speech commands: A dataset for limited-vocabulary speech recognition. arXiv preprint arXiv:1804.03209 (2018)"},{"key":"6807_CR19","unstructured":"NVIDIA: NVIDIA Jetson Orin Nano Developer Kit User Guide. https:\/\/manuals.plus\/nvidia\/jetson-orin-nano-developer-kit-manual (April 2023)"},{"key":"6807_CR20","unstructured":"Barrachina, S., Barreda, M., Catal\u00e1n, S., Dolz, M.F., Fabregat, G., Mayo, R., Quintana-Ort\u00ed, E.: An integrated framework for power-performance analysis of parallel scientific workloads. Energy, 114\u2013119 (2013)"},{"key":"6807_CR21","unstructured":"NVIDIA: NVIDIA Jetson Linux Developer Guide. Release 35.4.1. (2023). https:\/\/docs.nvidia.com\/jetson\/archives\/r35.4.1\/DeveloperGuide\/"}],"container-title":["The Journal of Supercomputing"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s11227-024-06807-1.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1007\/s11227-024-06807-1\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s11227-024-06807-1.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2024,12,27]],"date-time":"2024-12-27T17:04:18Z","timestamp":1735319058000},"score":1,"resource":{"primary":{"URL":"https:\/\/link.springer.com\/10.1007\/s11227-024-06807-1"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2024,12,27]]},"references-count":21,"journal-issue":{"issue":"1","published-print":{"date-parts":[[2025,1]]}},"alternative-id":["6807"],"URL":"https:\/\/doi.org\/10.1007\/s11227-024-06807-1","relation":{},"ISSN":["0920-8542","1573-0484"],"issn-type":[{"value":"0920-8542","type":"print"},{"value":"1573-0484","type":"electronic"}],"subject":[],"published":{"date-parts":[[2024,12,27]]},"assertion":[{"value":"4 December 2024","order":1,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"27 December 2024","order":2,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}},{"order":1,"name":"Ethics","group":{"name":"EthicsHeading","label":"Declarations"}},{"value":"The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.","order":2,"name":"Ethics","group":{"name":"EthicsHeading","label":"Conflict of interest"}}],"article-number":"349"}}