{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,4,1]],"date-time":"2025-04-01T13:05:16Z","timestamp":1743512716634,"version":"3.37.3"},"reference-count":26,"publisher":"Springer Science and Business Media LLC","issue":"12","license":[{"start":{"date-parts":[[2022,4,8]],"date-time":"2022-04-08T00:00:00Z","timestamp":1649376000000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2022,4,8]],"date-time":"2022-04-08T00:00:00Z","timestamp":1649376000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"funder":[{"DOI":"10.13039\/100014440","name":"Ministerio de Ciencia, Innovaci\u00f3n y Universidades","doi-asserted-by":"publisher","award":["PID2020- 113785RB-100","PID2019-106455GBC21"],"award-info":[{"award-number":["PID2020- 113785RB-100","PID2019-106455GBC21"]}],"id":[{"id":"10.13039\/100014440","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/100014440","name":"Ministerio de Ciencia, Innovaci\u00f3n y Universidades","doi-asserted-by":"publisher","award":["PID2020-113656RB-C21"],"award-info":[{"award-number":["PID2020-113656RB-C21"]}],"id":[{"id":"10.13039\/100014440","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/100012818","name":"Comunidad de Madrid","doi-asserted-by":"publisher","award":["PEJD-2019-PRE\/TIC-16327","2022\/00024\/001"],"award-info":[{"award-number":["PEJD-2019-PRE\/TIC-16327","2022\/00024\/001"]}],"id":[{"id":"10.13039\/100012818","id-type":"DOI","asserted-by":"publisher"}]},{"name":"Universidad Carlos III"}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["J Supercomput"],"published-print":{"date-parts":[[2022,8]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:p>The emerging Multi-Processor System-on-Chip (MPSoC) technology, which combines heterogeneous computing with the high performance of field programmable gate arrays (FPGA), is a promising platform for a large number of applications, including wireless communications and vehicular technology. In this specific application context, when multiple-input multiple-output (MIMO) scenarios are considered, the system usually has to manage a large number of communication links among sensors and antennas involving different vehicles and users. Millimeter wave (mmWave) communications are one of the key technology enablers toward achieving high data rates in beyond 5G systems (B5G). Communication at these frequency bands usually involves the use of large antenna arrays, often requiring high computational resources. One of the candidate platforms able to manage a huge number of communications is the Xilinx Zynq UltraScale+ EG Heterogeneous MPSoC, which is composed of a dual-core Cortex-R5, a quad-core ARM Cortex-A53, a graphics processing unit (GPU) and a high-end FPGA. This work analyzes the computational performance that requires a recent mmWave MIMO channel estimation algorithm in a platform of this kind. As a first approach, we will focus our work on the performance that can be achieved via the quad-core ARM Cortex-A53. To this end, we will use the libraries for numerical algebra (BLAS and LAPACK). The results show that our reference implementation is able to manage a large MIMO communication system with 256 antennas without exhausting platform resources.<\/jats:p>","DOI":"10.1007\/s11227-022-04479-3","type":"journal-article","created":{"date-parts":[[2022,4,8]],"date-time":"2022-04-08T05:08:34Z","timestamp":1649394514000},"page":"14756-14767","update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":3,"title":["Performance analysis of a millimeter wave MIMO channel estimation method in an embedded multi-core processor"],"prefix":"10.1007","volume":"78","author":[{"ORCID":"https:\/\/orcid.org\/0000-0003-0414-5365","authenticated-orcid":false,"given":"Pablo M.","family":"Aviles","sequence":"first","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Diego","family":"Lloria","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Jose A.","family":"Belloch","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Sandra","family":"Roger","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Almudena","family":"Lindoso","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Maximo","family":"Cobos","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"297","published-online":{"date-parts":[[2022,4,8]]},"reference":[{"key":"4479_CR1","doi-asserted-by":"crossref","unstructured":"Eldredge M, Hughes TJ, Ferencz RM, Rifai SM, Raefsky A, Herndon B (1997) \u201cHigh-performance parallel computing in industry,\u201d Parallel Computing, vol.\u00a023, no.\u00a09, pp. 1217\u20131233, parallel computing methods in applied fluid mechanics. [Online]. Available: https:\/\/www.sciencedirect.com\/science\/article\/pii\/S0167819197000495","DOI":"10.1016\/S0167-8191(97)00049-5"},{"issue":"1","key":"4479_CR2","doi-asserted-by":"publisher","first-page":"1686","DOI":"10.1007\/s11227-018-02739-9","volume":"75","author":"JA Belloch","year":"2019","unstructured":"Belloch JA, Amor-Martin A, Garcia-Donoro D, Mart\u00ednez-Zald\u00edvar FJ, Garcia-Castillo LE (2019) On the use of many-core machines for the acceleration of a mesh truncation technique for fem. J Supercomput 75(1):1686\u20131696","journal-title":"J Supercomput"},{"issue":"1","key":"4479_CR3","doi-asserted-by":"publisher","first-page":"1284","DOI":"10.1007\/s11227-018-2422-6","volume":"75","author":"JM Badia","year":"2019","unstructured":"Badia JM, Belloch JA, Cobos M, Igual F, Quintana-Orti ES (2019) Accelerating the srp-phat algorithm on multi- and many-core platforms using opencl. J Supercomput 75(1):1284\u20131297","journal-title":"J Supercomput"},{"issue":"8","key":"4479_CR4","doi-asserted-by":"publisher","first-page":"3796","DOI":"10.1109\/TVT.2012.2210576","volume":"61","author":"S Roger","year":"2012","unstructured":"Roger S, Ramiro C, Gonzalez A, Almenar V, Vidal AM (2012) Fully parallel GPU implementation of a fixed-complexity soft-output MIMO detector. IEEE Trans Vehicular Technol 61(8):3796\u20133800","journal-title":"IEEE Trans Vehicular Technol"},{"issue":"3","key":"4479_CR5","doi-asserted-by":"publisher","first-page":"1010","DOI":"10.1007\/s11227-012-0839-x","volume":"65","author":"C Ramiro","year":"2013","unstructured":"Ramiro C, Roger S, Gonzalez A, Almenar V, Vidal AM (2013) Multicore implementation of a fixed-complexity tree-search detector for MIMO communications. J Supercomput 65(3):1010\u20131019","journal-title":"J Supercomput"},{"issue":"2","key":"4479_CR6","doi-asserted-by":"publisher","first-page":"106","DOI":"10.1109\/MCOM.2014.6736750","volume":"52","author":"W Roh","year":"2014","unstructured":"Roh W, Seol J, Park J, Lee B, Lee J, Kim Y, Cho J, Cheun K, Aryanfar F (2014) Millimeter-wave beamforming as an enabling technology for 5G cellular communications: theoretical feasibility and prototype results. IEEE Communi Magaz 52(2):106\u2013113","journal-title":"IEEE Communi Magaz"},{"issue":"3","key":"4479_CR7","doi-asserted-by":"publisher","first-page":"55","DOI":"10.1109\/MCOM.001.1900411","volume":"58","author":"M Giordani","year":"2020","unstructured":"Giordani M, Polese M, Mezzavilla M, Rangan S, Zorzi M (2020) Toward 6G networks: use cases and technologies. IEEE Commun Magaz 58(3):55\u201361","journal-title":"IEEE Commun Magaz"},{"issue":"1","key":"4479_CR8","doi-asserted-by":"publisher","first-page":"2124","DOI":"10.1007\/s11227-020-03342-7","volume":"7","author":"JA Belloch","year":"2021","unstructured":"Belloch JA, Leon G, Badia JM, Lindoso A, San Millan E (2021) Evaluating the computational performance of the xilinx ultrascale+ eg heterogeneous mpsoc. J Supercomput 7(1):2124\u20132137","journal-title":"J Supercomput"},{"issue":"9","key":"4479_CR9","doi-asserted-by":"publisher","first-page":"5926","DOI":"10.1109\/TWC.2021.3071315","volume":"20","author":"S Roger","year":"2021","unstructured":"Roger S, Cobos M, Botella-Mascarell C, Fodor G (2021) Fast channel estimation in the transformed spatial domain for analog millimeter wave systems. IEEE Trans Wireless Commun 20(9):5926\u20135941","journal-title":"IEEE Trans Wireless Commun"},{"issue":"5","key":"4479_CR10","doi-asserted-by":"publisher","first-page":"831","DOI":"10.1109\/JSTSP.2014.2334278","volume":"8","author":"A Alkhateeb","year":"2014","unstructured":"Alkhateeb A, El Ayach O, Leus G, Heath RW (2014) Channel estimation and hybrid precoding for millimeter wave cellular systems. IEEE J Selected Topics Signal Process 8(5):831\u2013846","journal-title":"IEEE J Selected Topics Signal Process"},{"key":"4479_CR11","doi-asserted-by":"crossref","unstructured":"Zhang C, Guo D, Fan P (2016) \u201cTracking angles of departure and arrival in a mobile millimeter wave channel,\u201d in 2016 IEEE International Conference on Communications (ICC), May pp. 1\u20136","DOI":"10.1109\/ICC.2016.7510902"},{"issue":"6","key":"4479_CR12","doi-asserted-by":"publisher","first-page":"1164","DOI":"10.1109\/JSAC.2014.2328154","volume":"32","author":"MR Akdeniz","year":"2014","unstructured":"Akdeniz MR, Liu Y, Samimi MK, Sun S, Rangan S, Rappaport TS, Erkip E (2014) Millimeter wave channel modeling and cellular capacity evaluation. IEEE J Selected Areas Commun 32(6):1164\u20131179","journal-title":"IEEE J Selected Areas Commun"},{"key":"4479_CR13","unstructured":"Xilinx Inc, \u201cZynq UtraScale+ MPSoC Data Sheet: Overview,\u201d DS891 (v1.7),(2018)"},{"key":"4479_CR14","unstructured":"ARM, \u201cCortex-R5. Technical Reference Manual. Revision r1p2,\u201d(2011)"},{"key":"4479_CR15","unstructured":"ARM \u201cARM Cortex-A53 MPCore Processor. Technical Reference Manual. Revision r0p4,\u201d (2016)"},{"key":"4479_CR16","unstructured":"Olson T (2010)\u201cMali-400 MP: a scalable GPU for mobile devices,\u201d in Hot3D Session. Proc. International Conference on High Performance Graphics"},{"key":"4479_CR17","unstructured":"Xilinx Inc., \u201cEmbedded System Tools Reference Manual,\u201d UG1043 (v2019.1), (2019)"},{"key":"4479_CR18","unstructured":"Xilinx Inc, \u201cPetalinux Tools Documentation,\u201d UG1144 (v2018.3), (2018)"},{"key":"4479_CR19","unstructured":"Trompouki MM, Kosmidis L (2016) \u201cTowards general purpose computations on low-end mobile GPUs,\u201d in 2016 Design, Automation & Test in Europe Conference & Exhibition, DATE 2016, Dresden, Germany, March 14-18, 2016, pp. 539\u2013542"},{"key":"4479_CR20","doi-asserted-by":"crossref","unstructured":"Trompouki MM, Kosmidis L (2017)\u201cOptimisation opportunities and evaluation for GPGPU applications on low-end mobile GPUs,\u201d in Design, Automation & Test in Europe Conference & Exhibition, DATE 2017, Lausanne, Switzerland, March 27-31, 2017, pp. 950\u2013953","DOI":"10.23919\/DATE.2017.7927128"},{"key":"4479_CR21","doi-asserted-by":"crossref","unstructured":"Trompouki M.M, Kosmidis L (2018) \u201cBrook auto: high-level certification-friendly programming for GPU-powered automotive systems,\u201d in Proceedings of the 55th Annual Design Automation Conference, DAC 2018, San Francisco, CA, USA, June 24-29, 2018pp. 100:1\u2013100:6","DOI":"10.1109\/DAC.2018.8465869"},{"key":"4479_CR22","unstructured":"Avnet Inc, \u201cUltra96-V2 Hardware user guide. Version 1.0,\u201d (2019)"},{"key":"4479_CR23","unstructured":"Xilinx Inc, \u201cSDSoC Environment User Guide,\u201d UG1027 (v2017.4), (2018)"},{"key":"4479_CR24","unstructured":"Tomov S , Dongarra J, Baboulin M (2008) \u201cTowards dense linear algebra for hybrid gpu accelerated manycore systems.\u201d LAPACK Working Note, Tech. Rep. 210, Oct. [Online]. Available: http:\/\/www.netlib.org\/lapack\/lawnspdf\/lawn210.pdf"},{"key":"4479_CR25","doi-asserted-by":"publisher","first-page":"2","DOI":"10.1145\/1057935.1057936","volume":"65","author":"J Dongarra","year":"1985","unstructured":"Dongarra J, Croz JD, Hammarling S, Hanson RJ (1985) A proposal for an extended set of Fortran basic linear algebra subprograms. ACM Signum Newslett 65:2\u201318","journal-title":"ACM Signum Newslett"},{"key":"4479_CR26","doi-asserted-by":"crossref","unstructured":"Frigo M, Johnson SG (2005) \u201cThe design and implementation of FFTW3,\u201d Proceedings of the IEEE, vol.\u00a093, no.\u00a02, pp. 216\u2013231 special issue on \u201cProgram Generation, Optimization, and Platform Adaptation\u201d","DOI":"10.1109\/JPROC.2004.840301"}],"container-title":["The Journal of Supercomputing"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s11227-022-04479-3.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1007\/s11227-022-04479-3\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1007\/s11227-022-04479-3.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2022,7,18]],"date-time":"2022-07-18T16:46:38Z","timestamp":1658162798000},"score":1,"resource":{"primary":{"URL":"https:\/\/link.springer.com\/10.1007\/s11227-022-04479-3"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2022,4,8]]},"references-count":26,"journal-issue":{"issue":"12","published-print":{"date-parts":[[2022,8]]}},"alternative-id":["4479"],"URL":"https:\/\/doi.org\/10.1007\/s11227-022-04479-3","relation":{},"ISSN":["0920-8542","1573-0484"],"issn-type":[{"type":"print","value":"0920-8542"},{"type":"electronic","value":"1573-0484"}],"subject":[],"published":{"date-parts":[[2022,4,8]]},"assertion":[{"value":"22 March 2022","order":1,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"8 April 2022","order":2,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}}]}}