{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,1,16]],"date-time":"2026-01-16T12:55:28Z","timestamp":1768568128427,"version":"3.49.0"},"reference-count":31,"publisher":"MDPI AG","issue":"2","license":[{"start":{"date-parts":[[2020,4,17]],"date-time":"2020-04-17T00:00:00Z","timestamp":1587081600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"DOI":"10.13039\/100010664","name":"H2020 Future and Emerging Technologies","doi-asserted-by":"publisher","award":["754337 and 824064"],"award-info":[{"award-number":["754337 and 824064"]}],"id":[{"id":"10.13039\/100010664","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Computation"],"abstract":"<jats:p>New challenges in Astronomy and Astrophysics (AA) are urging the need for many exceptionally computationally intensive simulations. \u201cExascale\u201d (and beyond) computational facilities are mandatory to address the size of theoretical problems and data coming from the new generation of observational facilities in AA. Currently, the High-Performance Computing (HPC) sector is undergoing a profound phase of innovation, in which the primary challenge to the achievement of the \u201cExascale\u201d is the power consumption. The goal of this work is to give some insights about performance and energy footprint of contemporary architectures for a real astrophysical application in an HPC context. We use a state-of-the-art N-body application that we re-engineered and optimized to exploit the heterogeneous underlying hardware fully. We quantitatively evaluate the impact of computation on energy consumption when running on four different platforms. Two of them represent the current HPC systems (Intel-based and equipped with NVIDIA GPUs), one is a micro-cluster based on ARM-MPSoC, and one is a \u201cprototype towards Exascale\u201d equipped with ARM-MPSoCs tightly coupled with FPGAs. We investigate the behavior of the different devices where the high-end GPUs excel in terms of time-to-solution while MPSoC-FPGA systems outperform GPUs in power consumption. Our experience reveals that considering FPGAs for computationally intensive application seems very promising, as their performance is improving to meet the requirements of scientific applications. This work can be a reference for future platform development for astrophysics applications where computationally intensive calculations are required.<\/jats:p>","DOI":"10.3390\/computation8020034","type":"journal-article","created":{"date-parts":[[2020,4,21]],"date-time":"2020-04-21T03:23:06Z","timestamp":1587439386000},"page":"34","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":12,"title":["Performance and Energy Footprint Assessment of FPGAs and GPUs on HPC Systems Using Astrophysics Application"],"prefix":"10.3390","volume":"8","author":[{"ORCID":"https:\/\/orcid.org\/0000-0001-9808-2283","authenticated-orcid":false,"given":"David","family":"Goz","sequence":"first","affiliation":[{"name":"INAF-Osservatorio Astronomico di Trieste, 35122 Padova, Italy"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-9776-985X","authenticated-orcid":false,"given":"Georgios","family":"Ieronymakis","sequence":"additional","affiliation":[{"name":"ICS-FORTH, GR-700 13 Heraklion, Crete, Greece"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-5443-6470","authenticated-orcid":false,"given":"Vassilis","family":"Papaefstathiou","sequence":"additional","affiliation":[{"name":"ICS-FORTH, GR-700 13 Heraklion, Crete, Greece"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-5788-0892","authenticated-orcid":false,"given":"Nikolaos","family":"Dimou","sequence":"additional","affiliation":[{"name":"ICS-FORTH, GR-700 13 Heraklion, Crete, Greece"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-2386-623X","authenticated-orcid":false,"given":"Sara","family":"Bertocco","sequence":"additional","affiliation":[{"name":"INAF-Osservatorio Astronomico di Trieste, 35122 Padova, Italy"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-7955-1491","authenticated-orcid":false,"given":"Francesco","family":"Simula","sequence":"additional","affiliation":[{"name":"INFN-Sezione di Roma, 00185 Rome, Italy"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-8106-2742","authenticated-orcid":false,"given":"Antonio","family":"Ragagnin","sequence":"additional","affiliation":[{"name":"INAF-Osservatorio Astronomico di Trieste, 35122 Padova, Italy"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-1751-0130","authenticated-orcid":false,"given":"Luca","family":"Tornatore","sequence":"additional","affiliation":[{"name":"INAF-Osservatorio Astronomico di Trieste, 35122 Padova, Italy"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-9374-3249","authenticated-orcid":false,"given":"Igor","family":"Coretti","sequence":"additional","affiliation":[{"name":"INAF-Osservatorio Astronomico di Trieste, 35122 Padova, Italy"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-4211-6816","authenticated-orcid":false,"given":"Giuliano","family":"Taffoni","sequence":"additional","affiliation":[{"name":"INAF-Osservatorio Astronomico di Trieste, 35122 Padova, Italy"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"1968","published-online":{"date-parts":[[2020,4,17]]},"reference":[{"key":"ref_1","doi-asserted-by":"crossref","unstructured":"Dutot, P., Georgiou, Y., Glesser, D., Lefevre, L., Poquet, M., and Rais, I. (2017, January 14\u201317). Towards Energy Budget Control in HPC. Proceedings of the 2017 17th IEEE\/ACM International Symposium on Cluster, Cloud and Grid Computing (CCGRID), Madrid, Spain.","DOI":"10.1109\/CCGRID.2017.16"},{"key":"ref_2","first-page":"7206595","article-title":"Power-Efficient Computing: Experiences from the COSA Project","volume":"2017","author":"Cesini","year":"2017","journal-title":"Sci. Program."},{"key":"ref_3","unstructured":"Ammendola, R., Biagioni, A., Capuani, F., Cretaro, P., Bonis, G.D., Cicero, F.L., Lonardo, A., Martinelli, M., Paolucci, P.S., and Pastorelli, E. (2018). The Brain on Low Power Architectures\u2014Efficient Simulation of Cortical Slow Waves and Asynchronous States. arXiv."},{"key":"ref_4","doi-asserted-by":"crossref","unstructured":"Simula, F., Pastorelli, E., Paolucci, P.S., Martinelli, M., Lonardo, A., Biagioni, A., Capone, C., Capuani, F., Cretaro, P., and De Bonis, G. (2019, January 13\u201315). Real-Time Cortical Simulations: Energy and Interconnect Scaling on Distributed Systems. Proceedings of the 2019 27th Euromicro International Conference on Parallel, Distributed and Network-Based Processing (PDP), Pavia, Italy.","DOI":"10.1109\/EMPDP.2019.8671627"},{"key":"ref_5","doi-asserted-by":"crossref","unstructured":"Hunold, S., Costan, A., Gim\u00e9nez, D., Iosup, A., Ricci, L., G\u00f3mez Requena, M.E., Scarano, V., Varbanescu, A.L., Scott, S.L., and Lankes, S. (2015). Energy-Performance Tradeoffs for HPC Applications on Low Power Processors. Euro-Par 2015: Parallel Processing Workshops, Springer International Publishing.","DOI":"10.1007\/978-3-319-27308-2"},{"key":"ref_6","doi-asserted-by":"crossref","unstructured":"Nikolskiy, V.P., Stegailov, V.V., and Vecher, V.S. (2016, January 18\u201322). Efficiency of the Tegra K1 and X1 systems-on-chip for classical molecular dynamics. Proceedings of the 2016 International Conference on High Performance Computing Simulation (HPCS), Innsbruck, Austria.","DOI":"10.1109\/HPCSim.2016.7568401"},{"key":"ref_7","doi-asserted-by":"crossref","unstructured":"Morganti, L., Cesini, D., and Ferraro, A. (2016, January 17\u201319). Evaluating Systems on Chip through HPC Bioinformatic and Astrophysic Applications. Proceedings of the 2016 24th Euromicro International Conference on Parallel, Distributed, and Network-Based Processing (PDP), Crete, Greece.","DOI":"10.1109\/PDP.2016.82"},{"key":"ref_8","unstructured":"Arai, K., Bhatia, R., and Kapoor, S. Low Power High Performance Computing on Arm System-on-Chip in Astrophysics. Proceedings of the Future Technologies Conference (FTC) 2019."},{"key":"ref_9","unstructured":"Taffoni, G., Murante, G., Tornatore, L., Goz, D., Borgani, S., Katevenis, M., Chrysos, N., and Marazakis, M. (2019). Shall numerical astrophysics step into the era of Exascale computing?. arXiv."},{"key":"ref_10","unstructured":"Katevenis, M., Chrysos, N., Marazakis, M., Mavroidis, I., Chaix, F., Kallimanis, N., Navaridas, J., Goodacre, J., Vicini, P., and Biagioni, A. (September, January 31). The ExaNeSt Project: Interconnects, Storage, and Packaging for Exascale Systems. Proceedings of the 19th Euromicro Conference on Digital System Design, DSD, Limassol, Cyprus."},{"key":"ref_11","doi-asserted-by":"crossref","unstructured":"Ammendola, R., Biagioni, A., Cretaro, P., Frezza, O., Cicero, F.L., Lonardo, A., Martinelli, M., Paolucci, P.S., Pastorelli, E., and Simula, F. (September, January 30). The Next Generation of Exascale-Class Systems: The ExaNeSt Project. Proceedings of the 2017 Euromicro Conference on Digital System Design (DSD), Vienna, Austria.","DOI":"10.1109\/DSD.2017.20"},{"key":"ref_12","doi-asserted-by":"crossref","first-page":"58","DOI":"10.1016\/j.micpro.2018.05.009","article-title":"Next generation of Exascale-class systems: ExaNeSt project and the status of its interconnect and storage development","volume":"61","author":"Katevenis","year":"2018","journal-title":"Microprocess. Microsyst."},{"key":"ref_13","doi-asserted-by":"crossref","unstructured":"Spera, M., and Capuzzo-Dolcetta, R. (2017). Rapid mass segregation in small stellar clusters. Astrophys. Space Sci., 362.","DOI":"10.1007\/s10509-017-3209-6"},{"key":"ref_14","doi-asserted-by":"crossref","first-page":"4086","DOI":"10.1093\/mnras\/stv1161","article-title":"The Mass Spectrum of Compact Remnants From the Parsec Stellar Evolution Tracks","volume":"451","author":"Spera","year":"2015","journal-title":"Mon. Not. R. Astron. Soc."},{"key":"ref_15","unstructured":"Bertocco, S., Goz, D., Tornatore, L., Ragagnin, A., Maggio, G., Gasparo, F., Vuerli, C., Taffoni, G., and Molinaro, M. (2019). INAF Trieste Astronomical Observatory Information Technology Framework. arXiv."},{"key":"ref_16","unstructured":"Taffoni, G., Becciani, U., Garilli, B., Maggio, G., Pasian, F., Umana, G., Smareglia, R., and Vitello, F. (2020). CHIPP: INAF pilot project for HTC, HPC and HPDA. arXiv."},{"key":"ref_17","unstructured":"Bertocco, S., Goz, D., Tornatore, L., and Taffoni, G. (2020, April 17). Available online: https:\/\/www.ict.inaf.it\/index.php\/31-doi\/96-2018-4."},{"key":"ref_18","doi-asserted-by":"crossref","unstructured":"Frachtenberg, E., and Schwiegelshohn, U. (2009). Effects of Topology-Aware Allocation Policies on Scheduling Performance. Job Scheduling Strategies for Parallel Processing, Springer.","DOI":"10.1007\/978-3-642-16505-4"},{"key":"ref_19","doi-asserted-by":"crossref","unstructured":"Chaix, F., Ioannou, A., Kossifidis, N., Dimou, N., Ieronymakis, G., Marazakis, M., Papaefstathiou, V., Flouris, V., Ligerakis, M., and Ailamakis, G. (2019, January 17). Implementation and Impact of an Ultra-Compact Multi-FPGA Board for Large System Prototyping. Proceedings of the 2019 IEEE\/ACM International Workshop on Heterogeneous High-performance Reconfigurable Computing (H2RC), Denver, CO, USA.","DOI":"10.1109\/H2RC49586.2019.00010"},{"key":"ref_20","doi-asserted-by":"crossref","first-page":"40","DOI":"10.1109\/MC.2005.380","article-title":"High-performance, power-aware distributed computing for scientific applications","volume":"38","author":"Cameron","year":"2005","journal-title":"Computer"},{"key":"ref_21","unstructured":"Arai, K., Bhatia, R., and Kapoor, S. (2019). Direct N-body Code on Low-Power Embedded ARM GPUs. Intelligent Computing, Springer International Publishing."},{"key":"ref_22","doi-asserted-by":"crossref","unstructured":"Goz, D., Ieronymakis, G., Papaefstathiou, V., Dimou, N., Bertocco, S., Ragagnin, A., Tornatore, L., Taffoni, G., and Coretti, I. (2019). Direct N-body application on low-power and energy-efficient parallel architectures. arXiv.","DOI":"10.3233\/APC200088"},{"key":"ref_23","doi-asserted-by":"crossref","first-page":"580","DOI":"10.1016\/j.jcp.2012.11.013","article-title":"A Fully Parallel, High Precision, N-Body Code Running on Hybrid Computing Platforms","volume":"236","author":"Spera","year":"2013","journal-title":"J. Comput. Phys."},{"key":"ref_24","doi-asserted-by":"crossref","first-page":"2528","DOI":"10.1016\/j.cpc.2013.07.005","article-title":"A performance comparison of different graphics processing units running direct N-body simulations","volume":"184","author":"Spera","year":"2013","journal-title":"Comput. Phys. Commun."},{"key":"ref_25","unstructured":"Spera, M. (2014). Using Graphics Processing Units to solve the classical N-body problem in physics and astrophysics. arXiv."},{"key":"ref_26","doi-asserted-by":"crossref","first-page":"498","DOI":"10.1016\/j.newast.2008.01.010","article-title":"Sixth- and eighth-order Hermite integrator for N-body simulations","volume":"13","author":"Nitadori","year":"2008","journal-title":"New Astron."},{"key":"ref_27","doi-asserted-by":"crossref","unstructured":"Thall, A. (2006). Extended-Precision Floating-Point Numbers for GPU Computation, Association for Computing Machinery.","DOI":"10.1145\/1179622.1179682"},{"key":"ref_28","doi-asserted-by":"crossref","first-page":"21","DOI":"10.1109\/MCSE.2007.53","article-title":"IPython: A System for Interactive Scientific Computing","volume":"9","author":"Granger","year":"2007","journal-title":"Comput. Sci. Eng."},{"key":"ref_29","doi-asserted-by":"crossref","unstructured":"Virtanen, P., Gommers, R., Oliphant, T.E., Haberland, M., Reddy, T., Cournapeau, D., Burovski, E., Peterson, P., Weckesser, W., and Bright, J. (2020). Scipy 1.0: Fundamental Algorithms for Scientific Computing in Python. Nat. Methods.","DOI":"10.1038\/s41592-020-0772-5"},{"key":"ref_30","doi-asserted-by":"crossref","first-page":"22","DOI":"10.1109\/MCSE.2011.37","article-title":"The NumPy Array: A Structure for Efficient Numerical Computation","volume":"13","author":"Colbert","year":"2011","journal-title":"Comput. Sci. Eng."},{"key":"ref_31","doi-asserted-by":"crossref","first-page":"90","DOI":"10.1109\/MCSE.2007.55","article-title":"Matplotlib: A 2D Graphics Environment","volume":"9","author":"Hunter","year":"2007","journal-title":"Comput. Sci. Eng."}],"container-title":["Computation"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/2079-3197\/8\/2\/34\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,13]],"date-time":"2025-10-13T13:45:01Z","timestamp":1760363101000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/2079-3197\/8\/2\/34"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2020,4,17]]},"references-count":31,"journal-issue":{"issue":"2","published-online":{"date-parts":[[2020,6]]}},"alternative-id":["computation8020034"],"URL":"https:\/\/doi.org\/10.3390\/computation8020034","relation":{},"ISSN":["2079-3197"],"issn-type":[{"value":"2079-3197","type":"electronic"}],"subject":[],"published":{"date-parts":[[2020,4,17]]}}}