{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,3,2]],"date-time":"2025-03-02T05:45:27Z","timestamp":1740894327829,"version":"3.38.0"},"reference-count":37,"publisher":"SAGE Publications","issue":"6","license":[{"start":{"date-parts":[[2017,4,5]],"date-time":"2017-04-05T00:00:00Z","timestamp":1491350400000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/journals.sagepub.com\/page\/policies\/text-and-data-mining-license"}],"content-domain":{"domain":["journals.sagepub.com"],"crossmark-restriction":true},"short-container-title":["The International Journal of High Performance Computing Applications"],"published-print":{"date-parts":[[2018,11]]},"abstract":"<jats:p> Oil and gas companies rely on high performance computing to process seismic imaging algorithms such as reverse time migration. Graphics processing units are used to accelerate reverse time migration, but these deployments suffer from limitations such as the lack of high graphics processing unit memory capacity, frequent CPU-GPU communications that may be bottlenecked by the PCI bus transfer rate, and high power consumptions. Recently, AMD has launched the Accelerated Processing Unit (APU): a processor that merges a CPU and a graphics processing unit on the same die featuring a unified CPU-GPU memory. In this paper, we explore how efficiently may the APU be applicable to reverse time migration. Using OpenCL (along with MPI and OpenMP), a CPU\/APU\/GPU comparative study is conducted on a single node for the 3D acoustic reverse time migration, and then extended on up to 16 nodes. We show the relevance of overlapping the I\/O and MPI communications with the computations for the APU and graphics processing unit clusters, that performance results of APUs range between those of CPUs and those of graphics processing units, and that the APU power efficiency is greater than or equal to the graphics processing unit one. <\/jats:p>","DOI":"10.1177\/1094342017696562","type":"journal-article","created":{"date-parts":[[2017,4,5]],"date-time":"2017-04-05T10:07:27Z","timestamp":1491386847000},"page":"819-837","update-policy":"https:\/\/doi.org\/10.1177\/sage-journals-update-policy","source":"Crossref","is-referenced-by-count":7,"title":["Leveraging the accelerated processing units for seismic imaging: A performance and power efficiency comparison against CPUs and GPUs"],"prefix":"10.1177","volume":"32","author":[{"given":"Issam","family":"Said","sequence":"first","affiliation":[{"name":"Sorbonne Universit\u00e9s, Paris, France"}]},{"given":"Pierre","family":"Fortin","sequence":"additional","affiliation":[{"name":"Sorbonne Universit\u00e9s, Paris, France"}]},{"given":"Jean\u2013Luc","family":"Lamotte","sequence":"additional","affiliation":[{"name":"Sorbonne Universit\u00e9s, Paris, France"}]},{"given":"Henri","family":"Calandra","sequence":"additional","affiliation":[{"name":"Total E&P, Houston, USA"}]}],"member":"179","published-online":{"date-parts":[[2017,4,5]]},"reference":[{"volume-title":"Acc\u00e9l\u00e9ration mat\u00e9rielle pour l\u2019imagerie sismique: Mod\u00e9lisation, migration et interpr\u00e9tation","year":"2013","author":"Abdelkhalek R","key":"bibr1-1094342017696562"},{"key":"bibr2-1094342017696562","doi-asserted-by":"publisher","DOI":"10.1002\/cpe.1875"},{"key":"bibr3-1094342017696562","unstructured":"AMD (2015a) AMD APP SDK OpenCL optimization guide, rev 1.0. Available at: http:\/\/amd-dev.wpengine.netdna-cdn.com\/wordpress\/media\/2013\/12\/AMD_OpenCL_Programming_Optimization_Guide2.pdf"},{"key":"bibr4-1094342017696562","unstructured":"AMD (2015b) AMD APP SDK OpenCL user guide, rev 01. Available at: http:\/\/amd-dev.wpengine.netdna-cdn.com\/wordpress\/media\/2013\/12\/AMD_OpenCL_Programming_User_Guide2.pdf"},{"volume-title":"SEG\/EAGE 3-D Modeling Series","year":"1997","author":"Aminzadeh F","key":"bibr5-1094342017696562"},{"key":"bibr6-1094342017696562","doi-asserted-by":"publisher","DOI":"10.1006\/jcph.1994.1159"},{"key":"bibr7-1094342017696562","doi-asserted-by":"crossref","unstructured":"Chunlei C, Stoffa P, Roustam S (2009) 3D Seismic Modeling and Reverse-Time Migration with the Parallel Fourier Method using Non-Blocking Collective Communications. Tulsa, OK, USA: Society of Exploration Geophysicists.","DOI":"10.1190\/1.3255403"},{"key":"bibr8-1094342017696562","doi-asserted-by":"publisher","DOI":"10.1190\/1.1440185"},{"key":"bibr9-1094342017696562","doi-asserted-by":"publisher","DOI":"10.1190\/1.3284053"},{"key":"bibr10-1094342017696562","doi-asserted-by":"publisher","DOI":"10.1147\/rd.112.0215"},{"key":"bibr11-1094342017696562","doi-asserted-by":"publisher","DOI":"10.1109\/ICPP.2016.37"},{"key":"bibr12-1094342017696562","first-page":"43","volume-title":"International workshop on high-performance stencil computations (HiStencils)","author":"Eberhart P","year":"2014"},{"volume-title":"Preparing depth imaging applications for exascale challenges and impacts","year":"2014","author":"Farjallah A","key":"bibr13-1094342017696562"},{"key":"bibr14-1094342017696562","doi-asserted-by":"publisher","DOI":"10.1190\/1.3255428"},{"volume-title":"Cray users group conference","year":"2011","author":"Hager G","key":"bibr15-1094342017696562"},{"key":"bibr16-1094342017696562","doi-asserted-by":"publisher","DOI":"10.1201\/EBK1439811924"},{"key":"bibr17-1094342017696562","doi-asserted-by":"publisher","DOI":"10.1002\/cjg2.1530"},{"key":"bibr18-1094342017696562","unstructured":"Intel (2017) Intel MPI library for Linux OS - Developer reference, update 1. Available at: https:\/\/software.intel.com\/en-us\/mpi-developer-reference-linux"},{"key":"bibr19-1094342017696562","unstructured":"Jiang Z, Bonham K, Bancroft J, (2009) Overcoming Computational Cost Problems of Reverse-Time Migration. CREWES Research Report, vol. 21."},{"key":"bibr20-1094342017696562","doi-asserted-by":"publisher","DOI":"10.1111\/j.1365-2478.2011.01032.x"},{"key":"bibr21-1094342017696562","doi-asserted-by":"publisher","DOI":"10.1145\/2442516.2442550"},{"issue":"1","key":"bibr22-1094342017696562","first-page":"389","volume":"182","author":"Mich\u00e9a D","year":"2010","journal-title":"Geophysical Journal International"},{"key":"bibr23-1094342017696562","doi-asserted-by":"publisher","DOI":"10.1145\/1513895.1513905"},{"key":"bibr24-1094342017696562","doi-asserted-by":"publisher","DOI":"10.1016\/S0065-2687(06)48008-0"},{"key":"bibr25-1094342017696562","first-page":"379","volume-title":"10th Biennial International Conference and Exposition on Petroleum Geophysics","author":"Narang A","year":"2013"},{"key":"bibr26-1094342017696562","unstructured":"NVIDIA (2016) NVIDIA Tesla P100 - The most advanced data center accelerator ever built. Featuring Pascal P100, the world\u2019s fastest GPU. Whitepaper. Available at: https:\/\/images.nvidia.com\/content\/pdf\/tesla\/whitepaper\/pascal-architecture-whitepaper.pdf"},{"key":"bibr27-1094342017696562","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-642-16405-7_24"},{"key":"bibr28-1094342017696562","doi-asserted-by":"publisher","DOI":"10.1109\/SBAC-PAD.2009.29"},{"key":"bibr29-1094342017696562","unstructured":"Perrone M, Lu L, Liu L, (2011) Fast Scalable Reverse Time Migration Seismic Imaging on Blue Gene\/P Technical Report SuperComputing\u201911 from IBM."},{"volume-title":"Proceedings of the Cray User Group Conference","year":"2012","author":"Pritchard H","key":"bibr30-1094342017696562"},{"volume-title":"Proceedings of the Cray User Group Conference","year":"2012","author":"Pritchard H","key":"bibr31-1094342017696562"},{"key":"bibr32-1094342017696562","doi-asserted-by":"publisher","DOI":"10.3997\/2214-4609.201413106"},{"key":"bibr33-1094342017696562","unstructured":"Said I (2015) Contributions of hybrid architectures to depth imaging: A CPU, APU and GPU comparative study. PhD Thesis, Universit\u00e9 Pierre et Marie Curie, France."},{"volume-title":"Proceedings of the 24th High Performance Computing Symposium HPC \u201916","year":"2016","author":"Said I","key":"bibr34-1094342017696562"},{"volume-title":"Basics of Seismic Imaging","year":"2010","author":"Schuster GT","key":"bibr35-1094342017696562"},{"key":"bibr36-1094342017696562","doi-asserted-by":"publisher","DOI":"10.1190\/1.2742686"},{"key":"bibr37-1094342017696562","doi-asserted-by":"publisher","DOI":"10.1145\/2807591.2807602"}],"container-title":["The International Journal of High Performance Computing Applications"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/journals.sagepub.com\/doi\/pdf\/10.1177\/1094342017696562","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/journals.sagepub.com\/doi\/full-xml\/10.1177\/1094342017696562","content-type":"application\/xml","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/journals.sagepub.com\/doi\/pdf\/10.1177\/1094342017696562","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,3,1]],"date-time":"2025-03-01T14:07:37Z","timestamp":1740838057000},"score":1,"resource":{"primary":{"URL":"https:\/\/journals.sagepub.com\/doi\/10.1177\/1094342017696562"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2017,4,5]]},"references-count":37,"journal-issue":{"issue":"6","published-print":{"date-parts":[[2018,11]]}},"alternative-id":["10.1177\/1094342017696562"],"URL":"https:\/\/doi.org\/10.1177\/1094342017696562","relation":{},"ISSN":["1094-3420","1741-2846"],"issn-type":[{"type":"print","value":"1094-3420"},{"type":"electronic","value":"1741-2846"}],"subject":[],"published":{"date-parts":[[2017,4,5]]}}}