{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,3,3]],"date-time":"2025-03-03T05:46:12Z","timestamp":1740980772381,"version":"3.38.0"},"reference-count":52,"publisher":"SAGE Publications","issue":"1","license":[{"start":{"date-parts":[[2016,7,27]],"date-time":"2016-07-27T00:00:00Z","timestamp":1469577600000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/journals.sagepub.com\/page\/policies\/text-and-data-mining-license"}],"content-domain":{"domain":["journals.sagepub.com"],"crossmark-restriction":true},"short-container-title":["The International Journal of High Performance Computing Applications"],"published-print":{"date-parts":[[2017,1]]},"abstract":"<jats:p> We carry out a comparative performance study of multi-core CPUs, GPUs and Intel Xeon Phi (Many Integrated Core (MIC)) with a microscopy image analysis application. We experimentally evaluate the performance of computing devices on core operations of the application. We correlate the observed performance with the characteristics of computing devices and data access patterns, computation complexities, and parallelization forms of the operations. The results show a significant variability in the performance of operations with respect to the device used. The performances of operations with regular data access are comparable or sometimes better on a MIC than that on a GPU. GPUs are more efficient than MICs for operations that access data irregularly, because of the lower bandwidth of the MIC for random data accesses. We propose new performance-aware scheduling strategies that consider variabilities in operation speedups. Our scheduling strategies significantly improve application performance compared with classic strategies in hybrid configurations. <\/jats:p>","DOI":"10.1177\/1094342015594519","type":"journal-article","created":{"date-parts":[[2015,7,29]],"date-time":"2015-07-29T00:29:56Z","timestamp":1438129796000},"page":"32-51","update-policy":"https:\/\/doi.org\/10.1177\/sage-journals-update-policy","source":"Crossref","is-referenced-by-count":8,"title":["Application performance analysis and efficient execution on systems with multi-core CPUs, GPUs and MICs: a case study with microscopy image analysis"],"prefix":"10.1177","volume":"31","author":[{"given":"George","family":"Teodoro","sequence":"first","affiliation":[{"name":"Department of Computer Science, University of Bras\u00edlia, Bras\u00edlia, DF, Brazil"}]},{"given":"Tahsin","family":"Kurc","sequence":"additional","affiliation":[{"name":"Department of Biomedical Informatics, Stony Brook University, Stony Brook, NY, USA"},{"name":"Scientific Data Group, Oak Ridge National Laboratory, Oak Ridge, TN, USA"}]},{"given":"Guilherme","family":"Andrade","sequence":"additional","affiliation":[{"name":"Department of Computer Science, Federal University of Minas Gerais, Belo Horizonte, MG, Brazil"}]},{"given":"Jun","family":"Kong","sequence":"additional","affiliation":[{"name":"Department of Biomedical Informatics, Emory University, Atlanta, GA, USA"}]},{"given":"Renato","family":"Ferreira","sequence":"additional","affiliation":[{"name":"Department of Computer Science, Federal University of Minas Gerais, Belo Horizonte, MG, Brazil"}]},{"given":"Joel","family":"Saltz","sequence":"additional","affiliation":[{"name":"Department of Biomedical Informatics, Stony Brook University, Stony Brook, NY, USA"},{"name":"Scientific Data Group, Oak Ridge National Laboratory, Oak Ridge, TN, USA"}]}],"member":"179","published-online":{"date-parts":[[2016,7,27]]},"reference":[{"key":"bibr1-1094342015594519","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-642-33518-1_40"},{"key":"bibr2-1094342015594519","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-642-03869-3_80"},{"key":"bibr3-1094342015594519","doi-asserted-by":"publisher","DOI":"10.1109\/CLUSTER.2011.51"},{"key":"bibr4-1094342015594519","unstructured":"Bradski G (2000) The OpenCV Library. Dr. Dobb\u2019s Journal of Software Tools. Available at: http:\/\/www.drdobbs.com\/open-source\/the-opencv-library\/184404319"},{"key":"bibr5-1094342015594519","doi-asserted-by":"publisher","DOI":"10.1109\/IPDPS.2012.58"},{"key":"bibr6-1094342015594519","doi-asserted-by":"publisher","DOI":"10.1002\/sam.10129"},{"key":"bibr7-1094342015594519","doi-asserted-by":"publisher","DOI":"10.1109\/TBME.2010.2060338"},{"key":"bibr8-1094342015594519","doi-asserted-by":"publisher","DOI":"10.1136\/amiajnl-2011-000700"},{"journal-title":"Many-core applications research community symposium","year":"2012","author":"Cramer T","key":"bibr9-1094342015594519"},{"key":"bibr10-1094342015594519","first-page":"137","author":"Dean J","year":"2004","journal-title":"The sixth symposium on operating system design and implementation"},{"key":"bibr11-1094342015594519","doi-asserted-by":"publisher","DOI":"10.1145\/1383422.1383447"},{"volume-title":"TACC-Intel highly parallel computing symposium","year":"2012","author":"Eisenlohr J","key":"bibr12-1094342015594519"},{"key":"bibr13-1094342015594519","doi-asserted-by":"publisher","DOI":"10.1145\/2464996.2465445"},{"key":"bibr14-1094342015594519","doi-asserted-by":"publisher","DOI":"10.1145\/1375527.1375533"},{"key":"bibr15-1094342015594519","doi-asserted-by":"publisher","DOI":"10.1109\/HIPC.2010.5713173"},{"key":"bibr16-1094342015594519","doi-asserted-by":"publisher","DOI":"10.1145\/1454115.1454152"},{"key":"bibr17-1094342015594519","doi-asserted-by":"publisher","DOI":"10.1109\/IPDPS.2013.113"},{"key":"bibr18-1094342015594519","doi-asserted-by":"publisher","DOI":"10.1109\/HiPC.2011.6152715"},{"key":"bibr19-1094342015594519","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-642-38750-0_4"},{"key":"bibr20-1094342015594519","doi-asserted-by":"publisher","DOI":"10.1371\/journal.pone.0081049"},{"key":"bibr21-1094342015594519","doi-asserted-by":"publisher","DOI":"10.1109\/BIBM.2013.6732495"},{"key":"bibr22-1094342015594519","doi-asserted-by":"publisher","DOI":"10.1109\/ISCA.2008.38"},{"key":"bibr23-1094342015594519","doi-asserted-by":"publisher","DOI":"10.1145\/1353536.1346318"},{"key":"bibr24-1094342015594519","doi-asserted-by":"publisher","DOI":"10.1145\/1669112.1669121"},{"key":"bibr25-1094342015594519","first-page":"19","volume-title":"IEEE Computer Society Technical Committee on Computer Architecture Newsletter","author":"McCalpin JD","year":"1995"},{"key":"bibr26-1094342015594519","unstructured":"NVIDIA (2012) GPU Accelerated Applications. Available at: http:\/\/www.nvidia.co.uk\/content\/gpu-applications\/PDF\/GPU-apps-catalog-mar2015.pdf"},{"journal-title":"SIBGRAPI","year":"2010","author":"Oliveira VMA","key":"bibr27-1094342015594519"},{"key":"bibr28-1094342015594519","doi-asserted-by":"publisher","DOI":"10.1002\/cpe.899"},{"key":"bibr29-1094342015594519","doi-asserted-by":"publisher","DOI":"10.1145\/1810085.1810106"},{"key":"bibr30-1094342015594519","doi-asserted-by":"publisher","DOI":"10.1145\/2043556.2043579"},{"issue":"4","key":"bibr31-1094342015594519","first-page":"291","volume":"23","author":"Ruifrok AC","year":"2001","journal-title":"Analytical and Quantitative Cytology and Histology"},{"key":"bibr32-1094342015594519","doi-asserted-by":"publisher","DOI":"10.1177\/1094342013488260"},{"journal-title":"CoRR","year":"2013","author":"Saule E","key":"bibr33-1094342015594519"},{"key":"bibr34-1094342015594519","first-page":"923","volume-title":"Proceedings of the SPIE annual medical imaging meetings","author":"Sharp R","year":"2006"},{"key":"bibr35-1094342015594519","doi-asserted-by":"publisher","DOI":"10.1108\/09653560110381796"},{"key":"bibr36-1094342015594519","doi-asserted-by":"publisher","DOI":"10.1145\/321879.321884"},{"key":"bibr37-1094342015594519","doi-asserted-by":"publisher","DOI":"10.1145\/1851476.1851479"},{"key":"bibr38-1094342015594519","doi-asserted-by":"publisher","DOI":"10.1007\/s10586-010-0151-6"},{"key":"bibr39-1094342015594519","doi-asserted-by":"publisher","DOI":"10.1109\/IPDPS.2014.111"},{"key":"bibr40-1094342015594519","doi-asserted-by":"publisher","DOI":"10.1109\/IPDPS.2012.101"},{"key":"bibr41-1094342015594519","doi-asserted-by":"publisher","DOI":"10.1016\/j.parco.2013.03.001"},{"key":"bibr42-1094342015594519","doi-asserted-by":"publisher","DOI":"10.1109\/IPDPS.2013.11"},{"key":"bibr43-1094342015594519","doi-asserted-by":"publisher","DOI":"10.1109\/CLUSTR.2009.5289193"},{"key":"bibr44-1094342015594519","doi-asserted-by":"publisher","DOI":"10.1007\/s00778-013-0329-7"},{"key":"bibr45-1094342015594519","doi-asserted-by":"publisher","DOI":"10.1007\/s12524-008-0004-4"},{"key":"bibr46-1094342015594519","doi-asserted-by":"publisher","DOI":"10.1007\/s10707-010-0113-4"},{"key":"bibr47-1094342015594519","doi-asserted-by":"publisher","DOI":"10.1007\/s10707-010-0113-4"},{"key":"bibr48-1094342015594519","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-540-89689-0_100"},{"key":"bibr49-1094342015594519","doi-asserted-by":"publisher","DOI":"10.1109\/IGARSS.2011.6049338"},{"key":"bibr50-1094342015594519","doi-asserted-by":"publisher","DOI":"10.1109\/83.217222"},{"key":"bibr51-1094342015594519","doi-asserted-by":"publisher","DOI":"10.1109\/36.508406"},{"key":"bibr52-1094342015594519","doi-asserted-by":"publisher","DOI":"10.1145\/1498765.1498785"}],"container-title":["The International Journal of High Performance Computing Applications"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/journals.sagepub.com\/doi\/pdf\/10.1177\/1094342015594519","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/journals.sagepub.com\/doi\/full-xml\/10.1177\/1094342015594519","content-type":"application\/xml","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/journals.sagepub.com\/doi\/pdf\/10.1177\/1094342015594519","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,3,2]],"date-time":"2025-03-02T16:46:03Z","timestamp":1740933963000},"score":1,"resource":{"primary":{"URL":"https:\/\/journals.sagepub.com\/doi\/10.1177\/1094342015594519"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2016,7,27]]},"references-count":52,"journal-issue":{"issue":"1","published-print":{"date-parts":[[2017,1]]}},"alternative-id":["10.1177\/1094342015594519"],"URL":"https:\/\/doi.org\/10.1177\/1094342015594519","relation":{},"ISSN":["1094-3420","1741-2846"],"issn-type":[{"type":"print","value":"1094-3420"},{"type":"electronic","value":"1741-2846"}],"subject":[],"published":{"date-parts":[[2016,7,27]]}}}