{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,6,19]],"date-time":"2025-06-19T04:17:46Z","timestamp":1750306666946,"version":"3.41.0"},"publisher-location":"New York, NY, USA","reference-count":34,"publisher":"ACM","license":[{"start":{"date-parts":[[2015,2,7]],"date-time":"2015-02-07T00:00:00Z","timestamp":1423267200000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2015,2,7]]},"DOI":"10.1145\/2716282.2716285","type":"proceedings-article","created":{"date-parts":[[2015,2,3]],"date-time":"2015-02-03T13:43:17Z","timestamp":1422970997000},"page":"90-98","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":1,"title":["High performance computing of fiber scattering simulation"],"prefix":"10.1145","author":[{"given":"Leiming","family":"Yu","sequence":"first","affiliation":[{"name":"Northeastern University, USA"}]},{"given":"Yan","family":"Zhang","sequence":"additional","affiliation":[{"name":"Northeastern University, USA"}]},{"given":"Xiang","family":"Gong","sequence":"additional","affiliation":[{"name":"Northeastern University, USA"}]},{"given":"Nilay","family":"Roy","sequence":"additional","affiliation":[{"name":"Northeastern University, USA"}]},{"given":"Lee","family":"Makowski","sequence":"additional","affiliation":[{"name":"Northeastern University, USA"}]},{"given":"David","family":"Kaeli","sequence":"additional","affiliation":[{"name":"Northeastern University, USA"}]}],"member":"320","published-online":{"date-parts":[[2015,2,7]]},"reference":[{"key":"e_1_3_2_1_1_1","unstructured":"CUDA C Programming Guide. NVIDIA Corporation Feb 2014.  CUDA C Programming Guide. NVIDIA Corporation Feb 2014."},{"key":"e_1_3_2_1_2_1","doi-asserted-by":"publisher","DOI":"10.1145\/2493123.2462915"},{"key":"e_1_3_2_1_3_1","volume-title":"Springer","author":"Arnold Axel","year":"2013","unstructured":"Axel Arnold , Olaf Lenz , Stefan Kesselheim , Rudolf Weeber , Florian Fahrenberger , Dominic Roehm , Peter Ko\u0161ovan , and Christian Holm . Espresso 3.1 : Molecular dynamics software for coarse-grained models. In Meshfree methods for partial differential equations VI, pages 1\u201323 . Springer , 2013 . Axel Arnold, Olaf Lenz, Stefan Kesselheim, Rudolf Weeber, Florian Fahrenberger, Dominic Roehm, Peter Ko\u0161ovan, and Christian Holm. Espresso 3.1: Molecular dynamics software for coarse-grained models. In Meshfree methods for partial differential equations VI, pages 1\u201323. Springer, 2013."},{"key":"e_1_3_2_1_4_1","volume-title":"Procs. 12th Inter. Workshop on Image Analysis for Multimedia Interactive Services (WIAMIS 2011","author":"Athanasopoulos Andreas","year":"2011","unstructured":"Andreas Athanasopoulos , Anastasios Dimou , Vasileios Mezaris , and Ioannis Kompatsiaris . Gpu acceleration for support vector machines . In Procs. 12th Inter. Workshop on Image Analysis for Multimedia Interactive Services (WIAMIS 2011 ), Delft, Netherlands , 2011 . Andreas Athanasopoulos, Anastasios Dimou, Vasileios Mezaris, and Ioannis Kompatsiaris. Gpu acceleration for support vector machines. In Procs. 12th Inter. Workshop on Image Analysis for Multimedia Interactive Services (WIAMIS 2011), Delft, Netherlands, 2011."},{"key":"e_1_3_2_1_5_1","volume-title":"International Society for Optics and Photonics","author":"DiMarco Jeffrey","year":"2013","unstructured":"Jeffrey DiMarco and Michela Taufer . Performance impact of dynamic parallelism on different clustering algorithms. In SPIE Defense, Security, and Sensing, pages 87520E\u201387520E . International Society for Optics and Photonics , 2013 . Jeffrey DiMarco and Michela Taufer. Performance impact of dynamic parallelism on different clustering algorithms. In SPIE Defense, Security, and Sensing, pages 87520E\u201387520E. International Society for Optics and Photonics, 2013."},{"key":"e_1_3_2_1_6_1","volume-title":"Temporal data mining for neuroscience. GPU Computing Gems Emerald Edition, page 211","author":"Cao Yong","year":"2011","unstructured":"Wu-chun Feng, Yong Cao , Debprakash Patnaik , and Naren Ramakrishnan . Temporal data mining for neuroscience. GPU Computing Gems Emerald Edition, page 211 , 2011 . Wu-chun Feng, Yong Cao, Debprakash Patnaik, and Naren Ramakrishnan. Temporal data mining for neuroscience. GPU Computing Gems Emerald Edition, page 211, 2011."},{"key":"e_1_3_2_1_7_1","doi-asserted-by":"publisher","DOI":"10.1107\/S0021889877013879"},{"key":"e_1_3_2_1_8_1","doi-asserted-by":"publisher","DOI":"10.1107\/S0021889879012139"},{"key":"e_1_3_2_1_9_1","volume-title":"National renewable energy laboratory","author":"Richard V Greene and Biosciences Center Director.","year":"2013","unstructured":"Richard V Greene and Biosciences Center Director. National renewable energy laboratory . 2013 . Richard V Greene and Biosciences Center Director. National renewable energy laboratory. 2013."},{"key":"e_1_3_2_1_11_1","doi-asserted-by":"publisher","DOI":"10.1016\/S0006-3495(93)81393-6"},{"key":"e_1_3_2_1_12_1","volume-title":"Multiscale deconstruction of molecular architecture in corn stover. Scientific reports, 4","author":"Inouye Hideyo","year":"2014","unstructured":"Hideyo Inouye , Yan Zhang , Lin Yang , Nagarajan Venugopalan , Robert F Fischetti , S Charlotte Gleber , Stefan Vogt , W Fowle , Bryan Makowski , Melvin Tucker , Multiscale deconstruction of molecular architecture in corn stover. Scientific reports, 4 , 2014 . Hideyo Inouye, Yan Zhang, Lin Yang, Nagarajan Venugopalan, Robert F Fischetti, S Charlotte Gleber, Stefan Vogt, W Fowle, Bryan Makowski, Melvin Tucker, et al. Multiscale deconstruction of molecular architecture in corn stover. Scientific reports, 4, 2014."},{"key":"e_1_3_2_1_13_1","doi-asserted-by":"publisher","DOI":"10.1145\/1735688.1735696"},{"key":"e_1_3_2_1_14_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.jcp.2010.06.024"},{"key":"e_1_3_2_1_15_1","volume-title":"Clint Chapple, and Lee Makowski. Tissue specific specialization of the nanoscale architecture of arabidopsis. Journal of structural biology, 184(2):103\u2013114","author":"Liu Jiliang","year":"2013","unstructured":"Jiliang Liu , Hideyo Inouye , Nagarajan Venugopalan , Robert F Fischetti , S Charlotte Gleber , Stefan Vogt , Joanne C Cusumano , Jeong Im Kim , Clint Chapple, and Lee Makowski. Tissue specific specialization of the nanoscale architecture of arabidopsis. Journal of structural biology, 184(2):103\u2013114 , 2013 . Jiliang Liu, Hideyo Inouye, Nagarajan Venugopalan, Robert F Fischetti, S Charlotte Gleber, Stefan Vogt, Joanne C Cusumano, Jeong Im Kim, Clint Chapple, and Lee Makowski. Tissue specific specialization of the nanoscale architecture of arabidopsis. Journal of structural biology, 184(2):103\u2013114, 2013."},{"key":"e_1_3_2_1_16_1","doi-asserted-by":"publisher","DOI":"10.1109\/TPDS.2007.1069"},{"key":"e_1_3_2_1_17_1","unstructured":"Massachusetts Green High Performance Computing Center. http:\/\/www.northeastern.edu\/rc.  Massachusetts Green High Performance Computing Center. http:\/\/www.northeastern.edu\/rc."},{"key":"e_1_3_2_1_18_1","doi-asserted-by":"publisher","DOI":"10.1021\/ie00004a026"},{"key":"e_1_3_2_1_19_1","doi-asserted-by":"publisher","DOI":"10.1021\/ja0257319"},{"key":"e_1_3_2_1_20_1","doi-asserted-by":"publisher","DOI":"10.1021\/ja037055w"},{"key":"e_1_3_2_1_21_1","volume-title":"Kepler TM GK110","author":"NVIDIA.","year":"2012","unstructured":"NVIDIA. Nvidia\u2019s Next Generation CUDA TM Compute Architecture , Kepler TM GK110 , 2012 . NVIDIA. Nvidia\u2019s Next Generation CUDA TM Compute Architecture, Kepler TM GK110, 2012."},{"key":"e_1_3_2_1_22_1","unstructured":"NVIDIA. Profiler Compute Visual August 2014.  NVIDIA. Profiler Compute Visual August 2014."},{"key":"e_1_3_2_1_23_1","volume-title":"Occupancy calculator","author":"Vidia CUDA","year":"2009","unstructured":"CUDA N Vidia . Occupancy calculator , 2009 . CUDA NVidia. Occupancy calculator, 2009."},{"key":"e_1_3_2_1_24_1","volume-title":"GTC2013","author":"Panda DK","year":"2013","unstructured":"DK Panda . Mvapich2 : A high performance mpi library for nvidia gpu clusters with infiniband . GTC2013 ,( March 20, 2013 ), 2013. DK Panda. Mvapich2: A high performance mpi library for nvidia gpu clusters with infiniband. GTC2013,(March 20, 2013), 2013."},{"key":"e_1_3_2_1_25_1","first-page":"509","volume-title":"Computer Vision and Graphics","author":"Paravecino Fanny Nina","unstructured":"Fanny Nina Paravecino and David Kaeli . Accelerated connected component labeling using cuda framework . In Computer Vision and Graphics , pages 502\u2013 509 . Springer, 2014. Fanny Nina Paravecino and David Kaeli. Accelerated connected component labeling using cuda framework. In Computer Vision and Graphics, pages 502\u2013509. Springer, 2014."},{"key":"e_1_3_2_1_26_1","volume-title":"IBM Redbooks","author":"Quintero Dino","year":"2014","unstructured":"Dino Quintero , Luis Carlos Cruz , Ricardo Machado Picone , Dusan Smolej , Daniel de Souza Casali , Gheorghe Tudor , Joanna Wong , IBM Platform Computing Solutions Reference Architectures and Best Practices . IBM Redbooks , 2014 . Dino Quintero, Luis Carlos Cruz, Ricardo Machado Picone, Dusan Smolej, Daniel de Souza Casali, Gheorghe Tudor, Joanna Wong, et al. IBM Platform Computing Solutions Reference Architectures and Best Practices. IBM Redbooks, 2014."},{"key":"e_1_3_2_1_27_1","doi-asserted-by":"publisher","DOI":"10.1155\/2011\/403892"},{"key":"e_1_3_2_1_28_1","doi-asserted-by":"publisher","DOI":"10.1109\/InPar.2012.6339605"},{"key":"e_1_3_2_1_29_1","unstructured":"Top500. List of top 500 supercomputers. http:\/\/www.top500.org\/lists\/2014\/06\/ 2014.  Top500. List of top 500 supercomputers. http:\/\/www.top500.org\/lists\/2014\/06\/ 2014."},{"key":"e_1_3_2_1_30_1","first-page":"37","volume-title":"Proceedings of the Conference on High Performance Graphics","author":"Tzeng Stanley","unstructured":"Stanley Tzeng , Anjul Patney , and John D Owens . Task management for irregular-parallel workloads on the gpu . In Proceedings of the Conference on High Performance Graphics , pages 29\u2013 37 . Eurographics Association, 2010. Stanley Tzeng, Anjul Patney, and John D Owens. Task management for irregular-parallel workloads on the gpu. In Proceedings of the Conference on High Performance Graphics, pages 29\u201337. Eurographics Association, 2010."},{"key":"e_1_3_2_1_31_1","doi-asserted-by":"publisher","DOI":"10.1177\/1094342014526907"},{"key":"e_1_3_2_1_32_1","doi-asserted-by":"publisher","DOI":"10.1109\/CLUSTER.2011.42"},{"key":"e_1_3_2_1_33_1","volume-title":"Floating point and ieee 754 compliance for nvidia gpus. rn (A+ B), 21:1\u2013\u20131874919424","author":"Whitehead Nathan","year":"2011","unstructured":"Nathan Whitehead and Alex Fit-Florea . Precision & performance : Floating point and ieee 754 compliance for nvidia gpus. rn (A+ B), 21:1\u2013\u20131874919424 , 2011 . Nathan Whitehead and Alex Fit-Florea. Precision & performance: Floating point and ieee 754 compliance for nvidia gpus. rn (A+ B), 21:1\u2013\u20131874919424, 2011."},{"key":"e_1_3_2_1_34_1","doi-asserted-by":"publisher","DOI":"10.1109\/NEBEC.2014.6972999"},{"key":"e_1_3_2_1_35_1","volume-title":"G-blastn: accelerating nucleotide alignment by graphics processors. Bioinformatics, page btu047","author":"Zhao Kaiyong","year":"2014","unstructured":"Kaiyong Zhao and Xiaowen Chu . G-blastn: accelerating nucleotide alignment by graphics processors. Bioinformatics, page btu047 , 2014 . Kaiyong Zhao and Xiaowen Chu. G-blastn: accelerating nucleotide alignment by graphics processors. Bioinformatics, page btu047, 2014."}],"event":{"name":"GPGPU-8: General-purpose Processing with Graphics Processing Units 8","sponsor":["SIGPLAN ACM Special Interest Group on Programming Languages"],"location":"San Francisco CA USA","acronym":"GPGPU-8"},"container-title":["Proceedings of the 8th Workshop on General Purpose Processing using GPUs"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/2716282.2716285","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/2716282.2716285","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,18]],"date-time":"2025-06-18T07:00:42Z","timestamp":1750230042000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/2716282.2716285"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2015,2,7]]},"references-count":34,"alternative-id":["10.1145\/2716282.2716285","10.1145\/2716282"],"URL":"https:\/\/doi.org\/10.1145\/2716282.2716285","relation":{},"subject":[],"published":{"date-parts":[[2015,2,7]]},"assertion":[{"value":"2015-02-07","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}