{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,6,18]],"date-time":"2025-06-18T04:34:04Z","timestamp":1750221244916,"version":"3.41.0"},"publisher-location":"New York, NY, USA","reference-count":28,"publisher":"ACM","license":[{"start":{"date-parts":[[2018,7,15]],"date-time":"2018-07-15T00:00:00Z","timestamp":1531612800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"funder":[{"name":"Korea government (MSIP)","award":["2015R1A2A1A10056051"],"award-info":[{"award-number":["2015R1A2A1A10056051"]}]}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2018,7,15]]},"DOI":"10.1145\/3229631.3229649","type":"proceedings-article","created":{"date-parts":[[2019,1,14]],"date-time":"2019-01-14T13:15:25Z","timestamp":1547471725000},"page":"10-17","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":0,"title":["Massively parallel computation of linear recurrence equations with graphics processing units"],"prefix":"10.1145","author":[{"given":"Wonyong","family":"Sung","sequence":"first","affiliation":[{"name":"Seoul National University, Seoul, Korea"}]},{"given":"Dong-hwan","family":"Lee","sequence":"additional","affiliation":[{"name":"Seoul National University, Seoul, Korea"}]},{"given":"Kyuyeon","family":"Hwang","sequence":"additional","affiliation":[{"name":"Seoul National University, Seoul, Korea"}]}],"member":"320","published-online":{"date-parts":[[2018,7,15]]},"reference":[{"key":"e_1_3_2_1_1_1","doi-asserted-by":"publisher","DOI":"10.1109\/SIPS.2009.5336230"},{"key":"e_1_3_2_1_2_1","doi-asserted-by":"publisher","DOI":"10.1109\/12.42122"},{"key":"e_1_3_2_1_3_1","doi-asserted-by":"publisher","DOI":"10.1109\/IPPS.1992.223009"},{"key":"e_1_3_2_1_4_1","unstructured":"J. Bradbury S. Merity C. Xiong and R. Socher. 2016. Quasi-Recurrent Neural Networks. ArXiv e-prints (Nov. 2016). arXiv:1611.01576  J. Bradbury S. Merity C. Xiong and R. Socher. 2016. Quasi-Recurrent Neural Networks. ArXiv e-prints (Nov. 2016). arXiv:1611.01576"},{"key":"e_1_3_2_1_5_1","unstructured":"Ian Buck. 2015. Nvidia's next-gen Pascal GPU architecture to provide 10x speedup for deep learning apps. (2015).  Ian Buck. 2015. Nvidia's next-gen Pascal GPU architecture to provide 10x speedup for deep learning apps. (2015)."},{"key":"e_1_3_2_1_6_1","doi-asserted-by":"publisher","DOI":"10.1109\/TCT.1971.1083368"},{"key":"e_1_3_2_1_7_1","doi-asserted-by":"publisher","DOI":"10.1109\/T-C.1975.224291"},{"key":"e_1_3_2_1_8_1","doi-asserted-by":"publisher","DOI":"10.1145\/1375527.1375559"},{"key":"e_1_3_2_1_9_1","volume-title":"Proc. of IEEE International Symposium on Circuits and Systems (ISCAS)","volume":"2","author":"Feng Wuchun","year":"2010","unstructured":"Wuchun Feng and Shucai Xiao . 2010 . To GPU synchronize or not GPU synchronize? . In Proc. of IEEE International Symposium on Circuits and Systems (ISCAS) , Vol. 2 . 3801--3804. Wuchun Feng and Shucai Xiao. 2010. To GPU synchronize or not GPU synchronize?. In Proc. of IEEE International Symposium on Circuits and Systems (ISCAS), Vol. 2. 3801--3804."},{"key":"e_1_3_2_1_10_1","doi-asserted-by":"publisher","DOI":"10.1109\/TC.1981.1675755"},{"key":"e_1_3_2_1_11_1","unstructured":"M. Harris S. Sengupta and J.D. Owens. 2007. Parallel prefix sum (scan) with CUDA. GPU Gems.  M. Harris S. Sengupta and J.D. Owens. 2007. Parallel prefix sum (scan) with CUDA. GPU Gems."},{"key":"e_1_3_2_1_12_1","doi-asserted-by":"publisher","DOI":"10.1145\/1555815.1555775"},{"key":"e_1_3_2_1_13_1","doi-asserted-by":"publisher","DOI":"10.1145\/322017.322030"},{"key":"e_1_3_2_1_14_1","doi-asserted-by":"publisher","DOI":"10.1109\/TC.1973.5009159"},{"key":"e_1_3_2_1_15_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICASSP.2010.5495519"},{"key":"e_1_3_2_1_16_1","unstructured":"T. Lei Y. Zhang and Y. Artzi. 2017. Training RNNs as Fast as CNNs. ArXiv e-prints (Sept. 2017). arXiv:cs.CL\/1709.02755  T. Lei Y. Zhang and Y. Artzi. 2017. Training RNNs as Fast as CNNs. ArXiv e-prints (Sept. 2017). arXiv:cs.CL\/1709.02755"},{"key":"e_1_3_2_1_18_1","doi-asserted-by":"publisher","DOI":"10.1145\/1854273.1854344"},{"key":"e_1_3_2_1_19_1","unstructured":"NVIDIA Corporation. 2018. NVIDIA Corporation. NVIDIA CUDA (Compute Unified Device Architecture) Programming Guide. (2018). http:\/\/developer.nvidia.com\/object\/cuda.html  NVIDIA Corporation. 2018. NVIDIA Corporation. NVIDIA CUDA (Compute Unified Device Architecture) Programming Guide. (2018). http:\/\/developer.nvidia.com\/object\/cuda.html"},{"key":"e_1_3_2_1_20_1","unstructured":"NVIDIA Corporation. 2018. NVIDIA CUDA C\/C++ SDK code samples. (2018). http:\/\/developer.nvidia.com\/cuda-cc-sdk-code-samples  NVIDIA Corporation. 2018. NVIDIA CUDA C\/C++ SDK code samples. (2018). http:\/\/developer.nvidia.com\/cuda-cc-sdk-code-samples"},{"key":"e_1_3_2_1_21_1","unstructured":"NVIDIA Corporation. 2018. NVIDIA GeForce GTX 285. (2018). http:\/\/www.nvidia.com\/object\/product_geforce_gtx_285_us.\\html  NVIDIA Corporation. 2018. NVIDIA GeForce GTX 285. (2018). http:\/\/www.nvidia.com\/object\/product_geforce_gtx_285_us.\\html"},{"key":"e_1_3_2_1_22_1","doi-asserted-by":"publisher","DOI":"10.1109\/JPROC.2008.917757"},{"key":"e_1_3_2_1_23_1","volume-title":"Computer Organization and Design","author":"Patterson David","year":"1803","unstructured":"David Patterson and JL Hennessy . 2004. Computer Organization and Design . Morgan Kaufmann Publishers , MA 0 1803 USA. David Patterson and JL Hennessy. 2004. Computer Organization and Design. Morgan Kaufmann Publishers, MA 01803 USA."},{"volume-title":"Proc. of the 22nd ACM SIGGRAPH\/EUROGRAPHICS symposium on Graphics hardware. 97--106","author":"Sengupta S.","key":"e_1_3_2_1_24_1","unstructured":"S. Sengupta , M. Harris , Y. Zhang , and J.D. Owens . 2007. Scan primitives for GPU computing . In Proc. of the 22nd ACM SIGGRAPH\/EUROGRAPHICS symposium on Graphics hardware. 97--106 . S. Sengupta, M. Harris, Y. Zhang, and J.D. Owens. 2007. Scan primitives for GPU computing. In Proc. of the 22nd ACM SIGGRAPH\/EUROGRAPHICS symposium on Graphics hardware. 97--106."},{"key":"e_1_3_2_1_25_1","volume-title":"Proc. of IEEE International Conference on Acoustics Speech and Signal Processing (ICASSP)","volume":"11","author":"Sung Wonyong","unstructured":"Wonyong Sung and S. Mitra . 1986. Efficient multi-processor implementation of recursive digital filters . In Proc. of IEEE International Conference on Acoustics Speech and Signal Processing (ICASSP) , Vol. 11 . 257--260. Wonyong Sung and S. Mitra. 1986. Efficient multi-processor implementation of recursive digital filters. In Proc. of IEEE International Conference on Acoustics Speech and Signal Processing (ICASSP), Vol. 11. 257--260."},{"key":"e_1_3_2_1_26_1","doi-asserted-by":"crossref","first-page":"9","DOI":"10.1109\/PROC.1987.13881","article-title":"Implementation of digital filtering algorithms using pipelined vector processors","volume":"75","author":"Sung Wonyong","year":"1987","unstructured":"Wonyong Sung and S.K. Mitra . 1987 . Implementation of digital filtering algorithms using pipelined vector processors . Proc. IEEE 75 , 9 (Sept. 1987), 1293-1303. Wonyong Sung and S.K. Mitra. 1987. Implementation of digital filtering algorithms using pipelined vector processors. Proc. IEEE 75, 9 (Sept. 1987), 1293-1303.","journal-title":"Proc. IEEE"},{"key":"e_1_3_2_1_27_1","doi-asserted-by":"publisher","DOI":"10.1109\/71.113086"},{"key":"e_1_3_2_1_28_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-540-39924-7_32"},{"key":"e_1_3_2_1_29_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.parco.2004.09.007"}],"event":{"name":"SAMOS XVIII: Architectures, Modeling, and Simulation","acronym":"SAMOS XVIII","location":"Pythagorion Greece"},"container-title":["Proceedings of the 18th International Conference on Embedded Computer Systems: Architectures, Modeling, and Simulation"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3229631.3229649","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3229631.3229649","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,18]],"date-time":"2025-06-18T02:07:38Z","timestamp":1750212458000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3229631.3229649"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2018,7,15]]},"references-count":28,"alternative-id":["10.1145\/3229631.3229649","10.1145\/3229631"],"URL":"https:\/\/doi.org\/10.1145\/3229631.3229649","relation":{},"subject":[],"published":{"date-parts":[[2018,7,15]]},"assertion":[{"value":"2018-07-15","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}