{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,2,10]],"date-time":"2026-02-10T12:16:24Z","timestamp":1770725784338,"version":"3.49.0"},"publisher-location":"New York, NY, USA","reference-count":35,"publisher":"ACM","license":[{"start":{"date-parts":[[2018,11,1]],"date-time":"2018-11-01T00:00:00Z","timestamp":1541030400000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2018,11]]},"DOI":"10.1145\/3243176.3243184","type":"proceedings-article","created":{"date-parts":[[2018,10,10]],"date-time":"2018-10-10T13:32:32Z","timestamp":1539178352000},"page":"1-12","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":29,"title":["E-PUR"],"prefix":"10.1145","author":[{"given":"Franyell","family":"Silfa","sequence":"first","affiliation":[{"name":"Universitat Polit\u00e8cnica de Catalunya, Barcelona, Spain"}]},{"given":"Gem","family":"Dot","sequence":"additional","affiliation":[{"name":"Universitat Polit\u00e8cnica de Catalunya, Barcelona, Spain"}]},{"given":"Jose-Maria","family":"Arnau","sequence":"additional","affiliation":[{"name":"Universitat Polit\u00e8cnica de Catalunya, Barcelona, Spain"}]},{"given":"Antonio","family":"Gonz\u00e0lez","sequence":"additional","affiliation":[{"name":"Universitat Polit\u00e8cnica de Catalunya, Barcelona, Spain"}]}],"member":"320","published-online":{"date-parts":[[2018,11]]},"reference":[{"key":"e_1_3_2_1_1_1","unstructured":"Rami Al-Rfou Guillaume Alain Amjad Almahairi Christof Angermueller Dzmitry Bahdanau Nicolas Ballas Fr\u00e9d\u00e9ric Bastien Justin Bayer Anatoly Belikov Alexander Belopolsky Yoshua Bengio Arnaud Bergeron James Bergstra Valentin Bisson Josh Bleecher Snyder Nicolas Bouchard Nicolas Boulanger-Lewandowski Xavier Bouthillier Alexandre de Br\u00e9bisson Olivier Breuleux Pierre-Luc Carrier Kyunghyun Cho Jan Chorowski Paul Christiano Tim Cooijmans Marc-Alexandre C\u00f4t\u00e9 Myriam C\u00f4t\u00e9 Aaron Courville Yann N. Dauphin Olivier Delalleau Julien Demouth Guillaume Desjardins Sander Dieleman Laurent Dinh M\u00e9lanie Ducoffe Vincent Dumoulin Samira Ebrahimi Kahou Dumitru Erhan Ziye Fan Orhan Firat Mathieu Germain Xavier Glorot Ian Goodfellow Matt Graham Caglar Gulcehre Philippe Hamel Iban Harlouchet Jean-Philippe Heng Bal\u00e1zs Hidasi Sina Honari Arjun Jain S\u00e9bastien Jean Kai Jia Mikhail Korobov Vivek Kulkarni Alex Lamb Pascal Lamblin Eric Larsen C\u00e9sar Laurent Sean Lee Simon Lefrancois Simon Lemieux Nicholas L\u00e9onard Zhouhan Lin Jesse A. Livezey Cory Lorenz Jeremiah Lowin Qianli Ma Pierre-Antoine Manzagol Olivier Mastropietro Robert T. McGibbon Roland Memisevic Bart van Merri\u00ebnboer Vincent Michalski Mehdi Mirza Alberto Orlandi Christopher Pal Razvan Pascanu Mohammad Pezeshki Colin Raffel Daniel Renshaw Matthew Rocklin Adriana Romero Markus Roth Peter Sadowski John Salvatier Fran\u00e7ois Savard Jan Schl\u00fcter John Schulman Gabriel Schwartz Iulian Vlad Serban Dmitriy Serdyuk Samira Shabanian \u00c9tienne Simon Sigurd Spieckermann S. Ramana Subramanyam Jakub Sygnowski J\u00e9r\u00e9mie Tanguay Gijs van Tulder Joseph Turian Sebastian Urban Pascal Vincent Francesco Visin Harm de Vries David Warde-Farley Dustin J. Webb Matthew Willson Kelvin Xu Lijun Xue Li Yao Saizheng Zhang and Ying Zhang. 2016. Theano: A Python framework for fast computation of mathematical expressions. arXiv e-prints abs\/1605.02688 (May 2016). http:\/\/arxiv.org\/abs\/1605.02688  Rami Al-Rfou Guillaume Alain Amjad Almahairi Christof Angermueller Dzmitry Bahdanau Nicolas Ballas Fr\u00e9d\u00e9ric Bastien Justin Bayer Anatoly Belikov Alexander Belopolsky Yoshua Bengio Arnaud Bergeron James Bergstra Valentin Bisson Josh Bleecher Snyder Nicolas Bouchard Nicolas Boulanger-Lewandowski Xavier Bouthillier Alexandre de Br\u00e9bisson Olivier Breuleux Pierre-Luc Carrier Kyunghyun Cho Jan Chorowski Paul Christiano Tim Cooijmans Marc-Alexandre C\u00f4t\u00e9 Myriam C\u00f4t\u00e9 Aaron Courville Yann N. Dauphin Olivier Delalleau Julien Demouth Guillaume Desjardins Sander Dieleman Laurent Dinh M\u00e9lanie Ducoffe Vincent Dumoulin Samira Ebrahimi Kahou Dumitru Erhan Ziye Fan Orhan Firat Mathieu Germain Xavier Glorot Ian Goodfellow Matt Graham Caglar Gulcehre Philippe Hamel Iban Harlouchet Jean-Philippe Heng Bal\u00e1zs Hidasi Sina Honari Arjun Jain S\u00e9bastien Jean Kai Jia Mikhail Korobov Vivek Kulkarni Alex Lamb Pascal Lamblin Eric Larsen C\u00e9sar Laurent Sean Lee Simon Lefrancois Simon Lemieux Nicholas L\u00e9onard Zhouhan Lin Jesse A. Livezey Cory Lorenz Jeremiah Lowin Qianli Ma Pierre-Antoine Manzagol Olivier Mastropietro Robert T. McGibbon Roland Memisevic Bart van Merri\u00ebnboer Vincent Michalski Mehdi Mirza Alberto Orlandi Christopher Pal Razvan Pascanu Mohammad Pezeshki Colin Raffel Daniel Renshaw Matthew Rocklin Adriana Romero Markus Roth Peter Sadowski John Salvatier Fran\u00e7ois Savard Jan Schl\u00fcter John Schulman Gabriel Schwartz Iulian Vlad Serban Dmitriy Serdyuk Samira Shabanian \u00c9tienne Simon Sigurd Spieckermann S. Ramana Subramanyam Jakub Sygnowski J\u00e9r\u00e9mie Tanguay Gijs van Tulder Joseph Turian Sebastian Urban Pascal Vincent Francesco Visin Harm de Vries David Warde-Farley Dustin J. Webb Matthew Willson Kelvin Xu Lijun Xue Li Yao Saizheng Zhang and Ying Zhang. 2016. Theano: A Python framework for fast computation of mathematical expressions. arXiv e-prints abs\/1605.02688 (May 2016). http:\/\/arxiv.org\/abs\/1605.02688"},{"key":"e_1_3_2_1_2_1","volume-title":"Optimizing Performance of Recurrent Neural Networks on GPUs. CoRR abs\/1604.01946","author":"Appleyard Jeremy","year":"2016","unstructured":"Jeremy Appleyard , Tom\u00e1s Kocisk\u00fd , and Phil Blunsom . 2016. Optimizing Performance of Recurrent Neural Networks on GPUs. CoRR abs\/1604.01946 ( 2016 ). arXiv:1604.01946 http:\/\/arxiv.org\/abs\/1604.01946 Jeremy Appleyard, Tom\u00e1s Kocisk\u00fd, and Phil Blunsom. 2016. Optimizing Performance of Recurrent Neural Networks on GPUs. CoRR abs\/1604.01946 (2016). arXiv:1604.01946 http:\/\/arxiv.org\/abs\/1604.01946"},{"key":"e_1_3_2_1_3_1","volume-title":"Sequence Learning","author":"Baldi Pierre","unstructured":"Pierre Baldi , S\u00f8ren Brunak , Paolo Frasconi , Gianluca Pollastri , and Giovanni Soda . 2001. Bidirectional dynamics for protein secondary structure prediction . In Sequence Learning . Springer , 80--104. Pierre Baldi, S\u00f8ren Brunak, Paolo Frasconi, Gianluca Pollastri, and Giovanni Soda. 2001. Bidirectional dynamics for protein secondary structure prediction. In Sequence Learning. Springer, 80--104."},{"key":"e_1_3_2_1_4_1","volume-title":"Recurrent neural networks hardware implementation on FPGA. arXiv preprint arXiv:1511.05552","author":"Ming Chang Andre Xian","year":"2015","unstructured":"Andre Xian Ming Chang , Berin Martini , and Eugenio Culurciello . 2015. Recurrent neural networks hardware implementation on FPGA. arXiv preprint arXiv:1511.05552 ( 2015 ). Andre Xian Ming Chang, Berin Martini, and Eugenio Culurciello. 2015. Recurrent neural networks hardware implementation on FPGA. arXiv preprint arXiv:1511.05552 (2015)."},{"key":"e_1_3_2_1_5_1","doi-asserted-by":"publisher","DOI":"10.1109\/ISCA.2016.40"},{"key":"e_1_3_2_1_6_1","volume-title":"cuDNN: Efficient Primitives for Deep Learning. CoRR abs\/1410.0759","author":"Chetlur Sharan","year":"2014","unstructured":"Sharan Chetlur , Cliff Woolley , Philippe Vandermersch , Jonathan Cohen , John Tran , Bryan Catanzaro , and Evan Shelhamer . 2014. cuDNN: Efficient Primitives for Deep Learning. CoRR abs\/1410.0759 ( 2014 ). arXiv:1410.0759 http:\/\/arxiv.org\/abs\/1410.0759 Sharan Chetlur, Cliff Woolley, Philippe Vandermersch, Jonathan Cohen, John Tran, Bryan Catanzaro, and Evan Shelhamer. 2014. cuDNN: Efficient Primitives for Deep Learning. CoRR abs\/1410.0759 (2014). arXiv:1410.0759 http:\/\/arxiv.org\/abs\/1410.0759"},{"key":"e_1_3_2_1_7_1","unstructured":"Fran\u00e7ois Chollet and Others. {n. d.}. Keras.  Fran\u00e7ois Chollet and Others. {n. d.}. Keras."},{"key":"e_1_3_2_1_8_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2015.7298878"},{"key":"e_1_3_2_1_9_1","unstructured":"Merlin Friesen. {n. d.}. Linux Power Management Optimization on the Nvidia Jetson Platform. https:\/\/events.static.linuxfound.org\/sites\/events\/files\/slides\/Linux_Low_Power_ELC_SanDiego.pdf.  Merlin Friesen. {n. d.}. Linux Power Management Optimization on the Nvidia Jetson Platform. https:\/\/events.static.linuxfound.org\/sites\/events\/files\/slides\/Linux_Low_Power_ELC_SanDiego.pdf."},{"key":"e_1_3_2_1_10_1","doi-asserted-by":"publisher","DOI":"10.1109\/IJCNN.2000.861302"},{"key":"e_1_3_2_1_11_1","doi-asserted-by":"publisher","DOI":"10.1109\/IJCNN.2005.1556215"},{"key":"e_1_3_2_1_12_1","volume-title":"LSTM: A search space odyssey","author":"Greff Klaus","year":"2016","unstructured":"Klaus Greff , Rupesh K Srivastava , Jan Koutn\u00edk , Bas R Steunebrink , and J\u00fcrgen Schmidhuber . 2016 . LSTM: A search space odyssey . IEEE transactions on neural networks and learning systems (2016). Klaus Greff, Rupesh K Srivastava, Jan Koutn\u00edk, Bas R Steunebrink, and J\u00fcrgen Schmidhuber. 2016. LSTM: A search space odyssey. IEEE transactions on neural networks and learning systems (2016)."},{"key":"e_1_3_2_1_13_1","doi-asserted-by":"publisher","DOI":"10.1109\/ASPDAC.2017.7858394"},{"key":"e_1_3_2_1_14_1","doi-asserted-by":"publisher","DOI":"10.1145\/3020078.3021745"},{"key":"e_1_3_2_1_15_1","unstructured":"Song Han Jeff Pool John Tran and William Dally. 2015. Learning both weights and connections for efficient neural network. In Advances in Neural Information Processing Systems. 1135--1143.   Song Han Jeff Pool John Tran and William Dally. 2015. Learning both weights and connections for efficient neural network. In Advances in Neural Information Processing Systems . 1135--1143."},{"key":"e_1_3_2_1_16_1","doi-asserted-by":"publisher","DOI":"10.1162\/neco.1997.9.8.1735"},{"key":"e_1_3_2_1_17_1","volume-title":"Residual LSTM: Design of a Deep Recurrent Architecture for Distant Speech Recognition. arXiv preprint arXiv:1701.03360","author":"Kim Jaeyoung","year":"2017","unstructured":"Jaeyoung Kim , Mostafa El-Khamy , and Jungwon Lee . 2017. Residual LSTM: Design of a Deep Recurrent Architecture for Distant Speech Recognition. arXiv preprint arXiv:1701.03360 ( 2017 ). Jaeyoung Kim, Mostafa El-Khamy, and Jungwon Lee. 2017. Residual LSTM: Design of a Deep Recurrent Architecture for Distant Speech Recognition. arXiv preprint arXiv:1701.03360 (2017)."},{"key":"e_1_3_2_1_18_1","doi-asserted-by":"publisher","DOI":"10.1109\/SiPS.2016.48"},{"key":"e_1_3_2_1_19_1","doi-asserted-by":"publisher","DOI":"10.1109\/FCCM.2015.50"},{"key":"e_1_3_2_1_20_1","volume-title":"Fixed Point Quantization of Deep Convolutional Networks. CoRR abs\/1511.06393","author":"Lin Darryl Dexu","year":"2015","unstructured":"Darryl Dexu Lin , Sachin S. Talathi , and V. Sreekanth Annapureddy . 2015. Fixed Point Quantization of Deep Convolutional Networks. CoRR abs\/1511.06393 ( 2015 ). arXiv:1511.06393 http:\/\/arxiv.org\/abs\/1511.06393 Darryl Dexu Lin, Sachin S. Talathi, and V. Sreekanth Annapureddy. 2015. Fixed Point Quantization of Deep Convolutional Networks. CoRR abs\/1511.06393 (2015). arXiv:1511.06393 http:\/\/arxiv.org\/abs\/1511.06393"},{"key":"e_1_3_2_1_21_1","volume-title":"Learning to diagnose with LSTM recurrent neural networks. arXiv preprint arXiv:1511.03677","author":"Lipton Zachary C","year":"2015","unstructured":"Zachary C Lipton , David C Kale , Charles Elkan , and Randall Wetzell . 2015. Learning to diagnose with LSTM recurrent neural networks. arXiv preprint arXiv:1511.03677 ( 2015 ). Zachary C Lipton, David C Kale, Charles Elkan, and Randall Wetzell. 2015. Learning to diagnose with LSTM recurrent neural networks. arXiv preprint arXiv:1511.03677 (2015)."},{"key":"e_1_3_2_1_22_1","doi-asserted-by":"publisher","DOI":"10.1109\/ASRU.2015.7404790"},{"key":"e_1_3_2_1_23_1","unstructured":"Micron Inc. {n. d.}. TN-53-01: LPDDR4 System Power Calculator. https:\/\/www.micron.com\/support\/tools-and-utilities\/power-calc.  Micron Inc. {n. d.}. TN-53-01: LPDDR4 System Power Calculator. https:\/\/www.micron.com\/support\/tools-and-utilities\/power-calc."},{"key":"e_1_3_2_1_24_1","volume-title":"A tool to model large caches. HP Laboratories","author":"Muralimanohar Naveen","year":"2009","unstructured":"Naveen Muralimanohar , Rajeev Balasubramonian , and Norman P Jouppi . 2009. CACTI 6.0 : A tool to model large caches. HP Laboratories ( 2009 ), 22--31. Naveen Muralimanohar, Rajeev Balasubramonian, and Norman P Jouppi. 2009. CACTI 6.0: A tool to model large caches. HP Laboratories (2009), 22--31."},{"key":"e_1_3_2_1_25_1","unstructured":"NVIDIA. {n. d.}. NVIDIA TEGRA X1 new mobile superchip. http:\/\/international.download.nvidia.com\/pdf\/tegra\/Tegra-X1-whitepaper-v1.0.pdf.  NVIDIA. {n. d.}. NVIDIA TEGRA X1 new mobile superchip. http:\/\/international.download.nvidia.com\/pdf\/tegra\/Tegra-X1-whitepaper-v1.0.pdf."},{"key":"e_1_3_2_1_26_1","doi-asserted-by":"publisher","DOI":"10.1109\/78.650093"},{"key":"e_1_3_2_1_27_1","doi-asserted-by":"publisher","DOI":"10.21437\/Interspeech.2012-65"},{"key":"e_1_3_2_1_28_1","unstructured":"Ilya Sutskever Oriol Vinyals and Quoc V Le. 2014. Sequence to sequence learning with neural networks. In Advances in neural information processing systems. 3104--3112.   Ilya Sutskever Oriol Vinyals and Quoc V Le. 2014. Sequence to sequence learning with neural networks. In Advances in neural information processing systems . 3104--3112."},{"key":"e_1_3_2_1_29_1","volume-title":"26th International Conference on. IEEE\/ACM.","author":"Tabani Hamid","year":"2017","unstructured":"Hamid Tabani , Jose-Maria Arnau , Jordi Tubella , and Antonio Gonzalez . 2017 . An Ultra Low-power Hardware Accelerator for Acoustic Scoring in Speech Recognition. In Parallel Architecture and Compilation Techniques (PACT) , 26th International Conference on. IEEE\/ACM. Hamid Tabani, Jose-Maria Arnau, Jordi Tubella, and Antonio Gonzalez. 2017. An Ultra Low-power Hardware Accelerator for Acoustic Scoring in Speech Recognition. In Parallel Architecture and Compilation Techniques (PACT), 26th International Conference on. IEEE\/ACM."},{"key":"e_1_3_2_1_30_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2015.515"},{"key":"e_1_3_2_1_31_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2015.7298935"},{"key":"e_1_3_2_1_32_1","doi-asserted-by":"publisher","DOI":"10.1109\/GreenCom-CPSCom.2010.102"},{"key":"e_1_3_2_1_33_1","unstructured":"Yonghui Wu Mike Schuster Zhifeng Chen Quoc V Le Mohammad Norouzi Wolfgang Macherey Maxim Krikun Yuan Cao Qin Gao Klaus Macherey etal 2016. Google's neural machine translation system: Bridging the gap between human and machine translation. arXiv preprint arXiv:1609.08144 (2016).  Yonghui Wu Mike Schuster Zhifeng Chen Quoc V Le Mohammad Norouzi Wolfgang Macherey Maxim Krikun Yuan Cao Qin Gao Klaus Macherey et al. 2016. Google's neural machine translation system: Bridging the gap between human and machine translation. arXiv preprint arXiv:1609.08144 (2016)."},{"key":"e_1_3_2_1_34_1","doi-asserted-by":"publisher","DOI":"10.5555\/3195638.3195696"},{"key":"e_1_3_2_1_35_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2015.7299101"}],"event":{"name":"PACT '18: International conference on Parallel Architectures and Compilation Techniques","location":"Limassol Cyprus","acronym":"PACT '18","sponsor":["SIGARCH ACM Special Interest Group on Computer Architecture","IFIP WG 10.3 IFIP WG 10.3","IEEE CS"]},"container-title":["Proceedings of the 27th International Conference on Parallel Architectures and Compilation Techniques"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3243176.3243184","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3243176.3243184","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,18]],"date-time":"2025-06-18T00:57:39Z","timestamp":1750208259000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3243176.3243184"}},"subtitle":["an energy-efficient processing unit for recurrent neural networks"],"short-title":[],"issued":{"date-parts":[[2018,11]]},"references-count":35,"alternative-id":["10.1145\/3243176.3243184","10.1145\/3243176"],"URL":"https:\/\/doi.org\/10.1145\/3243176.3243184","relation":{},"subject":[],"published":{"date-parts":[[2018,11]]},"assertion":[{"value":"2018-11-01","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}