{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,6,2]],"date-time":"2026-06-02T21:50:52Z","timestamp":1780437052319,"version":"3.54.1"},"reference-count":179,"publisher":"Association for Computing Machinery (ACM)","issue":"5","license":[{"start":{"date-parts":[[2018,9,18]],"date-time":"2018-09-18T00:00:00Z","timestamp":1537228800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["ACM Comput. Surv."],"published-print":{"date-parts":[[2019,9,30]]},"abstract":"<jats:p>The field of machine learning is witnessing its golden era as deep learning slowly becomes the leader in this domain. Deep learning uses multiple layers to represent the abstractions of data to build computational models. Some key enabler deep learning algorithms such as generative adversarial networks, convolutional neural networks, and model transfers have completely changed our perception of information processing. However, there exists an aperture of understanding behind this tremendously fast-paced domain, because it was never previously represented from a multiscope perspective. The lack of core understanding renders these powerful methods as black-box machines that inhibit development at a fundamental level. Moreover, deep learning has repeatedly been perceived as a silver bullet to all stumbling blocks in machine learning, which is far from the truth. This article presents a comprehensive review of historical and recent state-of-the-art approaches in visual, audio, and text processing; social network analysis; and natural language processing, followed by the in-depth analysis on pivoting and groundbreaking advances in deep learning applications. It was also undertaken to review the issues faced in deep learning such as unsupervised learning, black-box models, and online learning and to illustrate how these challenges can be transformed into prolific future research avenues.<\/jats:p>","DOI":"10.1145\/3234150","type":"journal-article","created":{"date-parts":[[2018,9,18]],"date-time":"2018-09-18T12:11:32Z","timestamp":1537272692000},"page":"1-36","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":1161,"title":["A Survey on Deep Learning"],"prefix":"10.1145","volume":"51","author":[{"given":"Samira","family":"Pouyanfar","sequence":"first","affiliation":[{"name":"Florida International University, Miami, FL"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Saad","family":"Sadiq","sequence":"additional","affiliation":[{"name":"University of Miami, Coral Gables, FL"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Yilin","family":"Yan","sequence":"additional","affiliation":[{"name":"University of Miami, Coral Gables, FL"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Haiman","family":"Tian","sequence":"additional","affiliation":[{"name":"Florida International University, Miami, FL"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Yudong","family":"Tao","sequence":"additional","affiliation":[{"name":"University of Miami, Coral Gables, FL"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Maria Presa","family":"Reyes","sequence":"additional","affiliation":[{"name":"Florida International University, Miami, FL"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Mei-Ling","family":"Shyu","sequence":"additional","affiliation":[{"name":"University of Miami, Coral Gables, FL"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-9209-390X","authenticated-orcid":false,"given":"Shu-Ching","family":"Chen","sequence":"additional","affiliation":[{"name":"Florida International University"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"S. S.","family":"Iyengar","sequence":"additional","affiliation":[{"name":"Florida International University"}],"role":[{"vocabulary":"crossref","role":"author"}]}],"member":"320","published-online":{"date-parts":[[2018,9,18]]},"reference":[{"key":"e_1_2_1_1_1","volume-title":"Tensorflow: Large-scale machine learning on heterogeneous distributed systems. CoRR abs\/1603.04467","author":"Abadi Mart\u00edn","year":"2016","unstructured":"Mart\u00edn Abadi, Ashish Agarwal, Paul Barham, Eugene Brevdo, Zhifeng Chen, Craig Citro, Greg S. Corrado, Andy Davis, Jeffrey Dean, Matthieu Devin, Sanjay Ghemawat, Ian Goodfellow, Andrew Harp, Geoffrey Irving, Michael Isard, Yangqing Jia, Rafal Jozefowicz, Lukasz Kaiser, Manjunath Kudlur, Josh Levenberg, Dan Man\u00e9, Rajat Monga, Sherry Moore, Derek Murray, Chris Olah, Mike Schuster, Jonathon Shlens, Benoit Steiner, Ilya Sutskever, Kunal Talwar, Paul Tucker, Vincent Vanhoucke, Vijay Vasudevan, Fernanda Vi\u00e9gas, Oriol Vinyals, Pete Warden, Martin Wattenberg, Martin Wicke, Yuan Yu, and Xiaoqiang Zheng. 2016. Tensorflow: Large-scale machine learning on heterogeneous distributed systems. CoRR abs\/1603.04467 (2016). Retrieved from http:\/\/arxiv.org\/abs\/1603.04467."},{"key":"e_1_2_1_2_1","doi-asserted-by":"publisher","DOI":"10.1109\/TASLP.2014.2339736"},{"key":"e_1_2_1_3_1","doi-asserted-by":"publisher","DOI":"10.1109\/WASPAA.2017.8170027"},{"key":"e_1_2_1_4_1","volume-title":"YouTube-8M: A large-scale video classification benchmark. CoRR abs\/1609.08675","author":"Abu-El-Haija Sami","year":"2016","unstructured":"Sami Abu-El-Haija, Nisarg Kothari, Joonseok Lee, Paul Natsev, George Toderici, Balakrishnan Varadarajan, and Sudheendra Vijayanarasimhan. 2016. YouTube-8M: A large-scale video classification benchmark. CoRR abs\/1609.08675 (2016). Retrieved from http:\/\/arxiv.org\/abs\/1609.08675."},{"key":"e_1_2_1_5_1","unstructured":"Rami Al-Rfou Guillaume Alain Amjad Almahairi Christof Angermueller Dzmitry Bahdanau Nicolas Ballas Fr\u00e9d\u00e9ric Bastien Justin Bayer Anatoly Belikov Alexander Belopolsky Yoshua Bengio Arnaud Bergeron James Bergstra Valentin Bisson Josh Bleecher Snyder Nicolas Bouchard Nicolas Boulanger-Lewandowski Xavier Bouthillier Alexandre de Br\u00e9bisson Olivier Breuleux Pierre-Luc Carrier Kyunghyun Cho Jan Chorowski Paul Christiano Tim Cooijmans Marc-Alexandre C\u00f4t\u00e9 Myriam C\u00f4t\u00e9 Aaron Courville Yann N. Dauphin Olivier Delalleau Julien Demouth Guillaume Desjardins Sander Dieleman Laurent Dinh M\u00e9lanie Ducoffe Vincent Dumoulin Samira Ebrahimi Kahou Dumitru Erhan Ziye Fan Orhan Firat Mathieu Germain Xavier Glorot Ian Goodfellow Matt Graham Caglar Gulcehre Philippe Hamel Iban Harlouchet Jean-Philippe Heng Bal\u00e1zs Hidasi Sina Honari Arjun Jain S\u00e9bastien Jean Kai Jia Mikhail Korobov Vivek Kulkarni Alex Lamb Pascal Lamblin Eric Larsen C\u00e9sar Laurent Sean Lee Simon Lefrancois Simon Lemieux Nicholas L\u00e9onard Zhouhan Lin Jesse A. Livezey Cory Lorenz Jeremiah Lowin Qianli Ma Pierre-Antoine Manzagol Olivier Mastropietro Robert T. McGibbon Roland Memisevic Bart van Merri\u00ebnboer Vincent Michalski Mehdi Mirza Alberto Orlandi Christopher Pal Razvan Pascanu Mohammad Pezeshki Colin Raffel Daniel Renshaw Matthew Rocklin Adriana Romero Markus Roth Peter Sadowski John Salvatier Fran\u00e7ois Savard Jan Schl\u00fcter John Schulman Gabriel Schwartz Iulian Vlad Serban Dmitriy Serdyuk Samira Shabanian \u00c9tienne Simon Sigurd Spieckermann S. Ramana Subramanyam Jakub Sygnowski J\u00e9r\u00e9mie Tanguay Gijs van Tulder Joseph Turian Sebastian Urban Pascal Vincent Francesco Visin Harm de Vries David Warde-Farley Dustin J. Webb Matthew Willson Kelvin Xu Lijun Xue Li Yao Saizheng Zhang and Ying Zhang. 2016. Theano: A Python framework for fast computation of mathematical expressions. CoRR abs\/1605.02688 (2016). Retrieved from http:\/\/arxiv.org\/abs\/1605.02688."},{"key":"e_1_2_1_6_1","doi-asserted-by":"publisher","DOI":"10.5555\/3045390.3045410"},{"key":"e_1_2_1_7_1","doi-asserted-by":"publisher","DOI":"10.15252\/msb.20156651"},{"key":"e_1_2_1_8_1","volume-title":"Neurostream: Scalable and energy efficient deep learning with smart memory cubes. CoRR abs\/1701.06420","author":"Azarkhish Erfan","year":"2017","unstructured":"Erfan Azarkhish, Davide Rossi, Igor Loi, and Luca Benini. 2017. Neurostream: Scalable and energy efficient deep learning with smart memory cubes. CoRR abs\/1701.06420 (2017). Retrieved from http:\/\/arxiv.org\/abs\/1701.06420."},{"key":"e_1_2_1_9_1","volume-title":"Neural machine translation by jointly learning to align and translate. CoRR abs\/1409.0473","author":"Bahdanau Dzmitry","year":"2014","unstructured":"Dzmitry Bahdanau, Kyunghyun Cho, and Yoshua Bengio. 2014. Neural machine translation by jointly learning to align and translate. CoRR abs\/1409.0473 (2014). Retrieved from http:\/\/arxiv.org\/abs\/1409.0473."},{"key":"e_1_2_1_10_1","volume-title":"Courville","author":"Ballas Nicolas","year":"2015","unstructured":"Nicolas Ballas, Li Yao, Chris Pal, and Aaron C. Courville. 2015. Delving deeper into convolutional networks for learning video representations. CoRR abs\/1511.06432 (2015). Retrieved from http:\/\/arxiv.org\/abs\/1511.06432."},{"key":"e_1_2_1_11_1","doi-asserted-by":"publisher","DOI":"10.5555\/3044805.3044918"},{"key":"e_1_2_1_12_1","volume-title":"Empirical Methods in Natural Language Processing","volume":"2","author":"Berant Jonathan","year":"2013","unstructured":"Jonathan Berant, Andrew Chou, Roy Frostig, and Percy Liang. 2013. Semantic parsing on freebase from question-answer pairs. In Empirical Methods in Natural Language Processing, Vol. 2. Association for Computational Linguistics, 6."},{"key":"e_1_2_1_13_1","first-page":"81","article-title":"Statistical modeling: The two cultures","volume":"48","author":"Breiman Leo","year":"2003","unstructured":"Leo Breiman. 2003. Statistical modeling: The two cultures. Quality Control and Applied Statistics 48, 1 (2003), 81--82.","journal-title":"Quality Control and Applied Statistics"},{"key":"e_1_2_1_14_1","volume-title":"Can we open the black box of AI? Nature 538, 7623","author":"Castelvecchi Davide","year":"2016","unstructured":"Davide Castelvecchi. 2016. Can we open the black box of AI? Nature 538, 7623 (2016), 20--23."},{"key":"e_1_2_1_15_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2015.312"},{"key":"e_1_2_1_16_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-319-10581-9_3"},{"key":"e_1_2_1_17_1","volume-title":"MXNet: A flexible and efficient machine learning library for heterogeneous distributed systems. CoRR abs\/1512.01274","author":"Chen Tianqi","year":"2015","unstructured":"Tianqi Chen, Mu Li, Yutian Li, Min Lin, Naiyan Wang, Minjie Wang, Tianjun Xiao, Bing Xu, Chiyuan Zhang, and Zheng Zhang. 2015. MXNet: A flexible and efficient machine learning library for heterogeneous distributed systems. CoRR abs\/1512.01274 (2015). Retrieved from http:\/\/arxiv.org\/abs\/1512.01274."},{"key":"e_1_2_1_18_1","volume-title":"cuDNN: Efficient primitives for deep learning. CoRR abs\/1410.0759","author":"Chetlur Sharan","year":"2014","unstructured":"Sharan Chetlur, Cliff Woolley, Philippe Vandermersch, Jonathan Cohen, John Tran, Bryan Catanzaro, and Evan Shelhamer. 2014. cuDNN: Efficient primitives for deep learning. CoRR abs\/1410.0759 (2014). Retrieved from http:\/\/arxiv.org\/abs\/1410.0759."},{"key":"e_1_2_1_19_1","doi-asserted-by":"publisher","DOI":"10.1109\/TNNLS.2013.2242090"},{"key":"e_1_2_1_20_1","doi-asserted-by":"publisher","DOI":"10.3115\/v1\/D14-1179"},{"key":"e_1_2_1_21_1","doi-asserted-by":"publisher","DOI":"10.1109\/TNN.2006.881710"},{"key":"e_1_2_1_22_1","volume-title":"Accessed","author":"CIFAR.","year":"2009","unstructured":"CIFAR. 2009. CIFAR-10 and CIFAR-100 datasets. Retrieved from https:\/\/www.cs.toronto.edu\/&sim;kriz\/cifar.html. Accessed April 18, 2017."},{"key":"e_1_2_1_23_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-642-40763-5_51"},{"key":"e_1_2_1_24_1","doi-asserted-by":"publisher","DOI":"10.5555\/3042817.3043086"},{"key":"e_1_2_1_25_1","volume-title":"Torch: A Modular Machine Learning Software Library. Idiap-RR Idiap-RR-46-2002. Idiap.","author":"Collobert Ronan","year":"2002","unstructured":"Ronan Collobert, Samy Bengio, and Johnny Mari\u00e9thoz. 2002. Torch: A Modular Machine Learning Software Library. Idiap-RR Idiap-RR-46-2002. Idiap."},{"key":"e_1_2_1_26_1","doi-asserted-by":"publisher","DOI":"10.1109\/TASL.2011.2134090"},{"key":"e_1_2_1_27_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2005.177"},{"key":"e_1_2_1_28_1","doi-asserted-by":"publisher","DOI":"10.5555\/2999134.2999271"},{"key":"e_1_2_1_29_1","first-page":"1","article-title":"A tutorial survey of architectures, algorithms, and applications for deep learning","volume":"3","author":"Deng Li","year":"2014","unstructured":"Li Deng. 2014. A tutorial survey of architectures, algorithms, and applications for deep learning. APSIPA Transactions on Signal and Information Processing 3 (2014), 1--29.","journal-title":"APSIPA Transactions on Signal and Information Processing"},{"key":"e_1_2_1_30_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICASSP.2013.6638239"},{"key":"e_1_2_1_31_1","doi-asserted-by":"publisher","DOI":"10.3115\/1220355.1220406"},{"key":"e_1_2_1_32_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2015.7298878"},{"key":"e_1_2_1_33_1","doi-asserted-by":"publisher","DOI":"10.3115\/v1\/P15-1026"},{"key":"e_1_2_1_34_1","volume-title":"International Conference on Learning Representations Workshop. 1--4.","author":"Dozat Timothy","year":"2016","unstructured":"Timothy Dozat. 2016. Incorporating Nesterov momentum into Adam. In International Conference on Learning Representations Workshop. 1--4."},{"key":"e_1_2_1_35_1","volume-title":"Conference on Learning Theory. Omnipress, 257--269","author":"Duchi John C.","year":"2010","unstructured":"John C. Duchi, Elad Hazan, and Yoram Singer. 2010. Adaptive subgradient methods for online learning and stochastic optimization. In Conference on Learning Theory. Omnipress, 257--269."},{"key":"e_1_2_1_36_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.patcog.2010.09.020"},{"key":"e_1_2_1_37_1","volume-title":"International Conference on Machine Learning. Omnipress.","author":"Fakoor Rasool","year":"2013","unstructured":"Rasool Fakoor, Faisal Ladhak, Azade Nazi, and Manfred Huber. 2013. Using deep learning to enhance cancer diagnosis and classification. In International Conference on Machine Learning. Omnipress."},{"key":"e_1_2_1_38_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2016.213"},{"key":"e_1_2_1_39_1","doi-asserted-by":"publisher","DOI":"10.1109\/ASRU.2015.7404872"},{"key":"e_1_2_1_40_1","doi-asserted-by":"publisher","DOI":"10.1007\/BF00344251"},{"key":"e_1_2_1_41_1","doi-asserted-by":"publisher","DOI":"10.5555\/1873781.1873820"},{"key":"e_1_2_1_43_1","doi-asserted-by":"publisher","DOI":"10.1177\/0278364913491297"},{"key":"e_1_2_1_44_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2015.169"},{"key":"e_1_2_1_45_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2014.81"},{"key":"e_1_2_1_46_1","volume-title":"The 13th International Conference on Artificial Intelligence and Statistics","volume":"9","author":"Glorot Xavier","year":"2010","unstructured":"Xavier Glorot and Yoshua Bengio. 2010. Understanding the difficulty of training deep feedforward neural networks. In The 13th International Conference on Artificial Intelligence and Statistics, Vol. 9. JMLR.org, 249--256."},{"key":"e_1_2_1_47_1","volume-title":"IEEE International Conference on Neural Networks","volume":"1","author":"Goller Christoph","year":"1996","unstructured":"Christoph Goller and Andreas Kuchler. 1996. Learning task-dependent distributed representations by backpropagation through structure. In IEEE International Conference on Neural Networks, Vol. 1. IEEE, 347--352."},{"key":"e_1_2_1_48_1","doi-asserted-by":"publisher","DOI":"10.5555\/3086952"},{"key":"e_1_2_1_49_1","doi-asserted-by":"publisher","unstructured":"Ian Goodfellow Jean Pouget-Abadie Mehdi Mirza Bing Xu David Warde-Farley Sherjil Ozair Aaron Courville and Yoshua Bengio. 2014. Generative adversarial nets. In Advances in Neural Information Processing Systems. Curran Associates 2672--2680.","DOI":"10.5555\/2969033.2969125"},{"key":"e_1_2_1_50_1","volume-title":"Accessed","year":"2016","unstructured":"Google. 2016. Alphago. Retrieved from https:\/\/deepmind.com\/research\/alphago. Accessed April 18, 2017."},{"key":"e_1_2_1_51_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICASSP.2013.6638947"},{"key":"e_1_2_1_52_1","doi-asserted-by":"publisher","DOI":"10.1109\/TMI.2016.2553401"},{"key":"e_1_2_1_53_1","doi-asserted-by":"publisher","DOI":"10.5555\/3104322.3104374"},{"key":"e_1_2_1_54_1","doi-asserted-by":"publisher","DOI":"10.25148\/etd.FIDC000162"},{"key":"e_1_2_1_55_1","doi-asserted-by":"publisher","DOI":"10.1109\/IROS.2008.4651217"},{"key":"e_1_2_1_56_1","doi-asserted-by":"crossref","unstructured":"Kun Han Dong Yu and Ivan Tashev. 2014. Speech emotion recognition using deep neural network and extreme learning machine. In Interspeech. ISCA 223--227.","DOI":"10.21437\/Interspeech.2014-57"},{"key":"e_1_2_1_57_1","volume-title":"Mask R-CNN. In IEEE International Conference on Computer Vision. IEEE, 2980--2988","author":"He Kaiming","year":"2017","unstructured":"Kaiming He, Georgia Gkioxari, Piotr Doll\u00e1r, and Ross Girshick. 2017. Mask R-CNN. In IEEE International Conference on Computer Vision. IEEE, 2980--2988."},{"key":"e_1_2_1_58_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2016.90"},{"key":"e_1_2_1_59_1","volume-title":"Early visual concept learning with unsupervised deep learning. CoRR abs\/1606.05579","author":"Higgins Irina","year":"2016","unstructured":"Irina Higgins, Loic Matthey, Xavier Glorot, Arka Pal, Benigno Uria, Charles Blundell, Shakir Mohamed, and Alexander Lerchner. 2016. Early visual concept learning with unsupervised deep learning. CoRR abs\/1606.05579 (2016). Retrieved from http:\/\/arxiv.org\/abs\/1606.05579."},{"key":"e_1_2_1_60_1","doi-asserted-by":"publisher","DOI":"10.1109\/MSP.2012.2205597"},{"key":"e_1_2_1_61_1","doi-asserted-by":"publisher","DOI":"10.4249\/scholarpedia.5947"},{"key":"e_1_2_1_62_1","doi-asserted-by":"publisher","DOI":"10.1162\/neco.2006.18.7.1527"},{"key":"e_1_2_1_63_1","doi-asserted-by":"publisher","DOI":"10.1145\/1815961.1815998"},{"key":"e_1_2_1_64_1","volume-title":"Weinberger","author":"Huang Gao","year":"2016","unstructured":"Gao Huang, Yu Sun, Zhuang Liu, Daniel Sedra, and Kilian Q. Weinberger. 2016. Deep networks with stochastic depth. In European Conference on Computer Vision. Springer, 646--661."},{"key":"e_1_2_1_65_1","doi-asserted-by":"publisher","DOI":"10.1145\/2505515.2505665"},{"key":"e_1_2_1_66_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICASSP.2014.6853860"},{"key":"e_1_2_1_67_1","doi-asserted-by":"publisher","DOI":"10.1113\/jphysiol.1962.sp006837"},{"key":"e_1_2_1_68_1","volume-title":"Accessed","year":"2017","unstructured":"ImageNet. 2017. Retrieved from http:\/\/image-Net.org. Accessed April 18, 2017."},{"key":"e_1_2_1_69_1","volume-title":"Neon deep learning framework.","author":"Systems Intel Nervana","year":"2017","unstructured":"Intel Nervana Systems. 2017. Neon deep learning framework. Retrieved from https:\/\/www.nervanasys.com\/technology\/neon. Accessed April 4, 2017."},{"key":"e_1_2_1_70_1","doi-asserted-by":"publisher","DOI":"10.1145\/3042064"},{"key":"e_1_2_1_71_1","volume-title":"Harnessing nonlinearity: Predicting chaotic systems and saving energy in wireless communication. Science 304, 5667","author":"Jaeger Herbert","year":"2004","unstructured":"Herbert Jaeger and Harald Haas. 2004. Harnessing nonlinearity: Predicting chaotic systems and saving energy in wireless communication. Science 304, 5667 (2004), 78--80."},{"key":"e_1_2_1_72_1","doi-asserted-by":"publisher","DOI":"10.1145\/2647868.2654889"},{"key":"e_1_2_1_73_1","doi-asserted-by":"publisher","DOI":"10.1016\/S0166-4115(97)80111-2"},{"key":"e_1_2_1_74_1","volume-title":"Robustness in Automatic Speech Recognition: Fundamentals and Applications","author":"Junqua Jean-Claude","unstructured":"Jean-Claude Junqua and Jean-Paul Haton. 2012. Robustness in Automatic Speech Recognition: Fundamentals and Applications, Vol. 341. Springer Science 8 Business Media."},{"key":"e_1_2_1_75_1","volume-title":"2nd Workshop on Continuous Vector Space Models and their Compositionality. Citeseer, Association for Computational Linguistics, 31--39","author":"K\u00e5geb\u00e4ck Mikael","year":"2014","unstructured":"Mikael K\u00e5geb\u00e4ck, Olof Mogren, Nina Tahmasebi, and Devdatt Dubhashi. 2014. Extractive summarization using continuous vector space models. In 2nd Workshop on Continuous Vector Space Models and their Compositionality. Citeseer, Association for Computational Linguistics, 31--39."},{"key":"e_1_2_1_76_1","volume-title":"One model to learn them all. CoRR abs\/1706.05137","author":"Kaiser Lukasz","year":"2017","unstructured":"Lukasz Kaiser, Aidan N. Gomez, Noam Shazeer, Ashish Vaswani, Niki Parmar, Llion Jones, and Jakob Uszkoreit. 2017. One model to learn them all. CoRR abs\/1706.05137 (2017). Retrieved from http:\/\/arxiv.org\/abs\/1706.05137."},{"key":"e_1_2_1_77_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2014.223"},{"key":"e_1_2_1_78_1","volume-title":"The kinetics human action video dataset. CoRR abs\/1705.06950","author":"Kay Will","year":"2017","unstructured":"Will Kay, Jo\u00e3o Carreira, Karen Simonyan, Brian Zhang, Chloe Hillier, Sudheendra Vijayanarasimhan, Fabio Viola, Tim Green, Trevor Back, Paul Natsev, Mustafa Suleyman, and Andrew Zisserman. 2017. The kinetics human action video dataset. CoRR abs\/1705.06950 (2017). Retrieved from http:\/\/arxiv.org\/abs\/1705.06950."},{"key":"e_1_2_1_79_1","volume-title":"Convolutional neural networks for sentence classification. CoRR abs\/1408.5882","author":"Kim Yoon","year":"2014","unstructured":"Yoon Kim. 2014. Convolutional neural networks for sentence classification. CoRR abs\/1408.5882 (2014). Retrieved from http:\/\/arxiv.org\/abs\/1408.5882."},{"key":"e_1_2_1_80_1","volume-title":"Kingma and Jimmy Ba","author":"Diederik","year":"2014","unstructured":"Diederik P. Kingma and Jimmy Ba. 2014. Adam: A method for stochastic optimization. CoRR abs\/1412.6980 (2014). Retrieved from http:\/\/arxiv.org\/abs\/1412.6980."},{"key":"e_1_2_1_81_1","volume-title":"Kingma and Max Welling","author":"Diederik","year":"2013","unstructured":"Diederik P. Kingma and Max Welling. 2013. Auto-encoding variational bayes. CoRR abs\/1312.6114 (2013). Retrieved from http:\/\/arxiv.org\/abs\/1312.6114."},{"key":"e_1_2_1_82_1","doi-asserted-by":"publisher","DOI":"10.1109\/TASLP.2016.2628641"},{"key":"e_1_2_1_83_1","doi-asserted-by":"publisher","DOI":"10.1145\/2463372.2463509"},{"key":"e_1_2_1_84_1","volume-title":"The 13th International Conference on Pattern Recognition and Information Processing.","author":"Kovalev Vassili","year":"2016","unstructured":"Vassili Kovalev, Alexander Kalinovsky, and Sergey Kovalev. 2016. Deep learning with Theano, Torch, Caffe, Tensorflow, and Deeplearning4J: Which one is the best in speed and accuracy? In The 13th International Conference on Pattern Recognition and Information Processing."},{"key":"e_1_2_1_85_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-319-46487-9_19"},{"key":"e_1_2_1_86_1","doi-asserted-by":"publisher","DOI":"10.5555\/2999134.2999257"},{"key":"e_1_2_1_87_1","doi-asserted-by":"publisher","DOI":"10.1080\/00949655.2014.929131"},{"key":"e_1_2_1_88_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICASSP.2013.6639343"},{"key":"e_1_2_1_89_1","doi-asserted-by":"publisher","DOI":"10.5555\/303568.303704"},{"key":"e_1_2_1_90_1","volume-title":"Deep learning. Nature 521, 7553","author":"LeCun Yann","year":"2015","unstructured":"Yann LeCun, Yoshua Bengio, and Geoffrey Hinton. 2015. Deep learning. Nature 521, 7553 (2015), 436--444."},{"key":"e_1_2_1_91_1","doi-asserted-by":"publisher","DOI":"10.1109\/5.726791"},{"key":"e_1_2_1_92_1","doi-asserted-by":"publisher","DOI":"10.5555\/2685048.2685095"},{"key":"e_1_2_1_93_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICASSP.2015.7178826"},{"key":"e_1_2_1_94_1","volume-title":"Deep reinforcement learning: An overview. CoRR abs\/1701.07274","author":"Yuxi Li.","year":"2017","unstructured":"Yuxi Li. 2017. Deep reinforcement learning: An overview. CoRR abs\/1701.07274 (2017). Retrieved from https:\/\/arxiv.org\/abs\/1701.07274."},{"key":"e_1_2_1_95_1","doi-asserted-by":"publisher","unstructured":"Jifeng Dai Yi Li Kaiming He and Jian Sun. 2016. R-FCN: Object detection via region-based fully convolutional networks. In Advances in Neural Information Processing Systems. Curran Associates 379--387.","DOI":"10.5555\/3157096.3157139"},{"key":"e_1_2_1_96_1","volume-title":"European Conference on Computer Vision. Springer, 740--755","author":"Lin Tsung-Yi","unstructured":"Tsung-Yi Lin, Michael Maire, Serge Belongie, James Hays, Pietro Perona, Deva Ramanan, Piotr Doll\u00e1r, and C. Lawrence Zitnick. 2014. Microsoft COCO: Common objects in context. In European Conference on Computer Vision. Springer, 740--755."},{"key":"e_1_2_1_97_1","volume-title":"Arnaud Arindra Adiyoso Setio, Francesco Ciompi, Mohsen Ghafoorian, Jeroen A. W. M. van der Laak, Bram van Ginneken, and Clara I. S\u00e1nchez.","author":"Litjens Geert","year":"2017","unstructured":"Geert Litjens, Thijs Kooi, Babak Ehteshami Bejnordi, Arnaud Arindra Adiyoso Setio, Francesco Ciompi, Mohsen Ghafoorian, Jeroen A. W. M. van der Laak, Bram van Ginneken, and Clara I. S\u00e1nchez. 2017. A survey on deep learning in medical image analysis. CoRR abs\/1702.05747 (2017). Retrieved from http:\/\/arxiv.org\/abs\/1702.05747."},{"key":"e_1_2_1_98_1","doi-asserted-by":"publisher","DOI":"10.1038\/srep26286"},{"key":"e_1_2_1_99_1","doi-asserted-by":"publisher","DOI":"10.3390\/e17042140"},{"key":"e_1_2_1_100_1","volume-title":"Berg","author":"Liu Wei","year":"2016","unstructured":"Wei Liu, Dragomir Anguelov, Dumitru Erhan, Christian Szegedy, Scott Reed, Cheng-Yang Fu, and Alexander C. Berg. 2016. SSD: Single shot multibox detector. In European Conference on Computer Vision. Springer, 21--37."},{"key":"e_1_2_1_101_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2015.7298965"},{"key":"e_1_2_1_102_1","doi-asserted-by":"publisher","DOI":"10.5555\/850924.851523"},{"key":"e_1_2_1_103_1","doi-asserted-by":"publisher","DOI":"10.1109\/JSSC.2014.2356197"},{"key":"e_1_2_1_104_1","doi-asserted-by":"publisher","DOI":"10.1038\/nmeth.4627"},{"key":"e_1_2_1_105_1","doi-asserted-by":"publisher","DOI":"10.1371\/journal.pone.0119044"},{"key":"e_1_2_1_106_1","doi-asserted-by":"publisher","DOI":"10.1145\/2911451.2926732"},{"key":"e_1_2_1_107_1","doi-asserted-by":"publisher","DOI":"10.1007\/BF02478259"},{"key":"e_1_2_1_108_1","volume-title":"Federated learning of deep networks using model averaging. CoRR abs\/1602.05629","author":"McMahan H. Brendan","year":"2016","unstructured":"H. Brendan McMahan, Eider Moore, Daniel Ramage, and Blaise Ag\u00fcera y Arcas. 2016. Federated learning of deep networks using model averaging. CoRR abs\/1602.05629 (2016). Retrieved from http:\/\/arxiv.org\/abs\/1602.05629."},{"key":"e_1_2_1_109_1","doi-asserted-by":"publisher","DOI":"10.1109\/TNN.2008.2010350"},{"key":"e_1_2_1_110_1","volume-title":"International Conference on Learning Representations.","author":"Mirhoseini Azalia","year":"2018","unstructured":"Azalia Mirhoseini, Anna Goldie, Hieu Pham, Benoit Steiner, Quoc V. Le, and Jeff Dean. 2018. A hierarchical model for device placement. In International Conference on Learning Representations."},{"key":"e_1_2_1_111_1","doi-asserted-by":"publisher","DOI":"10.1109\/TPAMI.2013.29"},{"key":"e_1_2_1_112_1","volume-title":"Accessed","author":"MNIST.","year":"2017","unstructured":"MNIST. 2017. The MNIST database of handwritten digits. Retrieved from http:\/\/yann.lecun.com\/exdb\/mnist\/. Accessed April 18, 2017."},{"key":"e_1_2_1_113_1","volume-title":"Accessed","author":"Mordvintsev Alexander","year":"2015","unstructured":"Alexander Mordvintsev, Christopher Olah, and Mike Tyka. 2015. Inceptionism: Going deeper into neural networks. Google Research Blog. Retrieved from https:\/\/research.googleblog.com\/2015\/06\/inceptionism-going-deeper-into-neural.html. Accessed March 26, 2018."},{"key":"e_1_2_1_114_1","volume-title":"Multilingual Twitter sentiment classification: The role of human annotators. CoRR abs\/1602.07563","author":"Mozetic Igor","year":"2016","unstructured":"Igor Mozetic, Miha Grcar, and Jasmina Smailovic. 2016. Multilingual Twitter sentiment classification: The role of human annotators. CoRR abs\/1602.07563 (2016). arxiv:1602.07563. Retrieved from http:\/\/arxiv.org\/abs\/1602.07563."},{"key":"e_1_2_1_115_1","doi-asserted-by":"publisher","DOI":"10.1186\/s40537-014-0007-7"},{"key":"e_1_2_1_116_1","doi-asserted-by":"publisher","DOI":"10.18653\/v1\/S16-1001"},{"key":"e_1_2_1_117_1","doi-asserted-by":"publisher","DOI":"10.1109\/FPT.2011.6132679"},{"key":"e_1_2_1_118_1","volume-title":"Attentive convolutional neural network based speech emotion recognition: A study on the impact of input features, signal length, and acted speech. CoRR abs\/1706.00612","author":"Neumann Michael","year":"2017","unstructured":"Michael Neumann and Ngoc Thang Vu. 2017. Attentive convolutional neural network based speech emotion recognition: A study on the impact of input features, signal length, and acted speech. CoRR abs\/1706.00612 (2017). arxiv:1706.00612. Retrieved from http:\/\/arxiv.org\/abs\/1706.00612."},{"key":"e_1_2_1_119_1","doi-asserted-by":"publisher","DOI":"10.1038\/ni.3485"},{"key":"e_1_2_1_120_1","volume-title":"Applications of online deep learning for crisis response using social media information. CoRR abs\/1610.01030","author":"Nguyen Dat Tien","year":"2016","unstructured":"Dat Tien Nguyen, Shafiq R. Joty, Muhammad Imran, Hassan Sajjad, and Prasenjit Mitra. 2016. Applications of online deep learning for crisis response using social media information. CoRR abs\/1610.01030 (2016). Retrieved from http:\/\/arxiv.org\/abs\/1610.01030."},{"key":"e_1_2_1_121_1","doi-asserted-by":"publisher","DOI":"10.1109\/GLOCOMW.2016.7849067"},{"key":"e_1_2_1_122_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2015.178"},{"key":"e_1_2_1_123_1","volume-title":"Accessed","author":"Pascal","year":"2012","unstructured":"Pascal VOC. 2012. The PASCAL Visual Object Classes. Retrieved from http:\/\/host.robots.ox.ac.uk\/pascal\/VOC\/. Accessed April 18, 2017."},{"key":"e_1_2_1_124_1","volume-title":"How to construct deep recurrent neural networks. CoRR abs\/1312.6026","author":"Pascanu Razvan","year":"2013","unstructured":"Razvan Pascanu, Caglar Gulcehre, Kyunghyun Cho, and Yoshua Bengio. 2013. How to construct deep recurrent neural networks. CoRR abs\/1312.6026 (2013). Retrieved from http:\/\/arxiv.org\/abs\/1312.6026."},{"key":"e_1_2_1_125_1","volume-title":"SEGAN: Speech enhancement generative adversarial network. CoRR abs\/1703.09452","author":"Pascual Santiago","year":"2017","unstructured":"Santiago Pascual, Antonio Bonafonte, and Joan Serr\u00e0. 2017. SEGAN: Speech enhancement generative adversarial network. CoRR abs\/1703.09452 (2017). arxiv:1703.09452. Retrieved from http:\/\/arxiv.org\/abs\/1703.09452."},{"key":"e_1_2_1_126_1","doi-asserted-by":"publisher","DOI":"10.1038\/s41551-018-0195-0"},{"key":"e_1_2_1_127_1","doi-asserted-by":"publisher","DOI":"10.1142\/S1793351X17400050"},{"key":"e_1_2_1_128_1","doi-asserted-by":"publisher","DOI":"10.1109\/BigMM.2017.36"},{"key":"e_1_2_1_129_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICME.2017.8019447"},{"key":"e_1_2_1_130_1","volume-title":"Unsupervised representation learning with deep convolutional generative adversarial networks. CoRR abs\/1511.06434","author":"Radford Alec","year":"2015","unstructured":"Alec Radford, Luke Metz, and Soumith Chintala. 2015. Unsupervised representation learning with deep convolutional generative adversarial networks. CoRR abs\/1511.06434 (2015). Retrieved from http:\/\/arxiv.org\/abs\/1511.06434."},{"key":"e_1_2_1_131_1","volume-title":"Blei","author":"Ranganath Rajesh","year":"2016","unstructured":"Rajesh Ranganath, Adler J. Perotte, No\u00e9mie Elhadad, and David M. Blei. 2016. Deep survival analysis. In Machine Learning in Health Care. JMLR.org, 101--114."},{"key":"e_1_2_1_132_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2016.91"},{"key":"e_1_2_1_133_1","doi-asserted-by":"publisher","DOI":"10.5555\/2969239.2969250"},{"key":"e_1_2_1_134_1","doi-asserted-by":"publisher","DOI":"10.1037\/h0042519"},{"key":"e_1_2_1_135_1","unstructured":"Ruslan Salakhutdinov and Geoffrey Hinton. 2009. Deep Boltzmann machines. In Artificial Intelligence and Statistics. PMLR 448--455."},{"key":"e_1_2_1_136_1","doi-asserted-by":"publisher","DOI":"10.1162\/NECO_a_00311"},{"key":"e_1_2_1_137_1","doi-asserted-by":"publisher","DOI":"10.5555\/1886436.1886447"},{"key":"e_1_2_1_138_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.neunet.2014.09.003"},{"key":"e_1_2_1_139_1","doi-asserted-by":"publisher","DOI":"10.21437\/Interspeech.2011-169"},{"key":"e_1_2_1_140_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2013.465"},{"key":"e_1_2_1_141_1","doi-asserted-by":"publisher","DOI":"10.1145\/2567948.2577348"},{"key":"e_1_2_1_142_1","volume-title":"Very deep convolutional networks for large-scale image recognition. CoRR abs\/1409.1556","author":"Simonyan Karen","year":"2014","unstructured":"Karen Simonyan and Andrew Zisserman. 2014. Very deep convolutional networks for large-scale image recognition. CoRR abs\/1409.1556 (2014). Retrieved from http:\/\/arxiv.org\/abs\/1409.1556."},{"key":"e_1_2_1_143_1","volume-title":"Accessed","year":"2017","unstructured":"Skymind. 2017. Deeplearning4j deep learning framework. Retrieved from https:\/\/deeplearning4j.org. Accessed April 18, 2017."},{"key":"e_1_2_1_145_1","doi-asserted-by":"publisher","DOI":"10.5555\/2986459.2986549"},{"key":"e_1_2_1_146_1","doi-asserted-by":"publisher","DOI":"10.5555\/3104482.3104499"},{"key":"e_1_2_1_147_1","volume-title":"Conference on Empirical Methods in Natural Language Processing. Citeseer, Association for Computational Linguistics, 1631--1642","author":"Socher Richard","year":"2013","unstructured":"Richard Socher, Alex Perelygin, Jean Y. Wu, Jason Chuang, Christopher D. Manning, Andrew Y. Ng, and Christopher Potts. 2013. Recursive deep models for semantic compositionality over a sentiment treebank. In Conference on Empirical Methods in Natural Language Processing. Citeseer, Association for Computational Linguistics, 1631--1642."},{"key":"e_1_2_1_148_1","volume-title":"Amir Roshan Zamir, and Mubarak Shah","author":"Soomro Khurram","year":"2012","unstructured":"Khurram Soomro, Amir Roshan Zamir, and Mubarak Shah. 2012. UCF101: A dataset of 101 human actions classes from videos in the wild. CoRR abs\/1212.0402 (2012). Retrieved from http:\/\/arxiv.org\/abs\/1212.0402."},{"key":"e_1_2_1_149_1","volume-title":"Experiments on parallel training of deep neural network using model averaging. CoRR abs\/1507.01239","author":"Su Hang","year":"2015","unstructured":"Hang Su and Haoyu Chen. 2015. Experiments on parallel training of deep neural network using model averaging. CoRR abs\/1507.01239 (2015). Retrieved from http:\/\/arxiv.org\/abs\/1507.01239."},{"key":"e_1_2_1_150_1","doi-asserted-by":"publisher","DOI":"10.5555\/3042817.3043064"},{"key":"e_1_2_1_151_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2015.7298594"},{"key":"e_1_2_1_152_1","doi-asserted-by":"publisher","DOI":"10.1145\/2812802"},{"key":"e_1_2_1_153_1","doi-asserted-by":"publisher","DOI":"10.1109\/BigMM.2017.30"},{"key":"e_1_2_1_154_1","doi-asserted-by":"publisher","DOI":"10.1109\/BigMM.2017.31"},{"key":"e_1_2_1_155_1","doi-asserted-by":"publisher","DOI":"10.1109\/TPAMI.2008.128"},{"key":"e_1_2_1_156_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2015.510"},{"key":"e_1_2_1_157_1","volume-title":"Accessed","author":"Learning Transfer","year":"2017","unstructured":"Transfer Learning. 2017. Convolutional Neural Network for Visual Recognition. Retrieved from http:\/\/cs231n.github.io\/transfer-learning\/. Accessed April 25, 2017."},{"key":"e_1_2_1_158_1","volume-title":"Accessed","year":"2017","unstructured":"Trecvid. 2017. TREC Video Retrieval Evaluation. Retrieved from http:\/\/trecvid.nist.gov. Accessed April 18, 2017."},{"key":"e_1_2_1_159_1","doi-asserted-by":"publisher","DOI":"10.2352\/ISSN.2470-1173.2017.16.CVAS-344"},{"key":"e_1_2_1_160_1","volume-title":"Fast convolutional nets with fbfft: A GPU performance evaluation. CoRR abs\/1412.7580","author":"Vasilache Nicolas","year":"2014","unstructured":"Nicolas Vasilache, Jeff Johnson, Micha\u00ebl Mathieu, Soumith Chintala, Serkan Piantino, and Yann LeCun. 2014. Fast convolutional nets with fbfft: A GPU performance evaluation. CoRR abs\/1412.7580 (2014). Retrieved from http:\/\/arxiv.org\/abs\/1412.7580."},{"key":"e_1_2_1_161_1","doi-asserted-by":"publisher","DOI":"10.1145\/2911451.2914762"},{"key":"e_1_2_1_162_1","doi-asserted-by":"publisher","DOI":"10.1109\/TCAD.2016.2587683"},{"key":"e_1_2_1_163_1","doi-asserted-by":"publisher","DOI":"10.1007\/s11432-014-5237-y"},{"key":"e_1_2_1_164_1","volume-title":"International Joint Conference on Neural Networks. IEEE, 2384--2391","author":"Wehrmann Joonatas","unstructured":"Joonatas Wehrmann, Willian Becker, Henry E. L. Cagnini, and Rodrigo C. Barros. 2017. A character-based convolutional neural network for language-agnostic Twitter sentiment analysis. In International Joint Conference on Neural Networks. IEEE, 2384--2391."},{"key":"e_1_2_1_165_1","doi-asserted-by":"publisher","DOI":"10.1109\/TASLP.2015.2444659"},{"key":"e_1_2_1_166_1","volume-title":"Google\u2019s neural machine translation system: Bridging the gap between human and machine translation. CoRR abs\/1609.08144","author":"Wu Yonghui","year":"2016","unstructured":"Yonghui Wu, Mike Schuster, Zhifeng Chen, Quoc V. Le, Mohammad Norouzi, Wolfgang Macherey, Maxim Krikun, Yuan Cao, Qin Gao, Klaus Macherey, Jeff Klingner, Apurva Shah, Melvin Johnson, Xiaobing Liu, Lukasz Kaiser, Stephan Gouws, Yoshikiyo Kato, Taku Kudo, Hideto Kazawa, Keith Stevens, George Kurian, Nishant Patil, Wei Wang, Cliff Young, Jason Smith, Jason Riesa, Alex Rudnick, Oriol Vinyals, Greg Corrado, Macduff Hughes, and Jeffrey Dean. 2016. Google\u2019s neural machine translation system: Bridging the gap between human and machine translation. CoRR abs\/1609.08144 (2016). arxiv:1609.08144. Retrieved from http:\/\/arxiv.org\/abs\/1609.08144."},{"key":"e_1_2_1_167_1","volume-title":"Aggregated residual transformations for deep neural networks. CoRR abs\/1611.05431","author":"Xie Saining","year":"2016","unstructured":"Saining Xie, Ross Girshick, Piotr Doll\u00e1r, Zhuowen Tu, and Kaiming He. 2016. Aggregated residual transformations for deep neural networks. CoRR abs\/1611.05431 (2016). Retrieved from http:\/\/arxiv.org\/abs\/1611.05431."},{"key":"e_1_2_1_168_1","volume-title":"Multi-GPU training of ConvNets. CoRR abs\/1312.5853","author":"Yadan Omry","year":"2013","unstructured":"Omry Yadan, Keith Adams, Yaniv Taigman, and Marc\u2019Aurelio Ranzato. 2013. Multi-GPU training of ConvNets. CoRR abs\/1312.5853 (2013)."},{"key":"e_1_2_1_169_1","doi-asserted-by":"publisher","DOI":"10.4018\/IJMDEM.2017010101"},{"key":"e_1_2_1_170_1","doi-asserted-by":"publisher","DOI":"10.1109\/ISM.2015.126"},{"key":"e_1_2_1_171_1","doi-asserted-by":"publisher","DOI":"10.1109\/IRI.2016.88"},{"key":"e_1_2_1_172_1","volume-title":"ABCNN: Attention-based convolutional neural network for modeling sentence pairs. CoRR abs\/1512.05193","author":"Yin Wenpeng","year":"2015","unstructured":"Wenpeng Yin, Hinrich Sch\u00fctze, Bing Xiang, and Bowen Zhou. 2015. ABCNN: Attention-based convolutional neural network for modeling sentence pairs. CoRR abs\/1512.05193 (2015). arxiv:1512.05193. Retrieved from http:\/\/arxiv.org\/abs\/1512.05193."},{"key":"e_1_2_1_173_1","volume-title":"The 15th Annual Conference of the International Speech Communication Association. ISCA.","author":"Yu Dong","year":"2014","unstructured":"Dong Yu, Adam Eversole, Michael L. Seltzer, Kaisheng Yao, Brian Guenter, Oleksii Kuchaiev, Frank Seide, Huaming Wang, Jasha Droppo, Zhiheng Huang, Geoffrey Zweig, Christopher J. Rossbach, and Jon Currey. 2014. An introduction to computational networks and the computational network toolkit. In The 15th Annual Conference of the International Speech Communication Association. ISCA."},{"key":"e_1_2_1_174_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICASSP.2017.7952154"},{"key":"e_1_2_1_175_1","doi-asserted-by":"publisher","DOI":"10.1109\/CCGrid.2015.114"},{"key":"e_1_2_1_176_1","volume-title":"ADADELTA: An adaptive learning rate method. CoRR abs\/1212.5701","author":"Zeiler Matthew D.","year":"2012","unstructured":"Matthew D. Zeiler. 2012. ADADELTA: An adaptive learning rate method. CoRR abs\/1212.5701 (2012). Retrieved from http:\/\/arxiv.org\/abs\/1212.5701."},{"key":"e_1_2_1_177_1","doi-asserted-by":"publisher","DOI":"10.1145\/2684746.2689060"},{"key":"e_1_2_1_178_1","doi-asserted-by":"publisher","DOI":"10.1109\/TASLP.2017.2687104"},{"key":"e_1_2_1_179_1","doi-asserted-by":"publisher","unstructured":"Xiang Zhang Junbo Zhao and Yann LeCun. 2015. Character-level convolutional networks for text classification. In Advances in Neural Information Processing Systems. 649--657.","DOI":"10.5555\/2969239.2969312"},{"key":"e_1_2_1_180_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICASSP.2017.7953168"},{"key":"e_1_2_1_181_1","doi-asserted-by":"publisher","DOI":"10.21437\/Interspeech.2016-354"}],"container-title":["ACM Computing Surveys"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3234150","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3234150","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,18]],"date-time":"2025-06-18T02:08:03Z","timestamp":1750212483000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3234150"}},"subtitle":["Algorithms, Techniques, and Applications"],"short-title":[],"issued":{"date-parts":[[2018,9,18]]},"references-count":179,"journal-issue":{"issue":"5","published-print":{"date-parts":[[2019,9,30]]}},"alternative-id":["10.1145\/3234150"],"URL":"https:\/\/doi.org\/10.1145\/3234150","relation":{},"ISSN":["0360-0300","1557-7341"],"issn-type":[{"value":"0360-0300","type":"print"},{"value":"1557-7341","type":"electronic"}],"subject":[],"published":{"date-parts":[[2018,9,18]]},"assertion":[{"value":"2017-05-01","order":0,"name":"received","label":"Received","group":{"name":"publication_history","label":"Publication History"}},{"value":"2018-06-01","order":1,"name":"accepted","label":"Accepted","group":{"name":"publication_history","label":"Publication History"}},{"value":"2018-09-18","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}