{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,5,14]],"date-time":"2026-05-14T23:23:13Z","timestamp":1778800993419,"version":"3.51.4"},"reference-count":86,"publisher":"Association for Computing Machinery (ACM)","issue":"4","license":[{"start":{"date-parts":[[2020,10,12]],"date-time":"2020-10-12T00:00:00Z","timestamp":1602460800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"funder":[{"name":"National Science Foundation","award":["1660071"],"award-info":[{"award-number":["1660071"]}]}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["ACM Trans. Storage"],"published-print":{"date-parts":[[2020,11,30]]},"abstract":"<jats:p>The growing volume of data produced continuously in the Cloud and at the Edge poses significant challenges for large-scale AI applications to extract and learn useful information from the data in a timely and efficient way. The goal of this article is to explore the use of computational storage to address such challenges by distributed near-data processing. We describe Newport, a high-performance and energy-efficient computational storage developed for realizing the full potential of in-storage processing. To the best of our knowledge, Newport is the first commodity SSD that can be configured to run a server-like operating system, greatly minimizing the effort for creating and maintaining applications running inside the storage. We analyze the benefits of using Newport by running complex AI applications such as image similarity search and object tracking on a large visual dataset. The results demonstrate that data-intensive AI workloads can be efficiently parallelized and offloaded, even to a small set of Newport drives with significant performance gains and energy savings. In addition, we introduce a comprehensive taxonomy of existing computational storage solutions together with a realistic cost analysis for high-volume production, giving a good big picture of the economic feasibility of the computational storage technology.<\/jats:p>","DOI":"10.1145\/3415580","type":"journal-article","created":{"date-parts":[[2020,10,12]],"date-time":"2020-10-12T22:25:20Z","timestamp":1602541520000},"page":"1-37","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":48,"title":["Cost-effective, Energy-efficient, and Scalable Storage Computing for Large-scale AI Applications"],"prefix":"10.1145","volume":"16","author":[{"given":"Jaeyoung","family":"Do","sequence":"first","affiliation":[{"name":"Microsoft Research, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Victor C.","family":"Ferreira","sequence":"additional","affiliation":[{"name":"Federal University of Rio de Janeiro, Brazil"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-5494-3926","authenticated-orcid":false,"given":"Hossein","family":"Bobarshad","sequence":"additional","affiliation":[{"name":"NGD Systems, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-7765-3064","authenticated-orcid":false,"given":"Mahdi","family":"Torabzadehkashi","sequence":"additional","affiliation":[{"name":"NGD Systems, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Siavash","family":"Rezaei","sequence":"additional","affiliation":[{"name":"University of California, Irvine, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-6218-8745","authenticated-orcid":false,"given":"Ali","family":"Heydarigorji","sequence":"additional","affiliation":[{"name":"University of California, Irvine, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Diego","family":"Souza","sequence":"additional","affiliation":[{"name":"Wespa Intelligent Systems"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Brunno F.","family":"Goldstein","sequence":"additional","affiliation":[{"name":"Federal University of Rio de Janeiro, Brazil"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Leandro","family":"Santiago","sequence":"additional","affiliation":[{"name":"Federal University of Rio de Janeiro, Brazil"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Min Soo","family":"Kim","sequence":"additional","affiliation":[{"name":"University of California, Irvine, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Priscila M. V.","family":"Lima","sequence":"additional","affiliation":[{"name":"Federal University of Rio de Janeiro, Brazil"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Felipe M. G.","family":"Fran\u00e7a","sequence":"additional","affiliation":[{"name":"Federal University of Rio de Janeiro, Brazil"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Vladimir","family":"Alves","sequence":"additional","affiliation":[{"name":"NGD Systems, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"320","published-online":{"date-parts":[[2020,10,12]]},"reference":[{"key":"e_1_2_1_1_1","volume-title":"Youtube-8m: A large-scale video classification benchmark. Arxiv Preprint Arxiv:1609.08675","author":"Abu-El-Haija Sami","year":"2016","unstructured":"Sami Abu-El-Haija , Nisarg Kothari , Joonseok Lee , Paul Natsev , George Toderici , Balakrishnan Varadarajan , and Sudheendra Vijayanarasimhan . 2016. Youtube-8m: A large-scale video classification benchmark. Arxiv Preprint Arxiv:1609.08675 ( 2016 ). Sami Abu-El-Haija, Nisarg Kothari, Joonseok Lee, Paul Natsev, George Toderici, Balakrishnan Varadarajan, and Sudheendra Vijayanarasimhan. 2016. Youtube-8m: A large-scale video classification benchmark. Arxiv Preprint Arxiv:1609.08675 (2016)."},{"key":"e_1_2_1_2_1","volume-title":"Proceedings of the 17th European Symposium on Artificial Neural Networks.","author":"Aleksander I.","unstructured":"I. Aleksander , M. De Gregorio , F. Maia Galv\u00e3o Fran\u00e7a, P. Machado Vieira Lima, and H. Morton. 2009. A brief introduction to weightless neural systems . In Proceedings of the 17th European Symposium on Artificial Neural Networks. Retrieved from https:\/\/www.elen.ucl.ac.be\/Proceedings\/esann\/esannpdf\/es2009-6.pdf. I. Aleksander, M. De Gregorio, F. Maia Galv\u00e3o Fran\u00e7a, P. Machado Vieira Lima, and H. Morton. 2009. A brief introduction to weightless neural systems. In Proceedings of the 17th European Symposium on Artificial Neural Networks. Retrieved from https:\/\/www.elen.ucl.ac.be\/Proceedings\/esann\/esannpdf\/es2009-6.pdf."},{"key":"e_1_2_1_3_1","doi-asserted-by":"publisher","DOI":"10.1108\/eb007637"},{"key":"e_1_2_1_4_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.is.2019.02.006"},{"key":"e_1_2_1_5_1","volume-title":"Proceedings of the Symposium on Operating Systems Design and Implementation","volume":"10","author":"Beaver Doug","year":"2010","unstructured":"Doug Beaver , Sanjeev Kumar , Harry C. Li , Jason Sobel , Peter Vajgel et \u00a0al . 2010 . Finding a needle in haystack: Facebook\u2019s photo storage . In Proceedings of the Symposium on Operating Systems Design and Implementation , Vol. 10 . 1--8. Doug Beaver, Sanjeev Kumar, Harry C. Li, Jason Sobel, Peter Vajgel et\u00a0al. 2010. Finding a needle in haystack: Facebook\u2019s photo storage. In Proceedings of the Symposium on Operating Systems Design and Implementation, Vol. 10. 1--8."},{"key":"e_1_2_1_6_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2010.5539960"},{"key":"e_1_2_1_7_1","volume-title":"Salient object detection: A survey. Comput. Vis. Media","author":"Borji Ali","year":"2019","unstructured":"Ali Borji , Ming-Ming Cheng , Qibin Hou , Huaizu Jiang , and Jia Li. 2019. Salient object detection: A survey. Comput. Vis. Media ( 2019 ), 1--34. Ali Borji, Ming-Ming Cheng, Qibin Hou, Huaizu Jiang, and Jia Li. 2019. Salient object detection: A survey. Comput. Vis. Media (2019), 1--34."},{"key":"e_1_2_1_8_1","volume-title":"Understanding the OSI 7-layer model. PC Netw. Advis. 120, 2","author":"Briscoe Neil","year":"2000","unstructured":"Neil Briscoe . 2000. Understanding the OSI 7-layer model. PC Netw. Advis. 120, 2 ( 2000 ). Neil Briscoe. 2000. Understanding the OSI 7-layer model. PC Netw. Advis. 120, 2 (2000)."},{"key":"e_1_2_1_9_1","doi-asserted-by":"publisher","DOI":"10.1145\/1027794.1027801"},{"key":"e_1_2_1_10_1","volume-title":"Proceedings of the 27th International ACM Conference on International Conference on Supercomputing. ACM, 91--102","author":"Cho Sangyeun","unstructured":"Sangyeun Cho , Chanik Park , Hyunok Oh , Sungchan Kim , Youngmin Yi , and Gregory R. Ganger . 2013. Active disk meets flash: A case for intelligent SSDs . In Proceedings of the 27th International ACM Conference on International Conference on Supercomputing. ACM, 91--102 . Sangyeun Cho, Chanik Park, Hyunok Oh, Sungchan Kim, Youngmin Yi, and Gregory R. Ganger. 2013. Active disk meets flash: A case for intelligent SSDs. In Proceedings of the 27th International ACM Conference on International Conference on Supercomputing. ACM, 91--102."},{"key":"e_1_2_1_11_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.sysarc.2009.03.005"},{"key":"e_1_2_1_12_1","doi-asserted-by":"publisher","DOI":"10.1145\/2380656.2380672"},{"key":"e_1_2_1_13_1","doi-asserted-by":"publisher","DOI":"10.1109\/99.660313"},{"key":"e_1_2_1_14_1","volume-title":"Nearest-neighbor Methods in Learning and Vision: Theory and Practice","author":"Darrell Trevor","unstructured":"Trevor Darrell , Piotr Indyk , and Gregory Shakhnarovich . 2005. Nearest-neighbor Methods in Learning and Vision: Theory and Practice . The MIT Press . Trevor Darrell, Piotr Indyk, and Gregory Shakhnarovich. 2005. Nearest-neighbor Methods in Learning and Vision: Theory and Practice. The MIT Press."},{"key":"e_1_2_1_15_1","volume-title":"Proceedings of the ACM SIGMOD International Conference on Management of Data. ACM, 1221--1230","author":"Do Jaeyoung","unstructured":"Jaeyoung Do , Yang-Suk Kee , Jignesh M. Patel , Chanik Park , Kwanghyun Park , and David J . DeWitt. 2013. Query processing on smart SSDs: Opportunities and challenges . In Proceedings of the ACM SIGMOD International Conference on Management of Data. ACM, 1221--1230 . Jaeyoung Do, Yang-Suk Kee, Jignesh M. Patel, Chanik Park, Kwanghyun Park, and David J. DeWitt. 2013. Query processing on smart SSDs: Opportunities and challenges. In Proceedings of the ACM SIGMOD International Conference on Management of Data. ACM, 1221--1230."},{"key":"e_1_2_1_16_1","doi-asserted-by":"publisher","DOI":"10.1145\/3286588"},{"key":"e_1_2_1_17_1","volume-title":"Proceedings of the European Symposium on Artificial Neural Networks, Computational Intelligence and Machine Learning. 978\u2013287587014\u20138. DOI:https:\/\/doi.org\/10","author":"Do Nascimiento Daniel N.","unstructured":"Daniel N. Do Nascimiento , Rafael Lima De Carvalho , Felix Mora-Camino , Priscila V. M. Lima , and Felipe M. G. Franca . 2015. A WiSARD-based multi-term memory framework for online tracking of objects . In Proceedings of the European Symposium on Artificial Neural Networks, Computational Intelligence and Machine Learning. 978\u2013287587014\u20138. DOI:https:\/\/doi.org\/10 .13140\/RG.2.1.3387.5687 10.13140\/RG.2.1.3387.5687 Daniel N. Do Nascimiento, Rafael Lima De Carvalho, Felix Mora-Camino, Priscila V. M. Lima, and Felipe M. G. Franca. 2015. A WiSARD-based multi-term memory framework for online tracking of objects. In Proceedings of the European Symposium on Artificial Neural Networks, Computational Intelligence and Machine Learning. 978\u2013287587014\u20138. DOI:https:\/\/doi.org\/10.13140\/RG.2.1.3387.5687"},{"key":"e_1_2_1_18_1","doi-asserted-by":"publisher","DOI":"10.1109\/TPAMI.2002.1033219"},{"key":"e_1_2_1_19_1","unstructured":"Eideticom. 2020. Retrieved from https:\/\/www.eideticom.com\/.  Eideticom. 2020. Retrieved from https:\/\/www.eideticom.com\/."},{"key":"e_1_2_1_20_1","volume-title":"Inside Solid State Drives (SSDs)","author":"Eshghi K.","unstructured":"K. Eshghi and Rino Micheloni . 2013. SSD architecture and PCI express interface . In Inside Solid State Drives (SSDs) . Springer , 19--45. K. Eshghi and Rino Micheloni. 2013. SSD architecture and PCI express interface. In Inside Solid State Drives (SSDs). Springer, 19--45."},{"key":"e_1_2_1_21_1","unstructured":"A. E. Eshratifar M. S. Abrishami and M. Pedram. 2019. JointDNN: An efficient training and inference engine for intelligent mobile cloud computing services. IEEE Trans. Mob. Comput. (2019) 1--1.  A. E. Eshratifar M. S. Abrishami and M. Pedram. 2019. JointDNN: An efficient training and inference engine for intelligent mobile cloud computing services. IEEE Trans. Mob. Comput. (2019) 1--1."},{"key":"e_1_2_1_22_1","volume-title":"Proceedings of the Linux Symposium","volume":"1","author":"Fasheh Mark","year":"2006","unstructured":"Mark Fasheh . 2006 . OCFS2: The Oracle Clustered File System, version 2 . In Proceedings of the Linux Symposium , Vol. 1 . 289--302. Mark Fasheh. 2006. OCFS2: The Oracle Clustered File System, version 2. In Proceedings of the Linux Symposium, Vol. 1. 289--302."},{"key":"e_1_2_1_23_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICARCV.2010.5707913"},{"key":"e_1_2_1_24_1","volume-title":"The digital universe in 2020: Big data, bigger digital shadows, and biggest growth in the Far East. IDC iView: IDC Analyze the future","author":"Gantz John","year":"2007","unstructured":"John Gantz and David Reinsel . 2012. The digital universe in 2020: Big data, bigger digital shadows, and biggest growth in the Far East. IDC iView: IDC Analyze the future 2007 , 2012 (2012), 1--16. John Gantz and David Reinsel. 2012. The digital universe in 2020: Big data, bigger digital shadows, and biggest growth in the Far East. IDC iView: IDC Analyze the future 2007, 2012 (2012), 1--16."},{"key":"e_1_2_1_25_1","volume-title":"Squyres","author":"Graham Richard L.","year":"2006","unstructured":"Richard L. Graham , Timothy S. Woodall , and Jeffrey M . Squyres . 2006 . Open MPI: A flexible high performance MPI. In Parallel Processing and Applied Mathematics, Roman Wyrzykowski, Jack Dongarra, Norbert Meyer, and Jerzy Wa\u015bniewski (Eds.). Springer Berlin , 228--239. Richard L. Graham, Timothy S. Woodall, and Jeffrey M. Squyres. 2006. Open MPI: A flexible high performance MPI. In Parallel Processing and Applied Mathematics, Roman Wyrzykowski, Jack Dongarra, Norbert Meyer, and Jerzy Wa\u015bniewski (Eds.). Springer Berlin, 228--239."},{"key":"e_1_2_1_26_1","doi-asserted-by":"publisher","DOI":"10.1016\/0167-8191(96)00024-5"},{"key":"e_1_2_1_27_1","unstructured":"SSD Form Factor Working Group. 2011. Retrieved from http:\/\/www.ssdformfactor.org\/docs\/SSD_Form_Factor_Version1_00.pdf.  SSD Form Factor Working Group. 2011. Retrieved from http:\/\/www.ssdformfactor.org\/docs\/SSD_Form_Factor_Version1_00.pdf."},{"key":"e_1_2_1_28_1","volume-title":"Sangyeun Cho et\u00a0al","author":"Gu Boncheol","year":"2016","unstructured":"Boncheol Gu , Andre S. Yoon , Duck-Ho Bae , Insoon Jo , Jinyoung Lee , Jonghyun Yoon , Jeong-Uk Kang , Moonsang Kwon , Chanho Yoon , Sangyeun Cho et\u00a0al . 2016 . Biscuit : A framework for near-data processing of big data workloads. In ACM SIGARCH Comput. Archit. News, Vol. 44 . IEEE Press , 153--165. Boncheol Gu, Andre S. Yoon, Duck-Ho Bae, Insoon Jo, Jinyoung Lee, Jonghyun Yoon, Jeong-Uk Kang, Moonsang Kwon, Chanho Yoon, Sangyeun Cho et\u00a0al. 2016. Biscuit: A framework for near-data processing of big data workloads. In ACM SIGARCH Comput. Archit. News, Vol. 44. IEEE Press, 153--165."},{"key":"e_1_2_1_29_1","doi-asserted-by":"publisher","DOI":"10.1145\/1508244.1508271"},{"key":"e_1_2_1_30_1","volume-title":"Proceedings of the International Conference on Advances in Neural Information Processing Systems. 409--415","author":"Hastie Trevor","year":"1996","unstructured":"Trevor Hastie and Robert Tibshirani . 1996 . Discriminant adaptive nearest neighbor classification and regression . In Proceedings of the International Conference on Advances in Neural Information Processing Systems. 409--415 . Trevor Hastie and Robert Tibshirani. 1996. Discriminant adaptive nearest neighbor classification and regression. In Proceedings of the International Conference on Advances in Neural Information Processing Systems. 409--415."},{"key":"e_1_2_1_31_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-319-46448-0_45"},{"key":"e_1_2_1_32_1","doi-asserted-by":"publisher","DOI":"10.1109\/TPAMI.2014.2345390"},{"key":"e_1_2_1_33_1","volume-title":"Chou","author":"HeydariGorji Ali","year":"2020","unstructured":"Ali HeydariGorji , Siavash Rezaei , Mahdi Torabzadehkashi , Hossein Bobarshad , Vladimir Alves , and Pai H . Chou . 2020 . HyperTune: Dynamic hyperparameter tuning for efficient distribution of DNN training over heterogeneous systems. Arxiv Preprint Arxiv :2007.08077 (2020). Ali HeydariGorji, Siavash Rezaei, Mahdi Torabzadehkashi, Hossein Bobarshad, Vladimir Alves, and Pai H. Chou. 2020. HyperTune: Dynamic hyperparameter tuning for efficient distribution of DNN training over heterogeneous systems. Arxiv Preprint Arxiv:2007.08077 (2020)."},{"key":"e_1_2_1_34_1","volume-title":"Chou","author":"HeydariGorji Ali","year":"2020","unstructured":"Ali HeydariGorji , Mahdi Torabzadehkashi , Siavash Rezaei , Hossein Bobarshad , Vladimir Alves , and Pai H . Chou . 2020 . STANNIS : Low-Power Acceleration of Deep Neural Network Training Using Computational Storage Devices. Retrieved from arxiv:cs.DC\/2002.07215. Ali HeydariGorji, Mahdi Torabzadehkashi, Siavash Rezaei, Hossein Bobarshad, Vladimir Alves, and Pai H. Chou. 2020. STANNIS: Low-Power Acceleration of Deep Neural Network Training Using Computational Storage Devices. Retrieved from arxiv:cs.DC\/2002.07215."},{"key":"e_1_2_1_35_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2017.243"},{"key":"e_1_2_1_37_1","doi-asserted-by":"publisher","DOI":"10.1109\/TPAMI.2010.57"},{"key":"e_1_2_1_38_1","volume-title":"Caffe: Convolutional architecture for fast feature embedding. Arxiv Preprint Arxiv:1408.5093","author":"Jia Yangqing","year":"2014","unstructured":"Yangqing Jia , Evan Shelhamer , Jeff Donahue , Sergey Karayev , Jonathan Long , Ross Girshick , Sergio Guadarrama , and Trevor Darrell . 2014 . Caffe: Convolutional architecture for fast feature embedding. Arxiv Preprint Arxiv:1408.5093 (2014). Yangqing Jia, Evan Shelhamer, Jeff Donahue, Sergey Karayev, Jonathan Long, Ross Girshick, Sergio Guadarrama, and Trevor Darrell. 2014. Caffe: Convolutional architecture for fast feature embedding. Arxiv Preprint Arxiv:1408.5093 (2014)."},{"key":"e_1_2_1_39_1","doi-asserted-by":"publisher","DOI":"10.14778\/2994509.2994512"},{"key":"e_1_2_1_40_1","doi-asserted-by":"publisher","DOI":"10.1145\/2749469.2750412"},{"key":"e_1_2_1_41_1","doi-asserted-by":"publisher","DOI":"10.1109\/MSST.2013.6558444"},{"key":"e_1_2_1_42_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.ins.2015.07.056"},{"key":"e_1_2_1_43_1","doi-asserted-by":"publisher","DOI":"10.1145\/3123939.3124553"},{"key":"e_1_2_1_44_1","doi-asserted-by":"publisher","DOI":"10.1109\/TPAMI.2016.2516982"},{"key":"e_1_2_1_45_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2018.00935"},{"key":"e_1_2_1_46_1","volume-title":"Proceedings of the 22nd European Symposium on Artificial Neural Networks, Computational Intelligence and Machine Learning. 541--546","author":"De Carvalho Rafael Lima","unstructured":"Rafael Lima De Carvalho , Danilo S. C. Carvalho , Felix Mora-Camino , Priscila V. M. Lima , and Felipe M. G. Fran\u00e7a . 2014. Online tracking of multiple objects using WiSARD . In Proceedings of the 22nd European Symposium on Artificial Neural Networks, Computational Intelligence and Machine Learning. 541--546 . Retrieved from https:\/\/hal-enac.archives-ouvertes.fr\/hal-01059678. Rafael Lima De Carvalho, Danilo S. C. Carvalho, Felix Mora-Camino, Priscila V. M. Lima, and Felipe M. G. Fran\u00e7a. 2014. Online tracking of multiple objects using WiSARD. In Proceedings of the 22nd European Symposium on Artificial Neural Networks, Computational Intelligence and Machine Learning. 541--546. Retrieved from https:\/\/hal-enac.archives-ouvertes.fr\/hal-01059678."},{"key":"e_1_2_1_47_1","doi-asserted-by":"publisher","DOI":"10.1023\/B:VISI.0000029664.99615.94"},{"key":"e_1_2_1_48_1","doi-asserted-by":"publisher","DOI":"10.1049\/el:19961141"},{"key":"e_1_2_1_49_1","volume-title":"Proceedings of the Flash Memory Summit.","author":"Mehra Pankaj","year":"2019","unstructured":"Pankaj Mehra . 2019 . Samsung smartSSD: Accelerating data-rich applications . In Proceedings of the Flash Memory Summit. Pankaj Mehra. 2019. Samsung smartSSD: Accelerating data-rich applications. In Proceedings of the Flash Memory Summit."},{"key":"e_1_2_1_50_1","first-page":"2","article-title":"Docker: Lightweight Linux containers for consistent development and deployment","volume":"239","author":"Merkel Dirk","year":"2014","unstructured":"Dirk Merkel . 2014 . Docker: Lightweight Linux containers for consistent development and deployment . Linux J. 2014, 239 (2014), 2 . Dirk Merkel. 2014. Docker: Lightweight Linux containers for consistent development and deployment. Linux J. 2014, 239 (2014), 2.","journal-title":"Linux"},{"key":"e_1_2_1_51_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2016.465"},{"key":"e_1_2_1_52_1","doi-asserted-by":"publisher","DOI":"10.1145\/2928275.2928278"},{"key":"e_1_2_1_53_1","doi-asserted-by":"publisher","DOI":"10.1145\/1862876.1862877"},{"key":"e_1_2_1_54_1","doi-asserted-by":"publisher","DOI":"10.1109\/ISCAS.2017.8050867"},{"key":"e_1_2_1_55_1","volume-title":"Proceedings of the Flash Memory Summit.","author":"Ohshima Shigeo","year":"2016","unstructured":"Shigeo Ohshima and Yoichiro Tanaka . 2016 . New 3D flash technologies offer both low cost and low power solutions . In Proceedings of the Flash Memory Summit. Shigeo Ohshima and Yoichiro Tanaka. 2016. New 3D flash technologies offer both low cost and low power solutions. In Proceedings of the Flash Memory Summit."},{"key":"e_1_2_1_56_1","unstructured":"ONFI online. 2017. Open NAND Flash interface specification. Retrieved from http:\/\/www.onfi.org\/specifications.  ONFI online. 2017. Open NAND Flash interface specification. Retrieved from http:\/\/www.onfi.org\/specifications."},{"key":"e_1_2_1_57_1","unstructured":"PCI-SIG. 2020. Retrieved from https:\/\/pcisig.com\/specifications\/pciexpress\/M.2_Specification\/.  PCI-SIG. 2020. Retrieved from https:\/\/pcisig.com\/specifications\/pciexpress\/M.2_Specification\/."},{"key":"e_1_2_1_58_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2016.91"},{"key":"e_1_2_1_59_1","volume-title":"Yolov3: An incremental improvement. Arxiv Preprint Arxiv:1804.02767","author":"Redmon Joseph","year":"2018","unstructured":"Joseph Redmon and Ali Farhadi . 2018. Yolov3: An incremental improvement. Arxiv Preprint Arxiv:1804.02767 ( 2018 ). Joseph Redmon and Ali Farhadi. 2018. Yolov3: An incremental improvement. Arxiv Preprint Arxiv:1804.02767 (2018)."},{"key":"e_1_2_1_60_1","unstructured":"IDC Report. 2018. The digitization of the world from edge to core. Retrieved from https:\/\/www.seagate.com\/files\/www-content\/our-story\/trends\/files\/idc-seagate-dataage-whitepaper.pdf.  IDC Report. 2018. The digitization of the world from edge to core. Retrieved from https:\/\/www.seagate.com\/files\/www-content\/our-story\/trends\/files\/idc-seagate-dataage-whitepaper.pdf."},{"key":"e_1_2_1_61_1","doi-asserted-by":"publisher","DOI":"10.1109\/ReConFig48160.2019.8994771"},{"key":"e_1_2_1_62_1","volume-title":"Proceedings of the International Conference on ReConFigurable Computing and FPGAs (ReConFig\u201916)","author":"Rezaei S.","year":"2016","unstructured":"S. Rezaei , C. Hernandez-Calderon , S. Mirzamohammadi , E. Bozorgzadeh , A. Veidenbaum , A. Nicolau , and M. J. Prather . 2016. Data-rate-aware FPGA-based acceleration framework for streaming applications . In Proceedings of the International Conference on ReConFigurable Computing and FPGAs (ReConFig\u201916) . 1--6. DOI:https:\/\/doi.org\/10.1109\/ReConFig. 2016 .7857162 10.1109\/ReConFig.2016.7857162 S. Rezaei, C. Hernandez-Calderon, S. Mirzamohammadi, E. Bozorgzadeh, A. Veidenbaum, A. Nicolau, and M. J. Prather. 2016. Data-rate-aware FPGA-based acceleration framework for streaming applications. In Proceedings of the International Conference on ReConFigurable Computing and FPGAs (ReConFig\u201916). 1--6. DOI:https:\/\/doi.org\/10.1109\/ReConFig.2016.7857162"},{"key":"e_1_2_1_63_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCD.2018.00063"},{"key":"e_1_2_1_64_1","unstructured":"Steve Roddy. 2019. Arm NN: the Easy Way to Deploy Edge ML. Retrieved from https:\/\/community.arm.com\/developer\/tools-software\/tools\/b\/tools-software-ides-blog\/posts\/arm-nn-the-easy-way-to-deploy-edge-ml?_ga=2.9822706.6940669.1579038789-277442185.1570226249.  Steve Roddy. 2019. Arm NN: the Easy Way to Deploy Edge ML. Retrieved from https:\/\/community.arm.com\/developer\/tools-software\/tools\/b\/tools-software-ides-blog\/posts\/arm-nn-the-easy-way-to-deploy-edge-ml?_ga=2.9822706.6940669.1579038789-277442185.1570226249."},{"key":"e_1_2_1_65_1","volume-title":"CodeX: Bit-flexible encoding for streaming-based FPGA acceleration of DNNs. CoRR abs\/1901.05582","author":"Samragh Mohammad","year":"2019","unstructured":"Mohammad Samragh , Mojan Javaheripi , and Farinaz Koushanfar . 2019. CodeX: Bit-flexible encoding for streaming-based FPGA acceleration of DNNs. CoRR abs\/1901.05582 ( 2019 ). Mohammad Samragh, Mojan Javaheripi, and Farinaz Koushanfar. 2019. CodeX: Bit-flexible encoding for streaming-based FPGA acceleration of DNNs. CoRR abs\/1901.05582 (2019)."},{"key":"e_1_2_1_66_1","unstructured":"Scaleflux. 2020. Retrieved from http:\/\/scaleflux.com\/index.html.  Scaleflux. 2020. Retrieved from http:\/\/scaleflux.com\/index.html."},{"key":"e_1_2_1_67_1","volume-title":"Proceedings of the Symposium on Operating Systems Design and Implementation. 67--80","author":"Seshadri Sudharsan","year":"2014","unstructured":"Sudharsan Seshadri , Mark Gahagan , Meenakshi Sundaram Bhaskaran , Trevor Bunker , Arup De , Yanqin Jin , Yang Liu , and Steven Swanson . 2014 . Willow: A user-programmable SSD . In Proceedings of the Symposium on Operating Systems Design and Implementation. 67--80 . Sudharsan Seshadri, Mark Gahagan, Meenakshi Sundaram Bhaskaran, Trevor Bunker, Arup De, Yanqin Jin, Yang Liu, and Steven Swanson. 2014. Willow: A user-programmable SSD. In Proceedings of the Symposium on Operating Systems Design and Implementation. 67--80."},{"key":"e_1_2_1_68_1","doi-asserted-by":"publisher","DOI":"10.1109\/MSST.2010.5496972"},{"key":"e_1_2_1_69_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2003.1238663"},{"key":"e_1_2_1_70_1","unstructured":"SNIA. 2019. Computational Storage Technical Working Group. Retrieved from https:\/\/www.snia.org\/computational.  SNIA. 2019. Computational Storage Technical Working Group. Retrieved from https:\/\/www.snia.org\/computational."},{"key":"e_1_2_1_71_1","unstructured":"SolarWinds. 2018. Can gzip Compression Really Improve Web Performance? Retrieved from https:\/\/royal.pingdom.com\/can-gzip-compression-really-improve-web-performance\/.  SolarWinds. 2018. Can gzip Compression Really Improve Web Performance? Retrieved from https:\/\/royal.pingdom.com\/can-gzip-compression-really-improve-web-performance\/."},{"key":"e_1_2_1_72_1","first-page":"1","article-title":"The global file system: A file system for shared disk storage","volume":"1","author":"Soltis Steven R.","year":"1997","unstructured":"Steven R. Soltis , G. M. Erickson , Kenneth W. Preslan , Matthew T. O\u2019Keefe , and Thomas M. Ruwart . 1997 . The global file system: A file system for shared disk storage . IEEE Transactions on Parallel and Distributed Systems 1 (1997), 1 . Steven R. Soltis, G. M. Erickson, Kenneth W. Preslan, Matthew T. O\u2019Keefe, and Thomas M. Ruwart. 1997. The global file system: A file system for shared disk storage. IEEE Transactions on Parallel and Distributed Systems 1 (1997), 1.","journal-title":"IEEE Transactions on Parallel and Distributed Systems"},{"key":"e_1_2_1_73_1","unstructured":"Mohan Srinivasan. 2014. Flashcache. Retrieved from https:\/\/github.com\/facebookarchive\/flashcache.  Mohan Srinivasan. 2014. Flashcache. Retrieved from https:\/\/github.com\/facebookarchive\/flashcache."},{"key":"e_1_2_1_74_1","unstructured":"NGD Systems. 2020. Retrieved from https:\/\/www.ngdsystems.com.  NGD Systems. 2020. Retrieved from https:\/\/www.ngdsystems.com."},{"key":"e_1_2_1_75_1","unstructured":"Ted Friedman Thomas Bittman Neil MacDonald. 2019. How to Overcome Four Major Challenges in Edge Computing. Retrieved from https:\/\/www.gartner.com\/doc\/reprints?id=1-1XWDQ2PW8ct=1912108st=sb.  Ted Friedman Thomas Bittman Neil MacDonald. 2019. How to Overcome Four Major Challenges in Edge Computing. Retrieved from https:\/\/www.gartner.com\/doc\/reprints?id=1-1XWDQ2PW8ct=1912108st=sb."},{"key":"e_1_2_1_76_1","volume-title":"Proceedings of the USENIX Conference on File and Storage Technologies. 119--132","author":"Tiwari Devesh","year":"2013","unstructured":"Devesh Tiwari , Simona Boboila , Sudharshan S. Vazhkudai , Youngjae Kim , Xiaosong Ma , Peter Desnoyers , and Yan Solihin . 2013 . Active flash: Towards energy-efficient, in-situ data analytics on extreme-scale machines . In Proceedings of the USENIX Conference on File and Storage Technologies. 119--132 . Devesh Tiwari, Simona Boboila, Sudharshan S. Vazhkudai, Youngjae Kim, Xiaosong Ma, Peter Desnoyers, and Yan Solihin. 2013. Active flash: Towards energy-efficient, in-situ data analytics on extreme-scale machines. In Proceedings of the USENIX Conference on File and Storage Technologies. 119--132."},{"key":"e_1_2_1_77_1","volume-title":"Proceedings of the IEEE 21st International Conference on High Performance Computing and Communications","author":"Torabzadehkashi Mahdi","year":"1878","unstructured":"Mahdi Torabzadehkashi , Ali Heydarigorji , Siavash Rezaei , Hosein Bobarshad , Vladimir Alves , and Nader Bagherzadeh . 2019. Accelerating HPC applications using computational storage devices . In Proceedings of the IEEE 21st International Conference on High Performance Computing and Communications ; IEEE 17th International Conference on Smart City; IEEE 5th International Conference on Data Science and Systems (HPCC\/SmartCity\/DSS\u201919). IEEE , 1878 --1885. Mahdi Torabzadehkashi, Ali Heydarigorji, Siavash Rezaei, Hosein Bobarshad, Vladimir Alves, and Nader Bagherzadeh. 2019. Accelerating HPC applications using computational storage devices. In Proceedings of the IEEE 21st International Conference on High Performance Computing and Communications; IEEE 17th International Conference on Smart City; IEEE 5th International Conference on Data Science and Systems (HPCC\/SmartCity\/DSS\u201919). IEEE, 1878--1885."},{"key":"e_1_2_1_78_1","doi-asserted-by":"publisher","DOI":"10.1109\/IPDPSW.2018.00195"},{"key":"e_1_2_1_79_1","doi-asserted-by":"publisher","DOI":"10.1109\/EMPDP.2019.8671589"},{"key":"e_1_2_1_80_1","doi-asserted-by":"publisher","DOI":"10.1186\/s40537-019-0265-5"},{"key":"e_1_2_1_81_1","volume-title":"Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 2805--2813","author":"Valmadre Jack","unstructured":"Jack Valmadre , Luca Bertinetto , Jo\u00e3o Henriques , Andrea Vedaldi , and Philip H. S. Torr . 2017. End-to-end representation learning for correlation filter based tracking . In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 2805--2813 . Jack Valmadre, Luca Bertinetto, Jo\u00e3o Henriques, Andrea Vedaldi, and Philip H. S. Torr. 2017. End-to-end representation learning for correlation filter based tracking. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 2805--2813."},{"key":"e_1_2_1_82_1","unstructured":"DRAM Exchange Website. 2020. DRAM Exchange. Retrieved from https:\/\/www.dramexchange.com.  DRAM Exchange Website. 2020. DRAM Exchange. Retrieved from https:\/\/www.dramexchange.com."},{"key":"e_1_2_1_83_1","unstructured":"Part Stock Website. 2020. Part Stock Specs for Xilinx XCZU19EG-2FFVC1760E. Retrieved from http:\/\/www.part-stock.com\/product-part\/xilinx__XCZU19EG-2FFVC1760E.html.  Part Stock Website. 2020. Part Stock Specs for Xilinx XCZU19EG-2FFVC1760E. Retrieved from http:\/\/www.part-stock.com\/product-part\/xilinx__XCZU19EG-2FFVC1760E.html."},{"key":"e_1_2_1_84_1","volume-title":"Saul","author":"Weinberger Kilian Q.","year":"2009","unstructured":"Kilian Q. Weinberger and Lawrence K . Saul . 2009 . Distance metric learning for large margin nearest neighbor classification. J. Mach. Learn. Res . 10(Feb. 2009), 207--244. Kilian Q. Weinberger and Lawrence K. Saul. 2009. Distance metric learning for large margin nearest neighbor classification. J. Mach. Learn. Res. 10(Feb. 2009), 207--244."},{"key":"e_1_2_1_85_1","doi-asserted-by":"publisher","DOI":"10.1109\/TPAMI.2014.2388226"},{"key":"e_1_2_1_86_1","doi-asserted-by":"publisher","DOI":"10.1109\/SMARTCOMP.2014.7043841"},{"key":"e_1_2_1_87_1","volume-title":"Proceedings of the 11th USENIX Conference on File and Storage Technologies (FAST\u201913)","author":"Zhao Kai","year":"2013","unstructured":"Kai Zhao , Wenzhe Zhao , Hongbin Sun , Xiaodong Zhang , Nanning Zheng , and Tong Zhang . 2013 . LDPC-in-SSD: Making advanced error correction codes work effectively in solid state drives . In Proceedings of the 11th USENIX Conference on File and Storage Technologies (FAST\u201913) . 243--256. Kai Zhao, Wenzhe Zhao, Hongbin Sun, Xiaodong Zhang, Nanning Zheng, and Tong Zhang. 2013. LDPC-in-SSD: Making advanced error correction codes work effectively in solid state drives. In Proceedings of the 11th USENIX Conference on File and Storage Technologies (FAST\u201913). 243--256."}],"container-title":["ACM Transactions on Storage"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3415580","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3415580","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3415580","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T21:31:53Z","timestamp":1750195913000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3415580"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2020,10,12]]},"references-count":86,"journal-issue":{"issue":"4","published-print":{"date-parts":[[2020,11,30]]}},"alternative-id":["10.1145\/3415580"],"URL":"https:\/\/doi.org\/10.1145\/3415580","relation":{},"ISSN":["1553-3077","1553-3093"],"issn-type":[{"value":"1553-3077","type":"print"},{"value":"1553-3093","type":"electronic"}],"subject":[],"published":{"date-parts":[[2020,10,12]]},"assertion":[{"value":"2020-02-01","order":0,"name":"received","label":"Received","group":{"name":"publication_history","label":"Publication History"}},{"value":"2020-08-01","order":1,"name":"accepted","label":"Accepted","group":{"name":"publication_history","label":"Publication History"}},{"value":"2020-10-12","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}