{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,7,2]],"date-time":"2026-07-02T19:46:12Z","timestamp":1783021572127,"version":"3.54.6"},"reference-count":116,"publisher":"Association for Computing Machinery (ACM)","issue":"4","license":[{"start":{"date-parts":[[2023,9,8]],"date-time":"2023-09-08T00:00:00Z","timestamp":1694131200000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["J. Emerg. Technol. Comput. Syst."],"published-print":{"date-parts":[[2023,10,31]]},"abstract":"<jats:p>The number of parameters in deep neural networks (DNNs) is scaling at about 5\u00d7 the rate of Moore\u2019s Law. To sustain this growth, photonic computing is a promising avenue, as it enables higher throughput in dominant general matrix-matrix multiplication (GEMM) operations in DNNs than their electrical counterpart. However, purely photonic systems face several challenges including lack of photonic memory and accumulation of noise. In this article, we present an electro-photonic accelerator, ADEPT, which leverages a photonic computing unit for performing GEMM operations, a vectorized digital electronic application-specific integrated circuits for performing non-GEMM operations, and SRAM arrays for storing DNN parameters and activations. In contrast to prior works in photonic DNN accelerators, we adopt a system-level perspective and show that the gains while large are tempered relative to prior expectations. Our goal is to encourage architects to explore photonic technology in a more pragmatic way considering the system as a whole to understand its general applicability in accelerating today\u2019s DNNs. Our evaluation shows that ADEPT can provide, on average, 5.73\u00d7 higher throughput per watt compared to the traditional systolic arrays in a full-system, and at least 6.8\u00d7 and 2.5\u00d7 better throughput per watt, compared to state-of-the-art electronic and photonic accelerators, respectively.<\/jats:p>","DOI":"10.1145\/3606949","type":"journal-article","created":{"date-parts":[[2023,7,12]],"date-time":"2023-07-12T11:44:19Z","timestamp":1689162259000},"page":"1-31","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":54,"title":["An Electro-Photonic System for Accelerating Deep Neural Networks"],"prefix":"10.1145","volume":"19","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-1418-7422","authenticated-orcid":false,"given":"Cansu","family":"Demirkiran","sequence":"first","affiliation":[{"name":"Boston University, USA"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-0349-6959","authenticated-orcid":false,"given":"Furkan","family":"Eris","sequence":"additional","affiliation":[{"name":"Boston University, USA"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0009-0002-4800-7781","authenticated-orcid":false,"given":"Gongyu","family":"Wang","sequence":"additional","affiliation":[{"name":"Lightmatter, USA"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-2200-9586","authenticated-orcid":false,"given":"Jonathan","family":"Elmhurst","sequence":"additional","affiliation":[{"name":"Lightmatter, USA"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0009-0001-2787-4678","authenticated-orcid":false,"given":"Nick","family":"Moore","sequence":"additional","affiliation":[{"name":"Lightmatter, USA"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-3009-563X","authenticated-orcid":false,"given":"Nicholas C.","family":"Harris","sequence":"additional","affiliation":[{"name":"Lightmatter, USA"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0009-0003-4980-259X","authenticated-orcid":false,"given":"Ayon","family":"Basumallik","sequence":"additional","affiliation":[{"name":"Lightmatter, USA"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-5259-7721","authenticated-orcid":false,"given":"Vijay Janapa","family":"Reddi","sequence":"additional","affiliation":[{"name":"Harvard University, USA"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-3256-9942","authenticated-orcid":false,"given":"Ajay","family":"Joshi","sequence":"additional","affiliation":[{"name":"Boston University, USA"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-8218-5656","authenticated-orcid":false,"given":"Darius","family":"Bunandar","sequence":"additional","affiliation":[{"name":"Lightmatter, USA"}],"role":[{"vocabulary":"crossref","role":"author"}]}],"member":"320","published-online":{"date-parts":[[2023,9,8]]},"reference":[{"key":"e_1_3_2_2_2","unstructured":"(nd). Ansys. Retrieved from https:\/\/www.ansys.com\/products\/photonics"},{"key":"e_1_3_2_3_2","unstructured":"(nd). Genus Synthesis Solution. Retrieved from https:\/\/www.cadence.com\/en_US\/home\/tools\/digital-design-and-signoff\/synthesis\/genus-synthesis-solution.html"},{"key":"e_1_3_2_4_2","unstructured":"(nd). GF22nm FD-SOI Technology. Retrieved from https:\/\/globalfoundries.com\/sites\/default\/files\/product-briefs\/pb-22fdx-26-web.pdf"},{"key":"e_1_3_2_5_2","unstructured":"(nd). Intel Xeon Gold 6242 Processor (22m Cache 2.80 GHz) Product Specifications. Retrieved from https:\/\/ark.intel.com\/content\/www\/us\/en\/ark\/products\/192440\/intel-xeon-gold-6242-processor-22m-cache-2-80-ghz.html"},{"key":"e_1_3_2_6_2","doi-asserted-by":"publisher","DOI":"10.1364\/OE.20.002911"},{"key":"e_1_3_2_7_2","doi-asserted-by":"publisher","DOI":"10.1063\/5.0070992"},{"key":"e_1_3_2_8_2","unstructured":"Dario Amodei. 2020. AI and Compute. Retrieved from https:\/\/openai.com\/blog\/ai-and-compute\/"},{"key":"e_1_3_2_9_2","unstructured":"Andrew Anderson Aravind Vasudevan Cormac Keane and David Gregg. 2017. Low-memory GEMM-based convolution algorithms for deep neural networks. CoRR abs\/1709.03395 (2017). http:\/\/arxiv.org\/abs\/1709.03395"},{"key":"e_1_3_2_10_2","doi-asserted-by":"publisher","DOI":"10.1364\/OE.20.012014"},{"key":"e_1_3_2_11_2","doi-asserted-by":"publisher","DOI":"10.1364\/OE.423949"},{"key":"e_1_3_2_12_2","doi-asserted-by":"publisher","DOI":"10.1364\/OPTICA.424052"},{"key":"e_1_3_2_13_2","doi-asserted-by":"publisher","DOI":"10.1109\/JSTQE.2019.2945540"},{"key":"e_1_3_2_14_2","doi-asserted-by":"publisher","DOI":"10.1007\/s12274-010-0082-9"},{"key":"e_1_3_2_15_2","doi-asserted-by":"publisher","DOI":"10.1109\/MDAT.2022.3161126"},{"key":"e_1_3_2_16_2","doi-asserted-by":"publisher","DOI":"10.1109\/MSPEC.1986.6371053"},{"key":"e_1_3_2_17_2","doi-asserted-by":"publisher","DOI":"10.1002\/lpor.201100017"},{"key":"e_1_3_2_18_2","doi-asserted-by":"publisher","DOI":"10.1364\/AO.26.004039"},{"issue":"1","key":"e_1_3_2_19_2","doi-asserted-by":"crossref","first-page":"12324","DOI":"10.1038\/s41598-018-30619-y","article-title":"Hybrid optical-electronic convolutional neural networks with optimized diffractive optics for image classification","volume":"8","author":"Chang Julie","year":"2018","unstructured":"Julie Chang, Vincent Sitzmann, Xiong Dun, Wolfgang Heidrich, and Gordon Wetzstein. 2018. Hybrid optical-electronic convolutional neural networks with optimized diffractive optics for image classification. Sci. Rep. 8, 1 (2018), 12324.","journal-title":"Sci. Rep."},{"key":"e_1_3_2_20_2","doi-asserted-by":"publisher","DOI":"10.1109\/JSTQE.2012.2228170"},{"key":"e_1_3_2_21_2","doi-asserted-by":"publisher","DOI":"10.1109\/JSTQE.2012.2228170"},{"key":"e_1_3_2_22_2","doi-asserted-by":"publisher","DOI":"10.1145\/2654822.2541967"},{"key":"e_1_3_2_23_2","doi-asserted-by":"publisher","DOI":"10.1109\/JSSC.2016.2616357"},{"key":"e_1_3_2_24_2","first-page":"609","volume-title":"Proceedings of the 47th Annual IEEE\/ACM International Symposium on Microarchitecture","author":"Chen Y.","year":"2014","unstructured":"Y. Chen, T. Luo, S. Liu, S. Zhang, L. He, J. Wang, L. Li, T. Chen, Z. Xu, N. Sun, and O. Temam. 2014. DaDianNao: A machine-learning supercomputer. In Proceedings of the 47th Annual IEEE\/ACM International Symposium on Microarchitecture. 609\u2013622. 10.1109\/MICRO.2014.58"},{"key":"e_1_3_2_25_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.eng.2020.01.007"},{"key":"e_1_3_2_26_2","doi-asserted-by":"publisher","DOI":"10.1109\/JETCAS.2019.2910232"},{"key":"e_1_3_2_27_2","first-page":"1552","volume-title":"Proceedings of the IEEE International Symposium on Circuits and Systems (ISCAS\u201914)","author":"Chen Zhilu","year":"2014","unstructured":"Zhilu Chen, Jing Wang, Haibo He, and Xinming Huang. 2014. A fast deep learning system using GPU. In Proceedings of the IEEE International Symposium on Circuits and Systems (ISCAS\u201914). 1552\u20131555. 10.1109\/ISCAS.2014.6865444"},{"key":"e_1_3_2_28_2","doi-asserted-by":"publisher","DOI":"10.1109\/JPROC.2020.2968184"},{"key":"e_1_3_2_29_2","doi-asserted-by":"publisher","DOI":"10.1109\/MM.2021.3061394"},{"key":"e_1_3_2_30_2","doi-asserted-by":"publisher","DOI":"10.1364\/OPTICA.3.001460"},{"key":"e_1_3_2_31_2","article-title":"Optical convolutional neural networks\u2013combining silicon photonics and fourier optics for computer vision","author":"Cottle Edward","year":"2020","unstructured":"Edward Cottle, Florent Michel, Joseph Wilson, Nick New, and Iman Kundu. 2020. Optical convolutional neural networks\u2013combining silicon photonics and fourier optics for computer vision. arXiv:2103.09044. Retrieved from https:\/\/arxiv.org\/abs\/2103.09044","journal-title":"arXiv:2103.09044"},{"key":"e_1_3_2_32_2","doi-asserted-by":"publisher","DOI":"10.1109\/JSSC.2016.2596773"},{"key":"e_1_3_2_33_2","unstructured":"Jacob Devlin Ming-Wei Chang Kenton Lee and Kristina Toutanova. 2019. BERT: Pre-training of deep bidirectional transformers for language understanding. CoRR abs\/1810.04805 (2018). http:\/\/arxiv.org\/abs\/1810.04805"},{"key":"e_1_3_2_34_2","doi-asserted-by":"publisher","DOI":"10.1364\/OE.18.009852"},{"key":"e_1_3_2_35_2","doi-asserted-by":"crossref","first-page":"399","DOI":"10.1017\/CBO9781139042918.020","article-title":"Large-scale fpga-based convolutional networks","author":"Farabet Cl\u00e9ment","year":"2011","unstructured":"Cl\u00e9ment Farabet, Yann LeCun, Koray Kavukcuoglu, Eugenio Culurciello, Berin Martini, Polina Akselrod, and Selcuk Talay. 2011. Large-scale fpga-based convolutional networks. In Scaling Up Machine Learning: Parallel and Distributed Approaches, 399\u2013419.","journal-title":"Scaling Up Machine Learning: Parallel and Distributed Approaches"},{"key":"e_1_3_2_36_2","first-page":"109","volume-title":"Proceedings of the Computer Vision and Pattern Recognition (CVPR\u201911) Workshops","author":"Farabet Cl\u00e9ment","year":"2011","unstructured":"Cl\u00e9ment Farabet, Berin Martini, Benoit Corda, Polina Akselrod, Eugenio Culurciello, and Yann LeCun. 2011. Neuflow: A runtime reconfigurable dataflow processor for vision. In Proceedings of the Computer Vision and Pattern Recognition (CVPR\u201911) Workshops. 109\u2013116. 10.1109\/CVPRW.2011.5981829"},{"key":"e_1_3_2_37_2","doi-asserted-by":"publisher","DOI":"10.1038\/s41586-020-03070-1"},{"key":"e_1_3_2_38_2","first-page":"1","volume-title":"Proceedings of the International Conference on Field-Programmable Technology (ICFPT\u201919)","author":"Fox Sean","year":"2019","unstructured":"Sean Fox, Julian Faraone, David Boland, Kees Vissers, and Philip H. W. Leong. 2019. Training deep neural networks in low-precision with high accuracy using FPGAs. In Proceedings of the International Conference on Field-Programmable Technology (ICFPT\u201919). 1\u20139. 10.1109\/ICFPT47387.2019.00009"},{"key":"e_1_3_2_39_2","doi-asserted-by":"publisher","DOI":"10.1109\/JSTQE.2019.2908790"},{"key":"e_1_3_2_40_2","doi-asserted-by":"publisher","DOI":"10.1109\/ACCESS.2020.3012699"},{"key":"e_1_3_2_41_2","doi-asserted-by":"crossref","unstructured":"Ryan Hamerly Saumil Bandyopadhyay and Dirk Englund. 2022. Accurate self-configuration of rectangular multiport interferometers. Phys. Rev. Appl. 18 2 (2022) 024019. 10.1103\/PhysRevApplied.18.024019","DOI":"10.1103\/PhysRevApplied.18.024019"},{"key":"e_1_3_2_42_2","doi-asserted-by":"crossref","unstructured":"Ryan Hamerly Saumil Bandyopadhyay and Dirk Englund. 2022. Stability of self-configuring large multiport interferometers. Phys. Rev. Appl. 18 2 (2022) 024018. 10.1103\/PhysRevApplied.18.024018","DOI":"10.1103\/PhysRevApplied.18.024018"},{"key":"e_1_3_2_43_2","doi-asserted-by":"publisher","DOI":"10.1364\/OE.22.010487"},{"key":"e_1_3_2_44_2","first-page":"770","volume-title":"Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR\u201916)","author":"He K.","year":"2016","unstructured":"K. He, X. Zhang, S. Ren, and J. Sun. 2016. Deep residual learning for image recognition. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR\u201916). 770\u2013778. 10.1109\/CVPR.2016.90"},{"key":"e_1_3_2_45_2","unstructured":"Yanzhang He Tara N. Sainath Rohit Prabhavalkar Ian McGraw Raziel Alvarez Ding Zhao David Rybach Anjuli Kannan Yonghui Wu Ruoming Pang Qiao Liang Deepti Bhatia Yuan Shangguan Bo Li Golan Pundak Khe Chai Sim Tom Bagby Shuo yiin Chang Kanishka Rao and Alexander Gruenstein. 2018. Streaming end-to-end speech recognition for mobile devices. CoRR abs\/1811.06621 (2018). http:\/\/arxiv.org\/abs\/1811.06621"},{"key":"e_1_3_2_46_2","first-page":"10","volume-title":"Proceedings of the IEEE International Solid-State Circuits Conference Digest of Technical Papers (ISSCC\u201914)","author":"Horowitz M.","year":"2014","unstructured":"M. Horowitz. 2014. Computing\u2019s energy problem (and what we can do about it). In Proceedings of the IEEE International Solid-State Circuits Conference Digest of Technical Papers (ISSCC\u201914). 10\u201314."},{"key":"e_1_3_2_47_2","doi-asserted-by":"publisher","DOI":"10.1109\/JSSC.2021.3079111"},{"key":"e_1_3_2_48_2","first-page":"1","volume-title":"Proceedings of the 26th International Conference on Field Programmable Logic and Applications (FPL\u201916)","author":"Li Huimin","year":"2016","unstructured":"Huimin Li, Xitian Fan, Li Jiao, Wei Cao, Xuegong Zhou, and Lingli Wang. 2016. A high performance fpga-based accelerator for large-scale convolutional neural networks. In Proceedings of the 26th International Conference on Field Programmable Logic and Applications (FPL\u201916). 1\u20139. 10.1109\/FPL.2016.7577308"},{"key":"e_1_3_2_49_2","doi-asserted-by":"crossref","first-page":"82","DOI":"10.1145\/3081333.3081360","volume-title":"Proceedings of the 15th Annual International Conference on Mobile Systems, Applications, and Services (MobiSys\u201917)","author":"Huynh Loc N.","year":"2017","unstructured":"Loc N. Huynh, Youngki Lee, and Rajesh Krishna Balan. 2017. DeepMon: Mobile GPU-based deep learning framework for continuous vision applications. In Proceedings of the 15th Annual International Conference on Mobile Systems, Applications, and Services (MobiSys\u201917). Association for Computing Machinery, New York, NY, 82\u201395. 10.1145\/3081333.3081360"},{"key":"e_1_3_2_50_2","volume-title":"Classical Electrodynamics","author":"Jackson John David","year":"1975","unstructured":"John David Jackson. 1975. Classical Electrodynamics. Wiley, New York, NY."},{"key":"e_1_3_2_51_2","doi-asserted-by":"publisher","DOI":"10.1364\/OPTICA.6.000084"},{"key":"e_1_3_2_52_2","first-page":"124","volume-title":"Proceedings of the 3rd ACM\/IEEE International Symposium on Networks-on-Chip","author":"Joshi Ajay","year":"2009","unstructured":"Ajay Joshi, Christopher Batten, Yong-Jin Kwon, Scott Beamer, Imran Shamim, Krste Asanovic, and Vladimir Stojanovic. 2009. Silicon-photonic clos networks for global on-chip communication. In Proceedings of the 3rd ACM\/IEEE International Symposium on Networks-on-Chip. 124\u2013133. 10.1109\/NOCS.2009.5071460"},{"key":"e_1_3_2_53_2","doi-asserted-by":"publisher","DOI":"10.1145\/3360307"},{"key":"e_1_3_2_54_2","first-page":"1","volume-title":"Proceedings of the 44th Annual International Symposium on Computer Architecture (ISCA\u201917)","author":"Jouppi Norman P.","year":"2017","unstructured":"Norman P. Jouppi, Cliff Young, Nishant Patil, David Patterson, Gaurav Agrawal, Raminder Bajwa, Sarah Bates, Suresh Bhatia, Nan Boden, Al Borchers, Rick Boyle, Pierre-luc Cantin, Clifford Chao, Chris Clark, Jeremy Coriell, Mike Daley, Matt Dau, Jeffrey Dean, Ben Gelb, Tara Vazir Ghaemmaghami, Rajendra Gottipati, William Gulland, Robert Hagmann, C. Richard Ho, Doug Hogberg, John Hu, Robert Hundt, Dan Hurt, Julian Ibarz, Aaron Jaffey, Alek Jaworski, Alexander Kaplan, Harshit Khaitan, Daniel Killebrew, Andy Koch, Naveen Kumar, Steve Lacy, James Laudon, James Law, Diemthu Le, Chris Leary, Zhuyuan Liu, Kyle Lucke, Alan Lundin, Gordon MacKean, Adriana Maggiore, Maire Mahony, Kieran Miller, Rahul Nagarajan, Ravi Narayanaswami, Ray Ni, Kathy Nix, Thomas Norrie, Mark Omernick, Narayana Penukonda, Andy Phelps, Jonathan Ross, Matt Ross, Amir Salek, Emad Samadiani, Chris Severn, Gregory Sizikov, Matthew Snelham, Jed Souter, Dan Steinberg, Andy Swing, Mercedes Tan, Gregory Thorson, Bo Tian, Horia Toma, Erick Tuttle, Vijay Vasudevan, Richard Walter, Walter Wang, Eric Wilcox, and Doe Hyun Yoon. 2017. In-datacenter performance analysis of a tensor processing unit. In Proceedings of the 44th Annual International Symposium on Computer Architecture (ISCA\u201917). Association for Computing Machinery, New York, NY, 1\u201312. 10.1145\/3079856.3080246"},{"key":"e_1_3_2_55_2","unstructured":"Sangpyo Kim Jongmin Kim Michael Jaemin Kim Wonkyung Jung Minsoo Rhu John Kim and Jung Ho Ahn. 2021. BTS: An accelerator for bootstrappable fully homomorphic encryption. CoRR abs\/2112.15479 (2021). https:\/\/arxiv.org\/abs\/2112.15479"},{"key":"e_1_3_2_56_2","unstructured":"Raghuraman Krishnamoorthi. 2018. Quantizing deep convolutional networks for efficient inference: A whitepaper. CoRR abs\/1806.08342 (2018). http:\/\/arxiv.org\/abs\/1806.08342"},{"key":"e_1_3_2_57_2","volume-title":"Powering Extreme-Scale HPC with Cerebras WaferScale Accelerators","author":"Lavely Adam","year":"2022","unstructured":"Adam Lavely. 2022. Powering Extreme-Scale HPC with Cerebras WaferScale Accelerators. Technical Report. Cerebras Systems."},{"key":"e_1_3_2_58_2","doi-asserted-by":"publisher","DOI":"10.1109\/JSSC.2018.2865489"},{"key":"e_1_3_2_59_2","doi-asserted-by":"publisher","DOI":"10.1088\/1674-4926\/41\/11\/111404"},{"key":"e_1_3_2_60_2","doi-asserted-by":"publisher","DOI":"10.1126\/science.aat8084"},{"key":"e_1_3_2_61_2","doi-asserted-by":"publisher","DOI":"10.1364\/OE.23.027213"},{"key":"e_1_3_2_62_2","first-page":"1483","volume-title":"Proceedings of the Design, Automation Test in Europe Conference Exhibition (DATE\u201919)","author":"Liu Weichen","year":"2019","unstructured":"Weichen Liu, Wenyang Liu, Yichen Ye, Qian Lou, Yiyuan Xie, and Lei Jiang. 2019. HolyLight: A nanophotonic accelerator for deep learning in data centers. In Proceedings of the Design, Automation Test in Europe Conference Exhibition (DATE\u201919). 1483\u20131488. 10.23919\/DATE.2019.8715195"},{"key":"e_1_3_2_63_2","doi-asserted-by":"publisher","DOI":"10.1109\/socc.2018.8618542"},{"key":"e_1_3_2_64_2","doi-asserted-by":"publisher","DOI":"10.1109\/JLT.2019.2892512"},{"key":"e_1_3_2_65_2","doi-asserted-by":"publisher","DOI":"10.1364\/PRJ.1.000001"},{"key":"e_1_3_2_66_2","doi-asserted-by":"publisher","DOI":"10.1364\/OPTICA.2.000747"},{"key":"e_1_3_2_67_2","doi-asserted-by":"crossref","unstructured":"Mario Miscuglio Zibo Hu Shurui Li Jonathan K. George Roberto Capanna Hamed Dalir Philippe M. Bardet Puneet Gupta and Volker J. Sorger. 2020. Massively parallel amplitude-only fourier neural network. Optica 7 12 (2020) 1812\u20131819. https:\/\/opg.optica.org\/optica\/abstract.cfm?URI=optica-7-12-1812","DOI":"10.1364\/OPTICA.408659"},{"key":"e_1_3_2_68_2","doi-asserted-by":"publisher","DOI":"10.1038\/nphoton.2013.75"},{"issue":"4","key":"e_1_3_2_69_2","doi-asserted-by":"crossref","first-page":"269","DOI":"10.1515\/nanoph-2013-0013","article-title":"Resolving the thermal challenges for silicon microring resonator devices","volume":"3","author":"Padmaraju Kishore","year":"2014","unstructured":"Kishore Padmaraju and Keren Bergman. 2014. Resolving the thermal challenges for silicon microring resonator devices. Nanophotonics 3, 4-5 (2014), 269\u2013281.","journal-title":"Nanophotonics"},{"key":"e_1_3_2_70_2","first-page":"5206","volume-title":"Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP\u201915)","author":"Panayotov Vassil","year":"2015","unstructured":"Vassil Panayotov, Guoguo Chen, Daniel Povey, and Sanjeev Khudanpur. 2015. Librispeech: An ASR corpus based on public domain audio books. In Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP\u201915). 5206\u20135210. 10.1109\/ICASSP.2015.7178964"},{"key":"e_1_3_2_71_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.sysarc.2022.102561"},{"key":"e_1_3_2_72_2","volume-title":"Proceedings of the 49th International Conference on Parallel Processing (ICPP\u201920)","author":"Peng Jiaxin","year":"2020","unstructured":"Jiaxin Peng, Yousra Alkabani, Shuai Sun, Volker J. Sorger, and Tarek El-Ghazawi. 2020. DNNARA: A deep neural network accelerator using residue arithmetic and integrated photonics. In Proceedings of the 49th International Conference on Parallel Processing (ICPP\u201920). Association for Computing Machinery, New York, NY. 10.1145\/3404397.3404467"},{"key":"e_1_3_2_73_2","doi-asserted-by":"publisher","DOI":"10.1063\/1.4864257"},{"key":"e_1_3_2_74_2","unstructured":"Powerapi-Ng. (nd). Powerapi-ng\/pyrapl: A library to measure the python energy consumption of python code. Retrieved from https:\/\/github.com\/powerapi-ng\/pyRAPL"},{"key":"e_1_3_2_75_2","first-page":"2383","volume-title":"Proceedings of the Conference on Empirical Methods in Natural Language Processing","author":"Rajpurkar Pranav","year":"2016","unstructured":"Pranav Rajpurkar, Jian Zhang, Konstantin Lopyrev, and Percy Liang. 2016. SQuAD: 100,000+ questions for machine comprehension of text. In Proceedings of the Conference on Empirical Methods in Natural Language Processing. Association for Computational Linguistics, 2383\u20132392. 10.18653\/v1\/D16-1264"},{"key":"e_1_3_2_76_2","doi-asserted-by":"publisher","DOI":"10.1109\/HCS49909.2020.9220525"},{"issue":"5","key":"e_1_3_2_77_2","doi-asserted-by":"crossref","first-page":"467","DOI":"10.1109\/LPT.2018.2799004","article-title":"Low-power 56Gb\/s NRZ microring modulator driver in 28nm FDSOI CMOS","volume":"30","author":"Ramon Hannes","year":"2018","unstructured":"Hannes Ramon, Michael Vanhoecke, Jochem Verbist, Wouter Soenen, Peter De Heyn, Yoojin Ban, Marianna Pantouvaki, Joris Van Campenhout, Peter Ossieur, Xin Yin, et\u00a0al. 2018. Low-power 56Gb\/s NRZ microring modulator driver in 28nm FDSOI CMOS. IEEE Photon. Technol. Lett. 30, 5 (2018), 467\u2013470.","journal-title":"IEEE Photon. Technol. Lett."},{"key":"e_1_3_2_78_2","doi-asserted-by":"publisher","DOI":"10.1103\/PhysRevLett.73.58"},{"key":"e_1_3_2_79_2","first-page":"446","volume-title":"Proceedings of the ACM\/IEEE 47th Annual International Symposium on Computer Architecture (ISCA\u201920)","author":"Reddi V. J.","year":"2020","unstructured":"V. J. Reddi, C. Cheng, D. Kanter, P. Mattson, G. Schmuelling, C. Wu, B. Anderson, M. Breughe, M. Charlebois, W. Chou, R. Chukka, C. Coleman, S. Davis, P. Deng, G. Diamos, J. Duke, D. Fick, J. S. Gardner, I. Hubara, S. Idgunji, T. B. Jablin, J. Jiao, T. S. John, P. Kanwar, D. Lee, J. Liao, A. Lokhmotov, F. Massa, P. Meng, P. Micikevicius, C. Osborne, G. Pekhimenko, A. T. R. Rajan, D. Sequeira, A. Sirasao, F. Sun, H. Tang, M. Thomson, F. Wei, E. Wu, L. Xu, K. Yamada, B. Yu, G. Yuan, A. Zhong, P. Zhang, and Y. Zhou. 2020. MLPerf inference benchmark. In Proceedings of the ACM\/IEEE 47th Annual International Symposium on Computer Architecture (ISCA\u201920). 446\u2013459. 10.1109\/ISCA45697.2020.00045"},{"key":"e_1_3_2_80_2","doi-asserted-by":"publisher","DOI":"10.1109\/MSPEC.1965.5531775"},{"key":"e_1_3_2_81_2","doi-asserted-by":"publisher","DOI":"10.1016\/S1369-7021(07)70178-5"},{"key":"e_1_3_2_82_2","first-page":"234","volume-title":"Medical Image Computing and Computer-Assisted Intervention (MICCAI\u201915)","author":"Ronneberger Olaf","year":"2015","unstructured":"Olaf Ronneberger, Philipp Fischer, and Thomas Brox. 2015. U-NET: Convolutional networks for biomedical image segmentation. In Medical Image Computing and Computer-Assisted Intervention (MICCAI\u201915), Nassir Navab, Joachim Hornegger, William M. Wells, and Alejandro F. Frangi (Eds.). Springer International Publishing, Cham, 234\u2013241."},{"key":"e_1_3_2_83_2","doi-asserted-by":"publisher","DOI":"10.1007\/s11263-015-0816-y"},{"key":"e_1_3_2_84_2","unstructured":"Ananda Samajdar Yuhao Zhu Paul N. Whatmough Matthew Mattina and Tushar Krishna. 2018. SCALE-sim: Systolic CNN accelerator. CoRR abs\/1811.02883 (2018). http:\/\/arxiv.org\/abs\/1811.02883"},{"key":"e_1_3_2_85_2","first-page":"1","volume-title":"Proceedings of the IEEE International Symposium on Parallel and Distributed Processing","author":"Sancho Jose Carlos","year":"2008","unstructured":"Jose Carlos Sancho and Darren J. Kerbyson. 2008. Analysis of double buffering on two different multicore architectures: Quad-core Opteron and the Cell-BE. In Proceedings of the IEEE International Symposium on Parallel and Distributed Processing. 1\u201312. 10.1109\/IPDPS.2008.4536316"},{"key":"e_1_3_2_86_2","first-page":"53","volume-title":"Proceedings of the 20th IEEE International Conference on Application-specific Systems, Architectures and Processors","author":"Sankaradas M.","year":"2009","unstructured":"M. Sankaradas, V. Jakkula, S. Cadambi, S. Chakradhar, I. Durdanovic, E. Cosatto, and H. P. Graf. 2009. A massively parallel coprocessor for convolutional neural networks. In Proceedings of the 20th IEEE International Conference on Application-specific Systems, Architectures and Processors. 53\u201360. 10.1109\/ASAP.2009.25"},{"key":"e_1_3_2_87_2","first-page":"351","volume-title":"Proceedings of the Great Lakes Symposium on VLSI","author":"Shafiee Amin","year":"2022","unstructured":"Amin Shafiee, Sanmitra Banerjee, Krishnendu Chakrabarty, Sudeep Pasricha, and Mahdi Nikdast. 2022. LoCI: An analysis of the impact of optical loss and crosstalk noise in integrated silicon-photonic neural networks. In Proceedings of the Great Lakes Symposium on VLSI. 351\u2013355."},{"key":"e_1_3_2_88_2","doi-asserted-by":"publisher","DOI":"10.1038\/s41566-020-00754-y"},{"key":"e_1_3_2_89_2","doi-asserted-by":"publisher","DOI":"10.1038\/nphoton.2017.93"},{"key":"e_1_3_2_90_2","doi-asserted-by":"publisher","DOI":"10.1051\/matecconf\/201713900066"},{"key":"e_1_3_2_91_2","doi-asserted-by":"publisher","DOI":"10.1109\/JSTQE.2019.2945548"},{"key":"e_1_3_2_92_2","first-page":"860","volume-title":"Proceedings of the ACM\/IEEE 48th Annual International Symposium on Computer Architecture (ISCA\u201921)","author":"Shiflett Kyle","year":"2021","unstructured":"Kyle Shiflett, Avinash Karanth, Razvan Bunescu, and Ahmed Louri. 2021. Albireo: Energy-efficient acceleration of convolutional neural networks via silicon photonics. In Proceedings of the ACM\/IEEE 48th Annual International Symposium on Computer Architecture (ISCA\u201921). 860\u2013873. 10.1109\/ISCA52012.2021.00072"},{"key":"e_1_3_2_93_2","first-page":"474","volume-title":"Proceedings of the IEEE International Symposium on High Performance Computer Architecture (HPCA\u201920)","author":"Shiflett K.","year":"2020","unstructured":"K. Shiflett, D. Wright, A. Karanth, and A. Louri. 2020. PIXEL: Photonic neural network accelerator. In Proceedings of the IEEE International Symposium on High Performance Computer Architecture (HPCA\u201920). 474\u2013487. 10.1109\/HPCA47549.2020.00046"},{"issue":"16","key":"e_1_3_2_94_2","doi-asserted-by":"crossref","first-page":"23495","DOI":"10.1364\/OE.395441","article-title":"The diamond mesh, a phase-error-and loss-tolerant field-programmable MZI-based optical processor for optical neural networks","volume":"28","author":"Shokraneh Farhad","year":"2020","unstructured":"Farhad Shokraneh, Simon Geoffroy-Gagnon, and Odile Liboiron-Ladouceur. 2020. The diamond mesh, a phase-error-and loss-tolerant field-programmable MZI-based optical processor for optical neural networks. Opt. Expr. 28, 16 (2020), 23495\u201323508.","journal-title":"Opt. Expr."},{"key":"e_1_3_2_95_2","doi-asserted-by":"publisher","DOI":"10.1109\/ACCESS.2020.2987333"},{"key":"e_1_3_2_96_2","doi-asserted-by":"crossref","first-page":"534","DOI":"10.1038\/nature16454","article-title":"Single-chip microprocessor that communicates directly using light","volume":"528","author":"Sun Chen","year":"2015","unstructured":"Chen Sun, M. Wade, Yunsup Lee, J. Orcutt, L. Alloatti, M. Georgas, Andrew Waterman, J. Shainline, Rimas Avizienis, Sen Lin, B. Moss, R. Kumar, F. Pavanello, A. Atabaki, Henry Cook, Albert J. Ou, J. Leu, Yu hsin Chen, K. Asanovi\u0107, Rajeev J. Ram, M. Popovic, and V. Stojanovi\u0107. 2015. Single-chip microprocessor that communicates directly using light. Nature 528 (2015), 534\u2013538.","journal-title":"Nature"},{"issue":"1","key":"e_1_3_2_97_2","doi-asserted-by":"crossref","first-page":"110","DOI":"10.1109\/JLT.2018.2878327","article-title":"A 128 Gb\/s PAM4 silicon microring modulator with integrated thermo-optic resonance tuning","volume":"37","author":"Sun Jie","year":"2018","unstructured":"Jie Sun, Ranjeet Kumar, Meer Sakib, Jeffrey B Driscoll, Hasitha Jayatilleka, and Haisheng Rong. 2018. A 128 Gb\/s PAM4 silicon microring modulator with integrated thermo-optic resonance tuning. J. Lightw. Technol. 37, 1 (2018), 110\u2013115.","journal-title":"J. Lightw. Technol."},{"key":"e_1_3_2_98_2","doi-asserted-by":"crossref","unstructured":"Febin Sunny Asif Mirza Mahdi Nikdast and Sudeep Pasricha. 2021. CrossLight: A Cross-layer optimized silicon photonic neural network accelerator. CoRR abs\/2102.06960 (2021). https:\/\/arxiv.org\/abs\/2102.06960","DOI":"10.1109\/DAC18074.2021.9586161"},{"issue":"1","key":"e_1_3_2_99_2","first-page":"1","article-title":"Neuromorphic photonic networks using silicon photonic weight banks","volume":"7","author":"Tait Alexander N.","year":"2017","unstructured":"Alexander N. Tait, Thomas Ferreira De Lima, Ellen Zhou, Allie X. Wu, Mitchell A. Nahmias, Bhavin J. Shastri, and Paul R. Prucnal. 2017. Neuromorphic photonic networks using silicon photonic weight banks. Sci. Rep. 7, 1 (2017), 1\u201310.","journal-title":"Sci. Rep."},{"issue":"21","key":"e_1_3_2_100_2","first-page":"3427","article-title":"Broadcast and weight: An integrated network for scalable photonic spike processing","volume":"32","author":"Tait Alexander N.","year":"2014","unstructured":"Alexander N. Tait, Mitchell A. Nahmias, Bhavin J. Shastri, and Paul R. Prucnal. 2014. Broadcast and weight: An integrated network for scalable photonic spike processing. J. Lightw. Technol. 32, 21 (2014), 3427\u20133439.","journal-title":"J. Lightw. Technol."},{"key":"e_1_3_2_101_2","doi-asserted-by":"publisher","DOI":"10.1109\/MCSE.2017.29"},{"key":"e_1_3_2_102_2","first-page":"350","volume-title":"Proceedings of the IEEE International Solid-State Circuits Conference (ISSCC\u201918)","author":"Thonnart Yvain","year":"2018","unstructured":"Yvain Thonnart, Mounir Zid, Jos\u00e9 Luis Gonzalez-Jimenez, Guillaume Waltener, Robert Polster, Olivier Dubray, Florent Lepin, St\u00e9phane Bernab\u00e9, Sylvie Menezo, Gabriel Par\u00e8s, Olivier Castany, Laura Boutafa, Philippe Grosse, Beno\u00eet Charbonnier, and Charles Baudot. 2018. A 10Gb\/s Si-photonic transceiver with 150 \\(\\mu\\) W 120 \\(\\mu\\) s-lock-time digitally supervised analog microring wavelength stabilization for 1Tb\/s\/mm2 die-to-die optical networks. In Proceedings of the IEEE International Solid-State Circuits Conference (ISSCC\u201918). 350\u2013352. 10.1109\/ISSCC.2018.8310328"},{"key":"e_1_3_2_103_2","doi-asserted-by":"publisher","DOI":"10.1038\/nphoton.2017.14"},{"key":"e_1_3_2_104_2","doi-asserted-by":"publisher","DOI":"10.3390\/mi10010051"},{"issue":"11","key":"e_1_3_2_105_2","doi-asserted-by":"crossref","first-page":"4337","DOI":"10.1109\/TCAD.2022.3197538","article-title":"Photonic reconfigurable accelerators for efficient inference of cnns with mixed-sized tensors","volume":"41","author":"Vatsavai Sairam Sri","year":"2022","unstructured":"Sairam Sri Vatsavai and Ishan G. Thakkar. 2022. Photonic reconfigurable accelerators for efficient inference of cnns with mixed-sized tensors. IEEE Trans. Comput.-Aid. Des. Integr. Circ. Syst. 41, 11 (2022), 4337\u20134348.","journal-title":"IEEE Trans. Comput.-Aid. Des. Integr. Circ. Syst."},{"key":"e_1_3_2_106_2","doi-asserted-by":"publisher","DOI":"10.1364\/OL.38.000733"},{"key":"e_1_3_2_107_2","doi-asserted-by":"publisher","DOI":"10.1038\/s41586-020-2973-6"},{"key":"e_1_3_2_108_2","doi-asserted-by":"publisher","DOI":"10.1364\/OL.41.005318"},{"key":"e_1_3_2_109_2","unstructured":"J. Wilson. (nd). The multiply and fourier transform unit: A micro-scale optical processor. https:\/\/optalysys.com\/wp-content\/uploads\/2022\/04\/Multiply_and_Fourier_Transform_white_paper_12_12_20.pdf"},{"key":"e_1_3_2_110_2","unstructured":"Hao Wu Patrick Judd Xiaojie Zhang Mikhail Isaev and Paulius Micikevicius. 2020. Integer quantization for deep learning inference: Principles and empirical evaluation. arxiv:2004.09602. Retrieved from https:\/\/arxiv.org\/abs\/2004.09602"},{"key":"e_1_3_2_111_2","doi-asserted-by":"publisher","DOI":"10.1109\/LPT.2017.2779489"},{"key":"e_1_3_2_112_2","doi-asserted-by":"publisher","DOI":"10.1038\/s41586-020-03063-0"},{"issue":"2","key":"e_1_3_2_113_2","doi-asserted-by":"crossref","first-page":"023901","DOI":"10.1103\/PhysRevLett.123.023901","article-title":"Fourier-space diffractive deep neural network","volume":"123","author":"Yan Tao","year":"2019","unstructured":"Tao Yan, Jiamin Wu, Tiankuang Zhou, Hao Xie, Feng Xu, Jingtao Fan, Lu Fang, Xing Lin, and Qionghai Dai. 2019. Fourier-space diffractive deep neural network. Phys. Rev. Lett. 123, 2 (2019), 023901.","journal-title":"Phys. Rev. Lett."},{"key":"e_1_3_2_114_2","unstructured":"Guandao Yang Tianyi Zhang Polina Kirichenko Junwen Bai Andrew Gordon Wilson and Christopher De Sa. 2019. SWALP: Stochastic weight averaging in low-precision training. CoRR abs\/1904.11943 (2019). http:\/\/arxiv.org\/abs\/1904.11943"},{"issue":"5","key":"e_1_3_2_115_2","doi-asserted-by":"crossref","first-page":"367","DOI":"10.1038\/s41566-021-00796-w","article-title":"Large-scale neuromorphic optoelectronic computing with a reconfigurable diffractive processing unit","volume":"15","author":"Zhou Tiankuang","year":"2021","unstructured":"Tiankuang Zhou, Xing Lin, Jiamin Wu, Yitong Chen, Hao Xie, Yipeng Li, Jingtao Fan, Huaqiang Wu, Lu Fang, and Qionghai Dai. 2021. Large-scale neuromorphic optoelectronic computing with a reconfigurable diffractive processing unit. Nat. Photon. 15, 5 (2021), 367\u2013373.","journal-title":"Nat. Photon."},{"key":"e_1_3_2_116_2","first-page":"1","volume-title":"Proceedings of the 39th International Conference on Computer-Aided Design","author":"Zhu Ying","year":"2020","unstructured":"Ying Zhu, Grace Li Zhang, Bing Li, Xunzhao Yin, Cheng Zhuo, Huaxi Gu, Tsung-Yi Ho, and Ulf Schlichtmann. 2020. Countering variations and thermal effects for accurate optical neural networks. In Proceedings of the 39th International Conference on Computer-Aided Design. 1\u20137."},{"key":"e_1_3_2_117_2","doi-asserted-by":"publisher","DOI":"10.1364\/OPTICA.6.001132"}],"container-title":["ACM Journal on Emerging Technologies in Computing Systems"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3606949","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3606949","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T17:48:52Z","timestamp":1750182532000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3606949"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2023,9,8]]},"references-count":116,"journal-issue":{"issue":"4","published-print":{"date-parts":[[2023,10,31]]}},"alternative-id":["10.1145\/3606949"],"URL":"https:\/\/doi.org\/10.1145\/3606949","relation":{},"ISSN":["1550-4832","1550-4840"],"issn-type":[{"value":"1550-4832","type":"print"},{"value":"1550-4840","type":"electronic"}],"subject":[],"published":{"date-parts":[[2023,9,8]]},"assertion":[{"value":"2022-12-05","order":0,"name":"received","label":"Received","group":{"name":"publication_history","label":"Publication History"}},{"value":"2023-05-26","order":1,"name":"accepted","label":"Accepted","group":{"name":"publication_history","label":"Publication History"}},{"value":"2023-09-08","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}