{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,5,29]],"date-time":"2026-05-29T17:36:23Z","timestamp":1780076183182,"version":"3.54.0"},"reference-count":47,"publisher":"Association for Computing Machinery (ACM)","issue":"4","license":[{"start":{"date-parts":[[2018,7,30]],"date-time":"2018-07-30T00:00:00Z","timestamp":1532908800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["ACM Trans. Graph."],"published-print":{"date-parts":[[2018,8,31]]},"abstract":"<jats:p>\n            Shallow depth-of-field is commonly used by photographers to isolate a subject from a distracting background. However, standard cell phone cameras cannot produce such images optically, as their short focal lengths and small apertures capture nearly all-in-focus images. We present a system to computationally synthesize shallow depth-of-field images with a single mobile camera and a single button press. If the image is of a person, we use a person segmentation network to separate the person and their accessories from the background. If available, we also use dense\n            <jats:italic>dual-pixel<\/jats:italic>\n            auto-focus hardware, effectively a 2-sample light field with an approximately 1 millimeter baseline, to compute a dense depth map. These two signals are combined and used to render a defocused image. Our system can process a 5.4 megapixel image in 4 seconds on a mobile phone, is fully automatic, and is robust enough to be used by non-experts. The modular nature of our system allows it to degrade naturally in the absence of a dual-pixel sensor or a human subject.\n          <\/jats:p>","DOI":"10.1145\/3197517.3201329","type":"journal-article","created":{"date-parts":[[2018,7,31]],"date-time":"2018-07-31T15:56:23Z","timestamp":1533052583000},"page":"1-13","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":151,"title":["Synthetic depth-of-field with a single-camera mobile phone"],"prefix":"10.1145","volume":"37","author":[{"given":"Neal","family":"Wadhwa","sequence":"first","affiliation":[{"name":"Google Research"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Rahul","family":"Garg","sequence":"additional","affiliation":[{"name":"Google Research"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"David E.","family":"Jacobs","sequence":"additional","affiliation":[{"name":"Google Research"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Bryan E.","family":"Feldman","sequence":"additional","affiliation":[{"name":"Google Research"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Nori","family":"Kanazawa","sequence":"additional","affiliation":[{"name":"Google Research"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Robert","family":"Carroll","sequence":"additional","affiliation":[{"name":"Google Research"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Yair","family":"Movshovitz-Attias","sequence":"additional","affiliation":[{"name":"Google Research"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Jonathan T.","family":"Barron","sequence":"additional","affiliation":[{"name":"Google Research"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Yael","family":"Pritch","sequence":"additional","affiliation":[{"name":"Google Research"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Marc","family":"Levoy","sequence":"additional","affiliation":[{"name":"Google Research"}],"role":[{"vocabulary":"crossref","role":"author"}]}],"member":"320","published-online":{"date-parts":[[2018,7,30]]},"reference":[{"key":"e_1_2_2_1_1","unstructured":"Mart\u00edn Abadi Ashish Agarwal Paul Barham Eugene Brevdo etal 2015. TensorFlow: Large-Scale Machine Learning on Heterogeneous Systems. https:\/\/www.tensorflow.org\/  Mart\u00edn Abadi Ashish Agarwal Paul Barham Eugene Brevdo et al. 2015. TensorFlow: Large-Scale Machine Learning on Heterogeneous Systems. https:\/\/www.tensorflow.org\/"},{"key":"e_1_2_2_2_1","volume-title":"Spatiotemporal energy models for the perception of motion. JOSA A","author":"Adelson Edward H","year":"1985","unstructured":"Edward H Adelson and James R Bergen . 1985. Spatiotemporal energy models for the perception of motion. JOSA A ( 1985 ). Edward H Adelson and James R Bergen. 1985. Spatiotemporal energy models for the perception of motion. JOSA A (1985)."},{"key":"e_1_2_2_3_1","doi-asserted-by":"publisher","DOI":"10.1109\/34.121783"},{"key":"e_1_2_2_4_1","doi-asserted-by":"publisher","DOI":"10.1145\/2980179.2980257"},{"key":"e_1_2_2_5_1","volume-title":"Fast bilateral-space stereo for synthetic defocus. CVPR","author":"Barron Jonathan T","year":"2015","unstructured":"Jonathan T Barron , Andrew Adams , YiChang Shih , and Carlos Hern\u00e1ndez . 2015. Fast bilateral-space stereo for synthetic defocus. CVPR ( 2015 ). Jonathan T Barron, Andrew Adams, YiChang Shih, and Carlos Hern\u00e1ndez. 2015. Fast bilateral-space stereo for synthetic defocus. CVPR (2015)."},{"key":"e_1_2_2_6_1","volume":"201","author":"Barron J. T.","unstructured":"J. T. Barron and J. Malik. 201 5. Shape, illumination, and reflectance from shading. TPAMI (2015). J. T. Barron and J. Malik. 2015. Shape, illumination, and reflectance from shading. TPAMI (2015).","journal-title":"J. Malik."},{"key":"e_1_2_2_7_1","volume-title":"The fast bilateral solver. ECCV","author":"Barron Jonathan T","year":"2016","unstructured":"Jonathan T Barron and Ben Poole . 2016. The fast bilateral solver. ECCV ( 2016 ). Jonathan T Barron and Ben Poole. 2016. The fast bilateral solver. ECCV (2016)."},{"key":"e_1_2_2_8_1","doi-asserted-by":"publisher","DOI":"10.1109\/TPAMI.2013.18"},{"key":"e_1_2_2_9_1","volume-title":"Depth Map Prediction from a Single Image Using a Multi-scale Deep Network. NIPS","author":"Eigen David","year":"2014","unstructured":"David Eigen , Christian Puhrsch , and Rob Fergus . 2014. Depth Map Prediction from a Single Image Using a Multi-scale Deep Network. NIPS ( 2014 ). David Eigen, Christian Puhrsch, and Rob Fergus. 2014. Depth Map Prediction from a Single Image Using a Multi-scale Deep Network. NIPS (2014)."},{"key":"e_1_2_2_10_1","volume-title":"Unsupervised CNN for Single View Depth Estimation: Geometry to the Rescue. ECCV","author":"Garg Ravi","year":"2016","unstructured":"Ravi Garg , Vijay Kumar B.G. , Gustavo Carneiro , and Ian Reid . 2016. Unsupervised CNN for Single View Depth Estimation: Geometry to the Rescue. ECCV ( 2016 ). Ravi Garg, Vijay Kumar B.G., Gustavo Carneiro, and Ian Reid. 2016. Unsupervised CNN for Single View Depth Estimation: Geometry to the Rescue. ECCV (2016)."},{"key":"e_1_2_2_11_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2015.169"},{"key":"e_1_2_2_12_1","volume":"201","author":"Godard Cl\u00e9ment","unstructured":"Cl\u00e9ment Godard , Oisin Mac Aodha , and Gabriel J. Brostow. 201 7. Unsupervised Monocular Depth Estimation with Left-Right Consistency. CVPR (2017). Cl\u00e9ment Godard, Oisin Mac Aodha, and Gabriel J. Brostow. 2017. Unsupervised Monocular Depth Estimation with Left-Right Consistency. CVPR (2017).","journal-title":"Gabriel J. Brostow."},{"key":"e_1_2_2_13_1","doi-asserted-by":"publisher","DOI":"10.1145\/237170.237200"},{"key":"e_1_2_2_14_1","volume-title":"High-quality Depth from Uncalibrated Small Motion Clip. CVPR","author":"Ha Hyowon","year":"2016","unstructured":"Hyowon Ha , Sunghoon Im , Jaesik Park , Hae-Gon Jeon , and In So Kweon . 2016. High-quality Depth from Uncalibrated Small Motion Clip. CVPR ( 2016 ). Hyowon Ha, Sunghoon Im, Jaesik Park, Hae-Gon Jeon, and In So Kweon. 2016. High-quality Depth from Uncalibrated Small Motion Clip. CVPR (2016)."},{"key":"e_1_2_2_15_1","volume-title":"Burst photography for high dynamic range and low-light imaging on mobile cameras. SIGGRAPH","author":"Hasinoff Samuel W","year":"2016","unstructured":"Samuel W Hasinoff , Dillon Sharlet , Ryan Geiss , Andrew Adams , Jonathan T Barron , Florian Kainz , Jiawen Chen , and Marc Levoy . 2016. Burst photography for high dynamic range and low-light imaging on mobile cameras. SIGGRAPH ( 2016 ). Samuel W Hasinoff, Dillon Sharlet, Ryan Geiss, Andrew Adams, Jonathan T Barron, Florian Kainz, Jiawen Chen, and Marc Levoy. 2016. Burst photography for high dynamic range and low-light imaging on mobile cameras. SIGGRAPH (2016)."},{"key":"e_1_2_2_16_1","unstructured":"Kaiming He Georgia Gkioxari Piotr Doll\u00e1r and Ross Girshick. 2017. Mask R-CNN. ICCV (2017).  Kaiming He Georgia Gkioxari Piotr Doll\u00e1r and Ross Girshick. 2017. Mask R-CNN. ICCV (2017)."},{"key":"e_1_2_2_17_1","unstructured":"Carlos Hern\u00e1ndez. 2014. Lens Blur in the new Google Camera app. http:\/\/research.googleblog.com\/2014\/04\/lens-blur-in-new-google-camera-app.html.  Carlos Hern\u00e1ndez. 2014. Lens Blur in the new Google Camera app. http:\/\/research.googleblog.com\/2014\/04\/lens-blur-in-new-google-camera-app.html."},{"key":"e_1_2_2_18_1","doi-asserted-by":"publisher","DOI":"10.1145\/1186822.1073232"},{"key":"e_1_2_2_19_1","volume-title":"Obtaining shape from shading information. The Psychology of Computer Vision","author":"Horn B. K. P.","year":"1975","unstructured":"B. K. P. Horn . 1975. Obtaining shape from shading information. The Psychology of Computer Vision ( 1975 ). B. K. P. Horn. 1975. Obtaining shape from shading information. The Psychology of Computer Vision (1975)."},{"key":"e_1_2_2_21_1","doi-asserted-by":"crossref","unstructured":"H. G. Jeon J. Park G. Choe J. Park Y. Bok Y. W. Tai and I. S. Kweon. 2015. Accurate depth map estimation from a lenslet light field camera. CVPR (2015).  H. G. Jeon J. Park G. Choe J. Park Y. Bok Y. W. Tai and I. S. Kweon. 2015. Accurate depth map estimation from a lenslet light field camera. CVPR (2015).","DOI":"10.1109\/CVPR.2015.7298762"},{"key":"e_1_2_2_22_1","unstructured":"Neel Joshi and Larry Zitnick. 2014. Micro-Baseline Stereo. Technical Report.  Neel Joshi and Larry Zitnick. 2014. Micro-Baseline Stereo. Technical Report."},{"key":"e_1_2_2_23_1","doi-asserted-by":"publisher","DOI":"10.1145\/1276377.1276497"},{"key":"e_1_2_2_24_1","doi-asserted-by":"crossref","unstructured":"M. Kraus and M. Strengert. 2007. Depth-of-Field Rendering by Pyramidal Image Processing. Computer Graphics Forum (2007).  M. Kraus and M. Strengert. 2007. Depth-of-Field Rendering by Pyramidal Image Processing. Computer Graphics Forum (2007).","DOI":"10.1111\/j.1467-8659.2007.01088.x"},{"key":"e_1_2_2_25_1","doi-asserted-by":"publisher","DOI":"10.1109\/TVCG.2008.106"},{"key":"e_1_2_2_26_1","doi-asserted-by":"publisher","DOI":"10.1145\/237170.237199"},{"key":"e_1_2_2_27_1","doi-asserted-by":"publisher","DOI":"10.1109\/TPAMI.2015.2505283"},{"key":"e_1_2_2_28_1","volume-title":"Fully Convolutional Networks for Semantic Segmentation. CVPR","author":"Long Jonathan","year":"2015","unstructured":"Jonathan Long , Evan Shelhamer , and Trevor Darrell . 2015. Fully Convolutional Networks for Semantic Segmentation. CVPR ( 2015 ). Jonathan Long, Evan Shelhamer, and Trevor Darrell. 2015. Fully Convolutional Networks for Semantic Segmentation. CVPR (2015)."},{"key":"e_1_2_2_29_1","volume-title":"Stacked Hourglass Networks for Human Pose Estimation. ECCV","author":"Newell Alejandro","year":"2016","unstructured":"Alejandro Newell , Kaiyu Yang , and Jia Deng . 2016. Stacked Hourglass Networks for Human Pose Estimation. ECCV ( 2016 ). Alejandro Newell, Kaiyu Yang, and Jia Deng. 2016. Stacked Hourglass Networks for Human Pose Estimation. ECCV (2016)."},{"key":"e_1_2_2_30_1","unstructured":"Ren Ng Marc Levoy Mathieu Br\u00e9dif Gene Duval Mark Horowitz and Pat Hanrahan. 2005. Light field photography with a hand-held plenoptic camera. (2005).  Ren Ng Marc Levoy Mathieu Br\u00e9dif Gene Duval Mark Horowitz and Pat Hanrahan. 2005. Light field photography with a hand-held plenoptic camera. (2005)."},{"key":"e_1_2_2_31_1","volume-title":"Towards Accurate Multi-person Pose Estimation in the Wild. CVPR","author":"Papandreou George","year":"2017","unstructured":"George Papandreou , Tyler Zhu , Nori Kanazawa , Alexander Toshev , Jonathan Tompson . Chris Bregler , and Kevin Murphy . 2017. Towards Accurate Multi-person Pose Estimation in the Wild. CVPR ( 2017 ). George Papandreou, Tyler Zhu, Nori Kanazawa, Alexander Toshev, Jonathan Tompson. Chris Bregler, and Kevin Murphy. 2017. Towards Accurate Multi-person Pose Estimation in the Wild. CVPR (2017)."},{"key":"e_1_2_2_32_1","doi-asserted-by":"publisher","DOI":"10.1145\/2499370.2462176"},{"key":"e_1_2_2_33_1","volume-title":"U-Net: Convolutional Networks for Biomedical Image Segmentation. MICCAI","author":"Ronneberger Olaf","year":"2015","unstructured":"Olaf Ronneberger , Philipp Fischer , and Thomas Brox . 2015. U-Net: Convolutional Networks for Biomedical Image Segmentation. MICCAI ( 2015 ). Olaf Ronneberger, Philipp Fischer, and Thomas Brox. 2015. U-Net: Convolutional Networks for Biomedical Image Segmentation. MICCAI (2015)."},{"key":"e_1_2_2_34_1","doi-asserted-by":"publisher","DOI":"10.1109\/TPAMI.2008.132"},{"key":"e_1_2_2_35_1","doi-asserted-by":"publisher","DOI":"10.1023\/A:1014573219977"},{"key":"e_1_2_2_36_1","volume-title":"Automatic portrait segmentation for image stylization. Computer Graphics Forum","author":"Shen Xiaoyong","year":"2016","unstructured":"Xiaoyong Shen , Aaron Hertzmann , Jiaya Jia , Sylvain Paris , Brian Price , Eli Shechtman , and Ian Sachs . 2016a. Automatic portrait segmentation for image stylization. Computer Graphics Forum ( 2016 ). Xiaoyong Shen, Aaron Hertzmann, Jiaya Jia, Sylvain Paris, Brian Price, Eli Shechtman, and Ian Sachs. 2016a. Automatic portrait segmentation for image stylization. Computer Graphics Forum (2016)."},{"key":"e_1_2_2_37_1","volume-title":"Deep Automatic Portrait Matting. ECCV","author":"Shen Xiaoyong","year":"2016","unstructured":"Xiaoyong Shen , Xin Tao , Hongyun Gao , Chao Zhou , and Jiaya Jia . 2016b. Deep Automatic Portrait Matting. ECCV ( 2016 ). Xiaoyong Shen, Xin Tao, Hongyun Gao, Chao Zhou, and Jiaya Jia. 2016b. Deep Automatic Portrait Matting. ECCV (2016)."},{"key":"e_1_2_2_38_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2014.205"},{"key":"e_1_2_2_39_1","doi-asserted-by":"crossref","unstructured":"S. Suwajanakorn C. Hernandez and S. M. Seitz. 2015. Depth from focus with your mobile phone. CVPR (2015).  S. Suwajanakorn C. Hernandez and S. M. Seitz. 2015. Depth from focus with your mobile phone. CVPR (2015).","DOI":"10.1109\/CVPR.2015.7298972"},{"key":"e_1_2_2_40_1","doi-asserted-by":"crossref","unstructured":"H. Tang S. Cohen B. Price S. Schiller and K. N. Kutulakos. 2017. Depth from Defocus in the Wild. CVPR (2017).  H. Tang S. Cohen B. Price S. Schiller and K. N. Kutulakos. 2017. Depth from Defocus in the Wild. CVPR (2017).","DOI":"10.1109\/CVPR.2017.507"},{"key":"e_1_2_2_41_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2013.89"},{"key":"e_1_2_2_42_1","volume-title":"Joint Training of a Convolutional Network and a Graphical Model for Human Pose Estimation. NIPS","author":"Tompson Jonathan","year":"2014","unstructured":"Jonathan Tompson , Arjun Jain , Yann LeCun , and Christoph Bregler . 2014. Joint Training of a Convolutional Network and a Graphical Model for Human Pose Estimation. NIPS ( 2014 ). Jonathan Tompson, Arjun Jain, Yann LeCun, and Christoph Bregler. 2014. Joint Training of a Convolutional Network and a Graphical Model for Human Pose Estimation. NIPS (2014)."},{"key":"e_1_2_2_43_1","volume":"201","author":"Tripathi Subarna","unstructured":"Subarna Tripathi , Maxwell Collins , Matthew Brown , and Serge J. Belongie. 201 7. Pose2Instance: Harnessing Keypoints for Person Instance Segmentation. CoRR abs\/1704.01152 (2017). Subarna Tripathi, Maxwell Collins, Matthew Brown, and Serge J. Belongie. 2017. Pose2Instance: Harnessing Keypoints for Person Instance Segmentation. CoRR abs\/1704.01152 (2017).","journal-title":"Serge J. Belongie."},{"key":"e_1_2_2_44_1","volume-title":"Deep3D: Fully Automatic 2D-to-3D Video Conversion with Deep Convolutional Neural Networks. ECCV","author":"Xie Junyuan","year":"2016","unstructured":"Junyuan Xie , Ross Girshick , and Ali Farhadi . 2016. Deep3D: Fully Automatic 2D-to-3D Video Conversion with Deep Convolutional Neural Networks. ECCV ( 2016 ). Junyuan Xie, Ross Girshick, and Ali Farhadi. 2016. Deep3D: Fully Automatic 2D-to-3D Video Conversion with Deep Convolutional Neural Networks. ECCV (2016)."},{"key":"e_1_2_2_45_1","doi-asserted-by":"crossref","unstructured":"N. Xu B. Price S. Cohen and T. Huang. 2017. Deep Image Matting. CVPR (2017).  N. Xu B. Price S. Cohen and T. Huang. 2017. Deep Image Matting. CVPR (2017).","DOI":"10.1109\/CVPR.2017.41"},{"key":"e_1_2_2_46_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2014.509"},{"key":"e_1_2_2_47_1","volume-title":"Lowe","author":"Zhou Tinghui","year":"2017","unstructured":"Tinghui Zhou , Matthew Brown , Noah Snavely , and David G . Lowe . 2017 . Unsupervised Learning of Depth and Ego-Motion from Video. CVPR (2017). Tinghui Zhou, Matthew Brown, Noah Snavely, and David G. Lowe. 2017. Unsupervised Learning of Depth and Ego-Motion from Video. CVPR (2017)."},{"key":"e_1_2_2_48_1","doi-asserted-by":"publisher","DOI":"10.1145\/3123266.3123286"}],"container-title":["ACM Transactions on Graphics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3197517.3201329","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3197517.3201329","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,18]],"date-time":"2025-06-18T01:39:44Z","timestamp":1750210784000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3197517.3201329"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2018,7,30]]},"references-count":47,"journal-issue":{"issue":"4","published-print":{"date-parts":[[2018,8,31]]}},"alternative-id":["10.1145\/3197517.3201329"],"URL":"https:\/\/doi.org\/10.1145\/3197517.3201329","relation":{},"ISSN":["0730-0301","1557-7368"],"issn-type":[{"value":"0730-0301","type":"print"},{"value":"1557-7368","type":"electronic"}],"subject":[],"published":{"date-parts":[[2018,7,30]]},"assertion":[{"value":"2018-07-30","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}