{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,22]],"date-time":"2026-03-22T06:44:38Z","timestamp":1774161878015,"version":"3.50.1"},"reference-count":25,"publisher":"Association for Computing Machinery (ACM)","issue":"6","license":[{"start":{"date-parts":[[2018,12,4]],"date-time":"2018-12-04T00:00:00Z","timestamp":1543881600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["ACM Trans. Graph."],"published-print":{"date-parts":[[2018,12,31]]},"abstract":"<jats:p>We aim to generate high resolution shallow depth-of-field (DoF) images from a single all-in-focus image with controllable focal distance and aperture size. To achieve this, we propose a novel neural network model comprised of a depth prediction module, a lens blur module, and a guided upsampling module. All modules are differentiable and are learned from data. To train our depth prediction module, we collect a dataset of 2462 RGB-D images captured by mobile phones with a dual-lens camera, and use existing segmentation datasets to improve border prediction. We further leverage a synthetic dataset with known depth to supervise the lens blur and guided upsampling modules. The effectiveness of our system and training strategies are verified in the experiments. Our method can generate high-quality shallow DoF images at high resolution, and produces significantly fewer artifacts than the baselines and existing solutions for single image shallow DoF synthesis. Compared with the iPhone portrait mode, which is a state-of-the-art shallow DoF solution based on a dual-lens depth camera, our method generates comparable results, while allowing for greater flexibility to choose focal points and aperture size, and is not limited to one capture setup.<\/jats:p>","DOI":"10.1145\/3272127.3275013","type":"journal-article","created":{"date-parts":[[2018,11,28]],"date-time":"2018-11-28T19:16:10Z","timestamp":1543432570000},"page":"1-11","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":43,"title":["DeepLens"],"prefix":"10.1145","volume":"37","author":[{"given":"Lijun","family":"Wang","sequence":"first","affiliation":[{"name":"Dalian University of Technology"}]},{"given":"Xiaohui","family":"Shen","sequence":"additional","affiliation":[{"name":"ByteDance AI Lab"}]},{"given":"Jianming","family":"Zhang","sequence":"additional","affiliation":[{"name":"Adobe Research"}]},{"given":"Oliver","family":"Wang","sequence":"additional","affiliation":[{"name":"Adobe Research"}]},{"given":"Zhe","family":"Lin","sequence":"additional","affiliation":[{"name":"Adobe Research"}]},{"given":"Chih-Yao","family":"Hsieh","sequence":"additional","affiliation":[{"name":"Adobe Systems"}]},{"given":"Sarah","family":"Kong","sequence":"additional","affiliation":[{"name":"Adobe Systems"}]},{"given":"Huchuan","family":"Lu","sequence":"additional","affiliation":[{"name":"Dalian University of Technology"}]}],"member":"320","published-online":{"date-parts":[[2018,12,4]]},"reference":[{"key":"e_1_2_2_1_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2015.7299076"},{"key":"e_1_2_2_2_1","doi-asserted-by":"publisher","DOI":"10.1109\/TPAMI.2014.2345401"},{"key":"e_1_2_2_3_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2009.5206848"},{"key":"e_1_2_2_4_1","unstructured":"David Eigen Christian Puhrsch and Rob Fergus. 2014. Depth map prediction from a single image using a multi-scale deep network. In Advances in neural information processing systems. 2366--2374.   David Eigen Christian Puhrsch and Rob Fergus. 2014. Depth map prediction from a single image using a multi-scale deep network. In Advances in neural information processing systems. 2366--2374."},{"key":"e_1_2_2_5_1","volume-title":"Proceedings of IEEE Conference on Computer Vision and Pattern Recognition.","author":"Geiger Andreas","year":"2012","unstructured":"Andreas Geiger , Philip Lenz , and Raquel Urtasun . 2012 . Are we ready for Autonomous Driving? The KITTI Vision Benchmark Suite . In Proceedings of IEEE Conference on Computer Vision and Pattern Recognition. Andreas Geiger, Philip Lenz, and Raquel Urtasun. 2012. Are we ready for Autonomous Driving? The KITTI Vision Benchmark Suite. In Proceedings of IEEE Conference on Computer Vision and Pattern Recognition."},{"key":"e_1_2_2_6_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2016.584"},{"key":"e_1_2_2_7_1","doi-asserted-by":"publisher","DOI":"10.1145\/97880.97913"},{"key":"e_1_2_2_8_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-642-15549-9_1"},{"key":"e_1_2_2_9_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2016.90"},{"key":"e_1_2_2_10_1","volume-title":"Proceedings of IEEE Conference on Computer Vision and Pattern Recognition","author":"Isola Phillip","year":"2017","unstructured":"Phillip Isola , Jun-Yan Zhu , Tinghui Zhou , and Alexei A. Efros . 2017. Image-to-Image Translation with Conditional Adversarial Networks . Proceedings of IEEE Conference on Computer Vision and Pattern Recognition ( 2017 ), 5967--5976. Phillip Isola, Jun-Yan Zhu, Tinghui Zhou, and Alexei A. Efros. 2017. Image-to-Image Translation with Conditional Adversarial Networks. Proceedings of IEEE Conference on Computer Vision and Pattern Recognition (2017), 5967--5976."},{"key":"e_1_2_2_12_1","doi-asserted-by":"publisher","DOI":"10.1145\/2766920"},{"key":"e_1_2_2_13_1","volume-title":"Computer Graphics Forum","author":"Kraus Martin","unstructured":"Martin Kraus and Magnus Strengert . 2007. Depth-of-Field Rendering by Pyramidal Image Processing . In Computer Graphics Forum , Vol. 26 . Wiley Online Library , 645--654. Martin Kraus and Magnus Strengert. 2007. Depth-of-Field Rendering by Pyramidal Image Processing. In Computer Graphics Forum, Vol. 26. Wiley Online Library, 645--654."},{"key":"e_1_2_2_14_1","volume-title":"2016 Fourth International Conference on. IEEE, 239--248","author":"Laina Iro","year":"2016","unstructured":"Iro Laina , Christian Rupprecht , Vasileios Belagiannis , Federico Tombari , and Nassir Navab . 2016 . Deeper depth prediction with fully convolutional residual networks. In 3D Vision , 2016 Fourth International Conference on. IEEE, 239--248 . Iro Laina, Christian Rupprecht, Vasileios Belagiannis, Federico Tombari, and Nassir Navab. 2016. Deeper depth prediction with fully convolutional residual networks. In 3D Vision, 2016 Fourth International Conference on. IEEE, 239--248."},{"key":"e_1_2_2_15_1","doi-asserted-by":"publisher","DOI":"10.1145\/1778765.1778802"},{"key":"e_1_2_2_16_1","doi-asserted-by":"publisher","DOI":"10.1109\/TVCG.2008.106"},{"key":"e_1_2_2_17_1","volume-title":"Proceedings of IEEE Conference on Computer Vision and Pattern Recognition.","author":"Li Zhengqi","year":"2018","unstructured":"Zhengqi Li and Noah Snavely . 2018 . MegaDepth: Learning Single-View Depth Prediction from Internet Photos . In Proceedings of IEEE Conference on Computer Vision and Pattern Recognition. Zhengqi Li and Noah Snavely. 2018. MegaDepth: Learning Single-View Depth Prediction from Internet Photos. In Proceedings of IEEE Conference on Computer Vision and Pattern Recognition."},{"key":"e_1_2_2_18_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2015.7299152"},{"key":"e_1_2_2_19_1","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-642-33715-4_54"},{"key":"e_1_2_2_20_1","volume-title":"Proceedings of IEEE Conference on Computer Vision and Pattern Recognition.","author":"Srinivasan Pratul P.","unstructured":"Pratul P. Srinivasan , Rahul Garg , Neal Wadhwa , Ren Ng , and Jonathan T. Barron . 2018. Aperture Supervision for Monocular Depth Estimation . In Proceedings of IEEE Conference on Computer Vision and Pattern Recognition. Pratul P. Srinivasan, Rahul Garg, Neal Wadhwa, Ren Ng, and Jonathan T. Barron. 2018. Aperture Supervision for Monocular Depth Estimation. In Proceedings of IEEE Conference on Computer Vision and Pattern Recognition."},{"key":"e_1_2_2_21_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICCV.2017.246"},{"key":"e_1_2_2_22_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2015.7298972"},{"key":"e_1_2_2_23_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2017.404"},{"key":"e_1_2_2_24_1","volume-title":"Proceedings of IEEE Conference on Computer Vision and Pattern Recognition.","author":"Xu Ning","year":"2017","unstructured":"Ning Xu , Brian Price , Scott Cohen , and Thomas Huang . 2017 . Deep image matting . In Proceedings of IEEE Conference on Computer Vision and Pattern Recognition. Ning Xu, Brian Price, Scott Cohen, and Thomas Huang. 2017. Deep image matting. In Proceedings of IEEE Conference on Computer Vision and Pattern Recognition."},{"key":"e_1_2_2_25_1","doi-asserted-by":"publisher","DOI":"10.1155\/2016\/4125909"},{"key":"e_1_2_2_26_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2017.660"}],"container-title":["ACM Transactions on Graphics"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3272127.3275013","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3272127.3275013","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,18]],"date-time":"2025-06-18T00:44:03Z","timestamp":1750207443000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3272127.3275013"}},"subtitle":["shallow depth of field from a single image"],"short-title":[],"issued":{"date-parts":[[2018,12,4]]},"references-count":25,"journal-issue":{"issue":"6","published-print":{"date-parts":[[2018,12,31]]}},"alternative-id":["10.1145\/3272127.3275013"],"URL":"https:\/\/doi.org\/10.1145\/3272127.3275013","relation":{},"ISSN":["0730-0301","1557-7368"],"issn-type":[{"value":"0730-0301","type":"print"},{"value":"1557-7368","type":"electronic"}],"subject":[],"published":{"date-parts":[[2018,12,4]]},"assertion":[{"value":"2018-12-04","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}