{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,1,28]],"date-time":"2026-01-28T21:50:05Z","timestamp":1769637005503,"version":"3.49.0"},"reference-count":31,"publisher":"Association for Computing Machinery (ACM)","issue":"2s","license":[{"start":{"date-parts":[[2020,4,30]],"date-time":"2020-04-30T00:00:00Z","timestamp":1588204800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["ACM Trans. Multimedia Comput. Commun. Appl."],"published-print":{"date-parts":[[2020,4,30]]},"abstract":"<jats:p>\n            As a new form of volumetric media, Light Field (LF) can provide users with a true six degrees of freedom immersive experience because LF captures the scene with photo-realism, including aperture-limited changes in viewpoint. But uncompressed LF data is too large for network transmission, which is the reason why LF compression has become an important research topic. One of the more recent approaches for LF compression is to reduce the angular resolution of the input LF during compression and to use LF reconstruction to recover the discarded viewpoints during decompression. Following this approach, we propose a new LF reconstruction algorithm based on Graph Neural Networks; we show that it can achieve higher compression and better quality compared to existing reconstruction methods, although suffering from the same problem as those methods\u2014the inability to deal effectively with high-frequency image components. To solve this problem, we propose an adaptive two-layer compression architecture that separates high-frequency and low-frequency components and compresses each with a different strategy so that the performance can become robust and controllable. Experiments with multiple datasets\n            <jats:sup>1<\/jats:sup>\n            show that our proposed scheme is capable of providing a decompression quality of above 40 dB, and can significantly improve compression efficiency compared with similar LF reconstruction schemes.\n          <\/jats:p>","DOI":"10.1145\/3395620","type":"journal-article","created":{"date-parts":[[2020,6,22]],"date-time":"2020-06-22T02:49:20Z","timestamp":1592794160000},"page":"1-23","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":20,"title":["An Adaptive Two-Layer Light Field Compression Scheme Using GNN-Based Reconstruction"],"prefix":"10.1145","volume":"16","author":[{"ORCID":"https:\/\/orcid.org\/0000-0001-8304-9720","authenticated-orcid":false,"given":"Xinjue","family":"Hu","sequence":"first","affiliation":[{"name":"Beijing University of Posts and Telecommunication, Haidian, Beijing, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Jingming","family":"Shan","sequence":"additional","affiliation":[{"name":"Beijing University of Posts and Telecommunication, Haidian, Beijing, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Yu","family":"Liu","sequence":"additional","affiliation":[{"name":"Beijing University of Posts and Telecommunication; Research Center of Networks and Communications, Peng Cheng Laboratory, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Lin","family":"Zhang","sequence":"additional","affiliation":[{"name":"Beijing University of Posts and Telecommunication; Research Center of Networks and Communications, Peng Cheng Laboratory, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Shervin","family":"Shirmohammadi","sequence":"additional","affiliation":[{"name":"University of Ottawa, Ottawa, ON, Canada"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"320","published-online":{"date-parts":[[2020,6,21]]},"reference":[{"key":"e_1_2_1_1_1","volume-title":"Bergen","author":"Adelson Edward H.","year":"1991","unstructured":"Edward H. Adelson and James R . Bergen . 1991 . The Plenoptic Function and the Elements of Early Vision. Vol. 2 . Vision and Modeling Group, Media Laboratory, Massachusetts Institute of Technology . Edward H. Adelson and James R. Bergen. 1991. The Plenoptic Function and the Elements of Early Vision. Vol. 2. Vision and Modeling Group, Media Laboratory, Massachusetts Institute of Technology."},{"key":"e_1_2_1_2_1","doi-asserted-by":"publisher","DOI":"10.1109\/JDT.2011.2159359"},{"key":"e_1_2_1_3_1","doi-asserted-by":"publisher","DOI":"10.1109\/DCC.2018.00050"},{"key":"e_1_2_1_4_1","volume-title":"Stanford University","author":"Laboratory Computer Graphics","year":"2008","unstructured":"Computer Graphics Laboratory , Stanford University 2008 . Light Field Datasets. Retrieved from http:\/\/lightfield.stanford.edu\/lfs.html. Computer Graphics Laboratory, Stanford University 2008. Light Field Datasets. Retrieved from http:\/\/lightfield.stanford.edu\/lfs.html."},{"key":"e_1_2_1_5_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.image.2016.01.008"},{"key":"e_1_2_1_6_1","doi-asserted-by":"publisher","DOI":"10.1109\/CVPR.2013.137"},{"key":"e_1_2_1_7_1","volume-title":"23rd Annual Conference on Computer Graphics and Interactive Techniques. 43--54","author":"Gortler Steven J.","unstructured":"Steven J. Gortler , Radek Grzeszczuk , Richard Szeliski , and Michael F. Cohen . 1996. The lumigraph . In 23rd Annual Conference on Computer Graphics and Interactive Techniques. 43--54 . Steven J. Gortler, Radek Grzeszczuk, Richard Szeliski, and Michael F. Cohen. 1996. The lumigraph. In 23rd Annual Conference on Computer Graphics and Interactive Techniques. 43--54."},{"key":"e_1_2_1_8_1","doi-asserted-by":"publisher","DOI":"10.1109\/DCC.2018.00019"},{"key":"e_1_2_1_9_1","unstructured":"Will Hamilton Zhitao Ying and Jure Leskovec. 2017. Inductive representation learning on large graphs. In Advances in Neural Information Processing Systems. 1024--1034.  Will Hamilton Zhitao Ying and Jure Leskovec. 2017. Inductive representation learning on large graphs. In Advances in Neural Information Processing Systems. 1024--1034."},{"key":"e_1_2_1_10_1","doi-asserted-by":"publisher","DOI":"10.1109\/BMSB.2017.7986144"},{"key":"e_1_2_1_11_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICIP.2017.8296883"},{"key":"e_1_2_1_12_1","doi-asserted-by":"publisher","DOI":"10.1145\/3304109.3306228"},{"key":"e_1_2_1_13_1","first-page":"1","article-title":"Light field compression with homography-based low rank approximation","volume":"99","author":"Jiang Xiaoran","year":"2017","unstructured":"Xiaoran Jiang , Mika\u00ebl Le Pendu , Reuben A. Farrugia , and Christine Guillemot . 2017 . Light field compression with homography-based low rank approximation . IEEE Journal of Selected Topics in Signal Processing PP , 99 (2017), 1 -- 1 . Xiaoran Jiang, Mika\u00ebl Le Pendu, Reuben A. Farrugia, and Christine Guillemot. 2017. Light field compression with homography-based low rank approximation. IEEE Journal of Selected Topics in Signal Processing PP, 99 (2017), 1--1.","journal-title":"IEEE Journal of Selected Topics in Signal Processing PP"},{"key":"e_1_2_1_14_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICASSP.2012.6288140"},{"key":"e_1_2_1_15_1","doi-asserted-by":"publisher","DOI":"10.1145\/237170.237199"},{"key":"e_1_2_1_16_1","doi-asserted-by":"publisher","DOI":"10.1109\/JSTSP.2017.2725198"},{"key":"e_1_2_1_17_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICASSP.2014.6853654"},{"key":"e_1_2_1_18_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICMEW.2016.7574674"},{"key":"e_1_2_1_19_1","volume-title":"Signal Processing Conference. 11--15","author":"Lucas Lu\u00eds F. R.","unstructured":"Lu\u00eds F. R. Lucas , Caroline Conti , Paulo Nunes , Lu\u00eds Ducla Soares , Nuno M. M. Rodrigues , Carla L. Pagliari , Eduardo A. B. Da Silva , and S\u00e9rgio M. M . De Faria. 2014. Locally linear embedding-based prediction for 3D holoscopic image coding using HEVC . In Signal Processing Conference. 11--15 . Lu\u00eds F. R. Lucas, Caroline Conti, Paulo Nunes, Lu\u00eds Ducla Soares, Nuno M. M. Rodrigues, Carla L. Pagliari, Eduardo A. B. Da Silva, and S\u00e9rgio M. M. De Faria. 2014. Locally linear embedding-based prediction for 3D holoscopic image coding using HEVC. In Signal Processing Conference. 11--15."},{"key":"e_1_2_1_20_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICMEW.2016.7574670"},{"key":"e_1_2_1_22_1","volume-title":"International Conference on Machine Learning.","author":"Niepert Mathias","year":"2016","unstructured":"Mathias Niepert , Mohamed Ahmed , and Konstantin Kutzkov . 2016 . Learning convolutional neural networks for graphs . In International Conference on Machine Learning. Mathias Niepert, Mohamed Ahmed, and Konstantin Kutzkov. 2016. Learning convolutional neural networks for graphs. In International Conference on Machine Learning."},{"key":"e_1_2_1_23_1","volume-title":"IEEE International Conference on Multimedia and Expo Workshops. 1--4.","author":"Perra C.","unstructured":"C. Perra and P. Assuncao . 2016. High efficiency coding of light field images based on tiling and pseudo-temporal data arrangement . In IEEE International Conference on Multimedia and Expo Workshops. 1--4. C. Perra and P. Assuncao. 2016. High efficiency coding of light field images based on tiling and pseudo-temporal data arrangement. In IEEE International Conference on Multimedia and Expo Workshops. 1--4."},{"key":"e_1_2_1_24_1","volume-title":"JPEG 2000 compression of unfocused light field images based on lenslet array slicing. In IEEE International Conference on Consumer Electronics. 27--28","author":"Perra Cristian","year":"2017","unstructured":"Cristian Perra and Daniele Giusto . 2017 . JPEG 2000 compression of unfocused light field images based on lenslet array slicing. In IEEE International Conference on Consumer Electronics. 27--28 . Cristian Perra and Daniele Giusto. 2017. JPEG 2000 compression of unfocused light field images based on lenslet array slicing. In IEEE International Conference on Consumer Electronics. 27--28."},{"key":"e_1_2_1_25_1","volume-title":"IEEE International Symposium on Broadband Multimedia Systems and Broadcasting.","author":"Rassool Reza","year":"2017","unstructured":"Reza Rassool . 2017 . VMAF reproducibility: Validating a perceptual practical video quality metric . In IEEE International Symposium on Broadband Multimedia Systems and Broadcasting. Reza Rassool. 2017. VMAF reproducibility: Validating a perceptual practical video quality metric. In IEEE International Symposium on Broadband Multimedia Systems and Broadcasting."},{"key":"e_1_2_1_26_1","doi-asserted-by":"publisher","DOI":"10.1109\/TNN.2008.2005605"},{"key":"e_1_2_1_27_1","doi-asserted-by":"publisher","DOI":"10.1145\/2682631"},{"key":"e_1_2_1_28_1","volume-title":"Pascal Frossard, and Touradj Ebrahimi.","author":"Viola Irene","year":"2018","unstructured":"Irene Viola , Hermina Petric Maretic , Pascal Frossard, and Touradj Ebrahimi. 2018 . A graph learning approach for light field image compression. Applications of Digital Image Processing XLISpie-Int Soc Optical Engineering ( 2018), 12. Irene Viola, Hermina Petric Maretic, Pascal Frossard, and Touradj Ebrahimi. 2018. A graph learning approach for light field image compression. Applications of Digital Image Processing XLISpie-Int Soc Optical Engineering (2018), 12."},{"key":"e_1_2_1_29_1","doi-asserted-by":"publisher","DOI":"10.1109\/JSTSP.2017.2747126"},{"key":"e_1_2_1_30_1","volume-title":"Asian Conference on Computer Vision. Springer, 3--15","author":"Xu Shan","year":"2014","unstructured":"Shan Xu , Zhi-Liang Zhou , and Nicholas Devaney . 2014 . Multi-view image restoration from plenoptic raw images . In Asian Conference on Computer Vision. Springer, 3--15 . Shan Xu, Zhi-Liang Zhou, and Nicholas Devaney. 2014. Multi-view image restoration from plenoptic raw images. In Asian Conference on Computer Vision. Springer, 3--15."},{"key":"e_1_2_1_31_1","doi-asserted-by":"crossref","unstructured":"Wei Zhang Dong Liu Zhiwei Xiong and Jizheng Xu. 2018. SIFT-based adaptive prediction structure for light field compression. In Visual Communications and Image Processing. 1--4.  Wei Zhang Dong Liu Zhiwei Xiong and Jizheng Xu. 2018. SIFT-based adaptive prediction structure for light field compression. In Visual Communications and Image Processing. 1--4.","DOI":"10.1109\/VCIP.2017.8305107"},{"key":"e_1_2_1_32_1","doi-asserted-by":"publisher","DOI":"10.1109\/JETCAS.2018.2883479"}],"container-title":["ACM Transactions on Multimedia Computing, Communications, and Applications"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3395620","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3395620","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T22:38:45Z","timestamp":1750199925000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3395620"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2020,4,30]]},"references-count":31,"journal-issue":{"issue":"2s","published-print":{"date-parts":[[2020,4,30]]}},"alternative-id":["10.1145\/3395620"],"URL":"https:\/\/doi.org\/10.1145\/3395620","relation":{},"ISSN":["1551-6857","1551-6865"],"issn-type":[{"value":"1551-6857","type":"print"},{"value":"1551-6865","type":"electronic"}],"subject":[],"published":{"date-parts":[[2020,4,30]]},"assertion":[{"value":"2019-12-01","order":0,"name":"received","label":"Received","group":{"name":"publication_history","label":"Publication History"}},{"value":"2020-04-01","order":1,"name":"accepted","label":"Accepted","group":{"name":"publication_history","label":"Publication History"}},{"value":"2020-06-21","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}