{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,9,11]],"date-time":"2025-09-11T17:02:37Z","timestamp":1757610157720,"version":"3.44.0"},"reference-count":58,"publisher":"Association for Computing Machinery (ACM)","issue":"8","content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["Proc. VLDB Endow."],"published-print":{"date-parts":[[2025,4]]},"abstract":"<jats:p>Error-bounded lossy compression has been widely adopted in many scientific domains because it can address the challenges in storing, transferring, and analyzing unprecedented amounts of scientific data. However, general error-bounded lossy compressors may fail to meet additional quality requirements for downstream analysis, a.k.a. Quantities of Interest (QoIs). This may lead to uncertainties and even misinterpretations in scientific discoveries, significantly limiting the use of lossy compression in practice. In this paper, we propose QPET, a novel, versatile, and portable framework for QoI-preserving error-bounded lossy compression, which overcomes the challenges of modeling diverse QoIs by leveraging numerical strategies. QPET features (1) high portability to multiple existing lossy compressors, (2) versatile preservation to most differentiable univariate and multivariate QoIs, and (3) significant compression improvements in QoI-preservation tasks. Experiments with six real-world datasets demonstrate that integrating QPET into state-of-the-art error-bounded lossy compressors can gain 2x to 10x compression speedups of existing QoI-preserving error-bounded lossy compression solutions, up to 1000% compression ratio improvements to general-purpose compressors, and up to 133% compression ratio improvements to existing QoI-integrated scientific compressors.<\/jats:p>","DOI":"10.14778\/3742728.3742739","type":"journal-article","created":{"date-parts":[[2025,9,3]],"date-time":"2025-09-03T13:32:53Z","timestamp":1756906373000},"page":"2440-2453","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":0,"title":["QPET: A Versatile and Portable Quantity-of-Interest-Preservation Framework for Error-Bounded Lossy Compression"],"prefix":"10.14778","volume":"18","author":[{"given":"Jinyang","family":"Liu","sequence":"first","affiliation":[{"name":"University of Houston, Houston, TX, USA"}]},{"given":"Pu","family":"Jiao","sequence":"additional","affiliation":[{"name":"University of Kentucky, Lexington, KY, USA"}]},{"given":"Kai","family":"Zhao","sequence":"additional","affiliation":[{"name":"Florida State University, Tallahassee, FL, USA"}]},{"given":"Xin","family":"Liang","sequence":"additional","affiliation":[{"name":"University of Kentucky, Lexington, KY, USA"}]},{"given":"Sheng","family":"Di","sequence":"additional","affiliation":[{"name":"Argonne National Laboratory, Lemont, IL, USA"}]},{"given":"Franck","family":"Cappello","sequence":"additional","affiliation":[{"name":"Argonne National Laboratory, Lemont, IL, USA"}]}],"member":"320","published-online":{"date-parts":[[2025,9,3]]},"reference":[{"key":"e_1_2_1_1_1","unstructured":"[n.d.]. HDF5. http:\/\/www.hdfgroup.org\/HDF5. Last Accessed: 2025-05-25."},{"key":"e_1_2_1_2_1","unstructured":"[n.d.]. nvCOMP. https:\/\/github.com\/NVIDIA\/nvcomp. Last Accessed: 2025-05-25."},{"key":"e_1_2_1_3_1","unstructured":"2020. EXAALT: Malecular Dynamics at the Exascale. https:\/\/www.exascaleproject.org\/wp-content\/uploads\/2019\/10\/EXAALT.pdf. Online Last Accessed: 2025-05-25."},{"key":"e_1_2_1_4_1","unstructured":"2021. Team at Princeton Plasma Physics Laboratory employs DOE supercomputers to understand heat-load width requirements of future ITER device. https:\/\/www.olcf.ornl.gov\/2021\/02\/18\/scientists-use-supercomputers-to-study-reliable-fusion-reactor-design-operation. Online Last Accessed: 2025-05-25."},{"key":"e_1_2_1_5_1","volume-title":"Proceedings of the 26th Symposium on Operating Systems Principles. 647\u2013664","author":"Agrawal Nitin","year":"2017","unstructured":"Nitin Agrawal and Ashish Vulimiri. 2017. Low-latency analytics on colossal data streams with summarystore. In Proceedings of the 26th Symposium on Operating Systems Principles. 647\u2013664."},{"key":"e_1_2_1_6_1","doi-asserted-by":"crossref","first-page":"A2146","DOI":"10.1137\/18M1208885","article-title":"Multilevel techniques for compression and reduction of scientific data-quantitative control of accuracy in derived quantities","volume":"41","author":"Ainsworth Mark","year":"2019","unstructured":"Mark Ainsworth, Ozan Tugluk, Ben Whitney, and Scott Klasky. 2019. Multilevel techniques for compression and reduction of scientific data-quantitative control of accuracy in derived quantities. SIAM Journal on Scientific Computing 41, 4 (2019), A2146\u2013A2171.","journal-title":"SIAM Journal on Scientific Computing"},{"key":"e_1_2_1_7_1","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1145\/3231935","article-title":"Brotli: A general-purpose data compressor","volume":"37","author":"Alakuijala Jyrki","year":"2018","unstructured":"Jyrki Alakuijala, Andrea Farruggia, Paolo Ferragina, Eugene Kliuchnikov, Robert Obryk, Zoltan Szabadka, and Lode Vandevenne. 2018. Brotli: A general-purpose data compressor. ACM Transactions on Information Systems (TOIS) 37, 1 (2018), 1\u201330.","journal-title":"ACM Transactions on Information Systems (TOIS)"},{"key":"e_1_2_1_8_1","doi-asserted-by":"crossref","first-page":"10","DOI":"10.1145\/1239971.1239974","article-title":"XQueC: A query-conscious compressed XML database","volume":"7","author":"Arion Andrei","year":"2007","unstructured":"Andrei Arion, Angela Bonifati, Ioana Manolescu, and Andrea Pugliese. 2007. XQueC: A query-conscious compressed XML database. ACM Transactions on Internet Technology (TOIT) 7, 2 (2007), 10\u2013es.","journal-title":"ACM Transactions on Internet Technology (TOIT)"},{"key":"e_1_2_1_9_1","doi-asserted-by":"crossref","first-page":"3736","DOI":"10.1109\/TCSVT.2021.3101953","article-title":"Overview of the versatile video coding (VVC) standard and its applications","volume":"31","author":"Bross Benjamin","year":"2021","unstructured":"Benjamin Bross, Ye-Kui Wang, Yan Ye, Shan Liu, Jianle Chen, Gary J Sullivan, and Jens-Rainer Ohm. 2021. Overview of the versatile video coding (VVC) standard and its applications. IEEE Transactions on Circuits and Systems for Video Technology 31, 10 (2021), 3736\u20133764.","journal-title":"IEEE Transactions on Circuits and Systems for Video Technology"},{"key":"e_1_2_1_10_1","volume-title":"Proceedings of the 2001 ACM SIGMOD international conference on Management of data. 271\u2013282","author":"Chen Zhiyuan","year":"2001","unstructured":"Zhiyuan Chen, Johannes Gehrke, and Flip Korn. 2001. Query optimization in compressed database systems. In Proceedings of the 2001 ACM SIGMOD international conference on Management of data. 271\u2013282."},{"key":"e_1_2_1_11_1","unstructured":"Yann Collet. 2015. Zstandard - Real-time data compression algorithm. http:\/\/facebook.github.io\/zstd\/ (2015)."},{"key":"e_1_2_1_12_1","doi-asserted-by":"crossref","unstructured":"L Peter Deutsch. 1996. GZIP file format specification version 4.3.","DOI":"10.17487\/rfc1952"},{"key":"e_1_2_1_13_1","first-page":"1","article-title":"Online Piece-Wise Linear Approximation of Numerical Streams with Precision Guarantees","volume":"2","author":"Elmeleegy Hazem","year":"2009","unstructured":"Hazem Elmeleegy, Ahmed K. Elmagarmid, Emmanuel Cecchet, Walid G. Aref, and Willy Zwaenepoel. 2009. Online Piece-Wise Linear Approximation of Numerical Streams with Precision Guarantees. Proc. VLDB Endow. 2, 1 (Aug. 2009), 145\u2013156.","journal-title":"Proc. VLDB Endow."},{"key":"e_1_2_1_14_1","volume-title":"European conference on parallel processing. Springer, 3\u201319","author":"Foster Ian","year":"2017","unstructured":"Ian Foster, Mark Ainsworth, Bryce Allen, Julie Bessac, Franck Cappello, Jong Youl Choi, Emil Constantinescu, Philip E Davis, Sheng Di, Wendy Di, Hanqi Guo, Scott Klasky, Kerstin Kleese Van Dam, Tahsin Kurc, Qing Liu, Abid Malik, Kshitij Mehta, Klaus Mueller, Todd Munson, George Ostouchov, Manish Parashar, Tom Peterka, Line Pouchard, Dingwen Tao, Ozan Tugluk, Stefan Wild, Matthew Wolf, Justin M. Wozniak, Wei Xu, and Shinjae Yoo. 2017. Computing just what you need: Online data analysis and reduction at extreme scales. In European conference on parallel processing. Springer, 3\u201319."},{"key":"e_1_2_1_15_1","volume-title":"Smoky Mountains Computational Sciences and Engineering Conference. Springer, 22\u201339","author":"Gong Qian","year":"2021","unstructured":"Qian Gong, Xin Liang, Ben Whitney, Jong Youl Choi, Jieyang Chen, Lipeng Wan, St\u00e9phane Ethier, Seung-Hoe Ku, R Michael Churchill, C-S Chang, et al. 2021. Maintaining trust in reduction: Preserving the accuracy of quantities of interest for lossy compression. In Smoky Mountains Computational Sciences and Engineering Conference. Springer, 22\u201339."},{"key":"e_1_2_1_16_1","volume-title":"Smoky Mountains Computational Sciences and Engineering Conference. Springer, 22\u201339","author":"Gong Qian","year":"2021","unstructured":"Qian Gong, Xin Liang, Ben Whitney, Jong Youl Choi, Jieyang Chen, Lipeng Wan, St\u00e9phane Ethier, Seung-Hoe Ku, R Michael Churchill, C-S Chang, et al. 2021. Maintaining trust in reduction: Preserving the accuracy of quantities of interest for lossy compression. In Smoky Mountains Computational Sciences and Engineering Conference. Springer, 22\u201339."},{"key":"e_1_2_1_17_1","doi-asserted-by":"crossref","first-page":"13","DOI":"10.1080\/01621459.1963.10500830","article-title":"Probability Inequalities for Sums of Bounded Random Variables","volume":"58","author":"Hoeffding Wassily","year":"1963","unstructured":"Wassily Hoeffding. 1963. Probability Inequalities for Sums of Bounded Random Variables. J. Amer. Statist. Assoc. 58, 301 (1963), 13\u201330. http:\/\/www.jstor.org\/stable\/2282952","journal-title":"J. Amer. Statist. Assoc."},{"volume-title":"Computer Graphics Forum","author":"Ibarria Lawrence","key":"e_1_2_1_18_1","unstructured":"Lawrence Ibarria, Peter Lindstrom, Jarek Rossignac, and Andrzej Szymczak. 2003. Out-of-core compression and decompression of large n-dimensional scalar fields. In Computer Graphics Forum, Vol. 22. Wiley Online Library, 343\u2013348."},{"key":"e_1_2_1_19_1","doi-asserted-by":"crossref","first-page":"1688","DOI":"10.14778\/3236187.3236215","article-title":"Modelardb: Modular model-based time series management with spark and cassandra","volume":"11","author":"Jensen S\u00f8ren Kejser","year":"2018","unstructured":"S\u00f8ren Kejser Jensen, Torben Bach Pedersen, and Christian Thomsen. 2018. Modelardb: Modular model-based time series management with spark and cassandra. Proceedings of the VLDB Endowment 11, 11 (2018), 1688\u20131701.","journal-title":"Proceedings of the VLDB Endowment"},{"key":"e_1_2_1_20_1","doi-asserted-by":"crossref","first-page":"697","DOI":"10.14778\/3574245.3574255","article-title":"Toward Quantity-of-Interest Preserving Lossy Compression for Scientific Data","volume":"16","author":"Jiao Pu","year":"2022","unstructured":"Pu Jiao, Sheng Di, Hanqi Guo, Kai Zhao, Jiannan Tian, Dingwen Tao, Xin Liang, and Franck Cappello. 2022. Toward Quantity-of-Interest Preserving Lossy Compression for Scientific Data. Proceedings of the VLDB Endowment 16, 4 (2022), 697\u2013710.","journal-title":"Proceedings of the VLDB Endowment"},{"key":"e_1_2_1_21_1","volume-title":"Torben Bach Pedersen, and Christian Thomsen","author":"Jensen S\u00f8ren Kejser","year":"2019","unstructured":"S\u00f8ren Kejser Jensen, Torben Bach Pedersen, and Christian Thomsen. 2019. Scalable Model-Based Management of Correlated Dimensional Time Series in ModelarDB+. arXiv e-prints (2019), arXiv-1903."},{"key":"e_1_2_1_22_1","volume-title":"2021 Data Compression Conference (DCC). IEEE, 103\u2013112","author":"Knorr Fabian","year":"2021","unstructured":"Fabian Knorr, Peter Thoman, and Thomas Fahringer. 2021. ndzip: A high-throughput parallel lossless compressor for scientific data. In 2021 Data Compression Conference (DCC). IEEE, 103\u2013112."},{"key":"e_1_2_1_23_1","volume-title":"Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis. 1\u201314","author":"Knorr Fabian","year":"2021","unstructured":"Fabian Knorr, Peter Thoman, and Thomas Fahringer. 2021. ndzip-gpu: efficient lossless compression of scientific floating-point data on GPUs. In Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis. 1\u201314."},{"key":"e_1_2_1_24_1","volume-title":"Proceedings 19th International Conference on Data Engineering (Cat. No.03CH37405)","author":"Lazaridis I.","year":"2003","unstructured":"I. Lazaridis and S. Mehrotra. 2003. Capturing sensor-generated time series with quality guarantees. In Proceedings 19th International Conference on Data Engineering (Cat. No.03CH37405). 429\u2013440. 10.1109\/ICDE.2003.1260811"},{"key":"e_1_2_1_25_1","volume-title":"2023 IEEE International Parallel and Distributed Processing Symposium (IPDPS). IEEE, 1007\u20131017","author":"Li Shaomeng","year":"2023","unstructured":"Shaomeng Li, Peter Lindstrom, and John Clyne. 2023. Lossy scientific data compression with SPERR. In 2023 IEEE International Parallel and Distributed Processing Symposium (IPDPS). IEEE, 1007\u20131017."},{"key":"e_1_2_1_26_1","doi-asserted-by":"crossref","first-page":"3058","DOI":"10.14778\/3551793.3551852","article-title":"Chimp: efficient lossless floating point compression for time series databases","volume":"15","author":"Liakos Panagiotis","year":"2022","unstructured":"Panagiotis Liakos, Katia Papakonstantinopoulou, and Yannis Kotidis. 2022. Chimp: efficient lossless floating point compression for time series databases. Proceedings of the VLDB Endowment 15, 11 (2022), 3058\u20133070.","journal-title":"Proceedings of the VLDB Endowment"},{"key":"e_1_2_1_27_1","doi-asserted-by":"crossref","first-page":"5434","DOI":"10.1109\/TVCG.2022.3214821","article-title":"Toward feature-preserving vector field compression","volume":"29","author":"Liang Xin","year":"2022","unstructured":"Xin Liang, Sheng Di, Franck Cappello, Mukund Raj, Chunhui Liu, Kenji Ono, Zizhong Chen, Tom Peterka, and Hanqi Guo. 2022. Toward feature-preserving vector field compression. IEEE Transactions on Visualization and Computer Graphics 29, 12 (2022), 5434\u20135450.","journal-title":"IEEE Transactions on Visualization and Computer Graphics"},{"key":"e_1_2_1_28_1","volume-title":"2018 IEEE International Conference on Cluster Computing (CLUSTER). IEEE, 179\u2013189","author":"Liang Xin","year":"2018","unstructured":"Xin Liang, Sheng Di, Dingwen Tao, Zizhong Chen, and Franck Cappello. 2018. An efficient transformation scheme for lossy data compression with point-wise relative error bound. In 2018 IEEE International Conference on Cluster Computing (CLUSTER). IEEE, 179\u2013189."},{"key":"e_1_2_1_29_1","volume-title":"Error-Controlled Lossy Compression Optimized for High Compression Ratios of Scientific Datasets. In 2018 IEEE International Conference on Big Data. IEEE.","author":"Liang Xin","year":"2018","unstructured":"Xin Liang, Sheng Di, Dingwen Tao, Sihuan Li, Shaomeng Li, Hanqi Guo, Zizhong Chen, and Franck Cappello. 2018. Error-Controlled Lossy Compression Optimized for High Compression Ratios of Scientific Datasets. In 2018 IEEE International Conference on Big Data. IEEE."},{"key":"e_1_2_1_30_1","doi-asserted-by":"crossref","unstructured":"Xin Liang Hanqi Guo Sheng Di Franck Cappello Mukund Raj Chunhui Liu Kenji Ono Zizhong Chen and Tom Peterka. 2020. Toward Feature-Preserving 2D and 3D Vector Field Compression.. In Pacific Vis. 81\u201390.","DOI":"10.1109\/PacificVis48177.2020.6431"},{"key":"e_1_2_1_31_1","doi-asserted-by":"publisher","DOI":"10.1109\/TBDATA.2022.3201176"},{"key":"e_1_2_1_32_1","volume-title":"Fixed-rate compressed floating-point arrays","author":"Lindstrom Peter","year":"2014","unstructured":"Peter Lindstrom. 2014. Fixed-rate compressed floating-point arrays. IEEE transactions on visualization and computer graphics 20, 12 (2014), 2674\u20132683."},{"key":"e_1_2_1_33_1","unstructured":"Peter G Lindstrom et al. 2017. Fpzip. Technical Report. Lawrence Livermore National Lab.(LLNL) Livermore CA (United States)."},{"key":"e_1_2_1_34_1","doi-asserted-by":"crossref","first-page":"2586","DOI":"10.14778\/3476249.3476305","article-title":"Decomposed bounded floats for fast compression and queries","volume":"14","author":"Liu Chunwei","year":"2021","unstructured":"Chunwei Liu, Hao Jiang, John Paparrizos, and Aaron J Elmore. 2021. Decomposed bounded floats for fast compression and queries. Proceedings of the VLDB Endowment 14, 11 (2021), 2586\u20132598.","journal-title":"Proceedings of the VLDB Endowment"},{"key":"e_1_2_1_35_1","volume-title":"2022 SC22: International Conference for High Performance Computing, Networking, Storage and Analysis (SC). IEEE Computer Society, 892\u2013906","author":"Liu Jinyang","year":"2022","unstructured":"Jinyang Liu, Sheng Di, Kai Zhao, Xin Liang, Zizhong Chen, and Franck Cappello. 2022. Dynamic quality metric oriented error bounded lossy compression for scientific datasets. In 2022 SC22: International Conference for High Performance Computing, Networking, Storage and Analysis (SC). IEEE Computer Society, 892\u2013906."},{"key":"e_1_2_1_36_1","first-page":"1","article-title":"High-performance effective scientific error-bounded lossy compression with auto-tuned multi-component interpolation","volume":"2","author":"Liu Jinyang","year":"2024","unstructured":"Jinyang Liu, Sheng Di, Kai Zhao, Xin Liang, Sian Jin, Zizhe Jian, Jiajun Huang, Shixun Wu, Zizhong Chen, and Franck Cappello. 2024. High-performance effective scientific error-bounded lossy compression with auto-tuned multi-component interpolation. Proceedings of the ACM on Management of Data 2, 1 (2024), 1\u201327.","journal-title":"Proceedings of the ACM on Management of Data"},{"key":"e_1_2_1_37_1","doi-asserted-by":"crossref","first-page":"181","DOI":"10.1016\/j.ascom.2015.07.002","article-title":"A compression scheme for radio data in high performance computing","volume":"12","author":"Masui Kiyoshi","year":"2015","unstructured":"Kiyoshi Masui, Mandana Amiri, Liam Connor, Meiling Deng, Mateus Fandino, Carolin H\u00f6fer, Mark Halpern, David Hanna, Adam D Hincks, Gary Hinshaw, et al. 2015. A compression scheme for radio data in high performance computing. Astronomy and Computing 12 (2015), 181\u2013190.","journal-title":"Astronomy and Computing"},{"key":"e_1_2_1_38_1","first-page":"12","article-title":"Gorilla: A Fast, Scalable, in-Memory Time Series Database","volume":"8","author":"Tuomas Pelkonen","year":"2015","unstructured":"Tuomas Pelkonen et al. 2015. Gorilla: A Fast, Scalable, in-Memory Time Series Database. Proc. VLDB Endow. 8, 12 (Aug. 2015), 1816\u20131827.","journal-title":"Proc. VLDB Endow."},{"key":"e_1_2_1_39_1","doi-asserted-by":"crossref","unstructured":"X Carol Song Preston Smith Rajesh Kalyanam Xiao Zhu Eric Adams Kevin Colby Patrick Finnegan Erik Gough Elizabett Hillery Rick Irvine et al. 2022. Anvil-system architecture and experiences from deployment and early user operations. In Practice and experience in advanced research computing. 1\u20139.","DOI":"10.1145\/3491418.3530766"},{"key":"e_1_2_1_40_1","volume-title":"2022 IEEE\/ACM 8th International Workshop on Data Analysis and Reduction for Big Scientific Data (DRBSD). IEEE, 44\u201353","author":"Su Zhaoyuan","year":"2022","unstructured":"Zhaoyuan Su, Sheng Di, Ali Murat Gok, Yue Cheng, and Franck Cappello. 2022. Understanding impact of lossy compression on derivative-related metrics in scientific datasets. In 2022 IEEE\/ACM 8th International Workshop on Data Analysis and Reduction for Big Scientific Data (DRBSD). IEEE, 44\u201353."},{"volume-title":"Integrated circuit and systems, algorithms and architectures.","author":"Sze Vivienne","key":"e_1_2_1_41_1","unstructured":"Vivienne Sze, Madhukar Budagavi, and Gary J Sullivan. 2014. High efficiency video coding (HEVC). In Integrated circuit and systems, algorithms and architectures. Vol. 39. Springer, 40."},{"key":"e_1_2_1_42_1","volume-title":"2017 IEEE International Parallel and Distributed Processing Symposium. IEEE, 1129\u20131139","author":"Tao Dingwen","year":"2017","unstructured":"Dingwen Tao, Sheng Di, Zizhong Chen, and Franck Cappello. 2017. Significantly improving lossy compression for scientific data sets based on multidimensional prediction and error-controlled quantization. In 2017 IEEE International Parallel and Distributed Processing Symposium. IEEE, 1129\u20131139."},{"key":"e_1_2_1_43_1","doi-asserted-by":"publisher","DOI":"10.1177\/1094342017737147"},{"key":"e_1_2_1_44_1","doi-asserted-by":"crossref","first-page":"1336","DOI":"10.1109\/JPROC.2002.800725","article-title":"JPEG2000: Standard for interactive imaging","volume":"90","author":"Taubman David S","year":"2002","unstructured":"David S Taubman and Michael W Marcellin. 2002. JPEG2000: Standard for interactive imaging. Proc. IEEE 90, 8 (2002), 1336\u20131357.","journal-title":"Proc. IEEE"},{"key":"e_1_2_1_45_1","volume-title":"cuSZ (x): Optimizing Error-Bounded Lossy Compression for Scientific Data on GPUs. CoRR","author":"Tian Jiannan","year":"2021","unstructured":"Jiannan Tian, Sheng Di, Xiaodong Yu, Cody Rivera, Kai Zhao, Sian Jin, Yunhe Feng, Xin Liang, Dingwen Tao, and Franck Cappello. 2021. cuSZ (x): Optimizing Error-Bounded Lossy Compression for Scientific Data on GPUs. CoRR (2021)."},{"key":"e_1_2_1_46_1","volume-title":"Proceedings of the ACM International Conference on Parallel Architectures and Compilation Techniques","author":"Tian Jiannan","year":"2020","unstructured":"Jiannan Tian, Sheng Di, Kai Zhao, Cody Rivera, Megan Hickman Fulp, Robert Underwood, Sian Jin, Xin Liang, Jon Calhoun, Dingwen Tao, and Franck Cappello. 2020. cuSZ: An Efficient GPU-Based Error-Bounded Lossy Compression Framework for Scientific Data. In Proceedings of the ACM International Conference on Parallel Architectures and Compilation Techniques (Virtual Event, GA, USA) (PACT '20). Association for Computing Machinery, New York, NY, USA, 3\u201315. 10.1145\/3410463.3414624"},{"key":"e_1_2_1_47_1","volume-title":"OptZConfig: Efficient Parallel Optimization of Lossy Compression Configuration","author":"Underwood Robert","year":"2022","unstructured":"Robert Underwood, Jon C Calhoun, Sheng Di, Amy Apon, and Franck Cappello. 2022. OptZConfig: Efficient Parallel Optimization of Lossy Compression Configuration. IEEE Transactions on Parallel and Distributed Systems (2022)."},{"key":"e_1_2_1_48_1","volume-title":"2020 IEEE International Parallel and Distributed Processing Symposium (IPDPS). IEEE, 567\u2013577","author":"Underwood Robert","year":"2020","unstructured":"Robert Underwood, Sheng Di, Jon C Calhoun, and Franck Cappello. 2020. Fraz: A generic high-fidelity fixed-ratio lossy compression framework for scientific floating-point data. In 2020 IEEE International Parallel and Distributed Processing Symposium (IPDPS). IEEE, 567\u2013577."},{"volume-title":"High-dimensional probability: An introduction with applications in data science","author":"Vershynin Roman","key":"e_1_2_1_49_1","unstructured":"Roman Vershynin. 2018. High-dimensional probability: An introduction with applications in data science. Vol. 47. Cambridge university press."},{"volume-title":"High-dimensional statistics: A non-asymptotic viewpoint","author":"Wainwright Martin J","key":"e_1_2_1_50_1","unstructured":"Martin J Wainwright. 2019. High-dimensional statistics: A non-asymptotic viewpoint. Vol. 48. Cambridge university press."},{"key":"e_1_2_1_51_1","doi-asserted-by":"crossref","first-page":"30","DOI":"10.1145\/103085.103089","article-title":"The JPEG still picture compression standard","volume":"34","author":"Wallace Gregory K","year":"1991","unstructured":"Gregory K Wallace. 1991. The JPEG still picture compression standard. Commun. ACM 34, 4 (1991), 30\u201344.","journal-title":"Commun. ACM"},{"key":"e_1_2_1_52_1","doi-asserted-by":"crossref","first-page":"560","DOI":"10.1109\/TCSVT.2003.815165","article-title":"Overview of the H. 264\/AVC video coding standard","volume":"13","author":"Wiegand Thomas","year":"2003","unstructured":"Thomas Wiegand, Gary J Sullivan, Gisle Bjontegaard, and Ajay Luthra. 2003. Overview of the H. 264\/AVC video coding standard. IEEE Transactions on circuits and systems for video technology 13, 7 (2003), 560\u2013576.","journal-title":"IEEE Transactions on circuits and systems for video technology"},{"key":"e_1_2_1_53_1","volume-title":"2024 SC24: International Conference for High Performance Computing, Networking, Storage and Analysis SC. IEEE Computer Society, 1368\u20131383","author":"Wu Xuan","year":"2024","unstructured":"Xuan Wu, Qian Gong, Jieyang Chen, Qing Liu, Norbert Podhorszki, Xin Liang, and Scott Klasky. 2024. Error-controlled Progressive Retrieval of Scientific Data under Derivable Quantities of Interest. In 2024 SC24: International Conference for High Performance Computing, Networking, Storage and Analysis SC. IEEE Computer Society, 1368\u20131383."},{"key":"e_1_2_1_54_1","doi-asserted-by":"crossref","unstructured":"Mingze Xia Sheng Di Franck Cappello Pu Jiao Kai Zhao Jinyang Liu Xuan Wu Xin Liang and Hanqi Guo. 2024. Preserving Topological Feature with Sign-of-Determinant Predicates in Lossy Compression: A Case Study of Vector Field Critical Points. In 2024 IEEE 40th International Conference on Data Engineering (ICDE). IEEE 4979\u20134992.","DOI":"10.1109\/ICDE60146.2024.00378"},{"key":"e_1_2_1_55_1","volume-title":"TopoSZ: Preserving topology in error-bounded lossy compression","author":"Yan Lin","year":"2023","unstructured":"Lin Yan, Xin Liang, Hanqi Guo, and Bei Wang. 2023. TopoSZ: Preserving topology in error-bounded lossy compression. IEEE Transactions on Visualization and Computer Graphics (2023)."},{"key":"e_1_2_1_56_1","volume-title":"2021 IEEE 37th International Conference on Data Engineering (ICDE). IEEE, 1679\u20131690","author":"Zhang Feng","year":"2021","unstructured":"Feng Zhang, Zaifeng Pan, Yanliang Zhou, Jidong Zhai, Xipeng Shen, Onur Mutlu, and Xiaoyong Du. 2021. G-TADOC: Enabling efficient GPU-based text analytics without decompression. In 2021 IEEE 37th International Conference on Data Engineering (ICDE). IEEE, 1679\u20131690."},{"key":"e_1_2_1_57_1","doi-asserted-by":"crossref","first-page":"1522","DOI":"10.14778\/3236187.3236203","article-title":"Efficient document analytics on compressed data: Method, challenges, algorithms, insights","volume":"11","author":"Zhang Feng","year":"2018","unstructured":"Feng Zhang, Jidong Zhai, Xipeng Shen, Onur Mutlu, and Wenguang Chen. 2018. Efficient document analytics on compressed data: Method, challenges, algorithms, insights. Proceedings of the VLDB Endowment 11, 11 (2018), 1522\u20131535.","journal-title":"Proceedings of the VLDB Endowment"},{"key":"e_1_2_1_58_1","volume-title":"2021 IEEE 37th International Conference on Data Engineering (ICDE). 1643\u20131654","author":"Zhao Kai","year":"2021","unstructured":"Kai Zhao, Sheng Di, Maxim Dmitriev, Thierry-Laurent D. Tonellot, Zizhong Chen, and Franck Cappello. 2021. Optimizing Error-Bounded Lossy Compression for Scientific Data by Dynamic Spline Interpolation. In 2021 IEEE 37th International Conference on Data Engineering (ICDE). 1643\u20131654. 10.1109\/ICDE51399.2021.00145"}],"container-title":["Proceedings of the VLDB Endowment"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.14778\/3742728.3742739","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,9,3]],"date-time":"2025-09-03T13:36:33Z","timestamp":1756906593000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.14778\/3742728.3742739"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2025,4]]},"references-count":58,"journal-issue":{"issue":"8","published-print":{"date-parts":[[2025,4]]}},"alternative-id":["10.14778\/3742728.3742739"],"URL":"https:\/\/doi.org\/10.14778\/3742728.3742739","relation":{},"ISSN":["2150-8097"],"issn-type":[{"type":"print","value":"2150-8097"}],"subject":[],"published":{"date-parts":[[2025,4]]},"assertion":[{"value":"2025-09-03","order":3,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}