{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,8,30]],"date-time":"2025-08-30T00:06:20Z","timestamp":1756512380277,"version":"3.44.0"},"reference-count":109,"publisher":"Association for Computing Machinery (ACM)","issue":"7","content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["Proc. VLDB Endow."],"published-print":{"date-parts":[[2025,3]]},"abstract":"<jats:p>\n            Cardinality estimation is a fundamental functionality in database systems. Most existing cardinality estimators focus on handling predicates over numeric or categorical data. They have largely omitted an important data type, set-valued data, which frequently occur in contemporary applications such as information retrieval and recommender systems. The few existing estimators for such data either favor high-frequency elements or rely on a\n            <jats:italic toggle=\"yes\">partial independence assumption<\/jats:italic>\n            , which limits their practical applicability.\n          <\/jats:p>\n          <jats:p>We propose ACE, an Attention-based Cardinality Estimator for estimating the cardinality of queries over set-valued data. We first design a distillation-based data encoder to condense the dataset into a compact matrix. We then design an attention-based query analyzer to capture correlations among query elements. To handle variable-sized queries, a pooling module is introduced, followed by a regression model (MLP) to generate final cardinality estimates. We evaluate ACE on three datasets with varying query element distributions, demonstrating that ACE outperforms the state-of-the-art competitors in terms of both accuracy and efficiency.<\/jats:p>","DOI":"10.14778\/3734839.3734848","type":"journal-article","created":{"date-parts":[[2025,8,29]],"date-time":"2025-08-29T16:01:06Z","timestamp":1756483266000},"page":"2112-2125","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":0,"title":["ACE: A Cardinality Estimator for Set-Valued Queries"],"prefix":"10.14778","volume":"18","author":[{"given":"Yufan","family":"Sheng","sequence":"first","affiliation":[{"name":"University of New South Wales, Sydney, Australia"}]},{"given":"Xin","family":"Cao","sequence":"additional","affiliation":[{"name":"University of New South Wales, Sydney, Australia"}]},{"given":"Kaiqi","family":"Zhao","sequence":"additional","affiliation":[{"name":"The University of Auckland, Auckland, New Zealand"}]},{"given":"Yixiang","family":"Fang","sequence":"additional","affiliation":[{"name":"The Chinese University of Hong Kong, Shenzhen, China"}]},{"given":"Jianzhong","family":"Qi","sequence":"additional","affiliation":[{"name":"The University of Melbourne, Melbourne, Australia"}]},{"given":"Wenjie","family":"Zhang","sequence":"additional","affiliation":[{"name":"University of New South Wales, Sydney, Australia"}]},{"given":"Christian S.","family":"Jensen","sequence":"additional","affiliation":[{"name":"Aalborg University, Aalborg, Denmark"}]}],"member":"320","published-online":{"date-parts":[[2025,8,29]]},"reference":[{"key":"e_1_2_1_1_1","doi-asserted-by":"crossref","first-page":"e363","DOI":"10.1002\/sta4.363","article-title":"RankFromSets: Scalable set recommendation with optimal recall","volume":"10","author":"Altosaar Jaan","year":"2021","unstructured":"Jaan Altosaar, Rajesh Ranganath, and Wesley Tansey. 2021. RankFromSets: Scalable set recommendation with optimal recall. Stat 10, 1 (2021), e363.","journal-title":"Stat"},{"key":"e_1_2_1_2_1","unstructured":"Alvin. 2021. Recipes and Reviews. https:\/\/www.kaggle.com\/datasets\/irkaal\/foodcom-recipes-and-reviews"},{"key":"e_1_2_1_3_1","doi-asserted-by":"publisher","DOI":"10.1145\/3639309"},{"key":"e_1_2_1_4_1","volume-title":"Jamie Ryan Kiros, and Geoffrey E. Hinton","author":"Ba Jimmy Lei","year":"2016","unstructured":"Jimmy Lei Ba, Jamie Ryan Kiros, and Geoffrey E. Hinton. 2016. Layer Normalization. arXiv preprint arXiv:1607.06450 (2016)."},{"key":"e_1_2_1_5_1","unstructured":"Dzmitry Bahdanau Kyunghyun Cho and Yoshua Bengio. 2015. Neural Machine Translation by Jointly Learning to Align and Translate. In ICLR."},{"key":"e_1_2_1_6_1","volume-title":"Pattern recognition and machine learning","author":"Bishop Christopher M.","unstructured":"Christopher M. Bishop. 2006. Pattern recognition and machine learning. Springer New York."},{"key":"e_1_2_1_7_1","doi-asserted-by":"crossref","unstructured":"Lucas Bourtoule Varun Chandrasekaran Christopher A Choquette-Choo Hengrui Jia Adelin Travers Baiwu Zhang David Lie and Nicolas Papernot. 2021. Machine unlearning. In S&P. 141\u2013159.","DOI":"10.1109\/SP40001.2021.00019"},{"key":"e_1_2_1_8_1","doi-asserted-by":"crossref","unstructured":"S. Castro P. Meena Kumari S. Muthumari and J. Suganthi. 2023. Information Retrieval using Set-based Model Methods Tools and Applications in Medical Data Analysis. Machine Learning for Healthcare Systems: Foundations and Applications (2023) 187.","DOI":"10.1201\/9781003438816-11"},{"key":"e_1_2_1_9_1","doi-asserted-by":"crossref","unstructured":"Lisi Chen Gao Cong Xin Cao and Kian-Lee Tan. 2015. Temporal spatial-keyword top-k publish\/subscribe. In ICDE. 255\u2013266.","DOI":"10.1109\/ICDE.2015.7113289"},{"key":"e_1_2_1_10_1","doi-asserted-by":"crossref","unstructured":"Yankai Chen Yixiang Fang Yifei Zhang and Irwin King. 2023. Bipartite graph convolutional hashing for effective and efficient top-n search in hamming space. In WWW. 3164\u20133172.","DOI":"10.1145\/3543507.3583219"},{"key":"e_1_2_1_11_1","unstructured":"Krzysztof Choromanski Valerii Likhosherstov David Dohan Xingyou Song Andreea Gane Tamas Sarlos Peter Hawkins Jared Davis Afroz Mohiuddin Lukasz Kaiser et al. 2021. Rethinking attention with performers. In ICLR."},{"key":"e_1_2_1_12_1","doi-asserted-by":"publisher","DOI":"10.1145\/376284.375685"},{"key":"e_1_2_1_13_1","doi-asserted-by":"publisher","DOI":"10.14778\/3329772.3329780"},{"key":"e_1_2_1_14_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.ins.2016.08.084"},{"key":"e_1_2_1_15_1","doi-asserted-by":"publisher","DOI":"10.1007\/s11390-021-1350-8"},{"key":"e_1_2_1_16_1","doi-asserted-by":"crossref","unstructured":"Lise Getoor Benjamin Taskar and Daphne Koller. 2001. Selectivity estimation using probabilistic models. In SIGMOD. 461\u2013472.","DOI":"10.1145\/375663.375727"},{"key":"e_1_2_1_17_1","volume-title":"Pooling methods in deep neural networks, a review. arXiv preprint arXiv:2009.07485","author":"Gholamalinezhad Hossein","year":"2020","unstructured":"Hossein Gholamalinezhad and Hossein Khosravi. 2020. Pooling methods in deep neural networks, a review. arXiv preprint arXiv:2009.07485 (2020)."},{"key":"e_1_2_1_18_1","doi-asserted-by":"publisher","DOI":"10.1145\/3377000.3377005"},{"key":"e_1_2_1_19_1","doi-asserted-by":"crossref","unstructured":"Arthur Gretton Karsten Borgwardt Malte Rasch Bernhard Sch\u00f6lkopf and Alex Smola. 2006. A kernel method for the two-sample-problem. In NIPS. 513\u2013520.","DOI":"10.7551\/mitpress\/7503.003.0069"},{"key":"e_1_2_1_20_1","doi-asserted-by":"publisher","DOI":"10.1145\/335191.335448"},{"key":"e_1_2_1_21_1","volume-title":"Swami","author":"Haas Peter J.","year":"1994","unstructured":"Peter J. Haas, Jeffrey F. Naughton, and Arun N. Swami. 1994. On the relative cost of sampling for join selectivity estimation. In PODS. 14\u201324."},{"key":"e_1_2_1_22_1","doi-asserted-by":"publisher","DOI":"10.14778\/1453856.1453883"},{"key":"e_1_2_1_23_1","doi-asserted-by":"crossref","unstructured":"Max Halford Philippe Saint-Pierre and Franck Morvan. 2019. An approach based on bayesian networks for query selectivity estimation. In DASFAA. 3\u201319.","DOI":"10.1007\/978-3-030-18579-4_1"},{"key":"e_1_2_1_24_1","unstructured":"Will Hamilton Zhitao Ying and Jure Leskovec. 2017. Inductive representation learning on large graphs. In NIPS. 1025\u20131035."},{"key":"e_1_2_1_25_1","first-page":"752","article-title":"Cardinality Estimation in DBMS: A Comprehensive Benchmark Evaluation","volume":"15","author":"Han Yuxing","year":"2022","unstructured":"Yuxing Han, Ziniu Wu, Peizhi Wu, Rong Zhu, Jingyi Yang, Tan Wei Liang, Kai Zeng, Gao Cong, Yanzhao Qin, Andreas Pfadler, Zhengping Qian, Jingren Zhou, Jiangneng Li, and Bin Cui. 2022. Cardinality Estimation in DBMS: A Comprehensive Benchmark Evaluation. Proceedings of the VLDB Endowment 15,4 (2022), 752\u2013765.","journal-title":"Proceedings of the VLDB Endowment"},{"key":"e_1_2_1_26_1","unstructured":"Yu Hao Xin Cao Yufan Sheng Yixiang Fang and Wei Wang. 2021. KS-GNN: Keywords Search Over Incomplete Graphs via Graphs Neural Network. In NeurIPS. 1700\u20131712."},{"key":"e_1_2_1_27_1","doi-asserted-by":"crossref","unstructured":"Shohedul Hasan Saravanan Thirumuruganathan Jees Augustine Nick Koudas and Gautam Das. 2020. Deep learning models for selectivity estimation of multi-attribute queries. In SIGMOD. 1035\u20131050.","DOI":"10.1145\/3318464.3389741"},{"key":"e_1_2_1_28_1","volume-title":"Friedman","author":"Hastie Trevor","year":"2009","unstructured":"Trevor Hastie, Robert Tibshirani, Jerome H. Friedman, and Jerome H. Friedman. 2009. The Elements of Statistical Learning: Data Mining, Inference, and Prediction. Vol. 2. Springer."},{"key":"e_1_2_1_29_1","doi-asserted-by":"crossref","unstructured":"Kaiming He Xiangyu Zhang Shaoqing Ren and Jian Sun. 2016. Deep residual learning for image recognition. In CVPR. 770\u2013778.","DOI":"10.1109\/CVPR.2016.90"},{"key":"e_1_2_1_30_1","doi-asserted-by":"crossref","unstructured":"Benjamin Hilprecht Andreas Schmidt Moritz Kulessa Alejandro Molina Kristian Kersting and Carsten Binnig. 2020. DeepDB: Learn from Data not from Queries! Proceedings of the VLDB Endowment 13 7 992\u20131005.","DOI":"10.14778\/3384345.3384349"},{"key":"e_1_2_1_31_1","unstructured":"IBM. 2024. Array Types and Values. https:\/\/www.ibm.com\/docs\/en\/db2-for-zos\/13?topic=types-array-values"},{"key":"e_1_2_1_32_1","unstructured":"IMDb. 2024. Non-Commercial Datasets. https:\/\/developer.imdb.com\/non-commercial-datasets\/"},{"key":"e_1_2_1_33_1","doi-asserted-by":"crossref","unstructured":"Yannis Ioannidis. 2003. The history of histograms (abridged). In VLDB. 19\u201330.","DOI":"10.1016\/B978-012722442-8\/50011-2"},{"key":"e_1_2_1_34_1","doi-asserted-by":"crossref","unstructured":"Yannis E Ioannidis and Stavros Christodoulakis. 1991. On the propagation of errors in the size of join results. In SIGMOD. 268\u2013277.","DOI":"10.1145\/119995.115835"},{"key":"e_1_2_1_35_1","volume-title":"Perceiver: General perception with iterative attention. In ICML. 4651\u20134664.","author":"Jaegle Andrew","year":"2021","unstructured":"Andrew Jaegle, Felix Gimeno, Andy Brock, Oriol Vinyals, Andrew Zisserman, and Joao Carreira. 2021. Perceiver: General perception with iterative attention. In ICML. 4651\u20134664."},{"key":"e_1_2_1_36_1","volume-title":"Deep Neural Networks as Gaussian Processes. ICLR","author":"Jaehoon Lee","year":"2018","unstructured":"Lee Jaehoon, Bahri Yasaman, Novak Roman, Schoenholz Sam, Pennington Jeffrey, and Sohl-dickstein Jascha. 2018. Deep Neural Networks as Gaussian Processes. ICLR (2018)."},{"key":"e_1_2_1_37_1","doi-asserted-by":"publisher","DOI":"10.14778\/3151106.3151112"},{"key":"e_1_2_1_38_1","volume-title":"Adam: A Method for Stochastic Optimization. In ICLR.","author":"Kingma Diederik P","year":"2015","unstructured":"Diederik P Kingma and Jimmy Ba. 2015. Adam: A Method for Stochastic Optimization. In ICLR."},{"key":"e_1_2_1_39_1","volume-title":"Designing Data-intensive Applications: The Big Ideas Behind Reliable, Scalable, and Maintainable Systems","author":"Kleppmann Martin","unstructured":"Martin Kleppmann. 2017. Designing Data-intensive Applications: The Big Ideas Behind Reliable, Scalable, and Maintainable Systems. O'Reilly Media, Inc."},{"key":"e_1_2_1_40_1","doi-asserted-by":"publisher","DOI":"10.14257\/ijdta.2016.9.10.24"},{"key":"e_1_2_1_41_1","doi-asserted-by":"publisher","DOI":"10.1090\/conm\/352\/06369"},{"key":"e_1_2_1_42_1","volume-title":"Jeffrey Dean, and Neoklis Polyzotis.","author":"Kraska Tim","year":"2018","unstructured":"Tim Kraska, Alex Beutel, Ed H. Chi, Jeffrey Dean, and Neoklis Polyzotis. 2018. The case for learned index structures. In SIGMOD. 489\u2013504."},{"key":"e_1_2_1_43_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.chb.2015.05.027"},{"key":"e_1_2_1_44_1","unstructured":"Balaji Lakshminarayanan Alexander Pritzel and Charles Blundell. 2017. Simple and scalable predictive uncertainty estimation using deep ensembles. In NIPS. 6405\u20136416."},{"key":"e_1_2_1_45_1","doi-asserted-by":"crossref","unstructured":"Geon Lee Chanyoung Park and Kijung Shin. 2022. Set2Box: Similarity Preserving Representation Learning for Sets. In ICDM. 1023\u20131028.","DOI":"10.1109\/ICDM54844.2022.00125"},{"key":"e_1_2_1_46_1","unstructured":"Juho Lee Yoonho Lee Jungtaek Kim Adam Kosiorek Seungjin Choi and Yee Whye Teh. 2019. Set transformer: A framework for attention-based permutation-invariant neural networks. In ICML. 3744\u20133753."},{"key":"e_1_2_1_47_1","doi-asserted-by":"publisher","DOI":"10.14778\/2850583.2850594"},{"key":"e_1_2_1_48_1","doi-asserted-by":"crossref","unstructured":"Feifei Li Bin Wu Ke Yi and Zhuoyue Zhao. 2016. Wander join: Online aggregation for joins. In SIGMOD. 2121\u20132124.","DOI":"10.1145\/2882903.2899413"},{"key":"e_1_2_1_49_1","doi-asserted-by":"crossref","unstructured":"Guang Li Ren Togo Takahiro Ogawa and Miki Haseyama. 2020. Soft-label anonymous gastric x-ray image distillation. In ICIP. 305\u2013309.","DOI":"10.1109\/ICIP40778.2020.9191357"},{"key":"e_1_2_1_50_1","doi-asserted-by":"publisher","DOI":"10.14778\/3626292.3626302"},{"key":"e_1_2_1_51_1","unstructured":"Shuyang Li. 2020. Recipes and Interactions. https:\/\/www.kaggle.com\/datasets\/shuyangli94\/food-com-recipes-and-user-interactions"},{"key":"e_1_2_1_52_1","volume-title":"SHARE: A System for Hierarchical Assistive Recipe Editing. In EMNLP. 11077\u201311090.","author":"Li Shuyang","year":"2022","unstructured":"Shuyang Li, Yufei Li, Jianmo Ni, and Julian McAuley. 2022. SHARE: A System for Hierarchical Assistive Recipe Editing. In EMNLP. 11077\u201311090."},{"key":"e_1_2_1_53_1","volume-title":"Schneider","author":"Lipton Richard J.","year":"1990","unstructured":"Richard J. Lipton, Jeffrey F. Naughton, and Donovan A. Schneider. 1990. Practical selectivity estimation through adaptive sampling. In SIGMOD. 1\u201311."},{"key":"e_1_2_1_54_1","doi-asserted-by":"publisher","DOI":"10.14778\/3476249.3476254"},{"key":"e_1_2_1_55_1","unstructured":"Tianyi Liu Minshuo Chen Mo Zhou Simon S. Du Enlu Zhou and Tuo Zhao. 2019. Towards understanding the importance of shortcut connections in residual networks. In NeurIPS. 7892\u20137902."},{"key":"e_1_2_1_56_1","unstructured":"Xiaodong Liu Pengcheng He Weizhu Chen and Jianfeng Gao. 2019. MultiTask Deep Neural Networks for Natural Language Understanding. In ACL. 4487\u20134496."},{"key":"e_1_2_1_57_1","volume-title":"Jordan","author":"Long Mingsheng","year":"2017","unstructured":"Mingsheng Long, Han Zhu, Jianmin Wang, and Michael I. Jordan. 2017. Deep transfer learning with joint adaptation networks. In ICML. 2208\u20132217."},{"key":"e_1_2_1_58_1","unstructured":"Bodhisattwa Prasad Majumder Shuyang Li Jianmo Ni and Julian McAuley. 2019. Generating Personalized Recipes from Historical User Preferences. In EMNLP-IJCNLP. 5976\u20135982."},{"key":"e_1_2_1_59_1","doi-asserted-by":"publisher","DOI":"10.1145\/3626755"},{"key":"e_1_2_1_60_1","unstructured":"Guido Moerkotte and Axel Hertzschuch. 2020. alpha to omega: the G(r)eek Alphabet of Sampling. In CIDR."},{"key":"e_1_2_1_61_1","doi-asserted-by":"publisher","DOI":"10.14778\/1687627.1687738"},{"key":"e_1_2_1_62_1","volume-title":"DeWitt","author":"Muralikrishna M.","year":"1988","unstructured":"M. Muralikrishna and David J. DeWitt. 1988. Equi-depth multidimensional histograms. In SIGMOD. 28\u201336."},{"key":"e_1_2_1_63_1","unstructured":"MySQL. 2017. SET Data Type. http:\/\/download.nust.na\/pub6\/mysql\/tech-resources\/articles\/mysql-set-datatype.html"},{"key":"e_1_2_1_64_1","unstructured":"Oracle. 2023. PL\/SQL Collections and Records. https:\/\/docs.oracle.com\/en\/database\/oracle\/oracle-database\/23\/lnpls\/plsql-collections-and-records.html"},{"key":"e_1_2_1_65_1","unstructured":"Adam Paszke Sam Gross Francisco Massa Adam Lerer James Bradbury Gregory Chanan Trevor Killeen Zeming Lin Natalia Gimelshein Luca Antiga et al. 2019. PyTorch: An imperative style high-performance deep learning library. In NeurIPS. 8026\u20138037."},{"key":"e_1_2_1_66_1","volume-title":"ICCV Workshops. 689\u2013690","author":"Poon Hoifung","year":"2011","unstructured":"Hoifung Poon and Pedro Domingos. 2011. Sum-product networks: A new deep architecture. In ICCV Workshops. 689\u2013690."},{"key":"e_1_2_1_67_1","unstructured":"Viswanath Poosala and Yannis E Ioannidis. 1997. Selectivity estimation without the attribute value independence assumption. In VLDB. 486\u2013495."},{"key":"e_1_2_1_68_1","unstructured":"PostgreSQL. 2024. Array Functions and Operators. https:\/\/www.postgresql.org\/docs\/17\/functions-array.html"},{"key":"e_1_2_1_69_1","unstructured":"UNSW Sydney PVC (Research Infrastructure). 2010. Katana. (2010)."},{"key":"e_1_2_1_70_1","volume-title":"Online algorithms and stochastic approximations","author":"Saad David","unstructured":"David Saad. 1998. Online algorithms and stochastic approximations. Vol. 5. 6."},{"key":"e_1_2_1_71_1","doi-asserted-by":"crossref","unstructured":"P Griffiths Selinger Morton M Astrahan Donald D Chamberlin Raymond A Lorie and Thomas G Price. 1979. Access path selection in a relational database management system. In SIGMOD. 23\u201334.","DOI":"10.1145\/582095.582099"},{"key":"e_1_2_1_72_1","unstructured":"Andreas Seltenreich Bo Tang and Sjoerd Mullender. 2022. Bug Squashing with SQLsmith. https:\/\/github.com\/anse1\/sqlsmith"},{"key":"e_1_2_1_73_1","unstructured":"SQL server. 2023. Table-valued Parameters in SQL Server. https:\/\/learn.microsoft.com\/en-us\/sql\/relational-databases\/tables\/use-table-valued-parameters-database-engine?view=sql-server-ver16"},{"key":"e_1_2_1_74_1","volume-title":"GLU Variants Improve Transformer. arXiv preprint arXiv:2002.05202","author":"Shazeer Noam","year":"2020","unstructured":"Noam Shazeer. 2020. GLU Variants Improve Transformer. arXiv preprint arXiv:2002.05202 (2020)."},{"key":"e_1_2_1_75_1","doi-asserted-by":"crossref","unstructured":"Zhuoran Shen Mingyuan Zhang Haiyu Zhao Shuai Yi and Hongsheng Li. 2021. Efficient attention: Attention with linear complexities. In WACV. 3531\u20133539.","DOI":"10.1109\/WACV48630.2021.00357"},{"key":"e_1_2_1_76_1","doi-asserted-by":"publisher","DOI":"10.1145\/3589332"},{"key":"e_1_2_1_77_1","volume-title":"Julian Schrittwieser, Ioannis Antonoglou, Veda Panneershelvam, Marc Lanctot, et al.","author":"Silver David","year":"2016","unstructured":"David Silver, Aja Huang, Chris J Maddison, Arthur Guez, Laurent Sifre, George Van Den Driessche, Julian Schrittwieser, Ioannis Antonoglou, Veda Panneershelvam, Marc Lanctot, et al. 2016. Mastering the Game of Go with Deep Neural Networks and Tree Search. Nature 529, 7587 (2016), 484\u2013489."},{"key":"e_1_2_1_78_1","doi-asserted-by":"crossref","first-page":"4067581","DOI":"10.1155\/2022\/4067581","article-title":"Pooling operations in deep learning: from \"invariable\"' to \"variable","volume":"2022","author":"Tao Zhou","year":"2022","unstructured":"Zhou Tao, Chang XiaoYu, Lu HuiLing, Ye XinYu, Liu YunCan, and Zheng XiaoMin. 2022. Pooling operations in deep learning: from \"invariable\"' to \"variable\"'. BioMed Research International 2022, 1 (2022), 4067581.","journal-title":"'. BioMed Research International"},{"key":"e_1_2_1_79_1","volume-title":"Augmenting Convolutional networks with attention-based aggregation. arXiv preprint arXiv:2112.13692","author":"Touvron Hugo","year":"2021","unstructured":"Hugo Touvron, Matthieu Cord, Alaaeldin El-Nouby, Piotr Bojanowski, Armand Joulin, Gabriel Synnaeve, and Herv\u00e9 J\u00e9gou. 2021. Augmenting Convolutional networks with attention-based aggregation. arXiv preprint arXiv:2112.13692 (2021)."},{"key":"e_1_2_1_80_1","doi-asserted-by":"publisher","DOI":"10.14778\/3402707.3402724"},{"key":"e_1_2_1_81_1","unstructured":"Ashish Vaswani Noam Shazeer Niki Parmar Jakob Uszkoreit Llion Jones Aidan N. Gomez \u0141ukasz Kaiser and Illia Polosukhin. 2017. Attention is all you need. In NIPS. 6000\u20136010."},{"key":"e_1_2_1_82_1","doi-asserted-by":"publisher","DOI":"10.1145\/2629489"},{"key":"e_1_2_1_83_1","doi-asserted-by":"crossref","unstructured":"Fei Wang Mengqing Jiang Chen Qian Shuo Yang Cheng Li Honggang Zhang Xiaogang Wang and Xiaoou Tang. 2017. Residual attention network for image classification. In CVPR. 3156\u20133164.","DOI":"10.1109\/CVPR.2017.683"},{"key":"e_1_2_1_84_1","volume-title":"Conference of the Centre for Advanced Studies on Collaborative research. 328\u2013342","author":"Wang Hai","unstructured":"Hai Wang and Kenneth C. Sevcik. 2003. A multi-dimensional histogram for selectivity estimation and fast approximate query answering. In Conference of the Centre for Advanced Studies on Collaborative research. 328\u2013342."},{"key":"e_1_2_1_85_1","volume-title":"arXiv preprint arXiv:1811.10959","author":"Wang Tongzhou","year":"2018","unstructured":"Tongzhou Wang, Jun-Yan Zhu, Antonio Torralba, and Alexei A Efros. 2018. Dataset Distillation. arXiv preprint arXiv:1811.10959 (2018)."},{"key":"e_1_2_1_86_1","doi-asserted-by":"crossref","first-page":"4594","DOI":"10.1109\/TGRS.2018.2829625","article-title":"A Deep Neural Network With Spatial Pooling (DNNSP) for 3-D Point Cloud Classification","volume":"56","author":"Wang Zhen","year":"2018","unstructured":"Zhen Wang, Liqiang Zhang, Liang Zhang, Roujing Li, Yibo Zheng, and Zidong Zhu. 2018. A Deep Neural Network With Spatial Pooling (DNNSP) for 3-D Point Cloud Classification. IEEE Transactions on Geoscience and Remote Sensing 56, 8 (2018), 4594\u20134604.","journal-title":"IEEE Transactions on Geoscience and Remote Sensing"},{"key":"e_1_2_1_87_1","unstructured":"Benjamin Warner. 2022. Tinkering With Attention Pooling. https:\/\/benjaminwarner.dev\/2022\/07\/14\/tinkering-with-attention-pooling."},{"key":"e_1_2_1_88_1","unstructured":"Alexander Wei. 2024. Recipes with Ingredients and Tags. https:\/\/www.kaggle.com\/datasets\/realalexanderwei\/food-com-recipes-with-ingredients-and-tags"},{"key":"e_1_2_1_89_1","unstructured":"Widenex. 2024. Widenex GPTs. https:\/\/gpts.widenex.com\/"},{"key":"e_1_2_1_90_1","doi-asserted-by":"crossref","unstructured":"Peizhi Wu and Gao Cong. 2021. A Unified Deep Model of Learning from both Data and Queries for Cardinality Estimation. In SIGMOD. 2009\u20132022.","DOI":"10.1145\/3448016.3452830"},{"key":"e_1_2_1_91_1","doi-asserted-by":"publisher","DOI":"10.14778\/3626292.3626300"},{"key":"e_1_2_1_92_1","volume-title":"BayesCard: Revitilizing Bayesian Frameworks for Cardinality Estimation. arXiv preprint arXiv:2012.14743","author":"Wu Ziniu","year":"2020","unstructured":"Ziniu Wu, Amir Shaikhha, Rong Zhu, Kai Zeng, Yuxing Han, and Jingren Zhou. 2020. BayesCard: Revitilizing Bayesian Frameworks for Cardinality Estimation. arXiv preprint arXiv:2012.14743 (2020)."},{"key":"e_1_2_1_93_1","doi-asserted-by":"crossref","unstructured":"Tong Xiao Yinqiao Li Jingbo Zhu Zhengtao Yu and Tongran Liu. 2019. Sharing Attention Weights for Fast Transformer. In IJCAI. 5292\u20135298.","DOI":"10.24963\/ijcai.2019\/735"},{"key":"e_1_2_1_94_1","doi-asserted-by":"crossref","unstructured":"Yuanyuan Xu Wenjie Zhang Ying Zhang Maria Orlowska and Xuemin Lin. 2024. TimeSGN: Scalable and Effective Temporal Graph Neural Network. In ICDE. 3297\u20133310.","DOI":"10.1109\/ICDE60146.2024.00255"},{"key":"e_1_2_1_95_1","volume-title":"Proceedings of the SIGKDD Conference on Knowledge Discovery and Data Mining (KDD","author":"Xu Yuanyuan","year":"2025","unstructured":"Yuanyuan Xu, Wenjie Zhang, Ying Zhang, Xiwei Xu, and Xuemin Lin. 2025. Fast and Accurate Temporal Hypergraph Representation for Hyperedge Prediction. In Proceedings of the SIGKDD Conference on Knowledge Discovery and Data Mining (KDD 2025). ACM."},{"key":"e_1_2_1_96_1","doi-asserted-by":"publisher","DOI":"10.1007\/s41019-019-00104-1"},{"key":"e_1_2_1_97_1","doi-asserted-by":"publisher","DOI":"10.14778\/3421424.3421432"},{"key":"e_1_2_1_98_1","doi-asserted-by":"publisher","DOI":"10.14778\/3368289.3368294"},{"key":"e_1_2_1_99_1","doi-asserted-by":"crossref","unstructured":"Rex Ying Ruining He Kaifeng Chen Pong Eksombatchai William L Hamilton and Jure Leskovec. 2018. Graph convolutional neural networks for web-scale recommender systems. In KDD. 974\u2013983.","DOI":"10.1145\/3219819.3219890"},{"key":"e_1_2_1_100_1","doi-asserted-by":"publisher","DOI":"10.3390\/app12178643"},{"key":"e_1_2_1_101_1","doi-asserted-by":"crossref","unstructured":"Hansong Zhang Shikun Li Pengju Wang Dan Zeng and Shiming Ge. 2024. M3D: Dataset Condensation by Minimizing Maximum Mean Discrepancy. In AAAI. 9314\u20139322.","DOI":"10.1609\/aaai.v38i8.28784"},{"key":"e_1_2_1_102_1","doi-asserted-by":"publisher","DOI":"10.1145\/3639292"},{"key":"e_1_2_1_103_1","volume-title":"MDERank: A Masked Document Embedding Rank Approach for Unsupervised Keyphrase Extraction. In Findings of the Association for Computational Linguistics: ACL","author":"Zhang Linhan","year":"2022","unstructured":"Linhan Zhang, Qian Chen, Wen Wang, Chong Deng, ShiLiang Zhang, Bing Li, Wei Wang, and Xin Cao. 2022. MDERank: A Masked Document Embedding Rank Approach for Unsupervised Keyphrase Extraction. In Findings of the Association for Computational Linguistics: ACL 2022. Association for Computational Linguistics, 396\u2013409."},{"key":"e_1_2_1_104_1","unstructured":"Yan Zhang Jonathon Hare and Adam Prugel-Bennett. 2019. Deep Set Prediction Networks. In NeurIPS. 3212\u20133222."},{"key":"e_1_2_1_105_1","doi-asserted-by":"crossref","unstructured":"Zeyu Zhang Jiamou Liu Kaiqi Zhao Song Yang Xianda Zheng and Yifei Wang. 2023. Contrastive learning for signed bipartite graphs. In SIGIR. 1629\u20131638.","DOI":"10.1145\/3539618.3591655"},{"key":"e_1_2_1_106_1","volume-title":"Konda Reddy Mopuri, and Hakan Bilen","author":"Zhao Bo","year":"2021","unstructured":"Bo Zhao, Konda Reddy Mopuri, and Hakan Bilen. 2021. Dataset Condensation with Gradient Matching. In ICLR."},{"key":"e_1_2_1_107_1","volume-title":"Zongyan He, Rui Li, and Hao Zhang.","author":"Zhao Kangfei","year":"2022","unstructured":"Kangfei Zhao, Jeffrey Xu Yu, Zongyan He, Rui Li, and Hao Zhang. 2022. Lightweight and accurate cardinality estimation by neural network gaussian process. In SIGMOD. 973\u2013987."},{"key":"e_1_2_1_108_1","volume-title":"GMAN: A graph multi-attention network for traffic prediction. In AAAI. 1234\u20131241.","author":"Zheng Chuanpan","year":"2020","unstructured":"Chuanpan Zheng, Xiaoliang Fan, Cheng Wang, and Jianzhong Qi. 2020. GMAN: A graph multi-attention network for traffic prediction. In AAAI. 1234\u20131241."},{"key":"e_1_2_1_109_1","doi-asserted-by":"publisher","DOI":"10.14778\/3461535.3461539"}],"container-title":["Proceedings of the VLDB Endowment"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.14778\/3734839.3734848","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,8,29]],"date-time":"2025-08-29T16:03:00Z","timestamp":1756483380000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.14778\/3734839.3734848"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2025,3]]},"references-count":109,"journal-issue":{"issue":"7","published-print":{"date-parts":[[2025,3]]}},"alternative-id":["10.14778\/3734839.3734848"],"URL":"https:\/\/doi.org\/10.14778\/3734839.3734848","relation":{},"ISSN":["2150-8097"],"issn-type":[{"value":"2150-8097","type":"print"}],"subject":[],"published":{"date-parts":[[2025,3]]},"assertion":[{"value":"2025-08-29","order":3,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}