{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,1,23]],"date-time":"2026-01-23T07:11:23Z","timestamp":1769152283085,"version":"3.49.0"},"publisher-location":"New York, NY, USA","reference-count":28,"publisher":"ACM","license":[{"start":{"date-parts":[[2021,7,14]],"date-time":"2021-07-14T00:00:00Z","timestamp":1626220800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2021,7,14]]},"DOI":"10.1145\/3472163.3472273","type":"proceedings-article","created":{"date-parts":[[2021,9,7]],"date-time":"2021-09-07T16:46:57Z","timestamp":1631033217000},"page":"194-203","source":"Crossref","is-referenced-by-count":6,"title":["Analysis-oriented Metadata for Data Lakes"],"prefix":"10.1145","author":[{"given":"Yan","family":"Zhao","sequence":"first","affiliation":[{"name":"Universite Toulouse (Capitole) and Institut de Recherche en Informatique de Toulouse, France"}]},{"given":"Franck","family":"Ravat","sequence":"additional","affiliation":[{"name":"Universite Toulouse (Capitole) and Institut de Recherche en Informatique de Toulouse, France"}]},{"given":"Julien","family":"Aligon","sequence":"additional","affiliation":[{"name":"Universite Toulouse (Capitole) and Institut de Recherche en Informatique de Toulouse, France"}]},{"given":"Chantal","family":"Soule-dupuy","sequence":"additional","affiliation":[{"name":"Universite Toulouse (Capitole) and Institut de Recherche en Informatique de Toulouse, France"}]},{"given":"Gabriel","family":"Ferrettini","sequence":"additional","affiliation":[{"name":"Universite Toulouse (Capitole) and Institut de Recherche en Informatique de Toulouse, France"}]},{"given":"Imen","family":"Megdiche","sequence":"additional","affiliation":[{"name":"Institut de Recherche en Informatique de Toulouse, France"}]}],"member":"320","published-online":{"date-parts":[[2021,9,7]]},"reference":[{"key":"e_1_3_2_2_1_1","first-page":"125076","article-title":"Approaches to Multi-Objective Feature Selection","volume":"8","author":"Al-Tashi Qasem","year":"2020","journal-title":"A Systematic Literature Review. IEEE Access"},{"key":"e_1_3_2_2_2_1","volume-title":"Towards Information Profiling: Data Lake Content Metadata Management. In 2016 IEEE 16th International Conference on Data Mining Workshops (ICDMW). 178\u2013185","author":"Alserafi Ayman","year":"2016"},{"key":"e_1_3_2_2_3_1","doi-asserted-by":"publisher","DOI":"10.5220\/0005876203310338"},{"key":"e_1_3_2_2_4_1","volume-title":"Dataset Discovery in Data Lakes. In 2020 IEEE 36th International Conference on Data Engineering (ICDE). IEEE Computer Society, 709\u2013720","author":"Bogatu Alex","year":"2020"},{"key":"e_1_3_2_2_5_1","doi-asserted-by":"publisher","DOI":"10.1007\/s10796-020-10010-x"},{"key":"e_1_3_2_2_6_1","volume-title":"Proceedings of the 11th International Conference on Semantic Systems(SEMANTICS \u201915)","author":"Esteves Diego","year":"2015"},{"key":"e_1_3_2_2_7_1","volume-title":"Proceedings of the 28th International Conference on Neural Information Processing Systems -","volume":"2763","author":"Feurer Matthias","year":"2015"},{"key":"e_1_3_2_2_8_1","first-page":"11","article-title":"Does data warehouse end-user metadata add value?Commun","volume":"50","author":"Foshay Neil","year":"2007","journal-title":"ACM"},{"key":"e_1_3_2_2_9_1","first-page":"5","article-title":"Managing Google\u2019s data lake: an overview of the GOODS system. {IEEE} Data","volume":"39","author":"Halevy Alon","year":"2016","journal-title":"Eng. Bull."},{"key":"e_1_3_2_2_10_1","doi-asserted-by":"crossref","unstructured":"Frank Hutter Lars Kotthoff and Joaquin Vanschoren (Eds.). 2019. Automated Machine Learning: Methods Systems Challenges. Springer International Publishing. https:\/\/doi.org\/10.1007\/978-3-030-05318-5  Frank Hutter Lars Kotthoff and Joaquin Vanschoren (Eds.). 2019. Automated Machine Learning: Methods Systems Challenges. Springer International Publishing. https:\/\/doi.org\/10.1007\/978-3-030-05318-5","DOI":"10.1007\/978-3-030-05318-5"},{"key":"e_1_3_2_2_11_1","doi-asserted-by":"publisher","DOI":"10.4324\/9780203892053"},{"key":"e_1_3_2_2_12_1","volume-title":"C (May","author":"Keet C\u00a0Maria","year":"2015"},{"key":"e_1_3_2_2_13_1","volume-title":"Towards Learned Metadata Extraction for Data Lakes","author":"Langenecker Sven","year":"1842"},{"key":"e_1_3_2_2_14_1","doi-asserted-by":"publisher","DOI":"10.1109\/TKDE.2014.2327034"},{"key":"e_1_3_2_2_15_1","volume-title":"\u00a0F. de Carvalho","author":"Mantovani G.","year":"2019"},{"key":"e_1_3_2_2_16_1","doi-asserted-by":"crossref","unstructured":"Imen Megdiche Franck Ravat and Yan Zhao. 2021. Metadata Management on Data Processing in Data Lakes. In SOFSEM 2021: Theory and Practice of Computer Science(Lecture Notes in Computer Science) Tom\u00e1\u0161 Bure\u0161 Riccardo Dondi Johann Gamper Giovanna Guerrini Tomasz Jurdzi\u0144ski Claus Pahl Florian Sikora and Prudence\u00a0W.H. Wong (Eds.). Springer International Publishing Cham 553\u2013562. https:\/\/doi.org\/10.1007\/978-3-030-67731-2_40  Imen Megdiche Franck Ravat and Yan Zhao. 2021. Metadata Management on Data Processing in Data Lakes. In SOFSEM 2021: Theory and Practice of Computer Science(Lecture Notes in Computer Science) Tom\u00e1\u0161 Bure\u0161 Riccardo Dondi Johann Gamper Giovanna Guerrini Tomasz Jurdzi\u0144ski Claus Pahl Florian Sikora and Prudence\u00a0W.H. Wong (Eds.). Springer International Publishing Cham 553\u2013562. https:\/\/doi.org\/10.1007\/978-3-030-67731-2_40","DOI":"10.1007\/978-3-030-67731-2_40"},{"key":"e_1_3_2_2_17_1","volume-title":"Doing Data Science: Straight Talk from the Frontline. \u201dO\u2019Reilly Media","author":"O\u2019Neil Cathy"},{"key":"e_1_3_2_2_18_1","doi-asserted-by":"publisher","DOI":"10.1007\/s10618-014-0363-0"},{"key":"e_1_3_2_2_19_1","volume-title":"Data Lakes: Trends and Perspectives. In Database and Expert Systems Applications(Lecture Notes in Computer Science), Sven Hartmann, Josef K\u00fcng, Sharma Chakravarthy, Gabriele Anderst-Kotsis, A\u00a0Min Tjoa","author":"Ravat Franck","year":"2019"},{"key":"e_1_3_2_2_20_1","volume-title":"New Trends in Databases and Information Systems(Communications in Computer and Information Science)","author":"Ravat Franck"},{"key":"e_1_3_2_2_21_1","volume-title":"AI 2016: Advances in Artificial Intelligence(Lecture Notes in Computer Science), Byeong\u00a0Ho Kangand Quan Bai (Eds.)","author":"Raynaut William"},{"key":"e_1_3_2_2_22_1","volume-title":"Why Should I Trust You?\u201d: Explaining the Predictions of Any Classifier. arXiv:1602.04938 [cs, stat] (Aug","author":"Ribeiro Marco\u00a0Tulio","year":"2016"},{"key":"e_1_3_2_2_23_1","volume-title":"\u00a0F. de Carvalho","author":"Rivolli Adriano","year":"2018"},{"key":"e_1_3_2_2_24_1","volume-title":"Skluma: An Extensible Metadata Extraction Pipeline for Disorganized Data. In 2018 IEEE 14th International Conference on e-Science (e-Science). 256\u2013266","author":"Skluzacek J.","year":"2018"},{"key":"e_1_3_2_2_25_1","doi-asserted-by":"publisher","DOI":"10.1007\/s10462-019-09682-y"},{"key":"e_1_3_2_2_26_1","volume-title":"Data Warehousing and Knowledge Discovery(Lecture Notes in Computer Science), Ladjel Bellatreche and Mukesh\u00a0K","author":"Varga Jovan"},{"key":"e_1_3_2_2_27_1","volume-title":"Towards better understanding of meta-features contributions. arXiv:2002.04276 [cs, stat] (Feb","author":"Wo\u017anica Katarzyna","year":"2020"},{"key":"e_1_3_2_2_28_1","doi-asserted-by":"publisher","DOI":"10.1145\/3318464.3389726"}],"event":{"name":"IDEAS 2021: 25th International Database Engineering & Applications Symposium","location":"Montreal QC Canada","acronym":"IDEAS 2021"},"container-title":["25th International Database Engineering &amp; Applications Symposium"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3472163.3472273","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3472163.3472273","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T18:10:00Z","timestamp":1750183800000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3472163.3472273"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2021,7,14]]},"references-count":28,"alternative-id":["10.1145\/3472163.3472273","10.1145\/3472163"],"URL":"https:\/\/doi.org\/10.1145\/3472163.3472273","relation":{},"subject":[],"published":{"date-parts":[[2021,7,14]]}}}