{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,9,9]],"date-time":"2025-09-09T21:12:46Z","timestamp":1757452366606,"version":"3.44.0"},"publisher-location":"New York, NY, USA","reference-count":19,"publisher":"ACM","content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2025,6,22]]},"DOI":"10.1145\/3736229.3736263","type":"proceedings-article","created":{"date-parts":[[2025,9,2]],"date-time":"2025-09-02T10:07:28Z","timestamp":1756807648000},"page":"18-22","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":2,"title":["On fixing broken lineage"],"prefix":"10.1145","author":[{"ORCID":"https:\/\/orcid.org\/0000-0001-9486-929X","authenticated-orcid":false,"given":"Witold","family":"Andrzejewski","sequence":"first","affiliation":[{"name":"Poznan University of Technology, Poznan, Poland"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-4914-9394","authenticated-orcid":false,"given":"Pawe\u0142","family":"Boi\u0144ski","sequence":"additional","affiliation":[{"name":"Poznan University of Technology, Poznan, Poland"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-6037-5718","authenticated-orcid":false,"given":"Robert","family":"Wrembel","sequence":"additional","affiliation":[{"name":"Poznan University of Technology, Poznan, Poland"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"320","published-online":{"date-parts":[[2025,9,2]]},"reference":[{"key":"e_1_3_3_1_2_2","unstructured":"Daniel Bakkelund. 2009. An LCS-based string metric. University of Oslo Oslo Norway 9\u00a0pages. https:\/\/api.semanticscholar.org\/CorpusID:5116711"},{"key":"e_1_3_3_1_3_2","doi-asserted-by":"crossref","unstructured":"Pawe\u0142 Boi\u0144ski Witold Andrzejewski Mi\u0142osz Grocholewski Tobiasz Gruszczy\u0144ski and Robert Wrembel. 2025. Leveraging machine learning techniques for discovering broken lineage links between database objects. 7\u00a0pages. Under review.","DOI":"10.1145\/3736229.3736263"},{"key":"e_1_3_3_1_4_2","doi-asserted-by":"crossref","unstructured":"Peter Buneman and Wang-Chiew Tan. 2019. Data Provenance: What next? SIGMOD Rec. 47 3 (Feb. 2019) 5\u201316. https:\/\/doi.org\/10.1145\/3316416.3316418","DOI":"10.1145\/3316416.3316418"},{"key":"e_1_3_3_1_5_2","series-title":"(VLDB \u201997)","first-page":"426","volume-title":"Proceedings of the 23rd International Conference on Very Large Data Bases","author":"Ciaccia Paolo","year":"1997","unstructured":"Paolo Ciaccia, Marco Patella, and Pavel Zezula. 1997. M-tree: An Efficient Access Method for Similarity Search in Metric Spaces. In Proceedings of the 23rd International Conference on Very Large Data Bases(VLDB \u201997). Morgan Kaufmann Publishers Inc., San Francisco, CA, USA, 426\u2013435."},{"key":"e_1_3_3_1_6_2","doi-asserted-by":"crossref","unstructured":"Yingwei Cui Jennifer Widom and Janet\u00a0L. Wiener. 2000. Tracing the lineage of view data in a warehousing environment. ACM Trans. Database Syst. 25 2 (June 2000) 179\u2013227. https:\/\/doi.org\/10.1145\/357775.357777","DOI":"10.1145\/357775.357777"},{"key":"e_1_3_3_1_7_2","first-page":"104","volume-title":"5th Int. Conf. on Information, Process, and Knowledge Management (eKNOW 2013)","author":"Davis Delmar\u00a0B.","year":"2013","unstructured":"Delmar\u00a0B. Davis, Hazeline\u00a0U. Asuncion, Ghaleb Abdulla, and Christopher\u00a0W. Carr. 2013. Towards Recovering Provenance with Experiment Explorer. In 5th Int. Conf. on Information, Process, and Knowledge Management (eKNOW 2013). IARIA, Nice, France, 104\u2013110."},{"key":"e_1_3_3_1_8_2","first-page":"227","volume-title":"Datenbanksysteme in Business, Technologie und Web (BTW 2007) \u2013 12. Fachtagung des GI-Fachbereichs \"Datenbanken und Informationssysteme\" (DBIS)","author":"Glavic Boris","year":"2007","unstructured":"Boris Glavic and Klaus Dittrich. 2007. Data Provenance: A Categorization of Existing Approaches. In Datenbanksysteme in Business, Technologie und Web (BTW 2007) \u2013 12. Fachtagung des GI-Fachbereichs \"Datenbanken und Informationssysteme\" (DBIS). Gesellschaft f\u00fcr Informatik e. V., Bonn, 227\u2013241."},{"key":"e_1_3_3_1_9_2","unstructured":"Milosz Grocholewski and Tobiasz Gruszczynski. 2024. Structure lineage for database objects and code: design implementation and experimental evaluation (in Polish). Master thesis. Poznan University of Technology."},{"key":"e_1_3_3_1_10_2","volume-title":"Int. Conf. on Information Systems, (ICIS)","author":"Hariharan Anuja","year":"2024","unstructured":"Anuja Hariharan, Tianren Zhang, Marvin Motz, and Christof Weinhardt. 2024. Accessible data lineage: A scoping review on open-source data lineage platforms. In Int. Conf. on Information Systems, (ICIS). Association for Information Systems."},{"key":"e_1_3_3_1_11_2","doi-asserted-by":"crossref","unstructured":"Melanie Herschel Ralf Diestelk\u00e4mper and Houssem Ben\u00a0Lahmar. 2017. A survey on provenance: What for? What form? What from? The VLDB Journal 26 (12 2017). https:\/\/doi.org\/10.1007\/s00778-017-0486-1","DOI":"10.1007\/s00778-017-0486-1"},{"key":"e_1_3_3_1_12_2","unstructured":"Hofmann Felipe Alex. 2020. Tracer : a machine learning approach to data lineage. Master thesis. Massachusetts Institute of Technology."},{"key":"e_1_3_3_1_13_2","doi-asserted-by":"publisher","DOI":"10.1017\/CBO9781139924801.004"},{"key":"e_1_3_3_1_14_2","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-642-35173-0_29"},{"key":"e_1_3_3_1_15_2","doi-asserted-by":"crossref","unstructured":"Victor\u00a0M. Panaretos and Yoav Zemel. 2019. Statistical Aspects of Wasserstein Distances. Annual Review of Statistics and Its Application 6 Volume 6 2019 (2019) 405\u2013431. https:\/\/doi.org\/10.1146\/annurev-statistics-030718-104938","DOI":"10.1146\/annurev-statistics-030718-104938"},{"key":"e_1_3_3_1_16_2","doi-asserted-by":"crossref","unstructured":"Fotis Psallidas and Eugene Wu. 2018. Smoke: fine-grained lineage at interactive speed. Proc. VLDB Endow. 11 6 (Feb. 2018) 719\u2013732. https:\/\/doi.org\/10.14778\/3199517.3199522","DOI":"10.14778\/3184470.3184475"},{"key":"e_1_3_3_1_17_2","doi-asserted-by":"crossref","unstructured":"Mohammed\u00a0Suhail Rehman Silu Huang and Aaron\u00a0J. Elmore. 2021. A demonstration of RELIC: a system for retrospective lineage inference of data workflows. Proc. VLDB Endow. 14 12 (July 2021) 2795\u20132798. https:\/\/doi.org\/10.14778\/3476311.3476347","DOI":"10.14778\/3476311.3476347"},{"key":"e_1_3_3_1_18_2","series-title":"(VLDB \u201990)","first-page":"519","volume-title":"Proceedings of the 16th International Conference on Very Large Data Bases","author":"Wang Y.\u00a0Richard","year":"1990","unstructured":"Y.\u00a0Richard Wang and Stuart\u00a0E. Madnick. 1990. A Polygen Model for Heterogeneous Database Systems: The Source Tagging Perspective. In Proceedings of the 16th International Conference on Very Large Data Bases(VLDB \u201990). Morgan Kaufmann Publishers Inc., San Francisco, CA, USA, 519\u2013538."},{"key":"e_1_3_3_1_19_2","doi-asserted-by":"crossref","unstructured":"Masaya Yamada Hiroyuki Kitagawa Toshiyuki Amagasa and Akiyoshi Matono. 2022. Augmented lineage: traceability of data analysis including complex UDF processing. The VLDB Journal 32 5 (Nov. 2022) 963\u2013983. https:\/\/doi.org\/10.1007\/s00778-022-00769-7","DOI":"10.1007\/s00778-022-00769-7"},{"key":"e_1_3_3_1_20_2","doi-asserted-by":"crossref","unstructured":"Li Yujian and Liu Bo. 2007. A Normalized Levenshtein Distance Metric. IEEE Trans. Pattern Anal. Mach. Intell. 29 6 (June 2007) 1091\u20131095. https:\/\/doi.org\/10.1109\/TPAMI.2007.1078","DOI":"10.1109\/TPAMI.2007.1078"}],"event":{"name":"PW' 25: International Conference on Management of Data","sponsor":["SIGMOD ACM Special Interest Group on Management of Data"],"location":"Berlin Germany","acronym":"PW' 25"},"container-title":["Proceedings of the ProvenanceWeek 2025"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3736229.3736263","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,9,2]],"date-time":"2025-09-02T11:59:27Z","timestamp":1756814367000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3736229.3736263"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2025,6,22]]},"references-count":19,"alternative-id":["10.1145\/3736229.3736263","10.1145\/3736229"],"URL":"https:\/\/doi.org\/10.1145\/3736229.3736263","relation":{},"subject":[],"published":{"date-parts":[[2025,6,22]]},"assertion":[{"value":"2025-09-02","order":3,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}