{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,7]],"date-time":"2026-03-07T14:17:20Z","timestamp":1772893040883,"version":"3.50.1"},"reference-count":49,"publisher":"Association for Computing Machinery (ACM)","issue":"2","license":[{"start":{"date-parts":[[2023,12,21]],"date-time":"2023-12-21T00:00:00Z","timestamp":1703116800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"funder":[{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"crossref","award":["61972197"],"award-info":[{"award-number":["61972197"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"crossref"}]},{"DOI":"10.13039\/501100004608","name":"Natural Science Foundation of Jiangsu Province","doi-asserted-by":"crossref","award":["BK20201292"],"award-info":[{"award-number":["BK20201292"]}],"id":[{"id":"10.13039\/501100004608","id-type":"DOI","asserted-by":"crossref"}]},{"DOI":"10.13039\/501100012226","name":"Fundamental Research Funds for the Central Universities","doi-asserted-by":"crossref","award":["NG2023005"],"award-info":[{"award-number":["NG2023005"]}],"id":[{"id":"10.13039\/501100012226","id-type":"DOI","asserted-by":"crossref"}]},{"name":"Birkbeck BEI School Project (EFFECT), an oversea grant from the State Key Laboratory of Novel Software Technology, Nanjing University","award":["KFKT2022A03"],"award-info":[{"award-number":["KFKT2022A03"]}]},{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"crossref","award":["62272397"],"award-info":[{"award-number":["62272397"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"crossref"}]}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["ACM Trans. Softw. Eng. Methodol."],"published-print":{"date-parts":[[2024,2,29]]},"abstract":"<jats:p>\n            A Dockerfile defines a set of instructions to build Docker images, which can then be instantiated to support containerized applications. Recent studies have revealed a considerable amount of quality issues with Dockerfiles. In this article, we propose a novel approach, Dockerfiles Rule mIning and Violation dEtection (\n            <jats:sans-serif>DRIVE<\/jats:sans-serif>\n            ), to mine implicit rules and detect potential violations of such rules in Dockerfiles.\n            <jats:sans-serif>DRIVE<\/jats:sans-serif>\n            first parses Dockerfiles and transforms them to an intermediate representation. It then leverages an efficient sequential pattern mining algorithm to extract potential patterns. With heuristic-based reduction and moderate human intervention, potential rules are identified, which can then be utilized to detect potential violations of Dockerfiles.\n            <jats:sans-serif>DRIVE<\/jats:sans-serif>\n            identifies 34 semantic rules and 19 syntactic rules including 9 new semantic rules that have not been reported elsewhere. Extensive experiments on real-world Dockerfiles demonstrate the efficacy of our approach.\n          <\/jats:p>","DOI":"10.1145\/3617173","type":"journal-article","created":{"date-parts":[[2023,8,21]],"date-time":"2023-08-21T12:16:46Z","timestamp":1692620206000},"page":"1-23","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":6,"title":["DRIVE: Dockerfile Rule Mining and Violation Detection"],"prefix":"10.1145","volume":"33","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-3723-7584","authenticated-orcid":false,"given":"Yu","family":"Zhou","sequence":"first","affiliation":[{"name":"Nanjing University of Aeronautics and Astronautics, China"}]},{"ORCID":"https:\/\/orcid.org\/0009-0004-6175-0558","authenticated-orcid":false,"given":"Weilin","family":"Zhan","sequence":"additional","affiliation":[{"name":"Nanjing University of Aeronautics and Astronautics, China"}]},{"ORCID":"https:\/\/orcid.org\/0009-0004-5979-3161","authenticated-orcid":false,"given":"Zi","family":"Li","sequence":"additional","affiliation":[{"name":"Nanjing University of Aeronautics and Astronautics, China"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-5648-9624","authenticated-orcid":false,"given":"Tingting","family":"Han","sequence":"additional","affiliation":[{"name":"Birkbeck, University of London, UK"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-5993-1665","authenticated-orcid":false,"given":"Taolue","family":"Chen","sequence":"additional","affiliation":[{"name":"Birkbeck, University of London, UK"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-3874-5628","authenticated-orcid":false,"given":"Harald","family":"Gall","sequence":"additional","affiliation":[{"name":"University of Zurich, Switzerland"}]}],"member":"320","published-online":{"date-parts":[[2023,12,21]]},"reference":[{"key":"e_1_3_2_2_2","unstructured":"[n. d.]. Best Practices for Writing Dockerfiles. Retrieved July 22 2022 from https:\/\/docs.docker.com\/develop\/develop-images\/dockerfile_best-practices\/"},{"key":"e_1_3_2_3_2","unstructured":"[n. d.]. Hadolint. Retrieved July 22 2022 from https:\/\/github.com\/hadolint\/hadolint\/"},{"key":"e_1_3_2_4_2","doi-asserted-by":"publisher","DOI":"10.1109\/MS.2015.62"},{"key":"e_1_3_2_5_2","first-page":"497","volume-title":"Proceedings of the IEEE\/ACM 39th International Conference on Software Engineering Companion (ICSE-C\u201917)","author":"Artac Matej","year":"2017","unstructured":"Matej Artac, Tadej Borovssak, Elisabetta Di Nitto, Michele Guerriero, and Damian Andrew Tamburri. 2017. DevOps: Introducing infrastructure-as-code. In Proceedings of the IEEE\/ACM 39th International Conference on Software Engineering Companion (ICSE-C\u201917). IEEE, 497\u2013498."},{"key":"e_1_3_2_6_2","doi-asserted-by":"publisher","DOI":"10.1007\/s10664-021-10081-7"},{"key":"e_1_3_2_7_2","doi-asserted-by":"publisher","DOI":"10.1109\/MCC.2014.51"},{"key":"e_1_3_2_8_2","first-page":"411","volume-title":"Proceedings of the 26th ACM Joint Meeting on European Software Engineering Conference and Symposium on the Foundations of Software Engineering","author":"Bian Pan","year":"2018","unstructured":"Pan Bian, Bin Liang, Wenchang Shi, Jianjun Huang, and Yan Cai. 2018. Nar-miner: Discovering negative association rules from code for bug detection. In Proceedings of the 26th ACM Joint Meeting on European Software Engineering Conference and Symposium on the Foundations of Software Engineering. 411\u2013422."},{"key":"e_1_3_2_9_2","first-page":"334","volume-title":"Proceedings of the IEEE International Conference on Software Maintenance and Evolution (ICSME\u201916)","author":"Borges Hudson","year":"2016","unstructured":"Hudson Borges, Andre Hora, and Marco Tulio Valente. 2016. Understanding the factors that impact the popularity of GitHub repositories. In Proceedings of the IEEE International Conference on Software Maintenance and Evolution (ICSME\u201916). IEEE, 334\u2013344."},{"key":"e_1_3_2_10_2","doi-asserted-by":"publisher","DOI":"10.1016\/j.jss.2018.09.016"},{"key":"e_1_3_2_11_2","doi-asserted-by":"publisher","DOI":"10.5555\/3269687.3269696"},{"key":"e_1_3_2_12_2","unstructured":"Matt Carter. [n. d.]. Retrieved July 22 2022 from https:\/\/www.docker.com\/blog\/docker-index-shows-surging-momentum-in-developer-community-activity-again\/"},{"key":"e_1_3_2_13_2","doi-asserted-by":"publisher","DOI":"10.1109\/MSR.2017.67"},{"key":"e_1_3_2_14_2","doi-asserted-by":"publisher","DOI":"10.1145\/3106237.3106241"},{"key":"e_1_3_2_15_2","doi-asserted-by":"publisher","DOI":"10.1145\/3517193"},{"key":"e_1_3_2_16_2","first-page":"449","volume-title":"Proceedings of the IEEE\/ACM 18th International Conference on Mining Software Repositories (MSR\u201921)","author":"Eng Kalvin","year":"2021","unstructured":"Kalvin Eng and Abram Hindle. 2021. Revisiting dockerfiles in open source software over time. In Proceedings of the IEEE\/ACM 18th International Conference on Mining Software Repositories (MSR\u201921). IEEE, 449\u2013459."},{"issue":"1","key":"e_1_3_2_17_2","first-page":"54","article-title":"A survey of sequential pattern mining","volume":"1","author":"Fournier-Viger Philippe","year":"2017","unstructured":"Philippe Fournier-Viger, Jerry Chun-Wei Lin, Rage Uday Kiran, Yun Sing Koh, and Rincy Thomas. 2017. A survey of sequential pattern mining. Data Sci. Pattern Recogn. 1, 1 (2017), 54\u201377.","journal-title":"Data Sci. Pattern Recogn."},{"key":"e_1_3_2_18_2","first-page":"65","volume-title":"FIMI","author":"Grahne G\u00f6sta","year":"2003","unstructured":"G\u00f6sta Grahne and Jianfei Zhu. 2003. Efficiently using prefix-trees in mining frequent itemsets.. In FIMI, Vol. 90. 65."},{"key":"e_1_3_2_19_2","doi-asserted-by":"publisher","DOI":"10.1145\/3238147.3240470"},{"key":"e_1_3_2_20_2","doi-asserted-by":"publisher","DOI":"10.1145\/3377811.3380406"},{"key":"e_1_3_2_21_2","first-page":"1148","volume-title":"Proceedings of the IEEE\/ACM 43rd International Conference on Software Engineering (ICSE\u201921)","author":"Henkel Jordan","year":"2021","unstructured":"Jordan Henkel, Denini Silva, Leopoldo Teixeira, Marcelo d\u2019Amorim, and Thomas Reps. 2021. Shipwright: A human-in-the-loop system for dockerfile repair. In Proceedings of the IEEE\/ACM 43rd International Conference on Software Engineering (ICSE\u201921). IEEE, 1148\u20131160."},{"key":"e_1_3_2_22_2","first-page":"328","volume-title":"Proceedings of the IEEE\/ACM 41st International Conference on Software Engineering (ICSE\u201919)","author":"Horton Eric","year":"2019","unstructured":"Eric Horton and Chris Parnin. 2019. Dockerizeme: Automatic inference of environment dependencies for python code snippets. In Proceedings of the IEEE\/ACM 41st International Conference on Software Engineering (ICSE\u201919). IEEE, 328\u2013338."},{"key":"e_1_3_2_23_2","first-page":"28","volume-title":"Proceedings of the 35th Symposium on Mass Storage Systems and Technologies (MSST\u201919)","author":"Huang Zhuo","year":"2019","unstructured":"Zhuo Huang, Song Wu, Song Jiang, and Hai Jin. 2019. Fastbuild: Accelerating docker image building for efficient development and deployment of container. In Proceedings of the 35th Symposium on Mass Storage Systems and Technologies (MSST\u201919). IEEE, 28\u201337."},{"key":"e_1_3_2_24_2","doi-asserted-by":"publisher","DOI":"10.1109\/ASE51524.2021.9678585"},{"key":"e_1_3_2_25_2","article-title":"DockerMock: Pre-build detection of dockerfile faults through mocking instruction execution","author":"Li Mingjie","year":"2021","unstructured":"Mingjie Li, Xiaoying Bai, Minghua Ma, and Dan Pei. 2021. DockerMock: Pre-build detection of dockerfile faults through mocking instruction execution. arXiv:2104.05490. Retrieved from https:\/\/arxiv.org\/abs\/2104.05490","journal-title":"arXiv:2104.05490"},{"key":"e_1_3_2_26_2","doi-asserted-by":"publisher","DOI":"10.1145\/1095430.1081755"},{"key":"e_1_3_2_27_2","doi-asserted-by":"publisher","DOI":"10.1145\/2884781.2884870"},{"key":"e_1_3_2_28_2","doi-asserted-by":"publisher","DOI":"10.1145\/1095430.1081754"},{"key":"e_1_3_2_29_2","first-page":"50","volume-title":"Proceedings of the International Workshop on Dynamic Analysis: Held in Conjunction with the ACM SIGSOFT International Symposium on Software Testing and Analysis (ISSTA\u201908)","author":"Lo David","year":"2008","unstructured":"David Lo, Siau-Cheng Khoo, and Chao Liu. 2008. Mining past-time temporal rules from execution traces. In Proceedings of the International Workshop on Dynamic Analysis: Held in Conjunction with the ACM SIGSOFT International Symposium on Software Testing and Analysis (ISSTA\u201908). 50\u201356."},{"key":"e_1_3_2_30_2","doi-asserted-by":"publisher","DOI":"10.1002\/smr.1918"},{"key":"e_1_3_2_31_2","doi-asserted-by":"publisher","DOI":"10.1109\/ACCESS.2019.2905424"},{"issue":"6","key":"e_1_3_2_32_2","doi-asserted-by":"crossref","first-page":"e1329","DOI":"10.1002\/widm.1329","article-title":"Frequent itemset mining: A 25 years review","volume":"9","author":"Luna Jos\u00e9 Mar\u00eda","year":"2019","unstructured":"Jos\u00e9 Mar\u00eda Luna, Philippe Fournier-Viger, and Sebasti\u00e1n Ventura. 2019. Frequent itemset mining: A 25 years review. Data Min. Knowl. Discov. 9, 6 (2019), e1329.","journal-title":"Data Min. Knowl. Discov."},{"key":"e_1_3_2_33_2","doi-asserted-by":"publisher","DOI":"10.1145\/1255329.1255347"},{"key":"e_1_3_2_34_2","first-page":"386","volume-title":"Proceedings of the IEEE International Conference on Cloud Engineering","author":"Morabito Roberto","year":"2015","unstructured":"Roberto Morabito, Jimmy Kj\u00e4llman, and Miika Komu. 2015. Hypervisors vs. lightweight virtualization: A performance comparison. In Proceedings of the IEEE International Conference on Cloud Engineering. IEEE, 386\u2013393."},{"key":"e_1_3_2_35_2","first-page":"524","volume-title":"Proceedings of the IEEE International Conference on Software Maintenance and Evolution (ICSME\u201919)","author":"Oumaziz Mohamed A.","year":"2019","unstructured":"Mohamed A. Oumaziz, Jean-R\u00e9my Falleri, Xavier Blanc, Tegawend\u00e9 F. Bissyand\u00e9, and Jacques Klein. 2019. Handling duplicates in dockerfiles families: Learning from experts. In Proceedings of the IEEE International Conference on Software Maintenance and Evolution (ICSME\u201919). IEEE, 524\u2013535."},{"key":"e_1_3_2_36_2","doi-asserted-by":"publisher","DOI":"10.1109\/MCC.2015.51"},{"key":"e_1_3_2_37_2","first-page":"100","volume-title":"Proceedings of the IEEE International Conference on Software Quality, Reliability and Security (QRS\u201916)","author":"Papamichail Michail","year":"2016","unstructured":"Michail Papamichail, Themistoklis Diamantopoulos, and Andreas Symeonidis. 2016. User-perceived source code quality estimation based on static analysis metrics. In Proceedings of the IEEE International Conference on Software Quality, Reliability and Security (QRS\u201916). IEEE, 100\u2013107."},{"key":"e_1_3_2_38_2","doi-asserted-by":"publisher","DOI":"10.1109\/TKDE.2004.77"},{"key":"e_1_3_2_39_2","doi-asserted-by":"publisher","DOI":"10.1002\/smr.1737"},{"key":"e_1_3_2_40_2","doi-asserted-by":"publisher","DOI":"10.1145\/3133888"},{"key":"e_1_3_2_41_2","doi-asserted-by":"publisher","DOI":"10.1109\/TSE.2022.3176725"},{"key":"e_1_3_2_42_2","article-title":"Efficient analysis of pattern and association rule mining approaches","author":"Slimani Thabet","year":"2014","unstructured":"Thabet Slimani and Amor Lazzez. 2014. Efficient analysis of pattern and association rule mining approaches. arXiv:1402.2892. Retrieved from https:\/\/arxiv.org\/abs\/1402.2892","journal-title":"arXiv:1402.2892"},{"key":"e_1_3_2_43_2","doi-asserted-by":"publisher","DOI":"10.5555\/645337.650382"},{"key":"e_1_3_2_44_2","doi-asserted-by":"publisher","DOI":"10.5555\/2337223.2337360"},{"key":"e_1_3_2_45_2","first-page":"1","volume-title":"Proceedings of the IEEE 15th International Workshop on Software Clones (IWSC\u201921)","author":"Tsuru Tomoaki","year":"2021","unstructured":"Tomoaki Tsuru, Tasuku Nakagawa, Shinsuke Matsumoto, Yoshiki Higo, and Shinji Kusumoto. 2021. Type-2 code clone detection for dockerfiles. In Proceedings of the IEEE 15th International Workshop on Software Clones (IWSC\u201921). IEEE, 1\u20137."},{"key":"e_1_3_2_46_2","doi-asserted-by":"crossref","DOI":"10.1002\/9780470979174","volume-title":"Data Mining and Statistics for Decision Making","author":"Tuff\u00e9ry St\u00e9phane","year":"2011","unstructured":"St\u00e9phane Tuff\u00e9ry. 2011. Data Mining and Statistics for Decision Making. John Wiley & Sons, New York, NY."},{"key":"e_1_3_2_47_2","doi-asserted-by":"publisher","DOI":"10.1109\/ACCESS.2020.2973750"},{"key":"e_1_3_2_48_2","first-page":"247","volume-title":"Proceedings of the 27th Asia-Pacific Software Engineering Conference (APSEC\u201920)","author":"Wu Yiwen","year":"2020","unstructured":"Yiwen Wu, Yang Zhang, Tao Wang, and Huaimin Wang. 2020. Dockerfile changes in practice: A large-scale empirical study of 4,110 projects on GitHub. In Proceedings of the 27th Asia-Pacific Software Engineering Conference (APSEC\u201920). IEEE, 247\u2013256."},{"key":"e_1_3_2_49_2","doi-asserted-by":"publisher","DOI":"10.1109\/COMPSAC.2019.00033"},{"issue":"2","key":"e_1_3_2_50_2","first-page":"1","article-title":"A multi-dimensional analysis of technical lag in Debian-based Docker images","volume":"26","author":"Zerouali Ahmed","year":"2021","unstructured":"Ahmed Zerouali, Tom Mens, Alexandre Decan, Jesus Gonzalez-Barahona, and Gregorio Robles. 2021. A multi-dimensional analysis of technical lag in Debian-based Docker images. Emp. Softw. Eng. 26, 2 (2021), 1\u201345.","journal-title":"Emp. Softw. Eng."}],"container-title":["ACM Transactions on Software Engineering and Methodology"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3617173","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3617173","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T16:36:07Z","timestamp":1750178167000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3617173"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2023,12,21]]},"references-count":49,"journal-issue":{"issue":"2","published-print":{"date-parts":[[2024,2,29]]}},"alternative-id":["10.1145\/3617173"],"URL":"https:\/\/doi.org\/10.1145\/3617173","relation":{},"ISSN":["1049-331X","1557-7392"],"issn-type":[{"value":"1049-331X","type":"print"},{"value":"1557-7392","type":"electronic"}],"subject":[],"published":{"date-parts":[[2023,12,21]]},"assertion":[{"value":"2022-12-11","order":0,"name":"received","label":"Received","group":{"name":"publication_history","label":"Publication History"}},{"value":"2023-07-24","order":1,"name":"accepted","label":"Accepted","group":{"name":"publication_history","label":"Publication History"}},{"value":"2023-12-21","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}