{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,7,18]],"date-time":"2026-07-18T02:47:07Z","timestamp":1784342827074,"version":"3.55.0"},"reference-count":30,"publisher":"MDPI AG","issue":"5","license":[{"start":{"date-parts":[[2023,2,24]],"date-time":"2023-02-24T00:00:00Z","timestamp":1677196800000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"name":"Fundamental Research Funds for the Central Universities","award":["N2116017"],"award-info":[{"award-number":["N2116017"]}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Sensors"],"abstract":"<jats:p>At present, the explosive growth of software code volume and quantity makes the code review process very labor-intensive and time-consuming. An automated code review model can assist in improving the efficiency of the process. Tufano et al., designed two automated tasks to help improve the efficiency of code review based on the deep learning approach, from two different perspectives, namely, the developer submitting the code and the code reviewer. However, they only used code sequence information and did not explore the logical structure information with a richer meaning of the code. To improve the learning of code structure information, a program dependency graph serialization algorithm PDG2Seq algorithm is proposed, which converts the program dependency graph into a unique graph code sequence in a lossless manner, while retaining the program structure information and semantic information. We then designed an automated code review model based on the pre-trained model CodeBERT architecture, which strengthens the learning of code information by fusing program structure information and code sequence information, and then fine-tuned the model according to the code review activity scene to complete the automatic modification of the code. To verify the efficiency of the algorithm, the two tasks in the experiment were compared with the best Algorithm 1-encoder\/2-encoder. The experimental results show that the model we proposed has a significant improvement under the BLEU, Lewinshtein distance and ROUGE-L metrics.<\/jats:p>","DOI":"10.3390\/s23052551","type":"journal-article","created":{"date-parts":[[2023,2,27]],"date-time":"2023-02-27T02:10:46Z","timestamp":1677463846000},"page":"2551","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":11,"title":["Automatic Code Review by Learning the Structure Information of Code Graph"],"prefix":"10.3390","volume":"23","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-6798-9293","authenticated-orcid":false,"given":"Ying","family":"Yin","sequence":"first","affiliation":[{"name":"School of Computer Science and Engineering, Northeastern University, Shenyang 110169, China"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Yuhai","family":"Zhao","sequence":"additional","affiliation":[{"name":"School of Computer Science and Engineering, Northeastern University, Shenyang 110169, China"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-4986-8030","authenticated-orcid":false,"given":"Yiming","family":"Sun","sequence":"additional","affiliation":[{"name":"School of Computer Science and Engineering, Northeastern University, Shenyang 110169, China"}],"role":[{"vocabulary":"crossref","role":"author"}]},{"given":"Chen","family":"Chen","sequence":"additional","affiliation":[{"name":"School of Computer Science and Engineering, Northeastern University, Shenyang 110169, China"}],"role":[{"vocabulary":"crossref","role":"author"}]}],"member":"1968","published-online":{"date-parts":[[2023,2,24]]},"reference":[{"key":"ref_1","unstructured":"Sadowski, C., S\u00f6derberg, E., Church, L., Sipko, M., and Bacchelli, A. (June, January 27). Modern code review: A case study at google. Proceedings of the 40th International Conference on Software Engineering: Software Engineering in Practice, Gothenburg, Sweden."},{"key":"ref_2","doi-asserted-by":"crossref","first-page":"111515","DOI":"10.1016\/j.jss.2022.111515","article-title":"A decade of code comment quality assessment: A systematic literature review","volume":"195","author":"Rani","year":"2023","journal-title":"J. Syst. Softw."},{"key":"ref_3","doi-asserted-by":"crossref","unstructured":"Dong, L., Zhang, H., Yang, L., Weng, Z., Yang, X., Zhou, X., and Pan, Z. (2021, January 6\u20139). Survey on Pains and Best Practices of Code Review. Proceedings of the 2021 28th Asia-Pacific Software Engineering Conference (APSEC), Taipei, Taiwan.","DOI":"10.1109\/APSEC53868.2021.00055"},{"key":"ref_4","doi-asserted-by":"crossref","unstructured":"Wessel, M., Serebrenik, A., Wiese, I., Steinmacher, I., and Gerosa, M.A. (2020, January 21\u201323). What to Expect from Code Review Bots on GitHub? A Survey with OSS Maintainers. Proceedings of the XXXIV Brazilian Symposium on Software Engineering, Natal, Brazil.","DOI":"10.1145\/3422392.3422459"},{"key":"ref_5","unstructured":"Dosea, M., Sant\u2019Anna, C., Oliveira, Y., and Junior, M.C. (2020). A Survey of Software Code Review Practices in Brazil. arXiv."},{"key":"ref_6","doi-asserted-by":"crossref","unstructured":"Tufano, R., Pascarella, L., Tufano, M., Poshyvanyk, D., and Bavota, G. (2021, January 22\u201330). Towards Automating Code Review Activities. Proceedings of the 2021 IEEE\/ACM 43rd International Conference on Software Engineering (ICSE), Madrid, Spain.","DOI":"10.1109\/ICSE43902.2021.00027"},{"key":"ref_7","doi-asserted-by":"crossref","unstructured":"Huo, X., Li, M., and Zhou, Z.H. (2020, January 7\u201312). Control Flow Graph Embedding Based on Multi-Instance Decomposition for Bug Localization. Proceedings of the AAAI Conference on Artificial Intelligence, New York, NY, USA.","DOI":"10.1609\/aaai.v34i04.5844"},{"key":"ref_8","doi-asserted-by":"crossref","unstructured":"Wan, Y., Zhao, Z., Yang, M., Xu, G., Ying, H., Wu, J., and Yu, P.S. (2018, January 3\u20137). Improving automatic source code summarization via deep reinforcement learning. Proceedings of the 33rd ACM\/IEEE International Conference on Automated Software Engineering, Montpellier, France.","DOI":"10.1145\/3238147.3238206"},{"key":"ref_9","doi-asserted-by":"crossref","unstructured":"Wan, Y., Shu, J., Sui, Y., Xu, G., Zhao, Z., Wu, J., and Yu, P. (2019, January 11\u201315). Multi-modal attention network learning for semantic source code retrieval. Proceedings of the 2019 34th IEEE\/ACM International Conference on Automated Software Engineering (ASE), San Diego, CA, USA.","DOI":"10.1109\/ASE.2019.00012"},{"key":"ref_10","doi-asserted-by":"crossref","unstructured":"Feng, Z., Guo, D., Tang, D., Duan, N., Feng, X., Gong, M., Shou, L., Qin, B., Liu, T., and Jiang, D. (2020). CodeBERT: A pre-trained model for programming and natural languages. arXiv.","DOI":"10.18653\/v1\/2020.findings-emnlp.139"},{"key":"ref_11","unstructured":"Devlin, J., Chang, M.W., Lee, K., and Toutanova, K. (2018). Bert: Pre-training of deep bidirectional transformers for language understanding. arXiv."},{"key":"ref_12","first-page":"1","article-title":"Modular tree network for source code representation learning","volume":"29","author":"Wang","year":"2020","journal-title":"ACM Trans. Softw. Eng. Methodol."},{"key":"ref_13","doi-asserted-by":"crossref","first-page":"2179","DOI":"10.1007\/s10664-019-09730-9","article-title":"Deep code comment generation with hybrid lexical and syntactical information","volume":"25","author":"Hu","year":"2020","journal-title":"Empir. Softw. Eng."},{"key":"ref_14","doi-asserted-by":"crossref","unstructured":"Wu, H., Zhao, H., and Zhang, M. (2020). SIT3: Code Summarization with Structure-induced Transformer. arXiv.","DOI":"10.18653\/v1\/2021.findings-acl.93"},{"key":"ref_15","doi-asserted-by":"crossref","unstructured":"LeClair, A., Haque, S., Wu, L., and McMillan, C. (2020, January 13\u201315). Improved Code Summarization via a Graph Neural Network. Proceedings of the 28th International Conference on Program Comprehension, Seoul, Republic of Korea.","DOI":"10.1145\/3387904.3389268"},{"key":"ref_16","doi-asserted-by":"crossref","first-page":"3346","DOI":"10.1007\/s10664-018-9602-0","article-title":"Early prediction of merged code changes to prioritize reviewing tasks","volume":"23","author":"Fan","year":"2018","journal-title":"Empir. Softw. Eng."},{"key":"ref_17","doi-asserted-by":"crossref","unstructured":"Uchoa, A., Barbosa, C., Coutinho, D., Oizumi, W., Assuncao, W.K.G., Vergilio, S.R., Pereira, J.A., Oliveira, A., and Garcia, A. (2021, January 17\u201319). Predicting Design Impactful Changes in Modern Code Review: A Large-Scale Empirical Study. Proceedings of the 2021 IEEE\/ACM 18th International Conference on Mining Software Repositories (MSR), Madrid, Spain.","DOI":"10.1109\/MSR52588.2021.00059"},{"key":"ref_18","doi-asserted-by":"crossref","first-page":"32","DOI":"10.1016\/j.infsof.2018.01.015","article-title":"What factors influence the reviewer assignment to pull requests?","volume":"98","author":"Soares","year":"2018","journal-title":"Inf. Softw. Technol."},{"key":"ref_19","unstructured":"Shi, S.T., Li, M., Lo, D., Thung, F., and Huo, X. (February, January 27). Automatic code review by learning the revision of source code. Proceedings of the AAAI Conference on Artificial Intelligence, Honolulu, HI, USA."},{"key":"ref_20","doi-asserted-by":"crossref","unstructured":"Li, H.Y., Shi, S.T., Thung, F., Huo, X., Xu, B., Li, M., and Lo, D. (2019, January 14\u201317). DeepReview: Automatic code review using deep multi-instance learning. Proceedings of the Advances in Knowledge Discovery and Data Mining: 23rd Pacific-Asia Conference, PAKDD 2019, Macau, China.","DOI":"10.1007\/978-3-030-16145-3_25"},{"key":"ref_21","doi-asserted-by":"crossref","unstructured":"Siow, J.K., Gao, C., Fan, L., Chen, S., and Liu, Y. (2020, January 18\u201321). Core: Automating review recommendation for code changes. Proceedings of the 2020 IEEE 27th International Conference on Software Analysis, Evolution and Reengineering (SANER), London, ON, Canada.","DOI":"10.1109\/SANER48275.2020.9054794"},{"key":"ref_22","unstructured":"Hoang, T., Kang, H.J., Lo, D., and Lawall, J. (July, January 27). Cc2vec: Distributed representations of code changes. Proceedings of the ACM\/IEEE 42nd International Conference on Software Engineering, Seoul, Republic of Korea."},{"key":"ref_23","doi-asserted-by":"crossref","first-page":"5619","DOI":"10.1007\/s00500-020-05559-3","article-title":"Recommending pull request reviewers based on code changes","volume":"25","author":"Ye","year":"2021","journal-title":"Soft Comput."},{"key":"ref_24","unstructured":"Lu, L., Ren, X., Qi, L., Cui, C., and Jiao, Y. (2018, January 1\u20133). Target Gene Mining Algorithm Based on gSpan. Proceedings of the Collaborative Computing: Networking, Applications and Worksharing: 14th EAI International Conference, CollaborateCom 2018, Shanghai, China."},{"key":"ref_25","doi-asserted-by":"crossref","first-page":"2469","DOI":"10.1360\/jos182469","article-title":"An Efficient Frequent Subgraph Mining Algorithm","volume":"18","author":"Li","year":"2007","journal-title":"J. Softw."},{"key":"ref_26","doi-asserted-by":"crossref","unstructured":"Jin, W., Liu, X., Ma, Y., Aggarwal, C., and Tang, J. (2022). Feature Overcorrelation in Deep Graph Neural Networks: A New Perspective. arXiv.","DOI":"10.1145\/3534678.3539445"},{"key":"ref_27","unstructured":"Vaswani, A., Shazeer, N., Parmar, N., Uszkoreit, J., Jones, L., Gomez, A.N., Kaiser, \u0141., and Polosukhin, I. (2017, January 4\u20139). Attention is all you need. Proceedings of the 31st International Conference on Neural Information Processing Systems, Long Beach, CA, USA."},{"key":"ref_28","doi-asserted-by":"crossref","unstructured":"Tufano, M., Pantiuchina, J., Watson, C., Bavota, G., and Poshyvanyk, D. (2019, January 25\u201331). On learning meaningful code changes via neural machine translation. Proceedings of the 2019 IEEE\/ACM 41st International Conference on Software Engineering (ICSE), Montreal, QC, Canada.","DOI":"10.1109\/ICSE.2019.00021"},{"key":"ref_29","unstructured":"Lin, C.Y. (2004). Rouge: A Package for Automatic Evaluation of Summarie, Association for Computational Linguistics. Text Summarization Branches Out."},{"key":"ref_30","unstructured":"Sutskever, I., Vinyals, O., and Le, Q.V. (2014, January 8\u201313). Sequence to sequence learning with neural networks. Proceedings of the 27th International Conference on Neural Information Processing Systems, Montreal, QC, Canada."}],"container-title":["Sensors"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/1424-8220\/23\/5\/2551\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,10]],"date-time":"2025-10-10T18:42:10Z","timestamp":1760121730000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/1424-8220\/23\/5\/2551"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2023,2,24]]},"references-count":30,"journal-issue":{"issue":"5","published-online":{"date-parts":[[2023,3]]}},"alternative-id":["s23052551"],"URL":"https:\/\/doi.org\/10.3390\/s23052551","relation":{},"ISSN":["1424-8220"],"issn-type":[{"value":"1424-8220","type":"electronic"}],"subject":[],"published":{"date-parts":[[2023,2,24]]}}}