{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,2,27]],"date-time":"2026-02-27T04:29:02Z","timestamp":1772166542589,"version":"3.50.1"},"reference-count":28,"publisher":"Springer Science and Business Media LLC","issue":"1","license":[{"start":{"date-parts":[[2023,10,14]],"date-time":"2023-10-14T00:00:00Z","timestamp":1697241600000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"},{"start":{"date-parts":[[2023,10,14]],"date-time":"2023-10-14T00:00:00Z","timestamp":1697241600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0"}],"funder":[{"DOI":"10.13039\/501100012429","name":"Central Universities in China","doi-asserted-by":"publisher","award":["No. N2217002"],"award-info":[{"award-number":["No. N2217002"]}],"id":[{"id":"10.13039\/501100012429","id-type":"DOI","asserted-by":"publisher"}]},{"name":"Natural Science Foundation of Liaoning Provincial Department of Science and Technology","award":["No.2022-KF-11-04"],"award-info":[{"award-number":["No.2022-KF-11-04"]}]},{"name":"Natural Science Foundation of Liaoning Provincial Department of Science and Technology","award":["No.2022-KF-11-04"],"award-info":[{"award-number":["No.2022-KF-11-04"]}]}],"content-domain":{"domain":["link.springer.com"],"crossmark-restriction":false},"short-container-title":["J Cloud Comp"],"abstract":"<jats:title>Abstract<\/jats:title>\n                  <jats:p>This paper presents an interesting case study on Legacy Data Integration (LDI for short) for a Regional Cloud Arbitration Court. Due to the inconsistent structure and presentation, legacy arbitration cases can hardly integrate into the Cloud Court unless processed manually. In this study, we propose an AI-enabled LDI method to replace the costly manual approach and ensure privacy protection during the process. We trained AI models to replace tasks such as reading and understanding legacy cases, removing privacy information, composing new case records, and inputting them through the system interfaces. Our approach employs Optical Character Recognition (OCR), text classification, and Named Entity Recognition (NER) to transform legacy data into a system format. We applied our method to a Cloud Arbitration Court in Liaoning Province, China, and achieved a comparable privacy filtering effect while retaining the maximum amount of information. Our method demonstrated similar effectiveness as the manual LDI, but with greater efficiency, saving 90% of the workforce and achieving a 60%-70% information extraction rate compared to manual work. With the increasing development of informationalization and intelligentization in judgment and arbitration, many courts are adopting ABC technologies, namely Artificial intelligence, Big data, and Cloud computing, to build the court system. Our method provides a practical reference for integrating legal data into the system.<\/jats:p>","DOI":"10.1186\/s13677-023-00500-z","type":"journal-article","created":{"date-parts":[[2023,10,14]],"date-time":"2023-10-14T04:02:11Z","timestamp":1697256131000},"update-policy":"https:\/\/doi.org\/10.1007\/springer_crossmark_policy","source":"Crossref","is-referenced-by-count":2,"title":["AI-enabled legacy data integration with privacy protection: a case study on regional cloud arbitration court"],"prefix":"10.1186","volume":"12","author":[{"given":"Jie","family":"Song","sequence":"first","affiliation":[]},{"given":"Haifei","family":"Fu","sequence":"additional","affiliation":[]},{"given":"Tianzhe","family":"Jiao","sequence":"additional","affiliation":[]},{"given":"Dongqi","family":"Wang","sequence":"additional","affiliation":[]}],"member":"297","published-online":{"date-parts":[[2023,10,14]]},"reference":[{"issue":"2","key":"500_CR1","first-page":"421","volume":"7","author":"A Rashid","year":"2019","unstructured":"Rashid A, Chaturvedi A (2019) Cloud computing characteristics and services: a brief review. Int J Comput Sci Eng 7(2):421\u2013426","journal-title":"Int J Comput Sci Eng"},{"issue":"3","key":"500_CR2","doi-asserted-by":"publisher","first-page":"561","DOI":"10.1017\/als.2020.20","volume":"7","author":"GG Zheng","year":"2020","unstructured":"Zheng GG (2020) China\u2019s grand design of people\u2019s smart courts. Asian J Law Soc 7(3):561\u2013582. https:\/\/doi.org\/10.1017\/als.2020.20","journal-title":"Asian J Law Soc"},{"issue":"2","key":"500_CR3","doi-asserted-by":"publisher","first-page":"125","DOI":"10.3390\/math9020125","volume":"9","author":"K Anatoly Tikhanovich","year":"2021","unstructured":"Anatoly Tikhanovich K, Alexander Vladimirovich S, VeronikaAleksandrovna M (2021) On the effectiveness of the digital legal proceedings model in Russia. Mathematics 9(2):125. https:\/\/doi.org\/10.3390\/math9020125","journal-title":"Mathematics"},{"key":"500_CR4","doi-asserted-by":"publisher","unstructured":"Suhanto A, Hidayanto AN, Naisuty M, Bowo WA, Ayuning Budi NF, Phusavat K (2019) Hybrid cloud data integration critical success factors: a case study at PT Pos Indonesia. In: 2019 Fourth International Conference on Informatics and Computing (ICIC). pp 1\u20136. https:\/\/doi.org\/10.1109\/ICIC47613.2019.8985748","DOI":"10.1109\/ICIC47613.2019.8985748"},{"key":"500_CR5","doi-asserted-by":"publisher","DOI":"10.1109\/TII.2022.3170149","author":"X Zhou","year":"2022","unstructured":"Zhou X, Hu Y, Wu J, Liang W, Ma J, Jin Q (2022) Distribution bias aware collaborative generative adversarial network for imbalanced deep learning in industrial IoT. IEEE Trans Industr Inf. https:\/\/doi.org\/10.1109\/TII.2022.3170149","journal-title":"IEEE Trans Industr Inf"},{"issue":"9","key":"500_CR6","doi-asserted-by":"publisher","first-page":"6300","DOI":"10.1109\/TII.2022.3154473","volume":"18","author":"Y Jia","year":"2022","unstructured":"Jia Y, Liu B, Dou W, Xiaolong Xu, Zhou X, Qi L, Yan Z (2022) CroApp: a CNN-based resource optimization approach in edge computing environment. IEEE Trans Industr Inf 18(9):6300\u20136307","journal-title":"IEEE Trans Industr Inf"},{"issue":"16","key":"500_CR7","doi-asserted-by":"publisher","first-page":"12588","DOI":"10.1109\/JIOT.2021.3077449","volume":"8","author":"X Zhou","year":"2021","unstructured":"Zhou X, Xu X, Liang W, Zeng Z, Yan Z (2021) Deep-learning-enhanced multitarget detection for end-edge-cloud surveillance in smart IoT. IEEE Internet Things J 8(16):12588\u201312596. https:\/\/doi.org\/10.1109\/JIOT.2021.3077449","journal-title":"IEEE Internet Things J"},{"key":"500_CR8","doi-asserted-by":"publisher","first-page":"91265","DOI":"10.1109\/ACCESS.2019.2927491","volume":"7","author":"H Dhayne","year":"2019","unstructured":"Dhayne H, Haque R, Kilany R, Taher Y (2019) In search of big medical data integration solutions - a comprehensive survey. IEEE Access 7:91265\u201391290. https:\/\/doi.org\/10.1109\/ACCESS.2019.2927491","journal-title":"IEEE Access"},{"key":"500_CR9","doi-asserted-by":"publisher","first-page":"148845","DOI":"10.1109\/ACCESS.2021.3124010","volume":"9","author":"S Leng","year":"2021","unstructured":"Leng S, Lin J-R, Li S-W, Hu Z-Z (2021) A data integration and simplification framework for improving site planning and building design. IEEE Access 9:148845\u2013148861. https:\/\/doi.org\/10.1109\/ACCESS.2021.3124010","journal-title":"IEEE Access"},{"issue":"4","key":"500_CR10","doi-asserted-by":"publisher","first-page":"835","DOI":"10.1007\/s00607-021-00988-w","volume":"104","author":"R Reda","year":"2022","unstructured":"Reda R, Piccinini F, Martinelli G, Carbonaro A (2022) Heterogeneous self-tracked health and fitness data integration and sharing according to a linked open data approach. Computing 104(4):835\u2013857. https:\/\/doi.org\/10.1007\/s00607-021-00988-w","journal-title":"Computing"},{"issue":"5","key":"500_CR11","doi-asserted-by":"publisher","first-page":"1952","DOI":"10.3390\/s22051952","volume":"22","author":"K Habib","year":"2022","unstructured":"Habib K, Saad MHM, Hussain A, Sarker MR, Alaghbari KA (2022) An aggregated data integration approach to the web and cloud platforms through a modular REST-based OPC UA middleware. Sensors 22(5):1952. https:\/\/doi.org\/10.3390\/s22051952","journal-title":"Sensors"},{"key":"500_CR12","doi-asserted-by":"publisher","unstructured":"Prasath N, Sreemathy J (2021) A new approach for cloud data migration technique using talend ETL tool. In: 2021 7th International Conference on Advanced Computing and Communication Systems (ICACCS). pp 1674\u20131678. https:\/\/doi.org\/10.1109\/ICACCS51430.2021.9441898","DOI":"10.1109\/ICACCS51430.2021.9441898"},{"key":"500_CR13","doi-asserted-by":"publisher","first-page":"22400","DOI":"10.1109\/ACCESS.W2022.3151098","volume":"10","author":"A Rodriguez","year":"2022","unstructured":"Rodriguez A, Chen Y-L, Argueta C (2022) FADOHS: framework for detection and integration of unstructured data of hate speech on Facebook using sentiment and emotion analysis. IEEE Access 10:22400\u201322419. https:\/\/doi.org\/10.1109\/ACCESS.W2022.3151098","journal-title":"IEEE Access"},{"key":"500_CR14","doi-asserted-by":"publisher","unstructured":"Liu J, Abeysinghe R, Zheng F, Cui L (2019) Pattern-based extraction of disease drug combination knowledge from biomedical literature. In:2019 IEEE International Conference on Healthcare Informatics (ICHI). pp 1\u20137. https:\/\/doi.org\/10.1109\/ICHI.2019.8904473","DOI":"10.1109\/ICHI.2019.8904473"},{"key":"500_CR15","doi-asserted-by":"publisher","first-page":"104100","DOI":"10.1016\/j.engappai.2020.104100","volume":"97","author":"M-T Nguyen","year":"2021","unstructured":"Nguyen M-T, Le DT, Le L (2021) Transformers-based information extraction with limited data for domain-specific business documents. Eng Appl Artif Intell 97:104100. https:\/\/doi.org\/10.1016\/j.engappai.2020.104100","journal-title":"Eng Appl Artif Intell"},{"key":"500_CR16","doi-asserted-by":"publisher","first-page":"389","DOI":"10.1007\/978-3-030-86159-9_28","volume-title":"Document analysis and recognition \u2013 ICDAR 2021 workshops","author":"M Kerroumi","year":"2021","unstructured":"Kerroumi M, Sayem O, Shabou A (2021) VisualWordGrid: information extraction from scanned documents using a multimodal approach. In: Barney Smith EH, Pal U (eds) Document analysis and recognition \u2013 ICDAR 2021 workshops. Springer International Publishing, Cham, pp 389\u2013402"},{"key":"500_CR17","doi-asserted-by":"publisher","unstructured":"Liu S, Ma J, Feng X (2019) Transparent access and integration of heterogeneous encrypted database in hybrid cloud environment. In: ICC 2019 - 2019 IEEE International Conference on Communications (ICC). pp 1\u20136. https:\/\/doi.org\/10.1109\/ICC.2019.8761975","DOI":"10.1109\/ICC.2019.8761975"},{"issue":"2","key":"500_CR18","first-page":"15","volume":"5","author":"AA Alqarni","year":"2021","unstructured":"Alqarni AA (2021) A secure approach for data integration in cloud using Paillier homomorphic encryption. J Basic Appl Sci 5(2):15\u201321","journal-title":"J Basic Appl Sci"},{"key":"500_CR19","doi-asserted-by":"publisher","unstructured":"Ren W, Ghazinour K, Lian X (2022) kt-Safety: graph release via k-Anonymity and t-Closeness. IEEE Trans Knowl Data Eng 1\u201312. https:\/\/doi.org\/10.1109\/TKDE.2022.3221333","DOI":"10.1109\/TKDE.2022.3221333"},{"key":"500_CR20","doi-asserted-by":"publisher","unstructured":"Khan P, Khan Y, Kumar S (2021) Single identity clustering-based data anonymization in healthcare. In: Bansal JC, Paprzycki M, Bianchini M, Das S (eds) Computationally intelligent systems and their applications. Springer Singapore, Singapore, pp 1\u20139. https:\/\/doi.org\/10.1007\/978-981-16-0407-2_1","DOI":"10.1007\/978-981-16-0407-2_1"},{"key":"500_CR21","doi-asserted-by":"publisher","first-page":"160","DOI":"10.1016\/j.comcom.2020.07.032","volume":"161","author":"C Iwendi","year":"2020","unstructured":"Iwendi C, Moqurrab SA, Anjum A, Khan S, Mohan S, Srivastava G (2020) N-sanitization: a semantic privacy-preserving framework for unstructured medical datasets. Comput Commun 161:160\u2013171. https:\/\/doi.org\/10.1016\/j.comcom.2020.07.032","journal-title":"Comput Commun"},{"issue":"2","key":"500_CR22","doi-asserted-by":"publisher","first-page":"1","DOI":"10.1145\/3421509","volume":"22","author":"SA Moqurrab","year":"2021","unstructured":"Moqurrab SA, Anjum A, Khan A, Ahmed M, Ahmad A, Jeon G (2021) Deep-confidentiality: an IoT-enabled privacy-preserving framework for unstructured big biomedical data. ACM Trans Internet Technol 22(2):1\u201321. https:\/\/doi.org\/10.1145\/3421509","journal-title":"ACM Trans Internet Technol"},{"key":"500_CR23","doi-asserted-by":"publisher","DOI":"10.1109\/TCSS.2022.3217790","author":"Z Li","year":"2022","unstructured":"Li Z, Xiaolong Xu, Hang T, Xiang H, Cui Y, Qi L, Zhou X (2022) A knowledge-driven anomaly detection framework for social production system. IEEE Trans Comput Soc Syst. https:\/\/doi.org\/10.1109\/TCSS.2022.3217790","journal-title":"IEEE Trans Comput Soc Syst"},{"key":"500_CR24","unstructured":"Devlin J, Chang M-W, Lee K, Toutanova K (2018) BERT: pre-training of deep bidirectional transformers for language understanding. CoRR abs\/1810.04805. Available: http:\/\/arxiv.org\/abs\/1810.04805"},{"key":"500_CR25","doi-asserted-by":"publisher","unstructured":"Chang Y, Kong L, Jia K, Meng Q (2021) Chinese named entity recognition method based on BERT. In:2021 IEEE International Conference on Data Science and Computer Application (ICDSCA). pp 294\u2013299. https:\/\/doi.org\/10.1109\/ICDSCA53499.2021.9650256","DOI":"10.1109\/ICDSCA53499.2021.9650256"},{"key":"500_CR26","unstructured":"Xiao, et al. (2018) CAIL2018: a large-scale legal dataset for judgment prediction. CoRR abs\/1807.02478. Available: http:\/\/arxiv.org\/abs\/1807.02478"},{"issue":"2","key":"500_CR27","doi-asserted-by":"publisher","first-page":"99","DOI":"10.1023\/A:1026543900054","volume":"40","author":"Y Rubner","year":"2000","unstructured":"Rubner Y, Tomasi C, Guibas LJ (2000) The earth mover\u2019s distance as a metric for image retrieval. Int J Comput Vision 40(2):99","journal-title":"Int J Comput Vision"},{"key":"500_CR28","doi-asserted-by":"publisher","unstructured":"Bayardo RJ, Agrawal R (2005) Data privacy through optimal k-anonymization. In: 21st International Conference on Data Engineering (ICDE\u201905). pp 217\u2013228. https:\/\/doi.org\/10.1109\/ICDE.2005.42","DOI":"10.1109\/ICDE.2005.42"}],"container-title":["Journal of Cloud Computing"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/s13677-023-00500-z.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/article\/10.1186\/s13677-023-00500-z\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/link.springer.com\/content\/pdf\/10.1186\/s13677-023-00500-z.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2023,11,17]],"date-time":"2023-11-17T09:28:27Z","timestamp":1700213307000},"score":1,"resource":{"primary":{"URL":"https:\/\/journalofcloudcomputing.springeropen.com\/articles\/10.1186\/s13677-023-00500-z"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2023,10,14]]},"references-count":28,"journal-issue":{"issue":"1","published-online":{"date-parts":[[2023,12]]}},"alternative-id":["500"],"URL":"https:\/\/doi.org\/10.1186\/s13677-023-00500-z","relation":{"has-preprint":[{"id-type":"doi","id":"10.21203\/rs.3.rs-3067360\/v1","asserted-by":"object"}]},"ISSN":["2192-113X"],"issn-type":[{"value":"2192-113X","type":"electronic"}],"subject":[],"published":{"date-parts":[[2023,10,14]]},"assertion":[{"value":"15 June 2023","order":1,"name":"received","label":"Received","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"5 August 2023","order":2,"name":"accepted","label":"Accepted","group":{"name":"ArticleHistory","label":"Article History"}},{"value":"14 October 2023","order":3,"name":"first_online","label":"First Online","group":{"name":"ArticleHistory","label":"Article History"}},{"order":1,"name":"Ethics","group":{"name":"EthicsHeading","label":"Declarations"}},{"value":"The authors declare no competing interests.","order":2,"name":"Ethics","group":{"name":"EthicsHeading","label":"Competing interests"}}],"article-number":"145"}}