{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,5,12]],"date-time":"2026-05-12T13:24:15Z","timestamp":1778592255128,"version":"3.51.4"},"publisher-location":"New York, NY, USA","reference-count":32,"publisher":"ACM","license":[{"start":{"date-parts":[[2020,9,14]],"date-time":"2020-09-14T00:00:00Z","timestamp":1600041600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2020,9,14]]},"DOI":"10.1145\/3411170.3411258","type":"proceedings-article","created":{"date-parts":[[2020,9,4]],"date-time":"2020-09-04T21:26:23Z","timestamp":1599254783000},"page":"265-268","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":12,"title":["Building an Italian-Chinese Parallel Corpus for Machine Translation from the Web"],"prefix":"10.1145","author":[{"given":"Rita","family":"Tse","sequence":"first","affiliation":[{"name":"School of Applied Sciences - Macao Polytechnic Institute, Engineering Research Centre of Applied Technology on Machine Translation and Artificial Intelligence, Ministry of Education - Macao (China)"}]},{"given":"Silvia","family":"Mirri","sequence":"additional","affiliation":[{"name":"Department of Computer Science and Engineering - University of Bologna - Bologna (Italy)"}]},{"given":"Su-Kit","family":"Tang","sequence":"additional","affiliation":[{"name":"School of Applied Sciences - Macao Polytechnic Institute, Engineering Research Centre of Applied Technology on Machine Translation and Artificial Intelligence, Ministry of Education - Macao (China)"}]},{"given":"Giovanni","family":"Pau","sequence":"additional","affiliation":[{"name":"Department of Computer Science and Engineering - University of Bologna - Bologna (Italy), Computer Science Department - UCLA - Los Angeles, CA (USA)"}]},{"given":"Paola","family":"Salomoni","sequence":"additional","affiliation":[{"name":"Department of Computer Science and Engineering - University of Bologna - Bologna (Italy)"}]}],"member":"320","published-online":{"date-parts":[[2020,9,14]]},"reference":[{"key":"e_1_3_2_1_1_1","volume-title":"The TransBank Aligner: Cross-Sentence Alignment with Deep Neural Networks. In International Conference on Text, Speech, and Dialogue. Springer, 185--196","author":"Aghaebrahimian Ahmad","year":"2019","unstructured":"Ahmad Aghaebrahimian , Michael Ustaszewski , and Andy Stauder . 2019 . The TransBank Aligner: Cross-Sentence Alignment with Deep Neural Networks. In International Conference on Text, Speech, and Dialogue. Springer, 185--196 . Ahmad Aghaebrahimian, Michael Ustaszewski, and Andy Stauder. 2019. The TransBank Aligner: Cross-Sentence Alignment with Deep Neural Networks. In International Conference on Text, Speech, and Dialogue. Springer, 185--196."},{"key":"e_1_3_2_1_2_1","unstructured":"E Bartlett. [n.d.]. J. JW Kotrlik etal (2001).\" Organizational research: Determining appropriate sample size in survey research.\". Information Technology Learning and Performance 19 1 ([n.d.]).  E Bartlett. [n.d.]. J. JW Kotrlik et al. (2001).\" Organizational research: Determining appropriate sample size in survey research.\". Information Technology Learning and Performance 19 1 ([n.d.])."},{"key":"e_1_3_2_1_3_1","doi-asserted-by":"publisher","DOI":"10.3115\/981344.981366"},{"key":"e_1_3_2_1_4_1","first-page":"59","article-title":"Building a Brazilian Portuguese parallel corpus of original and simplified texts. Advances in Computational Linguistics","volume":"41","author":"Caseli Helena M","year":"2009","unstructured":"Helena M Caseli , Tiago F Pereira , Lucia Specia , Thiago AS Pardo , Caroline Gasperin , and Sandra Maria Alu\u00edsio . 2009 . Building a Brazilian Portuguese parallel corpus of original and simplified texts. Advances in Computational Linguistics , Research in Computer Science 41 (2009), 59 -- 70 . Helena M Caseli, Tiago F Pereira, Lucia Specia, Thiago AS Pardo, Caroline Gasperin, and Sandra Maria Alu\u00edsio. 2009. Building a Brazilian Portuguese parallel corpus of original and simplified texts. Advances in Computational Linguistics, Research in Computer Science 41 (2009), 59--70.","journal-title":"Research in Computer Science"},{"key":"e_1_3_2_1_5_1","volume-title":"International Conference on Human Interaction and Emerging Technologies. Springer, 688--694","author":"Casini Luca","year":"2019","unstructured":"Luca Casini , Giovanni Delnevo , Marco Roccetti , Nicol\u00f2 Zagni , and Giuseppe Cappiello . 2019 . Deep Water: Predicting water meter failures through a human-machine intelligence collaboration . In International Conference on Human Interaction and Emerging Technologies. Springer, 688--694 . Luca Casini, Giovanni Delnevo, Marco Roccetti, Nicol\u00f2 Zagni, and Giuseppe Cappiello. 2019. Deep Water: Predicting water meter failures through a human-machine intelligence collaboration. In International Conference on Human Interaction and Emerging Technologies. Springer, 688--694."},{"key":"e_1_3_2_1_6_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICRCICN.2016.7813653"},{"key":"e_1_3_2_1_7_1","doi-asserted-by":"publisher","DOI":"10.1016\/j.csl.2014.11.001"},{"key":"e_1_3_2_1_8_1","volume-title":"Intelligent and good machines? The role of domain and context codification. Mobile Networks and Applications","author":"Delnevo Giovanni","year":"2019","unstructured":"Giovanni Delnevo , Marco Roccetti , and Silvia Mirri . 2019. Intelligent and good machines? The role of domain and context codification. Mobile Networks and Applications ( 2019 ), 1--9. Giovanni Delnevo, Marco Roccetti, and Silvia Mirri. 2019. Intelligent and good machines? The role of domain and context codification. Mobile Networks and Applications (2019), 1--9."},{"key":"e_1_3_2_1_9_1","volume-title":"Proceedings of Machine Translation Summit XVII","volume":"119","author":"Espl\u00e0-Gomis Miquel","year":"2019","unstructured":"Miquel Espl\u00e0-Gomis , Mikel L Forcada , Gema Ram\u00edrez-S\u00e1nchez , and Hieu Hoang . 2019 . ParaCrawl: Web-scale parallel corpora for the languages of the EU . In Proceedings of Machine Translation Summit XVII Volume 2: Translator, Project and User Tracks. 118-- 119 . Miquel Espl\u00e0-Gomis, Mikel L Forcada, Gema Ram\u00edrez-S\u00e1nchez, and Hieu Hoang. 2019. ParaCrawl: Web-scale parallel corpora for the languages of the EU. In Proceedings of Machine Translation Summit XVII Volume 2: Translator, Project and User Tracks. 118--119."},{"key":"e_1_3_2_1_10_1","volume-title":"Felipe S\u00e1nchez-Mart\u00ednez, Gema Ram\u00edrez-S\u00e1nchez, and Francis M Tyers.","author":"Forcada Mikel L","year":"2011","unstructured":"Mikel L Forcada , Mireia Ginest\u00ed-Rosell , Jacob Nordfalk , Jim O'Regan , Sergio Ortiz-Rojas , Juan Antonio P\u00e9rez-Ortiz , Felipe S\u00e1nchez-Mart\u00ednez, Gema Ram\u00edrez-S\u00e1nchez, and Francis M Tyers. 2011 . Apertium: a free\/open-source platform for rule-based machine translation. Machine translation 25, 2 (2011), 127--144. Mikel L Forcada, Mireia Ginest\u00ed-Rosell, Jacob Nordfalk, Jim O'Regan, Sergio Ortiz-Rojas, Juan Antonio P\u00e9rez-Ortiz, Felipe S\u00e1nchez-Mart\u00ednez, Gema Ram\u00edrez-S\u00e1nchez, and Francis M Tyers. 2011. Apertium: a free\/open-source platform for rule-based machine translation. Machine translation 25, 2 (2011), 127--144."},{"key":"e_1_3_2_1_11_1","volume-title":"A program for aligning sentences in bilingual corpora. Computational linguistics 19, 1","author":"Gale William A","year":"1993","unstructured":"William A Gale and Kenneth W Church . 1993. A program for aligning sentences in bilingual corpora. Computational linguistics 19, 1 ( 1993 ), 75--102. William A Gale and Kenneth W Church. 1993. A program for aligning sentences in bilingual corpora. Computational linguistics 19, 1 (1993), 75--102."},{"key":"e_1_3_2_1_12_1","volume-title":"Proceedings of the 27th International Conference on Computational Linguistics. 1442--1453","author":"Gr\u00e9goire Francis","year":"2018","unstructured":"Francis Gr\u00e9goire and Philippe Langlais . 2018 . Extracting parallel sentences with bidirectional recurrent neural networks to improve machine translation . In Proceedings of the 27th International Conference on Computational Linguistics. 1442--1453 . Francis Gr\u00e9goire and Philippe Langlais. 2018. Extracting parallel sentences with bidirectional recurrent neural networks to improve machine translation. In Proceedings of the 27th International Conference on Computational Linguistics. 1442--1453."},{"key":"e_1_3_2_1_13_1","doi-asserted-by":"publisher","DOI":"10.1007\/s11042-017-5092-0"},{"key":"e_1_3_2_1_14_1","volume-title":"Europarl: A parallel corpus for statistical machine translation. In MT summit","author":"Koehn Philipp","year":"2005","unstructured":"Philipp Koehn . 2005 . Europarl: A parallel corpus for statistical machine translation. In MT summit , Vol. 5 . Citeseer , 79--86. Philipp Koehn. 2005. Europarl: A parallel corpus for statistical machine translation. In MT summit, Vol. 5. Citeseer, 79--86."},{"key":"e_1_3_2_1_15_1","volume-title":"Statistical machine translation","author":"Koehn Philipp","unstructured":"Philipp Koehn . 2009. Statistical machine translation . Cambridge University Press . Philipp Koehn. 2009. Statistical machine translation. Cambridge University Press."},{"key":"e_1_3_2_1_16_1","volume-title":"Six challenges for neural machine translation. arXiv preprint arXiv:1706.03872","author":"Koehn Philipp","year":"2017","unstructured":"Philipp Koehn and Rebecca Knowles . 2017. Six challenges for neural machine translation. arXiv preprint arXiv:1706.03872 ( 2017 ). Philipp Koehn and Rebecca Knowles. 2017. Six challenges for neural machine translation. arXiv preprint arXiv:1706.03872 (2017)."},{"key":"e_1_3_2_1_17_1","doi-asserted-by":"publisher","DOI":"10.46792\/fuoyejet.v1i1.26"},{"key":"e_1_3_2_1_18_1","volume-title":"Proceedings of the 40th annual meeting on association for computational linguistics. Association for Computational Linguistics, 311--318","author":"Papineni Kishore","year":"2002","unstructured":"Kishore Papineni , Salim Roukos , Todd Ward , and Wei-Jing Zhu . 2002 . BLEU: a method for automatic evaluation of machine translation . In Proceedings of the 40th annual meeting on association for computational linguistics. Association for Computational Linguistics, 311--318 . Kishore Papineni, Salim Roukos, Todd Ward, and Wei-Jing Zhu. 2002. BLEU: a method for automatic evaluation of machine translation. In Proceedings of the 40th annual meeting on association for computational linguistics. Association for Computational Linguistics, 311--318."},{"key":"e_1_3_2_1_19_1","volume-title":"Becoming a translator: An introduction to the theory and practice of translation","author":"Robinson Douglas","unstructured":"Douglas Robinson . 2019. Becoming a translator: An introduction to the theory and practice of translation . Routledge . Douglas Robinson. 2019. Becoming a translator: An introduction to the theory and practice of translation. Routledge."},{"key":"e_1_3_2_1_20_1","doi-asserted-by":"publisher","DOI":"10.1186\/s40537-019-0235-y"},{"key":"e_1_3_2_1_21_1","volume-title":"A Cautionary Tale for Machine Learning Design: why we Still Need Human-Assisted Big Data Analysis. Mobile Networks and Applications","author":"Roccetti Marco","year":"2020","unstructured":"Marco Roccetti , Giovanni Delnevo , Luca Casini , and Paola Salomoni . 2020. A Cautionary Tale for Machine Learning Design: why we Still Need Human-Assisted Big Data Analysis. Mobile Networks and Applications ( 2020 ), 1--9. Marco Roccetti, Giovanni Delnevo, Luca Casini, and Paola Salomoni. 2020. A Cautionary Tale for Machine Learning Design: why we Still Need Human-Assisted Big Data Analysis. Mobile Networks and Applications (2020), 1--9."},{"key":"e_1_3_2_1_22_1","first-page":"117","article-title":"A survey on parallel corpora alignment","volume":"2011","author":"Santos Andr\u00e9","year":"2011","unstructured":"Andr\u00e9 Santos . 2011 . A survey on parallel corpora alignment . MI-STAR 2011 (2011), 117 -- 128 . Andr\u00e9 Santos. 2011. A survey on parallel corpora alignment. MI-STAR 2011 (2011), 117--128.","journal-title":"MI-STAR"},{"key":"e_1_3_2_1_23_1","volume-title":"The Ninth Conference of the Association for Machine Translation in the Americas (AMTA","author":"Sennrich Rico","year":"2010","unstructured":"Rico Sennrich and Martin Volk . 2010 . MT-based sentence alignment for OCR-generated parallel texts . In The Ninth Conference of the Association for Machine Translation in the Americas (AMTA 2010). Rico Sennrich and Martin Volk. 2010. MT-based sentence alignment for OCR-generated parallel texts. In The Ninth Conference of the Association for Machine Translation in the Americas (AMTA 2010)."},{"key":"e_1_3_2_1_24_1","volume-title":"Proceedings of the 18th Nordic Conference of Computational Linguistics (NODALIDA","author":"Sennrich Rico","year":"2011","unstructured":"Rico Sennrich and Martin Volk . 2011 . Iterative, MT-based sentence alignment of parallel texts . In Proceedings of the 18th Nordic Conference of Computational Linguistics (NODALIDA 2011). 175--182. Rico Sennrich and Martin Volk. 2011. Iterative, MT-based sentence alignment of parallel texts. In Proceedings of the 18th Nordic Conference of Computational Linguistics (NODALIDA 2011). 175--182."},{"key":"e_1_3_2_1_25_1","volume-title":"Recent advances in natural language processing","author":"Tiedemann J\u00f6rg","unstructured":"J\u00f6rg Tiedemann . 2009. News from OPUS-A collection of multilingual parallel corpora with tools and interfaces . In Recent advances in natural language processing , Vol. 5 . 237--248. J\u00f6rg Tiedemann. 2009. News from OPUS-A collection of multilingual parallel corpora with tools and interfaces. In Recent advances in natural language processing, Vol. 5. 237--248."},{"key":"e_1_3_2_1_26_1","volume-title":"Practical Web Scraping for Data Science","author":"vanden Broucke Seppe","unstructured":"Seppe vanden Broucke and Bart Baesens . 2018. Stirring the HTML and CSS Soup . In Practical Web Scraping for Data Science . Springer , 49--77. Seppe vanden Broucke and Bart Baesens. 2018. Stirring the HTML and CSS Soup. In Practical Web Scraping for Data Science. Springer, 49--77."},{"key":"e_1_3_2_1_27_1","unstructured":"Maria Jose Varela-Salinas Ruth Burbat etal 2018. Google translate and deepL: breaking taboos in translator training. (2018).  Maria Jose Varela-Salinas Ruth Burbat et al. 2018. Google translate and deepL: breaking taboos in translator training. (2018)."},{"key":"e_1_3_2_1_28_1","unstructured":"Warren Weaver. 1949. Memorandum on Translation.  Warren Weaver. 1949. Memorandum on Translation."},{"key":"e_1_3_2_1_29_1","unstructured":"Yonghui Wu Mike Schuster Zhifeng Chen Quoc V Le Mohammad Norouzi Wolfgang Macherey Maxim Krikun Yuan Cao Qin Gao Klaus Macherey etal 2016. Google's neural machine translation system: Bridging the gap between human and machine translation. arXiv preprint arXiv:1609.08144 (2016).  Yonghui Wu Mike Schuster Zhifeng Chen Quoc V Le Mohammad Norouzi Wolfgang Macherey Maxim Krikun Yuan Cao Qin Gao Klaus Macherey et al. 2016. Google's neural machine translation system: Bridging the gap between human and machine translation. arXiv preprint arXiv:1609.08144 (2016)."},{"key":"e_1_3_2_1_30_1","volume-title":"Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers). 166--175","author":"Yang Nan","year":"2013","unstructured":"Nan Yang , Shujie Liu , Mu Li , Ming Zhou , and Nenghai Yu . 2013 . Word alignment modeling with context dependent deep neural network . In Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers). 166--175 . Nan Yang, Shujie Liu, Mu Li, Ming Zhou, and Nenghai Yu. 2013. Word alignment modeling with context dependent deep neural network. In Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers). 166--175."},{"key":"e_1_3_2_1_31_1","unstructured":"Danni Yu and Yicong Yu. [n.d.]. Knowledge Dissemination in Media Discourse: Analysis of Italian-Chinese\/Chinese-Italian Parallel Newspaper Corpora. In Knowledge Dissemination Etichs and Ideology in Specialised Communication: Linguistic and Discursive Perspectives Pre-conference Proceedings. 87.  Danni Yu and Yicong Yu. [n.d.]. Knowledge Dissemination in Media Discourse: Analysis of Italian-Chinese\/Chinese-Italian Parallel Newspaper Corpora. In Knowledge Dissemination Etichs and Ideology in Specialised Communication: Linguistic and Discursive Perspectives Pre-conference Proceedings. 87."},{"key":"e_1_3_2_1_32_1","series-title":"Journal of Physics: Conference Series","volume-title":"Research on Alignment in the Construction of Parallel Corpus","author":"Zong Zhaorong","year":"2003","unstructured":"Zhaorong Zong and Changchun Hong . 2019. Research on Alignment in the Construction of Parallel Corpus . In Journal of Physics: Conference Series , Vol. 1213 . IOP Publishing , 04 2003 . Zhaorong Zong and Changchun Hong. 2019. Research on Alignment in the Construction of Parallel Corpus. In Journal of Physics: Conference Series, Vol. 1213. IOP Publishing, 042003."}],"event":{"name":"GoodTechs '20: 6th EAI International Conference on Smart Objects and Technologies for Social Good","location":"Antwerp Belgium","acronym":"GoodTechs '20","sponsor":["EAI The European Alliance for Innovation"]},"container-title":["Proceedings of the 6th EAI International Conference on Smart Objects and Technologies for Social Good"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3411170.3411258","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3411170.3411258","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T21:28:12Z","timestamp":1750195692000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3411170.3411258"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2020,9,14]]},"references-count":32,"alternative-id":["10.1145\/3411170.3411258","10.1145\/3411170"],"URL":"https:\/\/doi.org\/10.1145\/3411170.3411258","relation":{},"subject":[],"published":{"date-parts":[[2020,9,14]]},"assertion":[{"value":"2020-09-14","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}