{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,6,19]],"date-time":"2025-06-19T04:19:39Z","timestamp":1750306779563,"version":"3.41.0"},"reference-count":42,"publisher":"Association for Computing Machinery (ACM)","issue":"3","license":[{"start":{"date-parts":[[2013,8,1]],"date-time":"2013-08-01T00:00:00Z","timestamp":1375315200000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["ACM Transactions on Asian Language Information Processing"],"published-print":{"date-parts":[[2013,8]]},"abstract":"<jats:p>We propose a named entity (NE) recognition method in which word chunks are repeatedly decomposed and concatenated. Our method identifies word chunks with a base chunker, such as a noun phrase chunker, and then recognizes NEs from the recognized word chunk sequences. By using word chunks, we can obtain features that cannot be obtained in word-sequence-based recognition methods, such as the first word of a word chunk, the last word of a word chunk, and so on. However, each word chunk may include a part of an NE or multiple NEs. To solve this problem, we use the following operators: SHIFT for separating the first word from a word chunk, POP for separating the last word from a word chunk, JOIN for concatenating two word chunks, and REDUCE for assigning an NE label to a word chunk. We evaluate our method on a Japanese NE recognition dataset that includes about 200,000 annotations of 191 types of NEs from over 8,500 news articles. The experimental results show that the training and processing speeds of our method are faster than those of a linear-chain structured perceptron and a semi-Markov perceptron, while maintaining high accuracy.<\/jats:p>","DOI":"10.1145\/2499955.2499958","type":"journal-article","created":{"date-parts":[[2013,8,20]],"date-time":"2013-08-20T14:07:13Z","timestamp":1377007633000},"page":"1-18","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":2,"title":["A Named Entity Recognition Method Based on Decomposition and Concatenation of Word Chunks"],"prefix":"10.1145","volume":"12","author":[{"given":"Tomoya","family":"Iwakura","sequence":"first","affiliation":[{"name":"Fujitsu Laboratories Ltd."}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Hiroya","family":"Takamura","sequence":"additional","affiliation":[{"name":"Tokyo Institute of Technology"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Manabu","family":"Okumura","sequence":"additional","affiliation":[{"name":"Tokyo Institute of Technology"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"320","published-online":{"date-parts":[[2013,8]]},"reference":[{"key":"e_1_2_1_1_1","doi-asserted-by":"publisher","DOI":"10.3115\/1073445.1073447"},{"key":"e_1_2_1_2_1","doi-asserted-by":"publisher","DOI":"10.5555\/218355.218367"},{"key":"e_1_2_1_3_1","doi-asserted-by":"publisher","DOI":"10.3115\/1118853.1118857"},{"key":"e_1_2_1_4_1","doi-asserted-by":"publisher","DOI":"10.1145\/1014052.1014065"},{"key":"e_1_2_1_5_1","volume-title":"Proceedings of the International Conference on Machine Learning (ICML\u201900)","author":"Collins M.","year":"2000","unstructured":"Collins , M. 2000 . Discriminative reranking for natural language parsing . In Proceedings of the International Conference on Machine Learning (ICML\u201900) . 175--182. Collins, M. 2000. Discriminative reranking for natural language parsing. In Proceedings of the International Conference on Machine Learning (ICML\u201900). 175--182."},{"key":"e_1_2_1_6_1","doi-asserted-by":"publisher","DOI":"10.3115\/1118693.1118694"},{"key":"e_1_2_1_7_1","doi-asserted-by":"publisher","DOI":"10.3115\/1073083.1073165"},{"key":"e_1_2_1_8_1","doi-asserted-by":"publisher","DOI":"10.5555\/1248547.1248566"},{"volume-title":"Proceedings of the Conference on Computational Natural Language Learning (CoNLL\u201910)","author":"Gimpel K.","key":"e_1_2_1_9_1","unstructured":"Gimpel , K. , Das , D. , and Smith , N. A . 2010. Distributed asynchronous online learning for natural language processing . In Proceedings of the Conference on Computational Natural Language Learning (CoNLL\u201910) . 213--222. Gimpel, K., Das, D., and Smith, N. A. 2010. Distributed asynchronous online learning for natural language processing. In Proceedings of the Conference on Computational Natural Language Learning (CoNLL\u201910). 213--222."},{"key":"e_1_2_1_10_1","doi-asserted-by":"publisher","DOI":"10.3115\/992628.992709"},{"key":"e_1_2_1_11_1","unstructured":"Hashimoto T. Inui T. and Murakami K. 2008. Constructing extended named entity annotated corpora. IPSJ SIG Notes 2008 113 113--120.  Hashimoto T. Inui T. and Murakami K. 2008. Constructing extended named entity annotated corpora. IPSJ SIG Notes 2008 113 113--120."},{"key":"e_1_2_1_12_1","volume-title":"Proceedings of the Annual Meeting of the Association for Computational Linguistics (ACL\u201908)","author":"Huang L.","year":"2008","unstructured":"Huang , L. 2008 . Forest reranking: Discriminative parsing with non-local features . In Proceedings of the Annual Meeting of the Association for Computational Linguistics (ACL\u201908) . 586--594. Huang, L. 2008. Forest reranking: Discriminative parsing with non-local features. In Proceedings of the Annual Meeting of the Association for Computational Linguistics (ACL\u201908). 586--594."},{"volume-title":"Proceedings of the Annual Meeting of the Association for Computational Linguistics (ACL\u201910)","author":"Huang L.","key":"e_1_2_1_13_1","unstructured":"Huang , L. and Sagae , K . 2010. Dynamic programming for linear-time incremental parsing . In Proceedings of the Annual Meeting of the Association for Computational Linguistics (ACL\u201910) . 1077--1086. Huang, L. and Sagae, K. 2010. Dynamic programming for linear-time incremental parsing. In Proceedings of the Annual Meeting of the Association for Computational Linguistics (ACL\u201910). 1077--1086."},{"key":"e_1_2_1_14_1","volume-title":"In Proceedings of the IREX Workshop.","author":"IREX Committee","year":"1999","unstructured":"IREX Committee . 1999 . In Proceedings of the IREX Workshop. IREX Committee. 1999. In Proceedings of the IREX Workshop."},{"key":"e_1_2_1_15_1","doi-asserted-by":"publisher","DOI":"10.3115\/1072228.1072282"},{"key":"e_1_2_1_16_1","volume-title":"Proceedings of the Recent Advances in Natural Language Processing (RANLP\u201911)","author":"Iwakura T.","year":"2011","unstructured":"Iwakura , T. 2011 . A named entity recognition method using rules acquired from unlabeled data . In Proceedings of the Recent Advances in Natural Language Processing (RANLP\u201911) . 170--177. Iwakura, T. 2011. A named entity recognition method using rules acquired from unlabeled data. In Proceedings of the Recent Advances in Natural Language Processing (RANLP\u201911). 170--177."},{"volume-title":"Proceedings of the Joint Conference on Empirical Methods on Natural Language Processing and the Conference on Computational Natural Language Learning (EMNLP-CoNLL\u201907)","author":"Kazama J.","key":"e_1_2_1_17_1","unstructured":"Kazama , J. and Torisawa , K . 2007. Exploiting Wikipedia as external knowledge for named entity recognition . In Proceedings of the Joint Conference on Empirical Methods on Natural Language Processing and the Conference on Computational Natural Language Learning (EMNLP-CoNLL\u201907) . 698--707. Kazama, J. and Torisawa, K. 2007. Exploiting Wikipedia as external knowledge for named entity recognition. In Proceedings of the Joint Conference on Empirical Methods on Natural Language Processing and the Conference on Computational Natural Language Learning (EMNLP-CoNLL\u201907). 698--707."},{"volume-title":"Proceedings of the Conference of the North American Chapter of the Association for Computational Learning: Human Language Technologies (HLT-NAACL\u201908)","author":"Kazama J.","key":"e_1_2_1_18_1","unstructured":"Kazama , J. and Torisawa , K . 2008. Inducing gazetteers for named entity recognition by large-scale clustering of dependency relations . In Proceedings of the Conference of the North American Chapter of the Association for Computational Learning: Human Language Technologies (HLT-NAACL\u201908) . 407--415. Kazama, J. and Torisawa, K. 2008. Inducing gazetteers for named entity recognition by large-scale clustering of dependency relations. In Proceedings of the Conference of the North American Chapter of the Association for Computational Learning: Human Language Technologies (HLT-NAACL\u201908). 407--415."},{"key":"e_1_2_1_19_1","doi-asserted-by":"publisher","DOI":"10.3115\/1118853.1118869"},{"volume-title":"Proceedings of the Conference Empirical Methods on Natural Language Processing (EMNLP\u201904)","author":"Kudo T.","key":"e_1_2_1_20_1","unstructured":"Kudo , T. , Yamamoto , K. , and Matsumoto , Y . 2004. Applying conditional random fields to Japanese morphological analysis . In Proceedings of the Conference Empirical Methods on Natural Language Processing (EMNLP\u201904) . 230--237. Kudo, T., Yamamoto, K., and Matsumoto, Y. 2004. Applying conditional random fields to Japanese morphological analysis. In Proceedings of the Conference Empirical Methods on Natural Language Processing (EMNLP\u201904). 230--237."},{"volume-title":"Proceedings of the International Conference on Language Resouces and Evaluation. 719--724","author":"Kurohashi S.","key":"e_1_2_1_21_1","unstructured":"Kurohashi , S. and Nagao , M . 1998. Building a Japanese parsed corpus while improving the parsing system . In Proceedings of the International Conference on Language Resouces and Evaluation. 719--724 . Kurohashi, S. and Nagao, M. 1998. Building a Japanese parsed corpus while improving the parsing system. In Proceedings of the International Conference on Language Resouces and Evaluation. 719--724."},{"volume-title":"Proceedings of the International Conference on Machine Learning (ICML\u201901)","author":"Lafferty J. D.","key":"e_1_2_1_22_1","unstructured":"Lafferty , J. D. , McCallum , A. , and Pereira , F. C. N. 2001. Conditional random fields: Probabilistic models for segmenting and labeling sequence data . In Proceedings of the International Conference on Machine Learning (ICML\u201901) . 282--289. Lafferty, J. D., McCallum, A., and Pereira, F. C. N. 2001. Conditional random fields: Probabilistic models for segmenting and labeling sequence data. In Proceedings of the International Conference on Machine Learning (ICML\u201901). 282--289."},{"key":"e_1_2_1_23_1","doi-asserted-by":"publisher","DOI":"10.1007\/BF01589116"},{"key":"e_1_2_1_24_1","doi-asserted-by":"publisher","DOI":"10.5715\/jnlp.11.3_39"},{"volume-title":"Proceedings of the International Conference on Machine Learning (ICML\u201900)","author":"McCallum A.","key":"e_1_2_1_25_1","unstructured":"McCallum , A. , Freitag , D. , and Pereira , F. C. N. 2000. Maximum entropy Markov models for information extraction and segmentation . In Proceedings of the International Conference on Machine Learning (ICML\u201900) . 591--598. McCallum, A., Freitag, D., and Pereira, F. C. N. 2000. Maximum entropy Markov models for information extraction and segmentation. In Proceedings of the International Conference on Machine Learning (ICML\u201900). 591--598."},{"volume-title":"Proceedings of the Conference of the North American Chapter of the Association for Computational Learning: Human Language Technologies (HLT-NAACL\u201910)","author":"McDonald R.","key":"e_1_2_1_26_1","unstructured":"McDonald , R. , Hall , K. , and Mann , G . 2010. Distributed training strategies for the structured perceptron . In Proceedings of the Conference of the North American Chapter of the Association for Computational Learning: Human Language Technologies (HLT-NAACL\u201910) . 456--464. McDonald, R., Hall, K., and Mann, G. 2010. Distributed training strategies for the structured perceptron. In Proceedings of the Conference of the North American Chapter of the Association for Computational Learning: Human Language Technologies (HLT-NAACL\u201910). 456--464."},{"volume-title":"Proceedings of the Conference on Empirical Methods on Natural Language Processing (EMNLP\u201910)","author":"Mejer A.","key":"e_1_2_1_27_1","unstructured":"Mejer , A. and Crammer , K . 2010. Confidence in structured-prediction using confidence-weighted models . In Proceedings of the Conference on Empirical Methods on Natural Language Processing (EMNLP\u201910) . 971--981. Mejer, A. and Crammer, K. 2010. Confidence in structured-prediction using confidence-weighted models. In Proceedings of the Conference on Empirical Methods on Natural Language Processing (EMNLP\u201910). 971--981."},{"volume-title":"Proceedings of the Conference of the European Chapter of the Association for Computational Linguistics (EACL\u201909)","author":"Nothman J.","key":"e_1_2_1_28_1","unstructured":"Nothman , J. , Murphy , T. , and Curran , J. R . 2009. Analysing Wikipedia and gold-standard corpora for NER training . In Proceedings of the Conference of the European Chapter of the Association for Computational Linguistics (EACL\u201909) . 612--620. Nothman, J., Murphy, T., and Curran, J. R. 2009. Analysing Wikipedia and gold-standard corpora for NER training. In Proceedings of the Conference of the European Chapter of the Association for Computational Linguistics (EACL\u201909). 612--620."},{"key":"e_1_2_1_29_1","doi-asserted-by":"publisher","DOI":"10.3115\/1220175.1220234"},{"volume-title":"Proceedings of the 3rd Workshop on Very Large Corpora. Association for Computational Linguistics, 82--94","author":"Ramshaw L.","key":"e_1_2_1_30_1","unstructured":"Ramshaw , L. and Marcus , M . 1995. Text chunking using transformation-based learning . In Proceedings of the 3rd Workshop on Very Large Corpora. Association for Computational Linguistics, 82--94 . Ramshaw, L. and Marcus, M. 1995. Text chunking using transformation-based learning. In Proceedings of the 3rd Workshop on Very Large Corpora. Association for Computational Linguistics, 82--94."},{"volume-title":"Neurocomputing: Foundations of Research","author":"Rosenblatt F.","key":"e_1_2_1_31_1","unstructured":"Rosenblatt , F. 1958. The perceptron: A probabilistic model for information storage and organization in the brain . In Neurocomputing: Foundations of Research , MIT Press , Cambridge, MA , 89--114. Rosenblatt, F. 1958. The perceptron: A probabilistic model for information storage and organization in the brain. In Neurocomputing: Foundations of Research, MIT Press, Cambridge, MA, 89--114."},{"key":"e_1_2_1_32_1","doi-asserted-by":"publisher","DOI":"10.3115\/1117601.1117631"},{"volume-title":"Proceedings of the Neural Information Processing Systems Conference (NIPS\u201904)","author":"Sarawagi S.","key":"e_1_2_1_33_1","unstructured":"Sarawagi , S. and Cohen , W. W . 2004. Semi-Markov conditional random fields for information extraction . In Proceedings of the Neural Information Processing Systems Conference (NIPS\u201904) . Sarawagi, S. and Cohen, W. W. 2004. Semi-Markov conditional random fields for information extraction. In Proceedings of the Neural Information Processing Systems Conference (NIPS\u201904)."},{"volume-title":"Proceedings of the International Conference on Language Resources and Evaluation (LREC\u201902)","author":"Sekine S.","key":"e_1_2_1_34_1","unstructured":"Sekine , S. , Sudo , K. , and Nobata , C . 2002. Extended named entity hierarchy . In Proceedings of the International Conference on Language Resources and Evaluation (LREC\u201902) . Sekine, S., Sudo, K., and Nobata, C. 2002. Extended named entity hierarchy. In Proceedings of the International Conference on Language Resources and Evaluation (LREC\u201902)."},{"key":"e_1_2_1_35_1","doi-asserted-by":"publisher","DOI":"10.3115\/1073445.1073473"},{"volume-title":"Proceedings of the Conference on Empirical Methods on Natural Language Processing (EMNLP\u201908)","author":"Snow R.","key":"e_1_2_1_36_1","unstructured":"Snow , R. , O\u2019Connor , B. , Jurafsky , D. , and Ng , A. Y . 2008. Cheap and fast - but is it good? evaluating non-expert annotations for natural language tasks . In Proceedings of the Conference on Empirical Methods on Natural Language Processing (EMNLP\u201908) . 254--263. Snow, R., O\u2019Connor, B., Jurafsky, D., and Ng, A. Y. 2008. Cheap and fast - but is it good? evaluating non-expert annotations for natural language tasks. In Proceedings of the Conference on Empirical Methods on Natural Language Processing (EMNLP\u201908). 254--263."},{"key":"e_1_2_1_37_1","doi-asserted-by":"publisher","DOI":"10.3115\/977035.977059"},{"key":"e_1_2_1_38_1","doi-asserted-by":"publisher","DOI":"10.3115\/1118853.1118877"},{"key":"e_1_2_1_39_1","doi-asserted-by":"publisher","DOI":"10.3115\/1119176.1119195"},{"key":"e_1_2_1_40_1","doi-asserted-by":"publisher","DOI":"10.3115\/1075218.1075260"},{"key":"e_1_2_1_41_1","unstructured":"Weischedel R. and Brunstein A. 2005. BBN Pronoun Coreference and Entity Type Corpus. Linguistic Data Consortium Philadelphia PA.  Weischedel R. and Brunstein A. 2005. BBN Pronoun Coreference and Entity Type Corpus. Linguistic Data Consortium Philadelphia PA."},{"key":"e_1_2_1_42_1","first-page":"13","article-title":"Shift-reduce chunking for Japanese named entity extraction","volume":"2007","author":"Yamada H.","year":"2007","unstructured":"Yamada , H. 2007 . Shift-reduce chunking for Japanese named entity extraction . IPSJ SIG Notes 2007 , 47, 13 -- 18 . Yamada, H. 2007. Shift-reduce chunking for Japanese named entity extraction. IPSJ SIG Notes 2007, 47, 13--18.","journal-title":"IPSJ SIG Notes"}],"container-title":["ACM Transactions on Asian Language Information Processing"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/2499955.2499958","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/2499955.2499958","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,18]],"date-time":"2025-06-18T07:34:00Z","timestamp":1750232040000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/2499955.2499958"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2013,8]]},"references-count":42,"journal-issue":{"issue":"3","published-print":{"date-parts":[[2013,8]]}},"alternative-id":["10.1145\/2499955.2499958"],"URL":"https:\/\/doi.org\/10.1145\/2499955.2499958","relation":{},"ISSN":["1530-0226","1558-3430"],"issn-type":[{"type":"print","value":"1530-0226"},{"type":"electronic","value":"1558-3430"}],"subject":[],"published":{"date-parts":[[2013,8]]},"assertion":[{"value":"2012-05-01","order":0,"name":"received","label":"Received","group":{"name":"publication_history","label":"Publication History"}},{"value":"2012-10-01","order":1,"name":"accepted","label":"Accepted","group":{"name":"publication_history","label":"Publication History"}},{"value":"2013-08-01","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}