{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,10,9]],"date-time":"2025-10-09T01:08:07Z","timestamp":1759972087634,"version":"build-2065373602"},"reference-count":16,"publisher":"Wiley","issue":"6","license":[{"start":{"date-parts":[[2007,3,22]],"date-time":"2007-03-22T00:00:00Z","timestamp":1174521600000},"content-version":"vor","delay-in-days":9637,"URL":"http:\/\/onlinelibrary.wiley.com\/termsAndConditions#vor"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["J. Am. Soc. Inf. Sci."],"published-print":{"date-parts":[[1980,11]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:p>An experimental computer program has been developed to classify documents according to the 80 sections and five major section groupings of <jats:italic>Chemical Abstracts<\/jats:italic> (CA). The program uses pattern recognition techniques supplemented by heuristics. During the \u201ctraining\u201d phase, words from pre\u2010classified documents are selected, and the probability of occurrence of each word in each section of CA is computed and stored in a reference dictionary. The \u201cclassification\u201d phase matches each word of a document title against the dictionary and assigns a section number to the document using weights derived from the probabilities in the dictionary. Heuristic techniques are used to normalize word variants such as plurals, past tenses, and gerunds in both the training phase and the classification phase. The dictionary lookup technique is supplemented by the analysis of chemical nomenclature terms into their component word roots to influence the section to which the documents are assigned. Program performance and human consistency have been evaluated by comparing the program results against the published sections of CA and by conducting an experiment with people experienced in the assignment of documents to CA sections. The program assigned approximately 78% of the documents to the correct major section groupings of CA and 67% of the correct sections or cross\u2010references at a rate of 100 documents per second.<\/jats:p>","DOI":"10.1002\/asi.4630310603","type":"journal-article","created":{"date-parts":[[2007,6,28]],"date-time":"2007-06-28T08:55:37Z","timestamp":1183020937000},"page":"396-402","source":"Crossref","is-referenced-by-count":18,"title":["The use of titles for automatic document classification"],"prefix":"10.1002","volume":"31","author":[{"given":"Karen A.","family":"Hamill","sequence":"first","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Antonio","family":"Zamora","sequence":"additional","affiliation":[],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"311","published-online":{"date-parts":[[2007,3,22]]},"reference":[{"key":"e_1_2_1_2_2","doi-asserted-by":"publisher","DOI":"10.1021\/cen-v049n028.p037"},{"key":"e_1_2_1_2_3","doi-asserted-by":"publisher","DOI":"10.1021\/cen-v056n049.p041"},{"key":"e_1_2_1_3_2","doi-asserted-by":"publisher","DOI":"10.1145\/321160.321165"},{"volume-title":"The SMART Retrieval System\u2010Experiments in Automatic Document Processing","year":"1971","author":"Salton G.","key":"e_1_2_1_4_2"},{"key":"e_1_2_1_5_2","doi-asserted-by":"publisher","DOI":"10.1145\/321075.321084"},{"key":"e_1_2_1_6_2","unstructured":"Kar B. G.;White L. J.. \u201cA Distance Measure for Automatic Sequential Document Classification.\u201dTechnical Report No. CISRC 75\u20107. Columbus OH: The Ohio State University;1975."},{"key":"e_1_2_1_7_2","doi-asserted-by":"crossref","unstructured":"White L. J.;Petrarca A. E.;Crawford L. G.;Brinkman B. J.;Mittal S.. \u201cCIRC II Data Base Classification.\u201dFinal Technical Report RADC\u2010TR\u201077\u2010211. New York: Rome Air Development Center;1977.","DOI":"10.21236\/ADA042268"},{"key":"e_1_2_1_8_2","unstructured":"White L. J.;Smith J. D.;Kar G.;Westbrook D. E.;Brinkman B. J.;Fisher R. A.. \u201cA Sequential Method for Automatic Document Classification.\u201dTechnical Report No. CISRC 75\u20105. Columbus OH: The Ohio State University;1975."},{"volume-title":"Sequential Methods in Pattern Recognition and Machine Learning","year":"1968","author":"Fu K. S.","key":"e_1_2_1_9_2"},{"key":"e_1_2_1_10_2","unstructured":"Hamill K. A.;Zamora A.. \u201cAn Automatic Document Classification System using Pattern Recognition Techniques.\u201dProceedings of the ASIS Annual Meeting.15:152\u2013155;1978."},{"key":"e_1_2_1_11_2","doi-asserted-by":"publisher","DOI":"10.1016\/S0019-9958(73)90310-0"},{"key":"e_1_2_1_12_2","unstructured":"Hamill K. A.. \u201cEffect of CODEN on the Performance of an Automatic Document Classification Technique.\u201d M.S. Thesis The Ohio State University 1979."},{"key":"e_1_2_1_13_2","doi-asserted-by":"publisher","DOI":"10.1145\/362686.362692"},{"key":"e_1_2_1_14_2","doi-asserted-by":"publisher","DOI":"10.1145\/362705.362709"},{"key":"e_1_2_1_15_2","doi-asserted-by":"publisher","DOI":"10.1002\/asi.4630270302"},{"volume-title":"Subject Coverage and Arrangement of Abstracts by Sections in CHEMICAL ABSTRACTS, 1975 edition","year":"1975","key":"e_1_2_1_16_2"}],"container-title":["Journal of the American Society for Information Science"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/api.wiley.com\/onlinelibrary\/tdm\/v1\/articles\/10.1002%2Fasi.4630310603","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/asistdl.onlinelibrary.wiley.com\/doi\/pdf\/10.1002\/asi.4630310603","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,8]],"date-time":"2025-10-08T19:12:19Z","timestamp":1759950739000},"score":1,"resource":{"primary":{"URL":"https:\/\/asistdl.onlinelibrary.wiley.com\/doi\/10.1002\/asi.4630310603"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[1980,11]]},"references-count":16,"journal-issue":{"issue":"6","published-print":{"date-parts":[[1980,11]]}},"alternative-id":["10.1002\/asi.4630310603"],"URL":"https:\/\/doi.org\/10.1002\/asi.4630310603","archive":["Portico"],"relation":{},"ISSN":["0002-8231","1097-4571"],"issn-type":[{"type":"print","value":"0002-8231"},{"type":"electronic","value":"1097-4571"}],"subject":[],"published":{"date-parts":[[1980,11]]}}}