{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,11,13]],"date-time":"2025-11-13T12:13:03Z","timestamp":1763035983170,"version":"build-2065373602"},"reference-count":11,"publisher":"Wiley","issue":"1","license":[{"start":{"date-parts":[[2005,1,31]],"date-time":"2005-01-31T00:00:00Z","timestamp":1107129600000},"content-version":"vor","delay-in-days":488,"URL":"http:\/\/onlinelibrary.wiley.com\/termsAndConditions#vor"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Proc of Assoc for Info"],"published-print":{"date-parts":[[2003,10]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:p>We analyzed textual properties of documents to identify predictive variables for various document qualities by means of statistical and linguistic methods. We have created a collection of 1000 documents, each document has been judged in terms of nine document qualities (accuracy, reliability, objectivity, depth, author\/producer credibility, readability, verbosity and conciseness, grammatical correctness, one\u2010sided or multiview.) Employing statistical analyses, we considered a kind of linear combination, asking (1) if it was possible to combine textual features linearly to predict document qualities; (2) what textual features had good predictive power; (3) what textual features were minimally required for prediction with a detection rate much better than the false alarm rate. We present several promising results, indicating that with a few number of textual features, we can predict various document qualities much better than chance.<\/jats:p>","DOI":"10.1002\/meet.1450400128","type":"journal-article","created":{"date-parts":[[2005,1,31]],"date-time":"2005-01-31T10:54:10Z","timestamp":1107168850000},"page":"221-229","source":"Crossref","is-referenced-by-count":3,"title":["Identification of effective predictive variables for document qualities"],"prefix":"10.1002","volume":"40","author":[{"given":"Kwong Bor","family":"Ng","sequence":"first","affiliation":[]},{"given":"Rong","family":"Tang","sequence":"additional","affiliation":[]},{"given":"Sharon","family":"Small","sequence":"additional","affiliation":[]},{"given":"Tomek","family":"Strzalkowski","sequence":"additional","affiliation":[]},{"given":"Paul","family":"Kantor","sequence":"additional","affiliation":[]},{"given":"Robert","family":"Rittman","sequence":"additional","affiliation":[]},{"given":"Peng","family":"Song","sequence":"additional","affiliation":[]},{"given":"Ying","family":"Sun","sequence":"additional","affiliation":[]},{"given":"Nina","family":"Wacholder","sequence":"additional","affiliation":[]}],"member":"311","published-online":{"date-parts":[[2005,1,31]]},"reference":[{"key":"e_1_2_9_2_1","unstructured":"Cunningham H. Maynard D. Bontcheva K. Tablan V. &Wilks Y.(2000). Experience of using GATE for NLP R&D. In Workshop on Using Toolsets and Architectures To Build NLP Systems at COLING\u20102000 Luxembourg."},{"key":"e_1_2_9_3_1","doi-asserted-by":"crossref","unstructured":"Day D. Aberdeen J. Hirschman L. Kozierok R. Robinson P.&Vilain M.(1997) Mixed\u2010initiative development of language processing systems. In Fifth Conference on Applied Natural Language Processing Association for Computational Linguistics. Retrieved December 12 2002 from http:\/\/www.mitre.org\/technology\/alembic\u2010workbench\/ANLP97\u2010bigger.html","DOI":"10.3115\/974557.974608"},{"volume-title":"Signal detection theory and ROC analysis","year":"1975","author":"Egan J. P.","key":"e_1_2_9_4_1"},{"key":"e_1_2_9_5_1","doi-asserted-by":"publisher","DOI":"10.7551\/mitpress\/7287.001.0001"},{"volume-title":"Applied Discriminant Analysis","year":"1994","author":"Huberty C. J.","key":"e_1_2_9_6_1"},{"key":"e_1_2_9_7_1","doi-asserted-by":"publisher","DOI":"10.4135\/9781412983938"},{"key":"e_1_2_9_8_1","series-title":"Sage University Paper series on quantitative applications in the social sciences, series no. 07\u2013106","volume-title":"Applied logistic regression analysis","author":"Menard Scott","year":"1995"},{"key":"e_1_2_9_9_1","doi-asserted-by":"publisher","DOI":"10.1007\/3-540-45921-9"},{"key":"e_1_2_9_10_1","doi-asserted-by":"publisher","DOI":"10.1017\/CBO9780511524882"},{"key":"e_1_2_9_11_1","unstructured":"Rong T. Ng K. B. Strzalkowski T.&Kantor P.(2003) Toward Machine Understanding of Information Quality. In ASIS 2003 Annual Meeting Proceedings (i.e. same volume)."},{"key":"e_1_2_9_12_1","series-title":"NIST Special Publication 500\u2013250: The Tenth Text Retrieval Conference","first-page":"1","volume-title":"Overview of TREC 2001","author":"Voorhees E.","year":"2001"}],"container-title":["Proceedings of the American Society for Information Science and Technology"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/api.wiley.com\/onlinelibrary\/tdm\/v1\/articles\/10.1002%2Fmeet.1450400128","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/asistdl.onlinelibrary.wiley.com\/doi\/pdf\/10.1002\/meet.1450400128","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,20]],"date-time":"2025-10-20T12:04:36Z","timestamp":1760961876000},"score":1,"resource":{"primary":{"URL":"https:\/\/asistdl.onlinelibrary.wiley.com\/doi\/10.1002\/meet.1450400128"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2003,10]]},"references-count":11,"journal-issue":{"issue":"1","published-print":{"date-parts":[[2003,10]]}},"alternative-id":["10.1002\/meet.1450400128"],"URL":"https:\/\/doi.org\/10.1002\/meet.1450400128","archive":["Portico"],"relation":{},"ISSN":["0044-7870","1550-8390"],"issn-type":[{"type":"print","value":"0044-7870"},{"type":"electronic","value":"1550-8390"}],"subject":[],"published":{"date-parts":[[2003,10]]}}}