{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2022,12,28]],"date-time":"2022-12-28T15:54:49Z","timestamp":1672242889671},"reference-count":9,"publisher":"World Scientific Pub Co Pte Lt","issue":"04","content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Int. J. Artif. Intell. Tools"],"published-print":{"date-parts":[[2004,12]]},"abstract":"<jats:p> Few tools exist that address the challenges facing researchers in the Textual Data Mining (TDM) field. Some are too specific to their application, or are prototypes not suitable for general use. More general tools often are not capable of processing large volumes of data. We have created a Textual Data Mining Infrastructure (TMI) that incorporates both existing and new capabilities in a reusable framework conducive to developing new tools and components. TMI adheres to strict guidelines that allow it to run in a wide range of processing environments \u2013 as a result, it accommodates the volume of computing and diversity of research occurring in TDM. A unique capability of TMI is support for optimization. This facilitates text mining research by automating the search for optimal parameters in text mining algorithms. In this article we describe a number of applications that use the TMI. A brief tutorial is provided on the use of TMI. We present several novel results that have not been published elsewhere. We also discuss how the TMI utilizes existing machine-learning libraries, thereby enabling researchers to continue and extend their endeavors with minimal effort. Towards that end, TMI is available on the web at . <\/jats:p>","DOI":"10.1142\/s0218213004001843","type":"journal-article","created":{"date-parts":[[2004,12,28]],"date-time":"2004-12-28T07:24:48Z","timestamp":1104218688000},"page":"829-849","source":"Crossref","is-referenced-by-count":7,"title":["A SOFTWARE INFRASTRUCTURE FOR RESEARCH IN TEXTUAL DATA MINING"],"prefix":"10.1142","volume":"13","author":[{"given":"LARS E.","family":"HOLZMAN","sequence":"first","affiliation":[{"name":"Computer Science, University of Massachusetts Amherst, MA 01002, USA"}]},{"given":"TODD A.","family":"FISHER","sequence":"additional","affiliation":[{"name":"Computer Science and Engineering, Lehigh University Bethlehem, PA 18015, USA"}]},{"given":"LEON M.","family":"GALITSKY","sequence":"additional","affiliation":[{"name":"Computer Science and Engineering, Lehigh University, Bethlehem, PA 18015, USA"}]},{"given":"APRIL","family":"KONTOSTATHIS","sequence":"additional","affiliation":[{"name":"Computer Science and Engineering, Lehigh University, Bethlehem, PA 18015, USA"}]},{"given":"WILLIAM M.","family":"POTTENGER","sequence":"additional","affiliation":[{"name":"Computer Science and Engineering, Lehigh University, Bethlehem, PA 18015, USA"}]}],"member":"219","published-online":{"date-parts":[[2011,11,21]]},"reference":[{"key":"rf1","volume-title":"A Comprehensive Survey of Text Mining","author":"Kontostathis A.","year":"2003"},{"key":"rf4","volume-title":"Data Mining: Practical Machine Learning Tools and Techniques with Java Implementations","author":"Witten I. H.","year":"2000"},{"key":"rf6","volume-title":"Comp. I. R.","author":"Pottenger W. M.","year":"2001"},{"key":"rf8","volume-title":"Proc. Third Conf. Applied N.L. Processing","author":"Brill E.","year":"1992"},{"key":"rf10","volume-title":"Data Mining for Scientific and Engineering Applications","author":"Pottenger W. M.","year":"2001"},{"key":"rf12","doi-asserted-by":"publisher","DOI":"10.1002\/(SICI)1097-4571(199009)41:6<391::AID-ASI1>3.0.CO;2-9"},{"key":"rf15","volume-title":"Proc. Tenth IEEE Znt. Symp. High Performance Distributed Computing","author":"Kuntraruk J.","year":"2001"},{"key":"rf20","author":"Brill E.","journal-title":"Comp. Ling."},{"key":"rf21","volume-title":"Proc. Third Int. Meeting on Research in Logistics","author":"Pinto V. D.","year":"2000"}],"container-title":["International Journal on Artificial Intelligence Tools"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.worldscientific.com\/doi\/pdf\/10.1142\/S0218213004001843","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2019,8,7]],"date-time":"2019-08-07T12:52:42Z","timestamp":1565182362000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.worldscientific.com\/doi\/abs\/10.1142\/S0218213004001843"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2004,12]]},"references-count":9,"journal-issue":{"issue":"04","published-online":{"date-parts":[[2011,11,21]]},"published-print":{"date-parts":[[2004,12]]}},"alternative-id":["10.1142\/S0218213004001843"],"URL":"https:\/\/doi.org\/10.1142\/s0218213004001843","relation":{},"ISSN":["0218-2130","1793-6349"],"issn-type":[{"value":"0218-2130","type":"print"},{"value":"1793-6349","type":"electronic"}],"subject":[],"published":{"date-parts":[[2004,12]]}}}