{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,10,21]],"date-time":"2025-10-21T00:39:42Z","timestamp":1761007182832,"version":"build-2065373602"},"reference-count":9,"publisher":"Wiley","issue":"1","license":[{"start":{"date-parts":[[2015,4,24]],"date-time":"2015-04-24T00:00:00Z","timestamp":1429833600000},"content-version":"vor","delay-in-days":478,"URL":"http:\/\/onlinelibrary.wiley.com\/termsAndConditions#vor"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Proc of Assoc for Info"],"published-print":{"date-parts":[[2014,1]]},"abstract":"<jats:title>ABSTRACT<\/jats:title><jats:p>The \u201cbig data\u201d movement promises to deliver better decisions in all aspects of our lives from business to science health, and government by using computational techniques to identify patterns from large historical collections of data. Although a unified view from curation to analysis has been proposed, current research appears to have polarized into two separate groups: those curating large datasets and those developing computational methods to identify patterns in large datasets. The case study presented here demonstrates the enormous impact that parameter tuning can have on the resulting accuracy, precision, and recall of a computational model that is generated from data. It also illustrates the vastness of the parameter space that must be searched in order to produce optimal models and curated in order to avoid redundant experiments. This highlights the need for research that focuses on the gap between collection and analytics if we are to realize the potential of big data.<\/jats:p>","DOI":"10.1002\/meet.2014.14505101138","type":"journal-article","created":{"date-parts":[[2015,4,24]],"date-time":"2015-04-24T17:19:02Z","timestamp":1429895942000},"page":"1-4","source":"Crossref","is-referenced-by-count":0,"title":["Parameter tuning: Exposing the gap between data curation and effective data analytics"],"prefix":"10.1002","volume":"51","author":[{"given":"Catherine","family":"Blake","sequence":"first","affiliation":[{"name":"School of Library and Information Science University of Illinois at Urbana\u2010Champaign"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Henry A.","family":"Gabb","sequence":"additional","affiliation":[{"name":"School of Library and Information Science University of Illinois at Urbana\u2010Champaign"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"311","published-online":{"date-parts":[[2015,4,24]]},"reference":[{"key":"e_1_2_7_2_1","doi-asserted-by":"publisher","DOI":"10.1002\/asi.22634"},{"key":"e_1_2_7_3_1","unstructured":"Cabena P.(1998).Discovering data mining: from concept to implementation: Prentice Hall."},{"key":"e_1_2_7_4_1","doi-asserted-by":"publisher","DOI":"10.1145\/240455.240464"},{"volume-title":"The Fourth Paradigm: Data\u2010Intensive Scientific Discovery","year":"2009","author":"Hey T.","key":"e_1_2_7_5_1"},{"key":"e_1_2_7_6_1","doi-asserted-by":"publisher","DOI":"10.1023\/B:MACH.0000035473.11134.83"},{"key":"e_1_2_7_7_1","unstructured":"Manyika J. Chui M. Brown B. Bughin J. Dobbs R. Roxburgh C. &Byers A. H.(2011).Big data: The next frontier for innovation competition and productivity. InM. G.Insititute(ed.)."},{"key":"e_1_2_7_8_1","doi-asserted-by":"crossref","unstructured":"Renear A. H. Sacchi S. &Wickett K. M.(2010).Definitions of dataset in the scientific and technical literature. Paper presented at the Paper presented at the ASIST Pittsburgh PA USA.","DOI":"10.1002\/meet.14504701240"},{"key":"e_1_2_7_9_1","doi-asserted-by":"publisher","DOI":"10.1145\/1721654.1721678"},{"key":"e_1_2_7_10_1","unstructured":"Yang Y. &Pedersen J. P.(1997).A Comparative Study on Feature Selection in Text Categorization. Paper presented at the Proceedings of the Fourteenth International Conference on Machine Learning (ICML'97)."}],"container-title":["Proceedings of the American Society for Information Science and Technology"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/api.wiley.com\/onlinelibrary\/tdm\/v1\/articles\/10.1002%2Fmeet.2014.14505101138","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/api.wiley.com\/onlinelibrary\/tdm\/v1\/articles\/10.1002%2Fmeet.2014.14505101138","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/asistdl.onlinelibrary.wiley.com\/doi\/pdf\/10.1002\/meet.2014.14505101138","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,20]],"date-time":"2025-10-20T12:00:05Z","timestamp":1760961605000},"score":1,"resource":{"primary":{"URL":"https:\/\/asistdl.onlinelibrary.wiley.com\/doi\/10.1002\/meet.2014.14505101138"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2014,1]]},"references-count":9,"journal-issue":{"issue":"1","published-print":{"date-parts":[[2014,1]]}},"alternative-id":["10.1002\/meet.2014.14505101138"],"URL":"https:\/\/doi.org\/10.1002\/meet.2014.14505101138","archive":["Portico"],"relation":{},"ISSN":["0044-7870","1550-8390"],"issn-type":[{"type":"print","value":"0044-7870"},{"type":"electronic","value":"1550-8390"}],"subject":[],"published":{"date-parts":[[2014,1]]}}}