{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,10,4]],"date-time":"2025-10-04T14:30:28Z","timestamp":1759588228626,"version":"3.38.0"},"reference-count":44,"publisher":"Walter de Gruyter GmbH","issue":"2","license":[{"start":{"date-parts":[[2011,1,1]],"date-time":"2011-01-01T00:00:00Z","timestamp":1293840000000},"content-version":"tdm","delay-in-days":0,"URL":"http:\/\/www.springer.com\/tdm"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":[],"published-print":{"date-parts":[[2011,1,1]]},"abstract":"<jats:title>Abstract<\/jats:title><jats:p>The discovery of association rules is an important and challenging data mining task. Most of the existing algorithms for finding association rules require multiple passes over the entire database, and I\/O overhead incurred is extremely high for very large databases. An obvious approach to reduce the complexity of association rule mining is sampling. In recent times, several sampling-based approaches have been developed for speeding up the process of association rule mining. A proficient progressive sampling-based approach is presented for mining association rules from large databases. At first, frequent itemsets are mined from an initial sample and subsequently, the negative border is computed from the mined frequent itemsets. Based on the support computed for the midpoint itemset in the sorted negative border, the sample size is either increased or association rules are mined from it. In this paper, we have presented an extensive analysis of the progressive sampling-based approach with different real life datasets and, in addition, the performance of the approach is evaluated with the well-known association rule mining algorithm, Apriori. The experimental results show that accuracy and computation time of the progressive sampling-based approach is effectively improved in mining of association rules from the real life datasets.<\/jats:p>","DOI":"10.2478\/s13537-011-0016-y","type":"journal-article","created":{"date-parts":[[2011,6,27]],"date-time":"2011-06-27T23:43:15Z","timestamp":1309218195000},"source":"Crossref","is-referenced-by-count":4,"title":["Analysis of the progressive sampling-based approach using real life datasets"],"prefix":"10.2478","volume":"1","author":[{"given":"Venkatapathy","family":"Umarani","sequence":"first","affiliation":[]},{"given":"Muthusamy","family":"Punithavalli","sequence":"additional","affiliation":[]}],"member":"374","reference":[{"key":"16_CR1","first-page":"77","volume":"1933","author":"V. Maojo","year":"2000","unstructured":"Maojo V., Sanandres J., A Survey of Data Mining Techniques, LECT NOTES COMPUT SC, 2000, 1933, 77\u201392","journal-title":"LECT NOTES COMPUT SC"},{"key":"16_CR2","doi-asserted-by":"crossref","unstructured":"Peng Y., Kou G., Chen Z., Shi Y., Recent trends in Data Mining (DM): Document Clustering of DM Publications, In: Chen J. (Ed.), Proceedings of International Conference on Service Systems and Service Management (9\u201311 June 2007, Chengdu, China), IEEE, 2007, 1653\u20131659","DOI":"10.1109\/ICSSSM.2006.320794"},{"key":"16_CR3","first-page":"549","volume":"5","author":"E. Ramaraj","year":"2009","unstructured":"Ramaraj E., Venkatesan, N., An efficient pattern mining analysis in health care database, JOURNAL OF THEORETICAL AND APPLIED INFORMATION TECHNOLOGY, 2009, 5, 549\u2013555","journal-title":"JOURNAL OF THEORETICAL AND APPLIED INFORMATION TECHNOLOGY"},{"key":"16_CR4","first-page":"414","volume":"32","author":"C.A. BabuRaj","year":"2006","unstructured":"BabuRaj C.A., Effective and Efficient Mining of Data in Mobile Computing, IAENG INTERNATIONAL JOURNAL OF COMPUTER SCIENCE, 2006, 32, 414\u2013418","journal-title":"IAENG INTERNATIONAL JOURNAL OF COMPUTER SCIENCE"},{"key":"16_CR5","unstructured":"Saeedmanesh M., Izadi T., Ahvar E., HDM: A Hybrid Data Mining Technique for Stock Exchange Prediction, In: Athanasios T. (Ed.), Proceedings of the International Multi-Conference of Engineers and Computer Scientists (17\u201319 March 2010, Hong Kong), Springer and America Institute of Physics, 2010, 587\u2013592"},{"key":"16_CR6","first-page":"82","volume":"15","author":"J.E. Moreno","year":"2007","unstructured":"Moreno J.E., Castillo O., Castro J.R., Mart\u00ednez L.G., Melin P., Data Mining for extraction of fuzzy IF-THEN rules using Mamdani and Takagi-Sugeno-Kang FIS, ENGINEERING LETTERS ON COMPUTER SCIENCE AND ENGINEERING, 2007, 15, 82\u201388","journal-title":"ENGINEERING LETTERS ON COMPUTER SCIENCE AND ENGINEERING"},{"key":"16_CR7","unstructured":"Ganji M.F., Abadeh M.S., Rule Based Classification System for Medical Data Mining Using Fuzzy Ant Colony Optimization, In: Athanasios T. (Ed.), Proceedings of The World Congress on Engineering and Computer Science (20\u201322 October, 2010, San Francisco, USA), Springer, 2010, 466\u2013472"},{"key":"16_CR8","unstructured":"Thotappa C., Ravindranath K., Data mining Aided Proficient Approach for Optimal Inventory Control in Supply Chain Management, In: Athanasios T. (Ed.), Proceedings of the World Congress on Engineering (30 June\u20132 July, 2010, London, U.K), Springer, 2010, 341\u2013345"},{"key":"16_CR9","unstructured":"Kannan K.S., Sekar P.S., Sathik M.M., Arumugam P., Financial Stock Market Forecast using Data Mining Techniques, In: Athanasios T. (Ed.), Proceedings of The International Multi-Conference of Engineers and Computer Scientists (17\u201319 March, 2010, Hong Kong), Springer, 2010, 555\u2013559"},{"key":"16_CR10","doi-asserted-by":"crossref","unstructured":"Vojtas P., Eckhardt A., Considering Data-Mining Techniques in User Preference Learning, In: Liu J. (Ed.), Proceedings of the IEEE\/WIC\/ACM International Conference on Web Intelligence and Intelligent Agent Technology (9\u201312 December, 2008, Sydney, Australia), IEEE, 2008, 33\u201336","DOI":"10.1109\/WIIAT.2008.53"},{"key":"16_CR11","volume-title":"Advances in Knowledge Discovery and Data Mining","author":"J. Han","year":"1996","unstructured":"Han J., Fu Y., Attribute-Oriented Induction in Data Mining, In: Fayyad U.M., Piatetsky-Shapiro G., Smyth P., Uthurusamy R. (Eds.), Advances in Knowledge Discovery and Data Mining, AAAI Press\/The MIT Press, Menlo Park, CA, 1996"},{"key":"16_CR12","volume-title":"Data Mining: Concepts and Techniques","author":"J. Han","year":"2001","unstructured":"Han J., Kamber M., Data Mining: Concepts and Techniques, Morgan Kaufman, San Francisco, CA, 2001"},{"key":"16_CR13","unstructured":"Agrawal R., Srikant R., Fast algorithms for mining association rules, In: Bocca J.B (Ed.), Proceedings of 20th International Conference on Very Large Data Bases (12\u201315 September 1994, Santiago, Chile), Morgan Kaufmann, 1994, 487\u2013499"},{"key":"16_CR14","first-page":"247","volume":"9","author":"M.J.Z. Rahman","year":"2009","unstructured":"Rahman M.J.Z., Balasubramanie P., Weighted Support Association Rule Mining using Closed Itemset Lattices in Parallel, INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY, 2009, 9, 247\u2013253","journal-title":"INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY"},{"key":"16_CR15","doi-asserted-by":"crossref","first-page":"58","DOI":"10.1145\/360402.360421","volume":"2","author":"J. Hipp","year":"2000","unstructured":"Hipp J., G\u00fcntzer U., Nakhaeizadeh G., Algorithms for association rule mining \u2014 a general survey and comparison, ACM SIGKDD EXPLORATIONS NEWSLETTER, 2000, 2, 58\u201364","journal-title":"ACM SIGKDD EXPLORATIONS NEWSLETTER"},{"key":"16_CR16","first-page":"156","volume":"18","author":"S.C. Abou","year":"2010","unstructured":"Abou S.C., Dao T.M., Association Rules Mining Approach to Mineral Processing Control, ENGINEERING LETTERS ON COMPUTER SCIENCE AND ENGINEERING, 2010, 18, 156\u2013164","journal-title":"ENGINEERING LETTERS ON COMPUTER SCIENCE AND ENGINEERING"},{"key":"16_CR17","unstructured":"Motwani M., Rana J.L., Jain R.C., Use of Domain Knowledge for Fast Mining of Association Rules, In: Athanasios T. (Ed.), Proceedings of the International Multi-Conference of Engineers and Computer Scientists (18\u201320 March, 2009, Hong Kong), Springer, 2009, 738\u2013743"},{"key":"16_CR18","unstructured":"Sheibani R., Ebrahimzadeh A., An Algorithm for Mining Fuzzy Association Rules, In: Athanasios T. (Ed.), Proceedings of the International Multi Conference of Engineers and Computer Scientists (19\u201321 March, 2008, Hong Kong), Springer, 2008, 486\u2013490"},{"key":"16_CR19","unstructured":"Abou S.C., Dao T.M., Fuzzy Logic Controller Based on Association Rules Mining: Application to Mineral Processing, In: Athanasios T. (Ed.), Proceedings of the World Congress on Engineering and Computer Science (20\u201322 October, 2009, San Francisco, USA), Springer, 2009, 878\u2013883"},{"key":"16_CR20","doi-asserted-by":"crossref","unstructured":"Agrawal R., Imielinski T., Swami A.N., Mining association rules between sets of items in large databases, In: Buneman P. (Ed.), Proceedings of the ACM SIGMOD International Conference on Management of data (25\u201328 May, Washington, D.C., United States), ACM Press, 1993, 207\u2013216","DOI":"10.1145\/170036.170072"},{"key":"16_CR21","doi-asserted-by":"crossref","first-page":"25","DOI":"10.1023\/B:DAMI.0000005257.93780.3b","volume":"8","author":"F. Coenen","year":"2004","unstructured":"Coenen F., Goulbourne G., Leng P., Tree Structures for Mining Association Rules, DATA MIN KNOWL DISC, 2004, 8, 25\u201351","journal-title":"DATA MIN KNOWL DISC"},{"key":"16_CR22","doi-asserted-by":"crossref","unstructured":"Han J., Efficient mining of partial periodic patterns in time series database, 15th International Conference on Data Engineering (23\u201326 March, 1999, Sydney, Australia), IEEE computer society, 1999, 106\u2013115","DOI":"10.1109\/ICDE.1999.754913"},{"key":"16_CR23","unstructured":"Thomas S., Bodagala S., Alsabti K., Ranka S., An efficient algorithm for the incremental updation of association rules in large databases, In: Heckerman D. (Ed.), Proceedings of the Knowledge Discovery and Data Mining (14\u201317 August, 1997, Newport Beach, California), AAAI Press, 1997, 263\u2013266"},{"key":"16_CR24","doi-asserted-by":"crossref","first-page":"54","DOI":"10.1007\/3-540-44794-6_5","volume":"2168","author":"F. Coenen","year":"2001","unstructured":"Coenen F., Goulbourne G., Leng P., Computing association rules using partial totals, LECT NOTES COMPUT SC, 2001, 2168, 54\u201366","journal-title":"LECT NOTES COMPUT SC"},{"key":"16_CR25","volume-title":"Sampling Techniques","author":"W.G. Cochran","year":"1977","unstructured":"Cochran W.G., Sampling Techniques, 3rd edition, John Wiley and Sons, New York, 1977","edition":"3rd edition"},{"key":"16_CR26","doi-asserted-by":"crossref","unstructured":"Parthasarathy S., Efficient Progressive Sampling for Association Rules, In: Ohsuga S. (Ed.), Proceedings of the IEEE International Conference on Data Mining (9\u201312 December 2002, Maebashi City, Japan), IEEE Computer Society, 2002, 354\u2013361","DOI":"10.1109\/ICDM.2002.1183923"},{"key":"16_CR27","doi-asserted-by":"crossref","unstructured":"Chen B., Haas P., Scheuermann P., New Two-Phase Sampling Based Algorithm for Discovering Association Rules, In: Zaki M.J. (Ed.), Proceedings of the eighth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (23\u201326 July, 2002, Alberta, Canada), ACM, 2002, 462\u2013468","DOI":"10.1145\/775047.775114"},{"key":"16_CR28","doi-asserted-by":"crossref","unstructured":"Zaki M.J., Parthasarathy S., Li W., Ogihara, M., Evaluation of Sampling for Data Mining of Association Rules, Proceedings of the 7th International workshop on Research Issues in Data Engineering (7\u20138 April 1997, Birmingham, UK), IEEE Computer Society, 1997, 42\u201350","DOI":"10.1109\/RIDE.1997.583696"},{"key":"16_CR29","doi-asserted-by":"crossref","first-page":"505","DOI":"10.1007\/11430919_59","volume":"3518","author":"K.T. Chuang","year":"2005","unstructured":"Chuang K.T., Chen M.S., Yang W.C., Progressive Sampling for Association Rules Based on Sampling Error Estimation, LECT NOTES COMPUT SC, 2005, 3518, 505\u2013515","journal-title":"LECT NOTES COMPUT SC"},{"key":"16_CR30","unstructured":"Estrada A., Morales E.F., NSC: A New Progressive Sampling Algorithm, Proceedings of the Workshop: Machine Learning for Scientific Data Analysis (Iberamia) (22\u201326 November, 2004, Iberamia), 2004, 335\u2013344"},{"key":"16_CR31","doi-asserted-by":"crossref","unstructured":"Umarani V., Punithavalli M., Developing Novel and Effective Approach for Association Rule Mining Using Progressive Sampling, Proceedings of Second International Conference on Computer and Electrical Engineering (28\u201330 December, 2009, Dubai, UAE), IEEE Computer Society, 2009, 610\u2013614","DOI":"10.1109\/ICCEE.2009.211"},{"key":"16_CR32","doi-asserted-by":"crossref","unstructured":"Umarani V., Punithavalli M., On Developing an Effectual Progressive Sampling-Based Approach for Association Rule Discovery, In: Wen D. (Ed.), The 2nd IEEE International Conference on Information Management and Engineering (16\u201318 April 2010, Chengdu, China), IEEE Press, 2010, 8\u201312","DOI":"10.1109\/ICIME.2010.5477643"},{"key":"16_CR33","doi-asserted-by":"crossref","first-page":"358","DOI":"10.1177\/0165551508100382","volume":"35","author":"B.A. Mahafzah","year":"2009","unstructured":"Mahafzah B.A., Al-Badarneh A.F., Zakaria M.Z., A new sampling technique for association rule mining, J INF SCI, 2009, 35, 358\u2013376","journal-title":"J INF SCI"},{"key":"16_CR34","doi-asserted-by":"crossref","first-page":"309","DOI":"10.1007\/s11390-005-0309-5","volume":"20","author":"C.Y. Jia","year":"2005","unstructured":"Jia C.Y., Gao X.P., Multi-scaling sampling: an adaptive sampling method for discovering approximate association rules, J COMPUT SCI TECHNOL, 2005, 20, 309\u2013318","journal-title":"J COMPUT SCI TECHNOL"},{"key":"16_CR35","doi-asserted-by":"crossref","first-page":"391","DOI":"10.1007\/978-3-540-30549-1_35","volume":"3339","author":"Y. Li","year":"2005","unstructured":"Li Y., Gopalan R.P., Effective Sampling for Mining Association Rules, LECT NOTES COMPUT SC, 2005, 3339, 391\u2013401","journal-title":"LECT NOTES COMPUT SC"},{"key":"16_CR36","unstructured":"Lin T.Y., Sampling in Association Rule Mining, In: Dasarathy B. (Ed.), Data Mining and Knowledge Discovery: Theory, Tools, and Technology VI, Proceedings of SPIE (Orlando, Fl, USA), SPIE, 2004, 161\u2013167"},{"key":"16_CR37","doi-asserted-by":"crossref","unstructured":"Chakaravarthy V.T., Pandit V., Sabharwal Y., Analysis of sampling techniques for association rule mining, In: Fagin R. (Ed.), Proceedings of the 12th International Conference on Database Theory (23\u201325 March 2009, St. Petersburg, Russia), ACM Press, 2009, 276\u2013283","DOI":"10.1145\/1514894.1514927"},{"key":"16_CR38","unstructured":"Zhao Y., Zhang C., Zhang S., Efficient frequent itemsets mining by sampling, In: Li Y. (Ed.), Proceedings of the fourth International Conference on Active Media Technology (7\u20139 June, 2006, Amsterdam, The Netherlands), IOS Press, 2006, 112\u2013117"},{"key":"16_CR39","doi-asserted-by":"crossref","unstructured":"Provost F., Jensen D., Oates T., Efficient progressive sampling, Proceedings of the fifth ACM SIGKDD international conference on Knowledge discovery and data mining (15\u201318 August, 1999, San Diego, California, United States), ACM, 1999, 23\u201332","DOI":"10.1145\/312129.312188"},{"key":"16_CR40","volume-title":"Studies in Computational Intelligence","author":"J.P. Huang","year":"2009","unstructured":"Huang J.P., Kuo H.C., A Filtering Approach for Mining Frequent Itemsets, In: Kocarev L. (Ed.), Studies in Computational Intelligence, Springer, Berlin\/Heidelberg, 2009"},{"key":"16_CR41","doi-asserted-by":"crossref","first-page":"35","DOI":"10.1007\/s10115-007-0111-5","volume":"17","author":"G. Liu","year":"2008","unstructured":"Liu G., Li J., Wong L., A New Concise representation of Frequent Itemsets Using Generators and A Positive Border, KNOWL INF SYST, 2008, 17, 35\u201356","journal-title":"KNOWL INF SYST"},{"key":"16_CR42","doi-asserted-by":"crossref","first-page":"472","DOI":"10.1109\/TKDE.2006.1599386","volume":"18","author":"M. Song","year":"2006","unstructured":"Song M., Rajasekaran S., A Transaction Mapping Algorithm for Frequent Itemsets Mining, IEEE T KNOWL DATA EN, 2006, 18, 472\u2013481","journal-title":"IEEE T KNOWL DATA EN"},{"key":"16_CR43","doi-asserted-by":"crossref","unstructured":"Yun U., Leggett J.J., WFIM: Weighted Frequent Itemset mining with a weight range and a minimum weight, In: Kamath C. (Ed.), Proceedings of the 5th SIAM Int. Conf. Data Mining (21\u201323 April, 2005, Newport Beach, USA), 2005, 636\u2013640","DOI":"10.1137\/1.9781611972757.76"},{"key":"16_CR44","unstructured":"Lotfi S., Sadreddini M.H., Mining Fuzzy Association Rules Using Mutual Information, In: Athanasios T. (Ed.), Proceedings of the International Multi-Conference of Engineers and Computer Scientists (18\u201320 March, 2009, Hong Kong), Springer, 2009, 684\u2013689"}],"container-title":["Open Computer Science"],"original-title":[],"link":[{"URL":"http:\/\/link.springer.com\/content\/pdf\/10.2478\/s13537-011-0016-y.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"http:\/\/link.springer.com\/article\/10.2478\/s13537-011-0016-y\/fulltext.html","content-type":"text\/html","content-version":"vor","intended-application":"text-mining"},{"URL":"http:\/\/link.springer.com\/content\/pdf\/10.2478\/s13537-011-0016-y","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,3,6]],"date-time":"2025-03-06T19:21:57Z","timestamp":1741288917000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.degruyter.com\/document\/doi\/10.2478\/s13537-011-0016-y\/html"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2011,1,1]]},"references-count":44,"journal-issue":{"issue":"2"},"URL":"https:\/\/doi.org\/10.2478\/s13537-011-0016-y","relation":{},"ISSN":["2299-1093"],"issn-type":[{"type":"electronic","value":"2299-1093"}],"subject":[],"published":{"date-parts":[[2011,1,1]]}}}