{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,30]],"date-time":"2026-03-30T03:02:12Z","timestamp":1774839732420,"version":"3.50.1"},"reference-count":23,"publisher":"MDPI AG","issue":"14","license":[{"start":{"date-parts":[[2022,7,6]],"date-time":"2022-07-06T00:00:00Z","timestamp":1657065600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"name":"Ministry of Research and Technology\/National Research and Innovation Agency","award":["281\/UN40.LP\/PT.01.03\/2021"],"award-info":[{"award-number":["281\/UN40.LP\/PT.01.03\/2021"]}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Sensors"],"abstract":"<jats:p>The phenomenon of big data has occurred in many fields of knowledge, one of which is astronomy. One example of a large dataset in astronomy is that of numerically integrated time series asteroid orbital elements from a time span of millions to billions of years. For example, the mean motion resonance (MMR) data of an asteroid are used to find out the duration that the asteroid was in a resonance state with a particular planet. For this reason, this research designs a computational model to obtain the mean motion resonance quickly and effectively by modifying and implementing the Symbolic Aggregate Approximation (SAX) algorithm and the motif discovery random projection algorithm on big data platforms (i.e., Apache Hadoop and Apache Spark). There are five following steps on the model: (i) saving data into the Hadoop Distributed File System (HDFS); (ii) importing files to the Resilient Distributed Datasets (RDD); (iii) preprocessing the data; (iv) calculating the motif discovery by executing the User-Defined Function (UDF) program; and (v) gathering the results from the UDF to the HDFS and the .csv file. The results indicated a very significant reduction in computational time between the use of the standalone method and the use of the big data platform. The proposed computational model obtained an average accuracy of 83%, compared with the SwiftVis software.<\/jats:p>","DOI":"10.3390\/s22145071","type":"journal-article","created":{"date-parts":[[2022,7,6]],"date-time":"2022-07-06T21:15:52Z","timestamp":1657142152000},"page":"5071","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":2,"title":["SAX and Random Projection Algorithms for the Motif Discovery of Orbital Asteroid Resonance Using Big Data Platforms"],"prefix":"10.3390","volume":"22","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-5324-8208","authenticated-orcid":false,"given":"Lala Septem","family":"Riza","sequence":"first","affiliation":[{"name":"Department of Computer Science Education, Universitas Pendidikan Indonesia, Bandung 40154, Indonesia"}]},{"given":"Muhammad Naufal","family":"Fazanadi","sequence":"additional","affiliation":[{"name":"Department of Computer Science Education, Universitas Pendidikan Indonesia, Bandung 40154, Indonesia"}]},{"given":"Judhistira Aria","family":"Utama","sequence":"additional","affiliation":[{"name":"Department of Physics Education, Universitas Pendidikan Indonesia, Bandung 40154, Indonesia"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-0632-6330","authenticated-orcid":false,"given":"Khyrina Airin Fariza Abu","family":"Samah","sequence":"additional","affiliation":[{"name":"Faculty of Computer and Mathematical Sciences, Universiti Teknologi MARA Cawangan Melaka Kampus Jasin, Melaka City 77300, Malaysia"}]},{"given":"Taufiq","family":"Hidayat","sequence":"additional","affiliation":[{"name":"Astronomy Research Division, Faculty of Mathematics and Natural Science, Institut Teknologi Bandung, Bandung 40132, Indonesia"}]},{"ORCID":"https:\/\/orcid.org\/0000-0003-0126-9944","authenticated-orcid":false,"given":"Shah","family":"Nazir","sequence":"additional","affiliation":[{"name":"Department of Computer Science, University of Swabi, Swabi 94640, Pakistan"}]}],"member":"1968","published-online":{"date-parts":[[2022,7,6]]},"reference":[{"key":"ref_1","doi-asserted-by":"crossref","first-page":"1","DOI":"10.4102\/jtscm.v9i1.165","article-title":"The impact of big data and business analytics on supply chain management","volume":"9","author":"Ittmann","year":"2015","journal-title":"J. Transp. Supply Chain Manag."},{"key":"ref_2","doi-asserted-by":"crossref","first-page":"137","DOI":"10.1016\/j.ijinfomgt.2014.10.007","article-title":"Beyond the hype: Big data concepts. methods. and analytics","volume":"35","author":"Gandomi","year":"2015","journal-title":"Int. J. Inf. Manag."},{"key":"ref_3","first-page":"94","article-title":"Genomic Repeat Detection Using the Knuth-Morris-Pratt Algorithm on R High-Performance-Computing Package","volume":"11","author":"Riza","year":"2019","journal-title":"Int. J. Adv. Soft Comput. Appl."},{"key":"ref_4","first-page":"441","article-title":"Parallel Exponential Smoothing Using the Bootstrap Method in R for Forecasting Asteroid\u2019s Orbital Elements","volume":"26","author":"Riza","year":"2018","journal-title":"Pertanika J. Sci. Technol."},{"key":"ref_5","doi-asserted-by":"crossref","first-page":"23","DOI":"10.1016\/j.compag.2017.09.037","article-title":"A review on the practice of big data analysis in agriculture","volume":"143","author":"Kamilaris","year":"2017","journal-title":"Comput. Electron. Agric."},{"key":"ref_6","doi-asserted-by":"crossref","unstructured":"Menichella, M., Paolicchi, P., and Farinella, P. (1996). The main belt as a source of near-Earth asteroids. Worlds in Interaction: Small Bodies and Planets of the Solar System, Springer.","DOI":"10.1007\/978-94-009-0209-1_19"},{"key":"ref_7","doi-asserted-by":"crossref","first-page":"29","DOI":"10.1016\/j.icarus.2006.04.001","article-title":"Atlas of the mean motion resonances in the Solar System","volume":"184","author":"Gallardo","year":"2006","journal-title":"Icarus"},{"key":"ref_8","doi-asserted-by":"crossref","unstructured":"Lin, J., Keogh, E., Lonardi, S., and Chiu, B. (2003, January 13). A Symbolic Representation of Time Series. with Implications for Streaming Algorithms. Proceedings of the 8th ACM SIGMOD Workshop on Research Issues in Data Mining and Knowledge Discovery, San Diego, CA, USA.","DOI":"10.1145\/882082.882086"},{"key":"ref_9","unstructured":"Jones, N.C., Pevzner, P.A., and Pevzner, P. (2004). An Introduction to Bioinformatics Algorithms, MIT Press."},{"key":"ref_10","unstructured":"White, T. (2012). Hadoop: The Definitive Guide, O\u2019Reilly Media. Inc."},{"key":"ref_11","doi-asserted-by":"crossref","first-page":"56","DOI":"10.1145\/2934664","article-title":"Apache spark: A unified engine for big data processing","volume":"59","author":"Zaharia","year":"2016","journal-title":"Commun. ACM"},{"key":"ref_12","first-page":"959","article-title":"The implementation of sax and random projection for motif discovery on the orbital elements and the resonance argument of asteroid","volume":"12","author":"Riza","year":"2021","journal-title":"Int. J. Nonlinear Anal. Appl."},{"key":"ref_13","doi-asserted-by":"crossref","unstructured":"Jiang, F., Leung, C.K., Sarumi, O.A., and Zhang, C.Y. (2016, January 15\u201318). Mining sequential patterns from uncertain big DNA in the spark framework. Proceedings of the 2016 IEEE International Conference on Bioinformatics and Biomedicine (BIBM), Shenzhen, China.","DOI":"10.1109\/BIBM.2016.7822641"},{"key":"ref_14","doi-asserted-by":"crossref","first-page":"783","DOI":"10.12928\/telkomnika.v18i2.14883","article-title":"Genomic repeats detection using Boyer-Moore algorithm on Apache Spark Streaming","volume":"18","author":"Riza","year":"2020","journal-title":"Telkomnika"},{"key":"ref_15","doi-asserted-by":"crossref","first-page":"160","DOI":"10.1016\/j.ins.2020.06.014","article-title":"Big data time series forecasting based on pattern sequence similarity and its application to the electricity demand","volume":"540","author":"Troncoso","year":"2020","journal-title":"Inf. Sci."},{"key":"ref_16","doi-asserted-by":"crossref","unstructured":"Krishnan, S.P.T., and Gonzalez, J.L.U. (2015). Building Your Next Big Thing with Google Cloud Platform: A Guide for Developers and Enterprise Architects, Apress.","DOI":"10.1007\/978-1-4842-1004-8"},{"key":"ref_17","unstructured":"Zaharia, M., Chowdhury, M., Das, T., Dave, A., Ma, J., McCauly, M., and Stoica, I. (2012, January 25\u201327). Resilient Distributed Datasets: A Fault-Tolerant Abstraction for In-Memory Cluster Computing. Proceedings of the 9th {USENIX} Symposium on Networked Systems Design and Implementation ({NSDI} 12), San Jose, CA, USA."},{"key":"ref_18","doi-asserted-by":"crossref","first-page":"2270","DOI":"10.1016\/j.patcog.2005.01.012","article-title":"Score normalization in multimodal biometric systems","volume":"38","author":"Jain","year":"2005","journal-title":"Pattern Recognit."},{"key":"ref_19","doi-asserted-by":"crossref","unstructured":"Keogh, E., Chakrabarti, K., Pazzani, M., and Mehrotra, S. (2001, January 21\u201324). Locally Adaptive Dimensionality Reduction for Indexing Large Time Series Databases. Proceedings of the 2001 ACM SIGMOD International Conference on Management of Data, Santa Barbara, CA, USA.","DOI":"10.1145\/375663.375680"},{"key":"ref_20","doi-asserted-by":"crossref","first-page":"107","DOI":"10.1007\/s10618-007-0064-z","article-title":"Experiencing SAX: A novel symbolic representation of time series","volume":"15","author":"Lin","year":"2007","journal-title":"Data Min. Knowl. Discov."},{"key":"ref_21","first-page":"269","article-title":"Combinatorial approaches to finding subtle signals in DNA sequences","volume":"8","author":"Pevzner","year":"2000","journal-title":"ISMB"},{"key":"ref_22","doi-asserted-by":"crossref","first-page":"4629","DOI":"10.1093\/nar\/11.13.4629","article-title":"An efficient method for finding repeats in molecular sequences","volume":"11","author":"Martinez","year":"1983","journal-title":"Nucleic Acids Res."},{"key":"ref_23","doi-asserted-by":"crossref","unstructured":"Ashraf, F.B., Abir, A.I., Salekin, M.S., and Mottalib, M.A. (2017, January 16\u201318). RPPMD (Randomly Projected Possible Motif Discovery): An Efficient Bucketing Method for Finding DNA Planted Motif. Proceedings of the 2017 International Conference on Electrical. Computer and Communication Engineering (ECCE), Cox\u2019s Bazar, Bangladesh.","DOI":"10.1109\/ECACE.2017.7912958"}],"container-title":["Sensors"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/1424-8220\/22\/14\/5071\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,10]],"date-time":"2025-10-10T23:43:22Z","timestamp":1760139802000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/1424-8220\/22\/14\/5071"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2022,7,6]]},"references-count":23,"journal-issue":{"issue":"14","published-online":{"date-parts":[[2022,7]]}},"alternative-id":["s22145071"],"URL":"https:\/\/doi.org\/10.3390\/s22145071","relation":{},"ISSN":["1424-8220"],"issn-type":[{"value":"1424-8220","type":"electronic"}],"subject":[],"published":{"date-parts":[[2022,7,6]]}}}