{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,30]],"date-time":"2026-04-30T03:48:55Z","timestamp":1777520935944,"version":"3.51.4"},"reference-count":34,"publisher":"MDPI AG","issue":"2","license":[{"start":{"date-parts":[[2018,2,11]],"date-time":"2018-02-11T00:00:00Z","timestamp":1518307200000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"name":"NASA","award":["NNX15AM85G"],"award-info":[{"award-number":["NNX15AM85G"]}]},{"DOI":"10.13039\/501100000930","name":"NSF","doi-asserted-by":"publisher","award":["IIP-1338925"],"award-info":[{"award-number":["IIP-1338925"]}],"id":[{"id":"10.13039\/501100000930","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100000930","name":"NSF","doi-asserted-by":"publisher","award":["ICER-1540998"],"award-info":[{"award-number":["ICER-1540998"]}],"id":[{"id":"10.13039\/501100000930","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["IJGI"],"abstract":"<jats:p>Discovering and accessing geospatial data presents a significant challenge for the Earth sciences community as massive amounts of data are being produced on a daily basis. In this article, we report a smart web-based geospatial data discovery system that mines and utilizes data relevancy from metadata user behavior. Specifically, (1) the system enables semantic query expansion and suggestion to assist users in finding more relevant data; (2) machine-learned ranking is utilized to provide the optimal search ranking based on a number of identified ranking features that can reflect users\u2019 search preferences; (3) a hybrid recommendation module is designed to allow users to discover related data considering metadata attributes and user behavior; (4) an integrated graphic user interface design is developed to quickly and intuitively guide data consumers to the appropriate data resources. As a proof of concept, we focus on a well-defined domain-oceanography and use oceanographic data discovery as an example. Experiments and a search example show that the proposed system can improve the scientific community\u2019s data search experience by providing query expansion, suggestion, better search ranking, and data recommendation via a user-friendly interface.<\/jats:p>","DOI":"10.3390\/ijgi7020062","type":"journal-article","created":{"date-parts":[[2018,2,12]],"date-time":"2018-02-12T10:50:38Z","timestamp":1518432638000},"page":"62","update-policy":"https:\/\/doi.org\/10.3390\/mdpi_crossmark_policy","source":"Crossref","is-referenced-by-count":13,"title":["A Smart Web-Based Geospatial Data Discovery System with Oceanographic Data as an Example"],"prefix":"10.3390","volume":"7","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-4591-483X","authenticated-orcid":false,"given":"Yongyao","family":"Jiang","sequence":"first","affiliation":[{"name":"NSF Spatiotemporal Innovation Center and Department of Geography and GeoInformation Science, George Mason University, Fairfax, VA 22030, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0002-3205-8464","authenticated-orcid":false,"given":"Yun","family":"Li","sequence":"additional","affiliation":[{"name":"NSF Spatiotemporal Innovation Center and Department of Geography and GeoInformation Science, George Mason University, Fairfax, VA 22030, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-7768-4066","authenticated-orcid":false,"given":"Chaowei","family":"Yang","sequence":"additional","affiliation":[{"name":"NSF Spatiotemporal Innovation Center and Department of Geography and GeoInformation Science, George Mason University, Fairfax, VA 22030, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"ORCID":"https:\/\/orcid.org\/0000-0001-5231-2303","authenticated-orcid":false,"given":"Fei","family":"Hu","sequence":"additional","affiliation":[{"name":"NSF Spatiotemporal Innovation Center and Department of Geography and GeoInformation Science, George Mason University, Fairfax, VA 22030, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Edward","family":"Armstrong","sequence":"additional","affiliation":[{"name":"NASA Jet Propulsion Laboratory, California Institute of Technology, Pasadena, CA 91109, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Thomas","family":"Huang","sequence":"additional","affiliation":[{"name":"NASA Jet Propulsion Laboratory, California Institute of Technology, Pasadena, CA 91109, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"David","family":"Moroni","sequence":"additional","affiliation":[{"name":"NASA Jet Propulsion Laboratory, California Institute of Technology, Pasadena, CA 91109, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Lewis","family":"McGibbney","sequence":"additional","affiliation":[{"name":"NASA Jet Propulsion Laboratory, California Institute of Technology, Pasadena, CA 91109, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Frank","family":"Greguska","sequence":"additional","affiliation":[{"name":"NASA Jet Propulsion Laboratory, California Institute of Technology, Pasadena, CA 91109, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Christopher","family":"Finch","sequence":"additional","affiliation":[{"name":"NASA Jet Propulsion Laboratory, California Institute of Technology, Pasadena, CA 91109, USA"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"1968","published-online":{"date-parts":[[2018,2,11]]},"reference":[{"key":"ref_1","unstructured":"Hartmann, D.L. (2015). Global Physical Climatology, Elsevier."},{"key":"ref_2","doi-asserted-by":"crossref","first-page":"1019","DOI":"10.1175\/2008JPO4066.1","article-title":"The effect of wind\u2013wave\u2013current interaction on air\u2013sea momentum fluxes and ocean response in tropical cyclones","volume":"39","author":"Fan","year":"2009","journal-title":"J. Phys. Oceanogr."},{"key":"ref_3","doi-asserted-by":"crossref","first-page":"87","DOI":"10.1007\/s12145-010-0050-7","article-title":"Mercury: Reusable metadata management, data discovery and access system","volume":"3","author":"Devarakonda","year":"2010","journal-title":"Earth Sci. Inf."},{"key":"ref_4","unstructured":"NASA (2017, April 07). NASA Strategic Plan, Available online: https:\/\/www.nasa.gov\/pdf\/516579main_NASA2011StrategicPlan.pdf."},{"key":"ref_5","doi-asserted-by":"crossref","first-page":"120","DOI":"10.1016\/j.compenvurbsys.2016.10.010","article-title":"Utilizing Cloud Computing to address big geospatial data challenges","volume":"61","author":"Yang","year":"2017","journal-title":"Comput. Environ. Urban Syst."},{"key":"ref_6","doi-asserted-by":"crossref","first-page":"700","DOI":"10.1126\/science.1197869","article-title":"Climate data challenges in the 21st century","volume":"331","author":"Overpeck","year":"2011","journal-title":"Science"},{"key":"ref_7","doi-asserted-by":"crossref","first-page":"17","DOI":"10.1080\/17538947.2012.674561","article-title":"Towards geospatial semantic search: Exploiting latent semantic relations in geospatial data","volume":"7","author":"Li","year":"2014","journal-title":"Int. J. Digit. Earth"},{"key":"ref_8","doi-asserted-by":"crossref","unstructured":"Vance, T.C., Merati, N., Yang, C., and Yuan, M. (2016). Polar CI Portal: A Cloud based polar resource discovery engine. Cloud Computing in Ocean and Atmospheric Sciences, Academic Press.","DOI":"10.1109\/OCEANS.2016.7761270"},{"key":"ref_9","doi-asserted-by":"crossref","unstructured":"Jiang, Y., Li, Y., Yang, C., Hu, F., Armstrong, E.M., Huang, T., Moroni, D., McGibbney, L.J., and Finch, C.J. (2017). Towards intelligent geospatial data discovery: A machine learning framework for search ranking. Int. J. Digit. Earth, 1\u201316.","DOI":"10.1080\/17538947.2017.1371255"},{"key":"ref_10","doi-asserted-by":"crossref","first-page":"493","DOI":"10.1287\/mksc.1110.0700","article-title":"Designing ranking systems for hotels on travel search engines by mining user-generated and crowdsourced content","volume":"31","author":"Ghose","year":"2012","journal-title":"Mark. Sci."},{"key":"ref_11","unstructured":"NRC (2012). New Research Opportunities in The earth Sciences, National Academies Press."},{"key":"ref_12","doi-asserted-by":"crossref","unstructured":"AlJadda, K., Korayem, M., Grainger, T., and Russell, C. (2014, January 27\u201330). Crowdsourced query augmentation through semantic discovery of domain-specific jargon. Proceedings of the 2014 IEEE International Conference on Big Data (Big Data), Washington, DC, USA.","DOI":"10.1109\/BigData.2014.7004310"},{"key":"ref_13","unstructured":"(2017, May 10). Semantic Web for Earth and Environmental Terminology (SWEET). Available online: https:\/\/www.researchgate.net\/publication\/250346856_Semantic_Web_for_Earth_and_Environmental_Terminology_SWEET."},{"key":"ref_14","doi-asserted-by":"crossref","first-page":"25","DOI":"10.1007\/s12145-013-0129-z","article-title":"Building a semantic based public transportation geoportal compliant with the INSPIRE transport network data theme","volume":"7","author":"Gunay","year":"2014","journal-title":"Earth Sci. Inf."},{"key":"ref_15","doi-asserted-by":"crossref","first-page":"188","DOI":"10.1002\/aris.1440380105","article-title":"Latent semantic analysis","volume":"38","author":"Dumais","year":"2004","journal-title":"Annu. Rev. Inf. Sci. Technol."},{"key":"ref_16","first-page":"601","article-title":"Latent dirichlet allocation","volume":"1","author":"Blei","year":"2002","journal-title":"Adv. Neural Inf. Process. Syst."},{"key":"ref_17","doi-asserted-by":"crossref","first-page":"398","DOI":"10.1111\/tgis.12151","article-title":"Metadata topic harmonization and semantic search for linked-data driven geoportals: A case study using ArcGIS Online","volume":"19","author":"Hu","year":"2015","journal-title":"Trans. GIS"},{"key":"ref_18","unstructured":"Gormley, C., and Tong, Z. (2015). Elasticsearch: The Definitive Guide: A Distributed Real-Time Search and Analytics Engine, O\u2019Reilly Media, Inc."},{"key":"ref_19","doi-asserted-by":"crossref","unstructured":"Martins, B., and Calado, P. (2010, January 18\u201319). Learning to rank for geographic information retrieval. Proceedings of the 6th Workshop on Geographic Information Retrieval, Zurich, Switzerland.","DOI":"10.1145\/1722080.1722107"},{"key":"ref_20","doi-asserted-by":"crossref","unstructured":"Shaw, B., Shea, J., Sinha, S., and Hogue, A. (2013, January 4\u20138). Learning to rank for spatiotemporal search. Proceedings of the Sixth ACM International Conference on Web Search and Data Mining, Rome, Italy.","DOI":"10.1145\/2433396.2433485"},{"key":"ref_21","unstructured":"(2017, May 10). Linked Data-The Story So Far. Available online: https:\/\/eprints.soton.ac.uk\/271285\/."},{"key":"ref_22","doi-asserted-by":"crossref","unstructured":"Krisnadhi, A., Hu, Y., Janowicz, K., Hitzler, P., Arko, R., Carbotte, S., Chandler, C., Cheatham, M., Fils, D., and Finin, T. (2015, January 11\u201315). The GeoLink modular oceanography ontology. Proceedings of the 14th International Semantic Web Conference, Bethlehem, PA, USA.","DOI":"10.1007\/978-3-319-25010-6_19"},{"key":"ref_23","doi-asserted-by":"crossref","first-page":"109","DOI":"10.1016\/j.knosys.2013.03.012","article-title":"Recommender systems survey","volume":"46","author":"Bobadilla","year":"2013","journal-title":"Knowl. Based Syst."},{"key":"ref_24","doi-asserted-by":"crossref","first-page":"256","DOI":"10.3390\/ijgi2020256","article-title":"From geoportals to geographic knowledge portals","volume":"2","author":"Vockner","year":"2014","journal-title":"ISPRS Int. J. Geo-Inf."},{"key":"ref_25","doi-asserted-by":"crossref","unstructured":"Jiang, Y., Li, Y., Yang, C., Armstrong, E.M., Huang, T., and Moroni, D. (2017). Reconstructing sessions from data discovery and access logs to build a semantic knowledge base for improving data discovery. ISPRS Int. J. Geo-Inf., 5.","DOI":"10.3390\/ijgi5050054"},{"key":"ref_26","doi-asserted-by":"crossref","first-page":"2310","DOI":"10.1080\/13658816.2017.1357819","article-title":"A comprehensive methodology for discovering semantic relationships among geospatial vocabularies using oceanographic data discovery as an example","volume":"31","author":"Jiang","year":"2017","journal-title":"Int. J. Geogr. Inf. Sci."},{"key":"ref_27","doi-asserted-by":"crossref","first-page":"950","DOI":"10.1126\/science.283.5404.950","article-title":"Genesis and evolution of the 1997-98 El Ni\u00f1o","volume":"283","author":"McPhaden","year":"1999","journal-title":"Science"},{"key":"ref_28","unstructured":"UCAR (2017, May 10). SST Data Sets: Overview & Comparison Table. Available online: https:\/\/climatedataguide.ucar.edu\/climate-data\/sst-data-sets-overview-comparison-table."},{"key":"ref_29","doi-asserted-by":"crossref","first-page":"21","DOI":"10.1016\/j.dsr2.2012.04.013","article-title":"Group for High Resolution Sea Surface temperature (GHRSST) analysis fields inter-comparisons. Part 1: A GHRSST multi-product ensemble (GMPE)","volume":"77","author":"Martin","year":"2012","journal-title":"Deep Sea Res. Part II"},{"key":"ref_30","unstructured":"Li, Y., Jiang, Y., Hu, F., Yang, C., Huang, T., Moroni, D., and Fench, C. (2016, January 19\u201323). Leveraging cloud computing to speedup user access log mining. Proceedings of the OCEANS 2016 MTS\/IEEE Monterey, Monterey, CA, USA."},{"key":"ref_31","doi-asserted-by":"crossref","unstructured":"Jin, B., Song, W., Zhao, K., Wei, X., Hu, F., and Jiang, Y. (2017). A high performance, spatiotemporal statistical analysis system based on a Spatiotemporal Cloud Platform. ISPRS Int. J. Geo-Inf., 6.","DOI":"10.3390\/ijgi6060165"},{"key":"ref_32","unstructured":"(2017, May 10). Mining and Utilizing Dataset Relevancy from Oceanographic Dataset (MUDROD) Metadata, Usage Metrics, and User Feedback to Improve Data Discovery and Access. Available online: http:\/\/adsabs.harvard.edu\/abs\/2015AGUFMIN51B1809J."},{"key":"ref_33","doi-asserted-by":"crossref","first-page":"78","DOI":"10.1109\/MCC.2014.22","article-title":"Streaming big data processing in datacenter clouds","volume":"1","author":"Ranjan","year":"2014","journal-title":"IEEE Cloud Comput."},{"key":"ref_34","doi-asserted-by":"crossref","unstructured":"Agichtein, E., Brill, E., and Dumais, S. (2006, January 6\u201311). Improving web search ranking by incorporating user behavior information. Proceedings of the 29th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, Seattle, WA, USA.","DOI":"10.1145\/1148170.1148177"}],"container-title":["ISPRS International Journal of Geo-Information"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/www.mdpi.com\/2220-9964\/7\/2\/62\/pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,10,11]],"date-time":"2025-10-11T14:54:37Z","timestamp":1760194477000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.mdpi.com\/2220-9964\/7\/2\/62"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2018,2,11]]},"references-count":34,"journal-issue":{"issue":"2","published-online":{"date-parts":[[2018,2]]}},"alternative-id":["ijgi7020062"],"URL":"https:\/\/doi.org\/10.3390\/ijgi7020062","relation":{},"ISSN":["2220-9964"],"issn-type":[{"value":"2220-9964","type":"electronic"}],"subject":[],"published":{"date-parts":[[2018,2,11]]}}}