{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,2,21]],"date-time":"2025-02-21T02:08:43Z","timestamp":1740103723421,"version":"3.37.3"},"reference-count":25,"publisher":"Wiley","license":[{"start":{"date-parts":[[2020,10,14]],"date-time":"2020-10-14T00:00:00Z","timestamp":1602633600000},"content-version":"unspecified","delay-in-days":0,"URL":"https:\/\/creativecommons.org\/licenses\/by\/4.0\/"}],"funder":[{"DOI":"10.13039\/501100001809","name":"National Natural Science Foundation of China","doi-asserted-by":"publisher","award":["61662057","N182504017"],"award-info":[{"award-number":["61662057","N182504017"]}],"id":[{"id":"10.13039\/501100001809","id-type":"DOI","asserted-by":"publisher"}]},{"DOI":"10.13039\/501100012226","name":"Fundamental Research Funds for the Central Universities","doi-asserted-by":"publisher","award":["61662057","N182504017"],"award-info":[{"award-number":["61662057","N182504017"]}],"id":[{"id":"10.13039\/501100012226","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Complexity"],"published-print":{"date-parts":[[2020,10,14]]},"abstract":"<jats:p>Cities in the big data era hold the massive urban data to create valuable information and digitally enhanced services. Sources of urban data are generally categorized as one of the three types: official, social, and sensorial, which are from the government and enterprises, social networks of citizens, and the sensor network. These types typically differ significantly from each other but are consolidated together for the smart urban services. Based on the sophisticated consolidation approaches, we argue that a new challenge, fragment complexity that represents a well-integrated data has appropriate but fragmentary schema and difficult to be queried, is ignored in the state-of-art urban data management. Comparing with predefined and rigid schema, fragmentary schema means a dataset contains millions of attributes but nonorthogonally distributed among tables, and of course, values of these attributes are even massive. As far as a query is concerned, locating where these attributes are being stored is the first encountered problem, while traditional value-based query optimization has no contributions. To address this problem, we propose an index on massive attributes as an attributes-oriented optimization, namely, attribute index. Attribute index is a secondary index for locating files in which the target attributes are stored. It contains three parts: ATree for searching keys, DTree for locating keys among files, and ADLinks as a mapping table between ATree and DTree. In this paper, the index architecture, logical structure and algorithms, the implementation details, the creation process, the integration to the existing key-value store, and the urban application scenario are described. Experiments show that, in comparison with B\u2009+\u2009-Tree, LSM-Tree, and AVL-Tree, the query time of ATree is 1.1x, 1.5x, and 1.2x faster, respectively. Finally, we integrate our proposition with HBase, namely, UrbanBase, whose query performance is 1.3x faster than the original HBase.<\/jats:p>","DOI":"10.1155\/2020\/8914757","type":"journal-article","created":{"date-parts":[[2020,10,15]],"date-time":"2020-10-15T18:21:32Z","timestamp":1602786092000},"page":"1-14","source":"Crossref","is-referenced-by-count":0,"title":["Solving the Fragment Complexity of Official, Social, and Sensorial Urban Data"],"prefix":"10.1155","volume":"2020","author":[{"ORCID":"https:\/\/orcid.org\/0000-0002-3711-500X","authenticated-orcid":true,"given":"Hui","family":"Liu","sequence":"first","affiliation":[{"name":"School of Metallurgy, Northeastern University, Shenyang, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Jingqing","family":"Jiang","sequence":"additional","affiliation":[{"name":"College of Computer Science and Technology, Inner Mongolia University for the Nationalities, Tongliao, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Yaowei","family":"Hou","sequence":"additional","affiliation":[{"name":"Software College, Northeastern University, Shenyang, China"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Jie","family":"Song","sequence":"additional","affiliation":[{"name":"Software College, Northeastern University, Shenyang, China"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"311","reference":[{"key":"1","doi-asserted-by":"publisher","DOI":"10.1016\/j.inffus.2019.06.016"},{"key":"2","doi-asserted-by":"publisher","DOI":"10.1155\/2018\/7202985"},{"key":"3","doi-asserted-by":"publisher","DOI":"10.1155\/2018\/9452813"},{"key":"4","doi-asserted-by":"publisher","DOI":"10.1155\/2020\/5192861"},{"key":"5","doi-asserted-by":"publisher","DOI":"10.1016\/j.eng.2016.02.003"},{"volume-title":"Big Data And Urban Informatics: Innovations And Challenges To Urban Planning And Knowledge Discovery","year":"2016","author":"P. Thakuriah","key":"6"},{"key":"7","doi-asserted-by":"publisher","DOI":"10.1109\/access.2019.2936941"},{"volume-title":"Advanced Parallel And Distributed Computing For Big Urban Data","year":"2020","author":"X. Andrade","key":"8"},{"key":"9","doi-asserted-by":"publisher","DOI":"10.1109\/mcom.2019.1800640"},{"key":"10","doi-asserted-by":"publisher","DOI":"10.1016\/j.future.2019.02.035"},{"key":"11","doi-asserted-by":"publisher","DOI":"10.1109\/tvcg.2013.226"},{"article-title":"Building a big data platform for smart cities: experience and lessons from santander","author":"B. Cheng","key":"12","doi-asserted-by":"crossref","DOI":"10.1109\/BigDataCongress.2015.91"},{"key":"13","article-title":"Models and practices in urban data science at scale","volume":"17","author":"B. Marco","year":"2018","journal-title":"Big Data Research"},{"key":"14","doi-asserted-by":"publisher","DOI":"10.1109\/tpds.2019.2891599"},{"key":"15","doi-asserted-by":"publisher","DOI":"10.14778\/3352063.3352134"},{"author":"X. Gao","key":"16","article-title":"Multi-dimensional index over a key-value store for semi-structured data"},{"article-title":"SPKV: a multi-dimensional index system for large scale key-value stores","author":"Q. Wang","key":"17","doi-asserted-by":"crossref","DOI":"10.1007\/978-3-319-11116-2_32"},{"volume-title":"Scalable Low-Latency Indexes for a Key-Value Store","year":"2017","author":"A. Kejriwal","key":"18"},{"author":"D\u2019silva","key":"19","article-title":"Secondary indexing techniques for key-value stores: two rings to rule them all"},{"key":"20"},{"article-title":"An extension of HBASE core which support faster scans at the expense of larger RAM consumption","year":"2019","author":"I. HBase","key":"21"},{"first-page":"97","article-title":"Join optimization in the MapReduce environment for column-wise data store","author":"M. Zhou","key":"22"},{"key":"23","first-page":"1","article-title":"Practical lessons from the deployment and management of a smart city internet-of-things infrastructure: the smartsantander testbed case","volume":"99","author":"P. Sotres","year":"2017","journal-title":"IEEE Access"},{"key":"24","doi-asserted-by":"publisher","DOI":"10.14778\/1920841.1920991"},{"first-page":"71","article-title":"LSM-trie: an LSM-tree-based ultra-large key-value store for small data","author":"X. Wu","key":"25"}],"container-title":["Complexity"],"original-title":[],"language":"en","link":[{"URL":"http:\/\/downloads.hindawi.com\/journals\/complexity\/2020\/8914757.pdf","content-type":"application\/pdf","content-version":"vor","intended-application":"text-mining"},{"URL":"http:\/\/downloads.hindawi.com\/journals\/complexity\/2020\/8914757.xml","content-type":"application\/xml","content-version":"vor","intended-application":"text-mining"},{"URL":"http:\/\/downloads.hindawi.com\/journals\/complexity\/2020\/8914757.pdf","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2020,10,15]],"date-time":"2020-10-15T18:21:39Z","timestamp":1602786099000},"score":1,"resource":{"primary":{"URL":"https:\/\/www.hindawi.com\/journals\/complexity\/2020\/8914757\/"}},"subtitle":[],"editor":[{"given":"Mohammad","family":"Swapan","sequence":"additional","affiliation":[],"role":[{"role":"editor","vocabulary":"crossref"}]}],"short-title":[],"issued":{"date-parts":[[2020,10,14]]},"references-count":25,"alternative-id":["8914757","8914757"],"URL":"https:\/\/doi.org\/10.1155\/2020\/8914757","relation":{},"ISSN":["1099-0526","1076-2787"],"issn-type":[{"type":"electronic","value":"1099-0526"},{"type":"print","value":"1076-2787"}],"subject":[],"published":{"date-parts":[[2020,10,14]]}}}