{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,3,18]],"date-time":"2026-03-18T14:11:30Z","timestamp":1773843090045,"version":"3.50.1"},"reference-count":37,"publisher":"Association for Computing Machinery (ACM)","issue":"12","content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Proc. VLDB Endow."],"published-print":{"date-parts":[[2020,8]]},"abstract":"<jats:p>In existing big data stacks, the processes of analytical processing and knowledge serving are usually separated in different systems. In Alibaba, we observed a new trend where these two processes are fused: knowledge serving incurs generation of new data, and these data are fed into the process of analytical processing which further fine tunes the knowledge base used in the serving process. Splitting this fused processing paradigm into separate systems incurs overhead such as extra data duplication, discrepant application development and expensive system maintenance.<\/jats:p>\n          <jats:p>\n            In this work, we propose Hologres, which is a cloud native service for hybrid serving and analytical processing (HSAP). Hologres decouples the computation and storage layers, allowing flexible scaling in each layer. Tables are partitioned into self-managed shards. Each shard processes its read and write requests concurrently independent of each other. Hologres leverages hybrid row\/column storage to optimize operations such as point lookup, column scan and data ingestion used in HSAP. We propose\n            <jats:italic toggle=\"yes\">Execution Context<\/jats:italic>\n            as a resource abstraction between system threads and user tasks. Execution contexts can be cooperatively scheduled with little context switching overhead. Queries are parallelized and mapped to execution contexts for concurrent execution. The scheduling framework enforces resource isolation among different queries and supports customizable schedule policy. We conducted experiments comparing Hologres with existing systems specifically designed for analytical processing and serving workloads. The results show that Hologres consistently outperforms other systems in both system throughput and end-to-end query latency.\n          <\/jats:p>","DOI":"10.14778\/3415478.3415550","type":"journal-article","created":{"date-parts":[[2020,9,14]],"date-time":"2020-09-14T18:46:46Z","timestamp":1600109206000},"page":"3272-3284","source":"Crossref","is-referenced-by-count":14,"title":["Alibaba hologres"],"prefix":"10.14778","volume":"13","author":[{"given":"Xiaowei","family":"Jiang","sequence":"first","affiliation":[{"name":"Alibaba Group"}]},{"given":"Yuejun","family":"Hu","sequence":"additional","affiliation":[{"name":"Alibaba Group"}]},{"given":"Yu","family":"Xiang","sequence":"additional","affiliation":[{"name":"Alibaba Group"}]},{"given":"Guangran","family":"Jiang","sequence":"additional","affiliation":[{"name":"Alibaba Group"}]},{"given":"Xiaojun","family":"Jin","sequence":"additional","affiliation":[{"name":"Alibaba Group"}]},{"given":"Chen","family":"Xia","sequence":"additional","affiliation":[{"name":"Alibaba Group"}]},{"given":"Weihua","family":"Jiang","sequence":"additional","affiliation":[{"name":"Alibaba Group"}]},{"given":"Jun","family":"Yu","sequence":"additional","affiliation":[{"name":"Alibaba Group"}]},{"given":"Haitao","family":"Wang","sequence":"additional","affiliation":[{"name":"Alibaba Group"}]},{"given":"Yuan","family":"Jiang","sequence":"additional","affiliation":[{"name":"Alibaba Group"}]},{"given":"Jihong","family":"Ma","sequence":"additional","affiliation":[{"name":"Alibaba Group"}]},{"given":"Li","family":"Su","sequence":"additional","affiliation":[{"name":"Alibaba Group"}]},{"given":"Kai","family":"Zeng","sequence":"additional","affiliation":[{"name":"Alibaba Group"}]}],"member":"320","published-online":{"date-parts":[[2020,8]]},"reference":[{"key":"e_1_2_1_1_1","unstructured":"Actian vector. https:\/\/www.actian.com."},{"key":"e_1_2_1_2_1","unstructured":"Apache arrow. https:\/\/arrow.apache.org."},{"key":"e_1_2_1_3_1","unstructured":"Apache hdfs. https:\/\/hadoop.apache.org."},{"key":"e_1_2_1_4_1","unstructured":"Flink. https:\/\/flink.apache.org."},{"key":"e_1_2_1_5_1","unstructured":"Greenplum. https:\/\/greenplum.org."},{"key":"e_1_2_1_6_1","unstructured":"Hbase. https:\/\/hbase.apache.org."},{"key":"e_1_2_1_7_1","unstructured":"Hive. https:\/\/hive.apache.org."},{"key":"e_1_2_1_8_1","unstructured":"Intel avx-512 instruction set. https:\/\/www.intel.com\/content\/www\/us\/en\/architecture-and-technology\/avx-512-overview.html."},{"key":"e_1_2_1_9_1","unstructured":"Memsql. http:\/\/www.memsql.com\/."},{"key":"e_1_2_1_10_1","unstructured":"Mysql. https:\/\/www.mysql.com."},{"key":"e_1_2_1_11_1","unstructured":"Pivotal greenplum. https:\/\/gpdb.docs.pivotal.io\/6-0\/admin_guide\/workload_mgmt.html."},{"key":"e_1_2_1_12_1","unstructured":"Postgresql. https:\/\/www.postgresql.org."},{"key":"e_1_2_1_13_1","unstructured":"Rocksdb. https:\/\/github.com\/facebook\/rocksdb\/wiki."},{"key":"e_1_2_1_14_1","unstructured":"Teradata. http:\/\/www.teradata.com."},{"key":"e_1_2_1_15_1","unstructured":"Tpc-h benchmark. http:\/\/www.tpc.org\/tpch."},{"key":"e_1_2_1_16_1","doi-asserted-by":"publisher","DOI":"10.1145\/1365815.1365816"},{"key":"e_1_2_1_17_1","doi-asserted-by":"publisher","DOI":"10.1145\/1807128.1807152"},{"key":"e_1_2_1_18_1","doi-asserted-by":"publisher","DOI":"10.1145\/2491245"},{"key":"e_1_2_1_19_1","doi-asserted-by":"publisher","DOI":"10.14778\/2732219.2732223"},{"key":"e_1_2_1_20_1","doi-asserted-by":"publisher","DOI":"10.1145\/2463676.2463710"},{"issue":"1","key":"e_1_2_1_21_1","first-page":"28","article-title":"The sap hana database-an architecture overview","volume":"35","author":"F\u00e4rber F.","year":"2012","unstructured":"F. F\u00e4rber, N. May, W. Lehner, P. Gro\u00dfe, I. M\u00fcller, H. Rauhe, and J. Dees. The sap hana database-an architecture overview. IEEE Data Eng. Bull., 35(1):28--33, 2012.","journal-title":"IEEE Data Eng. Bull."},{"key":"e_1_2_1_22_1","volume-title":"Proceedings of the 2018 International Conference on Management of Data, SIGMOD 2018","author":"Im J.-F.","year":"2018","unstructured":"J.-F. Im, K. Gopalakrishna, S. Subramaniam, M. Shrivastava, A. Tumbde, X. Jiang, J. Dai, S. Lee, N. Pawar, J. Li, and et al. Pinot: Realtime olap for 530 million users. In Proceedings of the 2018 International Conference on Management of Data, SIGMOD 2018, New York, NY, USA, 2018. Association for Computing Machinery."},{"key":"e_1_2_1_23_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICDE.2011.5767867"},{"key":"e_1_2_1_24_1","volume-title":"CIDR 2015, Seventh Biennial Conference on Innovative Data Systems Research, Asilomar, CA, USA, January 4-7, 2015, Online Proceedings. www.cidrdb.org","author":"Kornacker M.","year":"2015","unstructured":"M. Kornacker, A. Behm, V. Bittorf, T. Bobrovytsky, C. Ching, A. Choi, J. Erickson, M. Grund, D. Hecht, M. Jacobs, I. Joshi, L. Kuff, D. Kumar, A. Leblang, N. Li, I. Pandis, H. Robinson, D. Rorke, S. Rus, J. Russell, D. Tsirogiannis, S. Wanderman-Milne, and M. Yoder. Impala: A modern, open-source SQL engine forhadoop. In CIDR 2015, Seventh Biennial Conference on Innovative Data Systems Research, Asilomar, CA, USA, January 4-7, 2015, Online Proceedings. www.cidrdb.org, 2015."},{"key":"e_1_2_1_25_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICDE.2015.7113373"},{"key":"e_1_2_1_26_1","doi-asserted-by":"publisher","DOI":"10.1145\/1773912.1773922"},{"key":"e_1_2_1_27_1","doi-asserted-by":"publisher","DOI":"10.14778\/2367502.2367518"},{"key":"e_1_2_1_28_1","doi-asserted-by":"publisher","DOI":"10.14778\/2824032.2824071"},{"key":"e_1_2_1_29_1","doi-asserted-by":"publisher","DOI":"10.1145\/2588555.2610507"},{"key":"e_1_2_1_30_1","doi-asserted-by":"publisher","DOI":"10.1145\/2168836.2168855"},{"issue":"6","key":"e_1_2_1_31_1","first-page":"663","article-title":"Quickstep: A data platform based on the scaling-up approach","volume":"11","author":"Patel J. M.","year":"2018","unstructured":"J. M. Patel, H. Deshmukh, J. Zhu, N. Potti, Z. Zhang, M. Spehlmann, H. Memisoglu, and S. Saurabh. Quickstep: A data platform based on the scaling-up approach. PVLDB, 11(6):663--676, 2018.","journal-title":"PVLDB"},{"key":"e_1_2_1_32_1","volume-title":"Proceedings of the Fourth International Workshop on Accelerating Data Management Systems Using Modern Processor and Storage Architectures (ADMS 2013","author":"Psaroudakis I.","year":"2013","unstructured":"I. Psaroudakis, T. Scheuer, N. May, and A. Ailamaki. Task scheduling for highly concurrent analytical and transactional main-memory workloads. In Proceedings of the Fourth International Workshop on Accelerating Data Management Systems Using Modern Processor and Storage Architectures (ADMS 2013), number CONF, 2013."},{"key":"e_1_2_1_33_1","doi-asserted-by":"publisher","DOI":"10.5555\/1287369.1287407"},{"key":"e_1_2_1_34_1","doi-asserted-by":"publisher","DOI":"10.14778\/2536222.2536233"},{"issue":"2","key":"e_1_2_1_35_1","first-page":"21","article-title":"The voltdb main memory dbms","volume":"36","author":"Stonebraker M.","year":"2013","unstructured":"M. Stonebraker and A. Weisberg. The voltdb main memory dbms. IEEE Data Eng. Bull., 36(2):21--27, 2013.","journal-title":"IEEE Data Eng. Bull."},{"key":"e_1_2_1_36_1","doi-asserted-by":"publisher","DOI":"10.1145\/2588555.2595631"},{"key":"e_1_2_1_37_1","first-page":"21","article-title":"Vectorwise: Beyond column stores","volume":"35","author":"Zukowski M.","year":"2012","unstructured":"M. Zukowski and P. A. Boncz. Vectorwise: Beyond column stores. IEEE Data Eng. Bull., 35:21--27, 2012.","journal-title":"IEEE Data Eng. Bull."}],"container-title":["Proceedings of the VLDB Endowment"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.14778\/3415478.3415550","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,9,17]],"date-time":"2025-09-17T02:40:49Z","timestamp":1758076849000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.14778\/3415478.3415550"}},"subtitle":["a cloud-native service for hybrid serving\/analytical processing"],"short-title":[],"issued":{"date-parts":[[2020,8]]},"references-count":37,"journal-issue":{"issue":"12","published-print":{"date-parts":[[2020,8]]}},"alternative-id":["10.14778\/3415478.3415550"],"URL":"https:\/\/doi.org\/10.14778\/3415478.3415550","relation":{},"ISSN":["2150-8097"],"issn-type":[{"value":"2150-8097","type":"print"}],"subject":[],"published":{"date-parts":[[2020,8]]}}}