{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,2,5]],"date-time":"2026-02-05T23:05:29Z","timestamp":1770332729688,"version":"3.49.0"},"publisher-location":"New York, NY, USA","reference-count":39,"publisher":"ACM","license":[{"start":{"date-parts":[[2017,12,5]],"date-time":"2017-12-05T00:00:00Z","timestamp":1512432000000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"funder":[{"DOI":"10.13039\/100000001","name":"National Science Foundation","doi-asserted-by":"publisher","award":["CNS-1419123"],"award-info":[{"award-number":["CNS-1419123"]}],"id":[{"id":"10.13039\/100000001","id-type":"DOI","asserted-by":"publisher"}]},{"name":"National Science Foundation grants","award":["IIS-1447804"],"award-info":[{"award-number":["IIS-1447804"]}]},{"name":"National Science Foundation grants","award":["CNS-1513120"],"award-info":[{"award-number":["CNS-1513120"]}]}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2017,12,5]]},"DOI":"10.1145\/3148055.3148068","type":"proceedings-article","created":{"date-parts":[[2017,12,1]],"date-time":"2017-12-01T20:03:47Z","timestamp":1512158627000},"page":"1-10","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":20,"title":["Characterization of Big Data Stream Processing Pipeline"],"prefix":"10.1145","author":[{"given":"M. Haseeb","family":"Javed","sequence":"first","affiliation":[{"name":"Ohio State University, Columbus, OH, USA"}]},{"given":"Xiaoyi","family":"Lu","sequence":"additional","affiliation":[{"name":"Ohio State University, Columbus, OH, USA"}]},{"given":"Dhabaleswar K. (DK)","family":"Panda","sequence":"additional","affiliation":[{"name":"Ohio State University, Columbus, OH, USA"}]}],"member":"320","published-online":{"date-parts":[[2017,12,5]]},"reference":[{"key":"e_1_3_2_1_1_1","volume-title":"et almbox","author":"Abadi Daniel J","year":"2005","unstructured":"Daniel J Abadi , Yanif Ahmad , Magdalena Balazinska , Ugur Cetintemel , Mitch Cherniack , Jeong-Hyon Hwang , Wolfgang Lindner , Anurag Maskey , Alex Rasin , Esther Ryvkina , et almbox . 2005 . The Design of the Borealis Stream Processing Engine. Cidr , Vol. Vol. 5 . 277--289. Daniel J Abadi, Yanif Ahmad, Magdalena Balazinska, Ugur Cetintemel, Mitch Cherniack, Jeong-Hyon Hwang, Wolfgang Lindner, Anurag Maskey, Alex Rasin, Esther Ryvkina, et almbox. 2005. The Design of the Borealis Stream Processing Engine. Cidr, Vol. Vol. 5. 277--289."},{"key":"e_1_3_2_1_2_1","doi-asserted-by":"publisher","DOI":"10.1007\/s00778-003-0095-z"},{"key":"e_1_3_2_1_3_1","doi-asserted-by":"publisher","DOI":"10.14778\/2536222.2536229"},{"key":"e_1_3_2_1_4_1","doi-asserted-by":"publisher","DOI":"10.14778\/2824032.2824076"},{"key":"e_1_3_2_1_5_1","volume-title":"Stream: The stanford data stream management system. Data Stream Management","author":"Arasu Arvind","year":"2016","unstructured":"Arvind Arasu , Brian Babcock , Shivnath Babu , John Cieslewicz , Mayur Datar , Keith Ito , Rajeev Motwani , Utkarsh Srivastava , and Jennifer Widom . 2016 . Stream: The stanford data stream management system. Data Stream Management . Springer , 317--336. Arvind Arasu, Brian Babcock, Shivnath Babu, John Cieslewicz, Mayur Datar, Keith Ito, Rajeev Motwani, Utkarsh Srivastava, and Jennifer Widom. 2016. Stream: The stanford data stream management system. Data Stream Management. Springer, 317--336."},{"key":"e_1_3_2_1_6_1","doi-asserted-by":"publisher","DOI":"10.1145\/1807167.1807291"},{"key":"e_1_3_2_1_7_1","volume-title":"2015 a. Apache flink: Stream and batch processing in a single engine. Data Engineering","author":"Carbone Paris","year":"2015","unstructured":"Paris Carbone , Stephan Ewen , Seif Haridi , Asterios Katsifodimos , Volker Markl , and Kostas Tzoumas . 2015 a. Apache flink: Stream and batch processing in a single engine. Data Engineering ( 2015 ), 28. Paris Carbone, Stephan Ewen, Seif Haridi, Asterios Katsifodimos, Volker Markl, and Kostas Tzoumas. 2015 a. Apache flink: Stream and batch processing in a single engine. Data Engineering (2015), 28."},{"key":"e_1_3_2_1_8_1","volume-title":"2015 b. Lightweight Asynchronous Snapshots for Distributed Dataflows. CoRR","author":"Carbone Paris","year":"2015","unstructured":"Paris Carbone , Gyula F\u00f3ra , Stephan Ewen , Seif Haridi , and Kostas Tzoumas . 2015 b. Lightweight Asynchronous Snapshots for Distributed Dataflows. CoRR Vol. abs\/ 1506 .08603 ( 2015 ). http:\/\/arxiv.org\/abs\/1506.08603 Paris Carbone, Gyula F\u00f3ra, Stephan Ewen, Seif Haridi, and Kostas Tzoumas. 2015 b. Lightweight Asynchronous Snapshots for Distributed Dataflows. CoRR Vol. abs\/1506.08603 (2015). http:\/\/arxiv.org\/abs\/1506.08603"},{"key":"e_1_3_2_1_9_1","doi-asserted-by":"publisher","DOI":"10.14778\/2735496.2735503"},{"key":"e_1_3_2_1_10_1","doi-asserted-by":"publisher","DOI":"10.1145\/214451.214456"},{"key":"e_1_3_2_1_11_1","volume-title":"Boyang Jerry Peng, et almbox","author":"Chintapalli Sanket","year":"2016","unstructured":"Sanket Chintapalli , Derek Dagit , Bobby Evans , Reza Farivar , Thomas Graves , Mark Holderbaugh , Zhuo Liu , Kyle Nusbaum , Kishorkumar Patil , Boyang Jerry Peng, et almbox . 2016 . Benchmarking streaming computation engines: Storm, Flink and Spark streaming Parallel and Distributed Processing Symposium Workshops, 2016 IEEE International. IEEE , 1789--1792. Sanket Chintapalli, Derek Dagit, Bobby Evans, Reza Farivar, Thomas Graves, Mark Holderbaugh, Zhuo Liu, Kyle Nusbaum, Kishorkumar Patil, Boyang Jerry Peng, et almbox. 2016. Benchmarking streaming computation engines: Storm, Flink and Spark streaming Parallel and Distributed Processing Symposium Workshops, 2016 IEEE International. IEEE, 1789--1792."},{"key":"e_1_3_2_1_13_1","unstructured":"DistributedLog. 2015. (2015). http:\/\/distributedlog.incubator.apache.org  DistributedLog. 2015. (2015). http:\/\/distributedlog.incubator.apache.org"},{"key":"e_1_3_2_1_14_1","volume-title":"https:\/\/github.com\/nathanmarz\/elephantdb","author":"DB.","year":"2011","unstructured":"Elephant DB. 2011. ( 2011 ). https:\/\/github.com\/nathanmarz\/elephantdb ElephantDB. 2011. (2011). https:\/\/github.com\/nathanmarz\/elephantdb"},{"key":"e_1_3_2_1_15_1","unstructured":"Apache Flume. 2016. Welcome to apache flume. (2016).  Apache Flume. 2016. Welcome to apache flume. (2016)."},{"key":"e_1_3_2_1_16_1","volume-title":"FUGU: Elastic Data Stream Processing with Latency Constraints. Data Engineering","author":"Heinze Thomas","year":"2015","unstructured":"Thomas Heinze , Yuanzhen Ji , Lars Roediger , Valerio Pappalardo , Andreas Meister , Zbigniew Jerzak , and Christof Fetzer . 2015 . FUGU: Elastic Data Stream Processing with Latency Constraints. Data Engineering (2015), 73. Thomas Heinze, Yuanzhen Ji, Lars Roediger, Valerio Pappalardo, Andreas Meister, Zbigniew Jerzak, and Christof Fetzer. 2015. FUGU: Elastic Data Stream Processing with Latency Constraints. Data Engineering (2015), 73."},{"key":"e_1_3_2_1_17_1","volume-title":"ICDE Workshops.","author":"Huang Shengsheng","year":"2010","unstructured":"Shengsheng Huang , Jie Huang , Yan Liu , Lan Yi , and Jinquan Dai . 2010 . Hibench: A representative and comprehensive hadoop benchmark suite Proc . ICDE Workshops. Shengsheng Huang, Jie Huang, Yan Liu, Lan Yi, and Jinquan Dai. 2010. Hibench: A representative and comprehensive hadoop benchmark suite Proc. ICDE Workshops."},{"key":"e_1_3_2_1_18_1","volume-title":"Flavio Paiva Junqueira, and Benjamin Reed","author":"Hunt Patrick","year":"2010","unstructured":"Patrick Hunt , Mahadev Konar , Flavio Paiva Junqueira, and Benjamin Reed . 2010 . ZooKeeper: Wait-free Coordination for Internet-scale Systems. USENIX annual technical conference, Vol. Vol. 8 . 9. Patrick Hunt, Mahadev Konar, Flavio Paiva Junqueira, and Benjamin Reed. 2010. ZooKeeper: Wait-free Coordination for Internet-scale Systems. USENIX annual technical conference, Vol. Vol. 8. 9."},{"key":"e_1_3_2_1_19_1","unstructured":"InfiniBand Trade Association. 2017. (2017). http:\/\/www.infinibandta.org  InfiniBand Trade Association. 2017. (2017). http:\/\/www.infinibandta.org"},{"key":"e_1_3_2_1_20_1","volume-title":"Big Data Frameworks: A Comparative Study. CoRR","author":"Inoubli Wissem","year":"2016","unstructured":"Wissem Inoubli , Sabeur Aridhi , Haithem Mezni , and Alexander Jung . 2016. Big Data Frameworks: A Comparative Study. CoRR Vol. abs\/ 1610 .09962 ( 2016 ). http:\/\/arxiv.org\/abs\/1610.09962 Wissem Inoubli, Sabeur Aridhi, Haithem Mezni, and Alexander Jung. 2016. Big Data Frameworks: A Comparative Study. CoRR Vol. abs\/1610.09962 (2016). http:\/\/arxiv.org\/abs\/1610.09962"},{"key":"e_1_3_2_1_21_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICPP.2011.37"},{"key":"e_1_3_2_1_22_1","volume-title":"A high-throughput, distributed messaging system. URL: kafka. apache. org as of","author":"Kafka Apache","year":"2014","unstructured":"Apache Kafka . 2014. A high-throughput, distributed messaging system. URL: kafka. apache. org as of Vol. 5 , 1 ( 2014 ). Apache Kafka. 2014. A high-throughput, distributed messaging system. URL: kafka. apache. org as of Vol. 5, 1 (2014)."},{"key":"e_1_3_2_1_23_1","volume-title":"Retrieved","author":"Kinesis Amazon","year":"2006","unstructured":"Amazon Kinesis . 2006. ( 2006 ). Retrieved October 2, 2017 from https:\/\/aws.amazon.com\/kinesis Amazon Kinesis. 2006. (2006). Retrieved October 2, 2017 from https:\/\/aws.amazon.com\/kinesis"},{"key":"e_1_3_2_1_24_1","volume-title":"Samza and the Unix philosophy of distributed data. Bulletin of the IEEE CS Technical Committee on Data Engineering","author":"Kleppmann Martin","year":"2015","unstructured":"Martin Kleppmann and Jay Kreps . 2015. Kafka , Samza and the Unix philosophy of distributed data. Bulletin of the IEEE CS Technical Committee on Data Engineering ( 2015 ). Martin Kleppmann and Jay Kreps. 2015. Kafka, Samza and the Unix philosophy of distributed data. Bulletin of the IEEE CS Technical Committee on Data Engineering (2015)."},{"key":"e_1_3_2_1_25_1","doi-asserted-by":"publisher","DOI":"10.1145\/2723372.2742788"},{"key":"e_1_3_2_1_26_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICPP.2013.78"},{"key":"e_1_3_2_1_27_1","doi-asserted-by":"publisher","DOI":"10.1109\/HOTI.2014.15"},{"key":"e_1_3_2_1_28_1","volume-title":"2016 IEEE International Conference on. IEEE, 433--442","author":"Marcu Ovidiu-Cristian","year":"2016","unstructured":"Ovidiu-Cristian Marcu , Alexandru Costan , Gabriel Antoniu , and Mar\u00eda S P\u00e9rez-Hern\u00e1ndez . 2016 . Spark versus flink: Understanding performance in big data analytics frameworks Cluster Computing (CLUSTER) , 2016 IEEE International Conference on. IEEE, 433--442 . Ovidiu-Cristian Marcu, Alexandru Costan, Gabriel Antoniu, and Mar\u00eda S P\u00e9rez-Hern\u00e1ndez. 2016. Spark versus flink: Understanding performance in big data analytics frameworks Cluster Computing (CLUSTER), 2016 IEEE International Conference on. IEEE, 433--442."},{"key":"e_1_3_2_1_29_1","volume-title":"Big Data: Principles and best practices of scalable realtime data systems","author":"Marz Nathan","year":"2015","unstructured":"Nathan Marz and James Warren . 2015 . Big Data: Principles and best practices of scalable realtime data systems . Manning Publications Co. Nathan Marz and James Warren. 2015. Big Data: Principles and best practices of scalable realtime data systems. Manning Publications Co."},{"key":"e_1_3_2_1_30_1","volume-title":"Omni-Path, Ethernet\/iWARP, and RoCE.","author":"MVAPICH","year":"2017","unstructured":"MVAPICH : MPI over InfiniBand , Omni-Path, Ethernet\/iWARP, and RoCE. 2017 . (2017). http:\/\/mvapich.cse.ohio-state.edu MVAPICH: MPI over InfiniBand, Omni-Path, Ethernet\/iWARP, and RoCE. 2017. (2017). http:\/\/mvapich.cse.ohio-state.edu"},{"key":"e_1_3_2_1_31_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICDMW.2010.172"},{"key":"e_1_3_2_1_32_1","volume-title":"2016 IEEE International Conference on. IEEE, 592--598","author":"Qian Shilei","year":"2016","unstructured":"Shilei Qian , Gang Wu , Jie Huang , and Tathagata Das . 2016 . Benchmarking modern distributed streaming platforms Industrial Technology (ICIT) , 2016 IEEE International Conference on. IEEE, 592--598 . Shilei Qian, Gang Wu, Jie Huang, and Tathagata Das. 2016. Benchmarking modern distributed streaming platforms Industrial Technology (ICIT), 2016 IEEE International Conference on. IEEE, 592--598."},{"key":"e_1_3_2_1_33_1","volume-title":"https:\/\/rocksdb.org\/","author":"DB.","year":"2012","unstructured":"Rocks DB. 2012. ( 2012 ). https:\/\/rocksdb.org\/ RocksDB. 2012. (2012). https:\/\/rocksdb.org\/"},{"key":"e_1_3_2_1_34_1","volume-title":"CSA: Streaming Engine for Internet of Things. Data Engineering","author":"Shen Zhitao","year":"2015","unstructured":"Zhitao Shen , Vikram Kumaran , Michael J Franklin , Sailesh Krishnamurthy , Amit Bhat , Madhu Kumar , Robert Lerche , and Kim Macpherson . 2015 . CSA: Streaming Engine for Internet of Things. Data Engineering (2015), 39. Zhitao Shen, Vikram Kumaran, Michael J Franklin, Sailesh Krishnamurthy, Amit Bhat, Madhu Kumar, Robert Lerche, and Kim Macpherson. 2015. CSA: Streaming Engine for Internet of Things. Data Engineering (2015), 39."},{"key":"e_1_3_2_1_35_1","doi-asserted-by":"publisher","DOI":"10.1109\/MSST.2010.5496972"},{"key":"e_1_3_2_1_36_1","doi-asserted-by":"publisher","DOI":"10.1145\/16856.16888"},{"key":"e_1_3_2_1_37_1","unstructured":"Apache Storm. 2014. Storm distributed and fault-tolerant realtime computation. (2014). http:\/\/storm.apache.org  Apache Storm. 2014. Storm distributed and fault-tolerant realtime computation. (2014). http:\/\/storm.apache.org"},{"key":"e_1_3_2_1_38_1","volume-title":"Resilient distributed datasets: A fault-tolerant abstraction for in-memory cluster computing Proceedings of the 9th USENIX conference on Networked Systems Design and Implementation","author":"Zaharia Matei","unstructured":"Matei Zaharia , Mosharaf Chowdhury , Tathagata Das , Ankur Dave , Justin Ma , Murphy McCauley , Michael J Franklin , Scott Shenker , and Ion Stoica . 2012. Resilient distributed datasets: A fault-tolerant abstraction for in-memory cluster computing Proceedings of the 9th USENIX conference on Networked Systems Design and Implementation . USENIX Association , 2--2. Matei Zaharia, Mosharaf Chowdhury, Tathagata Das, Ankur Dave, Justin Ma, Murphy McCauley, Michael J Franklin, Scott Shenker, and Ion Stoica. 2012. Resilient distributed datasets: A fault-tolerant abstraction for in-memory cluster computing Proceedings of the 9th USENIX conference on Networked Systems Design and Implementation. USENIX Association, 2--2."},{"key":"e_1_3_2_1_39_1","doi-asserted-by":"publisher","DOI":"10.1145\/2517349.2522737"},{"key":"e_1_3_2_1_40_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICDEW.2006.118"}],"event":{"name":"UCC '17: 10th International Conference on Utility and Cloud Computing","location":"Austin Texas USA","acronym":"UCC '17","sponsor":["SIGARCH ACM Special Interest Group on Computer Architecture","IEEE TCSC IEEE Technical Committee on Scalable Computing"]},"container-title":["Proceedings of the Fourth IEEE\/ACM International Conference on Big Data Computing, Applications and Technologies"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3148055.3148068","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3148055.3148068","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3148055.3148068","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,18]],"date-time":"2025-06-18T02:26:33Z","timestamp":1750213593000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3148055.3148068"}},"subtitle":["A Case Study using Flink and Kafka"],"short-title":[],"issued":{"date-parts":[[2017,12,5]]},"references-count":39,"alternative-id":["10.1145\/3148055.3148068","10.1145\/3148055"],"URL":"https:\/\/doi.org\/10.1145\/3148055.3148068","relation":{},"subject":[],"published":{"date-parts":[[2017,12,5]]},"assertion":[{"value":"2017-12-05","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}