{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,6,18]],"date-time":"2025-06-18T04:27:12Z","timestamp":1750220832222,"version":"3.41.0"},"publisher-location":"New York, NY, USA","reference-count":13,"publisher":"ACM","license":[{"start":{"date-parts":[[2019,12,9]],"date-time":"2019-12-09T00:00:00Z","timestamp":1575849600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"funder":[{"DOI":"10.13039\/100000001","name":"National Science Foundation","doi-asserted-by":"publisher","award":["SHF-1816850 and CNS-1422119"],"award-info":[{"award-number":["SHF-1816850 and CNS-1422119"]}],"id":[{"id":"10.13039\/100000001","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":[],"published-print":{"date-parts":[[2019,12,9]]},"DOI":"10.1145\/3366624.3368157","type":"proceedings-article","created":{"date-parts":[[2019,11,27]],"date-time":"2019-11-27T13:23:09Z","timestamp":1574860989000},"page":"9-13","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":0,"title":["Troubleshooting distributed data analytics systems"],"prefix":"10.1145","author":[{"given":"Aidi","family":"Pi","sequence":"first","affiliation":[{"name":"University of Colorado"}]}],"member":"320","published-online":{"date-parts":[[2019,12,9]]},"reference":[{"key":"e_1_3_2_1_1_1","unstructured":"2017. Apache kafka. https:\/\/kafka.apache.org\/.  2017. Apache kafka. https:\/\/kafka.apache.org\/."},{"key":"e_1_3_2_1_2_1","unstructured":"2017. OpenTSDB. http:\/\/opentsdb.net\/\/.  2017. OpenTSDB. http:\/\/opentsdb.net\/\/."},{"key":"e_1_3_2_1_3_1","unstructured":"2017. Spark-19371. https:\/\/issues.apache.org\/jira\/browse\/SPARK-19371\/.  2017. Spark-19371. https:\/\/issues.apache.org\/jira\/browse\/SPARK-19371\/."},{"key":"e_1_3_2_1_4_1","unstructured":"2018. TPC-H. http:\/\/www.tpc.org\/tpch\/.  2018. TPC-H. http:\/\/www.tpc.org\/tpch\/."},{"key":"e_1_3_2_1_5_1","volume-title":"Spell: Streaming Parsing of System Event Logs. In &lt;u&gt","author":"Du Min","year":"2017","unstructured":"Min Du and Feifei Li . 2017 . Spell: Streaming Parsing of System Event Logs. In &lt;u&gt ;Proceeding of The IEEE International Conference on Data Mining &lt;\/u&gt;. Min Du and Feifei Li. 2017. Spell: Streaming Parsing of System Event Logs. In &lt;u&gt;Proceeding of The IEEE International Conference on Data Mining&lt;\/u&gt;."},{"key":"e_1_3_2_1_6_1","doi-asserted-by":"crossref","unstructured":"Min Du Feifei Li Guineng Zheng and Vivek Srikumar. 2017. DeepLog: Anomaly Detection and Diagnosis from SYstem Logs through Deep Learning. In &lt;u&gt;Proceeding of The ACM Conference on Computer and Communications Security&lt;\/u&gt;.  Min Du Feifei Li Guineng Zheng and Vivek Srikumar. 2017. DeepLog: Anomaly Detection and Diagnosis from SYstem Logs through Deep Learning. In &lt;u&gt;Proceeding of The ACM Conference on Computer and Communications Security&lt;\/u&gt;.","DOI":"10.1145\/3133956.3134015"},{"volume-title":"The HiBench benchmark suite: Characterization of the MapReduce-based data analysis. In &lt;u&gt","author":"Huang Shengsheng","key":"e_1_3_2_1_7_1","unstructured":"Shengsheng Huang , Jie Huang , Jinquan Dai , Tao Xie , and Bo Huang . 2010. The HiBench benchmark suite: Characterization of the MapReduce-based data analysis. In &lt;u&gt ;Proceeding of IEEE 26th International Conference on Data Engineering Workshops &lt;\/u&gt;. Shengsheng Huang, Jie Huang, Jinquan Dai, Tao Xie, and Bo Huang. 2010. The HiBench benchmark suite: Characterization of the MapReduce-based data analysis. In &lt;u&gt;Proceeding of IEEE 26th International Conference on Data Engineering Workshops&lt;\/u&gt;."},{"volume-title":"Log Clustering based Problem Identification for Online Service Systems. In &lt;u&gt","author":"Lin Qingwei","key":"e_1_3_2_1_8_1","unstructured":"Qingwei Lin , Hongyu Zhang , Jian-Guang Lou , Yu Zhang , and Xuewei Chen . 2016. Log Clustering based Problem Identification for Online Service Systems. In &lt;u&gt ;Proceeding of The IEEE\/ACM 38th International Conference on Software Engineering &lt;\/u&gt;. Qingwei Lin, Hongyu Zhang, Jian-Guang Lou, Yu Zhang, and Xuewei Chen. 2016. Log Clustering based Problem Identification for Online Service Systems. In &lt;u&gt;Proceeding of The IEEE\/ACM 38th International Conference on Software Engineering&lt;\/u&gt;."},{"key":"e_1_3_2_1_9_1","unstructured":"Aidi Pi Wei Chen Shaoqi Wang and Xiaobo Zhou. 2019. Semantic-aware Workflow Construction and Analysis for Distributed Data Analytics Systems. In &lt;u&gt;Proceeding of The 28th ACM International Symposium on High-Performance Parallel and Distributed Computing&lt;\/u&gt;.  Aidi Pi Wei Chen Shaoqi Wang and Xiaobo Zhou. 2019. Semantic-aware Workflow Construction and Analysis for Distributed Data Analytics Systems. In &lt;u&gt;Proceeding of The 28th ACM International Symposium on High-Performance Parallel and Distributed Computing&lt;\/u&gt;."},{"volume-title":"Literally. In &lt;u&gt;Proceeding of The 33rd IEEE International Parallel and Distributed Processing Symposium Workshop&lt;\/u&gt;.","author":"Pi Aidi","key":"e_1_3_2_1_10_1","unstructured":"Aidi Pi , Wei Chen , Will Zeller , and Xiaobo Zhou . 2019. It Can Understand the Logs , Literally. In &lt;u&gt;Proceeding of The 33rd IEEE International Parallel and Distributed Processing Symposium Workshop&lt;\/u&gt;. Aidi Pi, Wei Chen, Will Zeller, and Xiaobo Zhou. 2019. It Can Understand the Logs, Literally. In &lt;u&gt;Proceeding of The 33rd IEEE International Parallel and Distributed Processing Symposium Workshop&lt;\/u&gt;."},{"key":"e_1_3_2_1_11_1","unstructured":"Aidi Pi Wei Chen Xiaobo Zhou and Mike Ji. 2018. Profiling Distributed Systems in Lightweight Virtualized Environments with Logs and Resource Metrics. In &lt;u&gt;Proceeding of The 27th ACM International Symposium on High-Performance Parallel and Distributed Computing&lt;\/u&gt;.  Aidi Pi Wei Chen Xiaobo Zhou and Mike Ji. 2018. Profiling Distributed Systems in Lightweight Virtualized Environments with Logs and Resource Metrics. In &lt;u&gt;Proceeding of The 27th ACM International Symposium on High-Performance Parallel and Distributed Computing&lt;\/u&gt;."},{"key":"e_1_3_2_1_12_1","unstructured":"Xiao Yu Pallavi Joshi Jianwu Xu and Guoliang Jin. 2016. CloudSeer: Workflow monitoring of cloud infrastructures via interleaved logs. In &lt;u&gt;Proceeding of The 21th ACM International Conference on Architectural Support for Programming Languages and Operating Systems&lt;\/u&gt;.  Xiao Yu Pallavi Joshi Jianwu Xu and Guoliang Jin. 2016. CloudSeer: Workflow monitoring of cloud infrastructures via interleaved logs. In &lt;u&gt;Proceeding of The 21th ACM International Conference on Architectural Support for Programming Languages and Operating Systems&lt;\/u&gt;."},{"key":"e_1_3_2_1_13_1","unstructured":"Xu Zhao Kirk Rodrigues Yu Luo Ding Yuan and Michael Stumm. 2016. Non-intrusive performance profiling for entire software statcks based on the flow reconstruction principle. In &lt;u&gt;Proc. of The 12th USENIX Symposium on Operating Systems Design and Implementation&lt;\/u&gt;.  Xu Zhao Kirk Rodrigues Yu Luo Ding Yuan and Michael Stumm. 2016. Non-intrusive performance profiling for entire software statcks based on the flow reconstruction principle. In &lt;u&gt;Proc. of The 12th USENIX Symposium on Operating Systems Design and Implementation&lt;\/u&gt;."}],"event":{"name":"Middleware '19: 20th International Middleware Conference","sponsor":["ACM Association for Computing Machinery","USENIX Assoc USENIX Assoc","IFIP"],"location":"Davis California","acronym":"Middleware '19"},"container-title":["Proceedings of the 20th International Middleware Conference Doctoral Symposium"],"original-title":[],"link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3366624.3368157","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3366624.3368157","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3366624.3368157","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,17]],"date-time":"2025-06-17T23:13:33Z","timestamp":1750202013000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3366624.3368157"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2019,12,9]]},"references-count":13,"alternative-id":["10.1145\/3366624.3368157","10.1145\/3366624"],"URL":"https:\/\/doi.org\/10.1145\/3366624.3368157","relation":{},"subject":[],"published":{"date-parts":[[2019,12,9]]},"assertion":[{"value":"2019-12-09","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}