{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,13]],"date-time":"2026-04-13T18:10:31Z","timestamp":1776103831715,"version":"3.50.1"},"reference-count":76,"publisher":"Association for Computing Machinery (ACM)","issue":"3","content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Proc. VLDB Endow."],"published-print":{"date-parts":[[2021,11]]},"abstract":"<jats:p>\n            Exploratory data science largely happens in computational notebooks with dataframe APIs, such as pandas, that support flexible means to transform, clean, and analyze data. Yet, visually exploring data in dataframes remains tedious, requiring substantial programming effort for visualization and mental effort to determine what analysis to perform next. We propose Lux, an\n            <jats:italic>always-on<\/jats:italic>\n            framework for accelerating visual insight discovery in dataframe workflows. When users print a dataframe in their notebooks, Lux recommends visualizations to provide a quick overview of the patterns and trends and suggests promising analysis directions. Lux features a high-level language for generating visualizations on demand to encourage rapid visual experimentation with data. We demonstrate that through the use of a careful design and three system optimizations, Lux adds no more than two seconds of overhead on top of pandas for over 98% of datasets in the UCI repository. We evaluate Lux in terms of usability via interviews with early adopters, finding that Lux helps fulfill the needs of data scientists for visualization support within their dataframe workflows. Lux has already been embraced by data science practitioners, with over 3.1k stars on Github.\n          <\/jats:p>","DOI":"10.14778\/3494124.3494151","type":"journal-article","created":{"date-parts":[[2022,2,5]],"date-time":"2022-02-05T00:31:46Z","timestamp":1644021106000},"page":"727-738","source":"Crossref","is-referenced-by-count":44,"title":["Lux"],"prefix":"10.14778","volume":"15","author":[{"given":"Doris Jung-Lin","family":"Lee","sequence":"first","affiliation":[{"name":"UC Berkeley"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Dixin","family":"Tang","sequence":"additional","affiliation":[{"name":"UC Berkeley"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Kunal","family":"Agarwal","sequence":"additional","affiliation":[{"name":"UC Berkeley"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Thyne","family":"Boonmark","sequence":"additional","affiliation":[{"name":"UC Berkeley"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Caitlyn","family":"Chen","sequence":"additional","affiliation":[{"name":"UC Berkeley"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Jake","family":"Kang","sequence":"additional","affiliation":[{"name":"UC Berkeley"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Ujjaini","family":"Mukhopadhyay","sequence":"additional","affiliation":[{"name":"UC Berkeley"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Jerry","family":"Song","sequence":"additional","affiliation":[{"name":"UC Berkeley"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Micah","family":"Yong","sequence":"additional","affiliation":[{"name":"UC Berkeley"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Marti A.","family":"Hearst","sequence":"additional","affiliation":[{"name":"UC Berkeley"}],"role":[{"role":"author","vocabulary":"crossref"}]},{"given":"Aditya G.","family":"Parameswaran","sequence":"additional","affiliation":[{"name":"UC Berkeley"}],"role":[{"role":"author","vocabulary":"crossref"}]}],"member":"320","published-online":{"date-parts":[[2022,2,4]]},"reference":[{"key":"e_1_2_1_1_1","unstructured":"pandas-profiling. https:\/\/github.com\/pandas-profiling\/pandas-profiling.  pandas-profiling. https:\/\/github.com\/pandas-profiling\/pandas-profiling."},{"key":"e_1_2_1_2_1","volume-title":"http:\/\/happyplanetindex.org\/","author":"Index Happy Planet","year":"2019","unstructured":"Happy Planet Index . http:\/\/happyplanetindex.org\/ , 2019 . Happy Planet Index. http:\/\/happyplanetindex.org\/, 2019."},{"key":"e_1_2_1_3_1","volume-title":"https:\/\/www.tableau.com\/","year":"2019","unstructured":"Tableau. https:\/\/www.tableau.com\/ , 2019 . Accessed : 2019-09-11. Tableau. https:\/\/www.tableau.com\/, 2019. Accessed: 2019-09-11."},{"key":"e_1_2_1_4_1","volume-title":"WHO mission reviews COVID-19 response","author":"Afghanistan","year":"2020","unstructured":"Afghanistan : WHO mission reviews COVID-19 response . World Health Organization , 2020 . Afghanistan: WHO mission reviews COVID-19 response. World Health Organization, 2020."},{"key":"e_1_2_1_5_1","volume-title":"https:\/\/bamboolib.8080labs.com\/","year":"2020","unstructured":"bamboolib. https:\/\/bamboolib.8080labs.com\/ , 2020 . bamboolib. https:\/\/bamboolib.8080labs.com\/, 2020."},{"key":"e_1_2_1_6_1","volume-title":"WHO fighting tirelessly against the odds","year":"2020","unstructured":"COVID-19 in Pakistan : WHO fighting tirelessly against the odds . World Health Organization , 2020 . COVID-19 in Pakistan: WHO fighting tirelessly against the odds. World Health Organization, 2020."},{"key":"e_1_2_1_7_1","unstructured":"Faster data exploration in Jupyter through Lux 2020.  Faster data exploration in Jupyter through Lux 2020."},{"key":"e_1_2_1_8_1","unstructured":"LUX Exploratory Data Analysis (EDA). YouTube Dec 2020.  LUX Exploratory Data Analysis (EDA). YouTube Dec 2020."},{"key":"e_1_2_1_9_1","volume-title":"YouTube","author":"Library LUX","year":"2020","unstructured":"LUX Library : Matplotlib replacer ? YouTube , Dec 2020 . LUX Library: Matplotlib replacer? YouTube, Dec 2020."},{"key":"e_1_2_1_10_1","volume-title":"https:\/\/papermill.readthedocs.io\/","year":"2020","unstructured":"papermill 2.3.3 documentation. https:\/\/papermill.readthedocs.io\/ , 2020 . papermill 2.3.3 documentation. https:\/\/papermill.readthedocs.io\/, 2020."},{"key":"e_1_2_1_11_1","volume-title":"Interactive data visualization BI Tools","author":"Power","year":"2020","unstructured":"Power BI : Interactive data visualization BI Tools . 2020 . Power BI: Interactive data visualization BI Tools. 2020."},{"key":"e_1_2_1_12_1","unstructured":"State of Data Science and Machine Learning 2020. Kaggle 2020.  State of Data Science and Machine Learning 2020. Kaggle 2020."},{"key":"e_1_2_1_13_1","volume-title":"https:\/\/streamlit.io\/","year":"2021","unstructured":"Streamlit. https:\/\/streamlit.io\/ , 2021 . Streamlit. https:\/\/streamlit.io\/, 2021."},{"key":"e_1_2_1_14_1","unstructured":"UCI Machine Learning Repository Mar 2021.  UCI Machine Learning Repository Mar 2021."},{"key":"e_1_2_1_15_1","unstructured":"adamerose. PandasGUI. https:\/\/github.com\/adamerose\/pandasgui.  adamerose. PandasGUI. https:\/\/github.com\/adamerose\/pandasgui."},{"key":"e_1_2_1_16_1","doi-asserted-by":"publisher","DOI":"10.1109\/TVCG.2018.2865040"},{"key":"e_1_2_1_17_1","doi-asserted-by":"publisher","DOI":"10.1109\/TVCG.2017.2743990"},{"key":"e_1_2_1_18_1","volume-title":"NPR","author":"Beaubien J.","year":"2020","unstructured":"J. Beaubien . Why Rwanda Is Doing Better Than Ohio When It Comes To Controlling COVID-19 . NPR , Jul 2020 . J. Beaubien. Why Rwanda Is Doing Better Than Ohio When It Comes To Controlling COVID-19. NPR, Jul 2020."},{"key":"e_1_2_1_19_1","doi-asserted-by":"publisher","DOI":"10.1145\/3299869.3319876"},{"key":"e_1_2_1_20_1","unstructured":"M. Bostock V. Ogievetsky and J. Heer. D3: Data-driven documents.  M. Bostock V. Ogievetsky and J. Heer. D3: Data-driven documents."},{"key":"e_1_2_1_21_1","volume-title":"CRC Press","author":"Brooke J.","year":"1996","unstructured":"J. Brooke . \"SUS-A quick and dirty usability scale.\" Usability evaluation in industry . CRC Press , June 1996 . ISBN: 9780748404605. J. Brooke. \"SUS-A quick and dirty usability scale.\" Usability evaluation in industry. CRC Press, June 1996. ISBN: 9780748404605."},{"key":"e_1_2_1_22_1","doi-asserted-by":"publisher","DOI":"10.1561\/1900000004"},{"key":"e_1_2_1_23_1","volume-title":"Foresight: Rapid data exploration through guideposts. arXiv preprint arXiv:1709.10513","author":"Haas P. J.","year":"2017","unstructured":"\u00c7. Demiralp, P. J. Haas , S. Parthasarathy , and T. Pedapati . Foresight: Rapid data exploration through guideposts. arXiv preprint arXiv:1709.10513 , 2017 . \u00c7. Demiralp, P. J. Haas, S. Parthasarathy, and T. Pedapati. Foresight: Rapid data exploration through guideposts. arXiv preprint arXiv:1709.10513, 2017."},{"key":"e_1_2_1_24_1","volume-title":"Kaggle","year":"2020","unstructured":"Dgomonov. Data Exploration on NYC Airbnb . Kaggle , Aug 2020 . Dgomonov. Data Exploration on NYC Airbnb. Kaggle, Aug 2020."},{"key":"e_1_2_1_25_1","volume-title":"Medium","author":"Duvva P.","year":"2021","unstructured":"P. Duvva . Speed up EDA With the Intelligent Lux . Medium , Mar 2021 . P. Duvva. Speed up EDA With the Intelligent Lux. Medium, Mar 2021."},{"key":"e_1_2_1_26_1","unstructured":"Fbdesignpro. sweetviz. https:\/\/github.com\/fbdesignpro\/sweetviz.  Fbdesignpro. sweetviz. https:\/\/github.com\/fbdesignpro\/sweetviz."},{"key":"e_1_2_1_27_1","doi-asserted-by":"publisher","DOI":"10.5555\/645927.672356"},{"key":"e_1_2_1_28_1","doi-asserted-by":"publisher","DOI":"10.1109\/TVCG.2010.164"},{"key":"e_1_2_1_29_1","doi-asserted-by":"publisher","DOI":"10.1109\/TKDE.2005.16"},{"key":"e_1_2_1_30_1","doi-asserted-by":"publisher","DOI":"10.1038\/s41562-021-01079-8"},{"key":"e_1_2_1_31_1","doi-asserted-by":"publisher","DOI":"10.1145\/3290605.3300358"},{"key":"e_1_2_1_32_1","doi-asserted-by":"publisher","DOI":"10.1145\/3209900.3209910"},{"key":"e_1_2_1_33_1","doi-asserted-by":"publisher","DOI":"10.1109\/MCSE.2007.55"},{"key":"e_1_2_1_34_1","volume-title":"Collaborative data science","author":"P. T. Inc.","year":"2015","unstructured":"P. T. Inc. Collaborative data science , 2015 . P. T. Inc. Collaborative data science, 2015."},{"key":"e_1_2_1_35_1","volume-title":"CIDR","author":"Jindal A.","year":"2021","unstructured":"A. Jindal , K. V. Emani , M. Daum , O. Poppe , B. Haynes , A. Pavlenko , A. Gupta , K. Ramachandra , C. Curino , A. C. M\u00fcller , W. Wu , and H. Patel . Magpie: Python at speed and scale using cloud backends . In CIDR , February 2021 . A. Jindal, K. V. Emani, M. Daum, O. Poppe, B. Haynes, A. Pavlenko, A. Gupta, K. Ramachandra, C. Curino, A. C. M\u00fcller, W. Wu, and H. Patel. Magpie: Python at speed and scale using cloud backends. In CIDR, February 2021."},{"key":"e_1_2_1_36_1","doi-asserted-by":"publisher","DOI":"10.14778\/2824032.2824103"},{"key":"e_1_2_1_37_1","doi-asserted-by":"publisher","DOI":"10.1145\/2939502.2939514"},{"key":"e_1_2_1_38_1","doi-asserted-by":"publisher","DOI":"10.1109\/TVCG.2012.219"},{"key":"e_1_2_1_39_1","doi-asserted-by":"publisher","DOI":"10.14778\/2735479.2735485"},{"key":"e_1_2_1_40_1","volume-title":"Kaggle","year":"2020","unstructured":"Kkanda. Analyzing UCI Crime and Communities Dataset . Kaggle , Mar 2020 . Kkanda. Analyzing UCI Crime and Communities Dataset. Kaggle, Mar 2020."},{"key":"e_1_2_1_41_1","doi-asserted-by":"publisher","DOI":"10.1109\/VL\/HCC50065.2020.9127202"},{"key":"e_1_2_1_42_1","doi-asserted-by":"publisher","DOI":"10.1145\/3301275.3302307"},{"issue":"3","key":"e_1_2_1_43_1","first-page":"3","article-title":"The Case for a Visual Discovery Assistant","volume":"41","author":"Lee D. J.-L.","year":"2018","unstructured":"D. J.-L. Lee and A. Parameswaran . The Case for a Visual Discovery Assistant : A Holistic Solution for Accelerating Visual Data Exploration. IEEE Bulletin of Technical Committee on Data Engineering , 41 ( 3 ): 3 -- 14 , 2018 . D. J.-L. Lee and A. Parameswaran. The Case for a Visual Discovery Assistant: A Holistic Solution for Accelerating Visual Data Exploration. IEEE Bulletin of Technical Committee on Data Engineering, 41(3):3--14, 2018.","journal-title":"A Holistic Solution for Accelerating Visual Data Exploration. IEEE Bulletin of Technical Committee on Data Engineering"},{"key":"e_1_2_1_44_1","doi-asserted-by":"publisher","DOI":"10.1109\/TVCG.2021.3085751"},{"key":"e_1_2_1_45_1","doi-asserted-by":"publisher","DOI":"10.14778\/3494124.3494151"},{"key":"e_1_2_1_46_1","doi-asserted-by":"publisher","DOI":"10.1145\/3313831.3376880"},{"key":"e_1_2_1_47_1","volume-title":"PVLDB","author":"Macke S.","year":"2017","unstructured":"S. Macke , Y. Zhang , S. Huang , and A. Parameswaran . Fastmatch: Adaptive algorithms for rapid discovery of relevant histogram visualizations . PVLDB , 2017 . S. Macke, Y. Zhang, S. Huang, and A. Parameswaran. Fastmatch: Adaptive algorithms for rapid discovery of relevant histogram visualizations. PVLDB, 2017."},{"key":"e_1_2_1_48_1","doi-asserted-by":"publisher","DOI":"10.5555\/3522802.3522950"},{"key":"e_1_2_1_49_1","doi-asserted-by":"publisher","DOI":"10.1109\/TVCG.2007.70594"},{"key":"e_1_2_1_50_1","doi-asserted-by":"publisher","DOI":"10.1145\/3173574.3173962"},{"key":"e_1_2_1_51_1","volume-title":"Formalizing visualization design knowledge as constraints: Actionable and extensible models in draco","author":"Moritz D.","year":"2019","unstructured":"D. Moritz , C. Wang , G. L. Nelson , H. Lin , A. M. Smith , B. Howe , and J. Heer . Formalizing visualization design knowledge as constraints: Actionable and extensible models in draco . IEEE transactions on visualization and computer graphics, 25(1):438--448, 2019 . D. Moritz, C. Wang, G. L. Nelson, H. Lin, A. M. Smith, B. Howe, and J. Heer. Formalizing visualization design knowledge as constraints: Actionable and extensible models in draco. IEEE transactions on visualization and computer graphics, 25(1):438--448, 2019."},{"key":"e_1_2_1_52_1","volume-title":"Medium","author":"Pandey P.","year":"2021","unstructured":"P. Pandey . Intelligent visual data discovery with lux: A python library . Medium , Mar 2021 . P. Pandey. Intelligent visual data discovery with lux: A python library. Medium, Mar 2021."},{"key":"e_1_2_1_53_1","doi-asserted-by":"publisher","DOI":"10.14778\/3407790.3407807"},{"key":"e_1_2_1_54_1","doi-asserted-by":"publisher","DOI":"10.1145\/3274419"},{"key":"e_1_2_1_55_1","doi-asserted-by":"publisher","DOI":"10.5555\/645925.671500"},{"key":"e_1_2_1_56_1","first-page":"307","volume-title":"Proc of the 26th Intl Conference on Very Large","author":"Sarawagi S.","year":"2000","unstructured":"S. Sarawagi . User-adaptive exploration of multidimensional data . Proc of the 26th Intl Conference on Very Large , pages 307 -- 316 , 2000 . S. Sarawagi. User-adaptive exploration of multidimensional data. Proc of the 26th Intl Conference on Very Large, pages 307--316, 2000."},{"key":"e_1_2_1_57_1","doi-asserted-by":"publisher","DOI":"10.5555\/645338.650401"},{"key":"e_1_2_1_58_1","doi-asserted-by":"publisher","DOI":"10.1109\/TVCG.2016.2599030"},{"key":"e_1_2_1_59_1","volume-title":"Reactive vega: A streaming dataflow architecture for declarative interactive visualization","author":"Satyanarayan A.","year":"2016","unstructured":"A. Satyanarayan , R. Russell , J. Hoffswell , and J. Heer . Reactive vega: A streaming dataflow architecture for declarative interactive visualization . IEEE transactions on visualization and computer graphics, 22(1):659--668, 2016 . A. Satyanarayan, R. Russell, J. Hoffswell, and J. Heer. Reactive vega: A streaming dataflow architecture for declarative interactive visualization. IEEE transactions on visualization and computer graphics, 22(1):659--668, 2016."},{"key":"e_1_2_1_60_1","unstructured":"sfu-db. dataprep. https:\/\/github.com\/sfu-db\/dataprep.  sfu-db. dataprep. https:\/\/github.com\/sfu-db\/dataprep."},{"key":"e_1_2_1_61_1","doi-asserted-by":"publisher","DOI":"10.14778\/3025111.3025126"},{"key":"e_1_2_1_62_1","doi-asserted-by":"publisher","DOI":"10.1109\/2945.981851"},{"key":"e_1_2_1_63_1","doi-asserted-by":"publisher","DOI":"10.1109\/2945.981851"},{"key":"e_1_2_1_64_1","doi-asserted-by":"publisher","DOI":"10.14778\/3342263.3342278"},{"key":"e_1_2_1_65_1","volume-title":"Feb.","author":"The","year":"2020","unstructured":"The pandas development team. pandas-dev\/pandas: Pandas , Feb. 2020 . The pandas development team. pandas-dev\/pandas: Pandas, Feb. 2020."},{"key":"e_1_2_1_66_1","doi-asserted-by":"publisher","DOI":"10.21105\/joss.01057"},{"key":"e_1_2_1_67_1","doi-asserted-by":"publisher","DOI":"10.1145\/3092931.3092937"},{"key":"e_1_2_1_68_1","doi-asserted-by":"publisher","DOI":"10.14778\/2831360.2831371"},{"key":"e_1_2_1_69_1","doi-asserted-by":"publisher","DOI":"10.1145\/3371117"},{"key":"e_1_2_1_70_1","doi-asserted-by":"publisher","DOI":"10.5555\/2967391"},{"key":"e_1_2_1_71_1","volume-title":"Medium","author":"Wijaya C. Y.","year":"2021","unstructured":"C. Y. Wijaya . Quick recommendation-based data exploration with lux . Medium , Feb 2021 . C. Y. Wijaya. Quick recommendation-based data exploration with lux. Medium, Feb 2021."},{"key":"e_1_2_1_72_1","doi-asserted-by":"publisher","DOI":"10.1145\/2939502.2939506"},{"key":"e_1_2_1_73_1","volume-title":"Visualizing dataflow graphs of deep learning models in tensorflow","author":"Wongsuphasawat K.","year":"2017","unstructured":"K. Wongsuphasawat , D. Smilkov , J. Wexler , J. Wilson , D. Mane , D. Fritz , D. Krishnan , F. B. Vi\u00e9gas , and M. Wattenberg . Visualizing dataflow graphs of deep learning models in tensorflow . IEEE transactions on visualization and computer graphics, 24(1):1--12, 2017 . K. Wongsuphasawat, D. Smilkov, J. Wexler, J. Wilson, D. Mane, D. Fritz, D. Krishnan, F. B. Vi\u00e9gas, and M. Wattenberg. Visualizing dataflow graphs of deep learning models in tensorflow. IEEE transactions on visualization and computer graphics, 24(1):1--12, 2017."},{"key":"e_1_2_1_74_1","doi-asserted-by":"publisher","DOI":"10.14778\/2536354.2536356"},{"key":"e_1_2_1_75_1","doi-asserted-by":"publisher","DOI":"10.14778\/3297753.3297763"},{"key":"e_1_2_1_76_1","volume-title":"Enhancing the interactivity of dataframe queries by leveraging think time. CoRR, abs\/2103.02145","author":"Xin D.","year":"2021","unstructured":"D. Xin , D. Petersohn , D. Tang , Y. Wu , J. E. Gonzalez , J. M. Hellerstein , A. D. Joseph , and A. G. Parameswaran . Enhancing the interactivity of dataframe queries by leveraging think time. CoRR, abs\/2103.02145 , 2021 . D. Xin, D. Petersohn, D. Tang, Y. Wu, J. E. Gonzalez, J. M. Hellerstein, A. D. Joseph, and A. G. Parameswaran. Enhancing the interactivity of dataframe queries by leveraging think time. CoRR, abs\/2103.02145, 2021."}],"container-title":["Proceedings of the VLDB Endowment"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.14778\/3494124.3494151","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2022,12,28]],"date-time":"2022-12-28T11:33:20Z","timestamp":1672227200000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.14778\/3494124.3494151"}},"subtitle":["always-on visualization recommendations for exploratory dataframe workflows"],"short-title":[],"issued":{"date-parts":[[2021,11]]},"references-count":76,"journal-issue":{"issue":"3","published-print":{"date-parts":[[2021,11]]}},"alternative-id":["10.14778\/3494124.3494151"],"URL":"https:\/\/doi.org\/10.14778\/3494124.3494151","relation":{},"ISSN":["2150-8097"],"issn-type":[{"value":"2150-8097","type":"print"}],"subject":[],"published":{"date-parts":[[2021,11]]}}}