{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2026,4,13]],"date-time":"2026-04-13T19:29:38Z","timestamp":1776108578723,"version":"3.50.1"},"reference-count":65,"publisher":"Association for Computing Machinery (ACM)","issue":"CSCW2","license":[{"start":{"date-parts":[[2021,10,18]],"date-time":"2021-10-18T00:00:00Z","timestamp":1634515200000},"content-version":"vor","delay-in-days":5,"URL":"http:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"funder":[{"DOI":"10.13039\/100000001","name":"National Science Foundation","doi-asserted-by":"publisher","award":["IIS-1845900"],"award-info":[{"award-number":["IIS-1845900"]}],"id":[{"id":"10.13039\/100000001","id-type":"DOI","asserted-by":"publisher"}]}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["Proc. ACM Hum.-Comput. Interact."],"published-print":{"date-parts":[[2021,10,13]]},"abstract":"<jats:p>Data scientists often collaborate with clients to analyze data to meet a client's needs. What does the end-to-end workflow of a data scientist's collaboration with clients look like throughout the lifetime of a project? To investigate this question, we interviewed ten data scientists (5 female, 4 male, 1 non-binary) in diverse roles across industry and academia. We discovered that they work with clients in a six-stage outer-loop workflow, which involves 1) laying groundwork by building trust before a project begins, 2) orienting to the constraints of the client's environment, 3) collaboratively framing the problem, 4) bridging the gap between data science and domain expertise, 5) the inner loop of technical data analysis work, 6) counseling to help clients emotionally cope with analysis results. This novel outer-loop workflow contributes to CSCW by expanding the notion of what collaboration means in data science beyond the widely-known inner-loop technical workflow stages of acquiring, cleaning, analyzing, modeling, and visualizing data. We conclude by discussing the implications of our findings for data science education, parallels to design work, and unmet needs for tool development.<\/jats:p>","DOI":"10.1145\/3476052","type":"journal-article","created":{"date-parts":[[2021,10,19]],"date-time":"2021-10-19T02:39:17Z","timestamp":1634611157000},"page":"1-28","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":29,"title":["Orienting, Framing, Bridging, Magic, and Counseling: How Data Scientists Navigate the Outer Loop of Client Collaborations in Industry and Academia"],"prefix":"10.1145","volume":"5","author":[{"given":"Sean","family":"Kross","sequence":"first","affiliation":[{"name":"University of California, San Diego, La Jolla, CA, USA"}]},{"given":"Philip","family":"Guo","sequence":"additional","affiliation":[{"name":"University of California, San Diego, La Jolla, CA, USA"}]}],"member":"320","published-online":{"date-parts":[[2021,10,18]]},"reference":[{"key":"e_1_2_1_1_1","doi-asserted-by":"publisher","DOI":"10.1109\/TVCG.2018.2865040"},{"key":"e_1_2_1_2_1","unstructured":"Alex Ball. [n.d.]. Review of Data Management Lifecycle Models."},{"key":"e_1_2_1_3_1","doi-asserted-by":"publisher","DOI":"10.1145\/2702123.2702366"},{"key":"e_1_2_1_4_1","doi-asserted-by":"publisher","DOI":"10.1080\/00031305.1974.10479092"},{"key":"e_1_2_1_5_1","doi-asserted-by":"publisher","DOI":"10.1145\/2998181.2998265"},{"key":"e_1_2_1_6_1","volume-title":"Brennan","author":"Clark Herbert H.","year":"1991","unstructured":"Herbert H. Clark and Susan E. Brennan. 1991. Grounding in Communication. In Perspectives on Socially Shared Cognition, L.B. Resnick, J.M. Levine, and S.D. Teasley (Eds.). American Psychological Association, 127--149."},{"key":"e_1_2_1_7_1","volume-title":"Strauss","author":"Corbin Juliet M.","year":"2008","unstructured":"Juliet M. Corbin and Anselm L. Strauss. 2008. Basics of qualitative research: techniques and procedures for developing grounded theory. SAGE Publications, Inc."},{"key":"e_1_2_1_8_1","doi-asserted-by":"publisher","DOI":"10.5040\/9781474293884"},{"key":"e_1_2_1_9_1","doi-asserted-by":"publisher","DOI":"10.1017\/9781316480748.021"},{"key":"e_1_2_1_10_1","unstructured":"James Densmore. 2017. There are two types of data scientists -- and two types of problems to solve. https:\/\/medium.com\/@jamesdensmore\/there-are-two-types-of-data-scientists-and-two-types-of-problems-to-solve-a149a0148e64. Accessed: 2020--10--10."},{"key":"e_1_2_1_11_1","unstructured":"Conor Dewey. 2018. An Ode to the Type A Data Scientist. Towards Data Science -- https:\/\/towardsdatascience.com\/ode-to-the-type-a-data-scientist-78d11456019. Accessed: 2020--10--10."},{"key":"e_1_2_1_12_1","doi-asserted-by":"publisher","DOI":"10.1080\/10618600.2017.1384734"},{"key":"e_1_2_1_13_1","doi-asserted-by":"publisher","DOI":"10.1145\/500286.500297"},{"key":"e_1_2_1_14_1","doi-asserted-by":"publisher","DOI":"10.1145\/3313831.3376442"},{"key":"e_1_2_1_15_1","doi-asserted-by":"publisher","DOI":"10.1002\/sim.4780010103"},{"key":"e_1_2_1_16_1","doi-asserted-by":"publisher","DOI":"10.1145\/2998626.2998635"},{"key":"e_1_2_1_17_1","doi-asserted-by":"publisher","DOI":"10.1037\/1076-8998.5.1.95"},{"key":"e_1_2_1_18_1","unstructured":"Philip J. Guo. 2012. Software Tools to Facilitate Research Programming. Ph.D. Dissertation. Stanford University."},{"key":"e_1_2_1_19_1","doi-asserted-by":"publisher","DOI":"10.1145\/2047196.2047205"},{"key":"e_1_2_1_20_1","unstructured":"Bob Hayes. 2020. Who Does the Machine Learning and Data Science Work? customer think -- https:\/\/customerthink.com\/who-does-the-machine-learning-and-data-science-work\/. Accessed: 2021-01--10."},{"key":"e_1_2_1_21_1","doi-asserted-by":"publisher","DOI":"10.1145\/2818048.2820067"},{"key":"e_1_2_1_22_1","volume-title":"Peng","author":"Hicks Stephanie C.","year":"2019","unstructured":"Stephanie C. Hicks and Roger D. Peng. 2019. Elements and Principles for Characterizing Variation between Data Analyses. arXiv:1903.07639 [stat.AP]"},{"key":"e_1_2_1_23_1","doi-asserted-by":"publisher","DOI":"10.1109\/VLHCC.2016.7739680"},{"key":"e_1_2_1_24_1","volume-title":"The Managed Heart: Commercialization of Human Feeling (1 ed.)","author":"Hochschild Arlie Russell","unstructured":"Arlie Russell Hochschild. 2012. The Managed Heart: Commercialization of Human Feeling (1 ed.). University of California Press."},{"key":"e_1_2_1_25_1","doi-asserted-by":"publisher","DOI":"10.1145\/3134688"},{"key":"e_1_2_1_26_1","doi-asserted-by":"publisher","DOI":"10.1007\/s10606-012--9184-0"},{"key":"e_1_2_1_27_1","doi-asserted-by":"publisher","DOI":"10.1145\/1978942.1979444"},{"key":"e_1_2_1_28_1","doi-asserted-by":"publisher","DOI":"10.1109\/TVCG.2012.219"},{"key":"e_1_2_1_29_1","doi-asserted-by":"publisher","DOI":"10.1145\/2884781.2884783"},{"key":"e_1_2_1_30_1","doi-asserted-by":"publisher","DOI":"10.1145\/3290605.3300493"},{"key":"e_1_2_1_31_1","doi-asserted-by":"publisher","DOI":"10.1080\/00031305.2019.1668849"},{"key":"e_1_2_1_32_1","volume-title":"Guo","author":"Lau Sam","year":"2020","unstructured":"Sam Lau, Ian Drosos, Julia M. Markel, and Philip J. Guo. 2020. The Design Space of Computational Notebooks: An Analysis of 60 Systems in Academia and Industry. In Proceedings of the IEEE Symposium on Visual Languages and Human-Centric Computing (VL\/HCC) (VL\/HCC '20)."},{"key":"e_1_2_1_33_1","doi-asserted-by":"publisher","DOI":"10.1007\/s10606-006--9025-0"},{"key":"e_1_2_1_34_1","doi-asserted-by":"publisher","DOI":"10.1145\/2487294.2487311"},{"key":"e_1_2_1_35_1","doi-asserted-by":"publisher","DOI":"10.1002\/cpe.v18:10"},{"key":"e_1_2_1_36_1","first-page":"57","article-title":"The Impertinent Questioner: The Scientist's Guide to the Statistician's Mind","volume":"46","author":"Lurie Willam","year":"1958","unstructured":"Willam Lurie. 1958. The Impertinent Questioner: The Scientist's Guide to the Statistician's Mind. American Scientist 46, 1 (1958), 57--61.","journal-title":"American Scientist"},{"key":"e_1_2_1_37_1","doi-asserted-by":"publisher","DOI":"10.1145\/3361118"},{"key":"e_1_2_1_38_1","doi-asserted-by":"publisher","DOI":"10.1145\/1639950.1640025"},{"key":"e_1_2_1_39_1","doi-asserted-by":"publisher","DOI":"10.1109\/ICDE.2017.112"},{"key":"e_1_2_1_40_1","doi-asserted-by":"publisher","DOI":"10.1145\/2470654.2466248"},{"key":"e_1_2_1_41_1","doi-asserted-by":"publisher","DOI":"10.1145\/3290605.3300356"},{"key":"e_1_2_1_42_1","doi-asserted-by":"publisher","DOI":"10.1002\/cpe.v18:10"},{"key":"e_1_2_1_43_1","doi-asserted-by":"publisher","DOI":"10.1207\/S15327051HCI1523_4"},{"key":"e_1_2_1_44_1","volume-title":"Scientific Collaboration on the Internet","author":"Olson Gary M.","unstructured":"Gary M. Olson, Ann Zimmerman, and Nathan Bos. 2008. Scientific Collaboration on the Internet. The MIT Press."},{"key":"e_1_2_1_45_1","unstructured":"Roger Peng. 2019. How Data Scientists Think - A Mini Case Study. Simply Stats blog -- https:\/\/simplystatistics.org\/2019\/01\/09\/how-data-scientists-think-a-mini-case-study\/. Accessed: 2020--10--10."},{"key":"e_1_2_1_46_1","unstructured":"Roger Peng. 2019. The Tentpoles of Data Science. Simply Stats blog -- https:\/\/simplystatistics.org\/2019\/01\/18\/the-tentpoles-of-data-science\/. Accessed: 2020--10--10."},{"key":"e_1_2_1_47_1","unstructured":"Roger Peng and Hilary Parker. 2018. Not So Standard Deviations podcast episodes on Design Thinking (Episodes 63--69). https:\/\/nssdeviations.com\/63-book-club-part-1. Accessed: 2020--10--10."},{"key":"e_1_2_1_48_1","doi-asserted-by":"publisher","DOI":"10.1126\/science.1213847"},{"key":"e_1_2_1_49_1","doi-asserted-by":"publisher","DOI":"10.1145\/2702123.2702298"},{"key":"e_1_2_1_50_1","unstructured":"ProjectPro. 2020. Type A Data Scientist vs. Type B Data Scientist. https:\/\/www.dezyre.com\/article\/type-a-data-scientist-vs-type-b-data-scientist\/194. Accessed: 2020--10--10."},{"key":"e_1_2_1_51_1","doi-asserted-by":"publisher","DOI":"10.1145\/2818048.2820026"},{"key":"e_1_2_1_52_1","doi-asserted-by":"publisher","DOI":"10.1111\/cgf.12391"},{"key":"e_1_2_1_53_1","volume-title":"Saturation in qualitative research: exploring its conceptualization and operationalization. Quality & quantity 52, 4","author":"Saunders Benjamin","year":"2018","unstructured":"Benjamin Saunders, Julius Sim, Tom Kingstone, Shula Baker, Jackie Waterfield, Bernadette Bartlam, Heather Burroughs, and Clare Jinks. 2018. Saturation in qualitative research: exploring its conceptualization and operationalization. Quality & quantity 52, 4 (2018)."},{"key":"e_1_2_1_54_1","doi-asserted-by":"publisher","DOI":"10.1145\/1376616.1376747"},{"key":"e_1_2_1_55_1","doi-asserted-by":"publisher","DOI":"10.1145\/2744195"},{"key":"e_1_2_1_56_1","doi-asserted-by":"publisher","DOI":"10.1145\/382151.382978"},{"key":"e_1_2_1_57_1","doi-asserted-by":"publisher","DOI":"10.1371\/journal.pcbi.1008770"},{"key":"e_1_2_1_58_1","doi-asserted-by":"publisher","DOI":"10.5555\/1241934.1241935"},{"key":"e_1_2_1_59_1","doi-asserted-by":"publisher","DOI":"10.1145\/3357292.3357295"},{"key":"e_1_2_1_60_1","doi-asserted-by":"publisher","DOI":"10.1145\/3359313"},{"key":"e_1_2_1_61_1","volume-title":"Tidy, Transform, Visualize, and Model Data","author":"Wickham Hadley","unstructured":"Hadley Wickham and Garrett Grolemund. 2017. R for Data Science: Import, Tidy, Transform, Visualize, and Model Data (1st ed.). O'Reilly Media, Inc.","edition":"1"},{"key":"e_1_2_1_62_1","unstructured":"Karlijn Willems. 2017. Data Scientist vs. Data Engineer. https:\/\/www.datacamp.com\/community\/blog\/data-scientist-vs-data-engineer. Accessed: 2020--10--10."},{"key":"e_1_2_1_63_1","unstructured":"Kanit Wongsuphasawat Yang Liu and Jeffrey Heer. 2019. Goals Process and Challenges of Exploratory Data Analysis: An Interview Study. arXiv:1911.00568 [cs.HC]"},{"key":"e_1_2_1_64_1","volume-title":"Voyager: Exploratory Analysis via Faceted Browsing of Visualization Recommendations","author":"Wongsuphasawat Kanit","year":"2016","unstructured":"Kanit Wongsuphasawat, Dominik Moritz, Anushka Anand, Jock Mackinlay, Bill Howe, and Jeffrey Heer. 2016. Voyager: Exploratory Analysis via Faceted Browsing of Visualization Recommendations. IEEE Trans. Visualization & Comp. Graphics (Proc. InfoVis) (2016). http:\/\/idl.cs.washington.edu\/papers\/voyager"},{"key":"e_1_2_1_65_1","doi-asserted-by":"publisher","DOI":"10.1145\/3392826"}],"container-title":["Proceedings of the ACM on Human-Computer Interaction"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3476052","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3476052","content-type":"application\/pdf","content-version":"vor","intended-application":"syndication"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3476052","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,7,14]],"date-time":"2025-07-14T04:58:59Z","timestamp":1752469139000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3476052"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2021,10,13]]},"references-count":65,"journal-issue":{"issue":"CSCW2","published-print":{"date-parts":[[2021,10,13]]}},"alternative-id":["10.1145\/3476052"],"URL":"https:\/\/doi.org\/10.1145\/3476052","relation":{},"ISSN":["2573-0142"],"issn-type":[{"value":"2573-0142","type":"electronic"}],"subject":[],"published":{"date-parts":[[2021,10,13]]},"assertion":[{"value":"2021-10-18","order":3,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}