2023
Population Modeling with Machine Learning can Enhance Measures of Mental Health - Open-Data Replication, Ty Easley, Ruiqi Chen, Kayla Hannon, Rosie Dutt, and Janine Bijsterbosch
Attending to the Cultures of Data Science Work, Lindsay Poirier
Evaluation of EDISON's Data Science Competency Framework Through a Comparative Literature Analysis, Karl R. B. Schmitt, Linda Clark, Katherine M. Kinnaird, Ruth E. H. Wertz, and Björn Sandstede
2022
An Educator’s Perspective of the Tidyverse, Mine Çetinkaya-Rundel, Johanna Hardin, Benjamin Baumer, Amelia McNamara, Nicholas J. Horton, and Colin W. Rundel
Mental Health in the UK Biobank: A Roadmap to Self-Report Measures and Neuroimaging Correlates, Rosie K. Dutt, Kayla Hannon, Ty O. Easley, Joseph C. Griffis, Wei Zhang, and Janine D. Bijsterbosch
Implementing GitHub Actions Continuous Integration to Reduce Error Rates in Ecological Data Collection, Albert Y. Kim, Valentine Herrmann, Ross Barreto, Brianna Calkins, Erika Gonzalez-Akre, Daniel J. Johnson, Jennifer A. Jordan, Lukas Magee, Ian R. McGregor, Nicolle Montero, Karl Novak, Teagan Rogers, Jessica Shue, and Kristina J. Anderson-Teixeira
Accountable Data: The Politics and Pragmatics of Disclosure Datasets, Lindsay Poirier
2021
Modern Data Science with R: Second Edition, Benjamin Baumer, Daniel T. Kaplan, and Nicholas J. Horton
Infer: An R Package for Tidyverse-Friendly Statistical Inference, Simon P. Couch, Andrew P. Bray, Chester Ismay, Evgeni Chasnovski, B. Baumer, and Mine Cetinkaya-Rundel
The Data Science Corps Wrangle-Analyze- Visualize Program: Building Data Acumen for Undergraduate Students, Nicholas J. Horton, Benjamin Baumer, Andrew Zieffler, and Valerie Barr
Statistical Inference via Data Science: A ModernDive into R and the Tidyverse, Chester Ismay and Albert Y. Kim
Moving Ethnography: Infrastructuring Doubletakes and Switchbacks in Experimental Collaborative Methods, Aalok Khandekar, Brandon Costelloe-Kuehn, Lindsay Poirier, Alli Morgan, Alison Kenner, Kim Fortun, and Mike Fortun
The Forestecology R Package for Fitting and Assessing Neighborhood Models of the Effect of Interspecific Competition on the Growth of Trees, Albert Y. Kim, David N. Allen, and Simon P. Couch
Automatic Hierarchy Expansion for Improved Structure and Chord Evaluation, Katherine M. Kinnaird and Brian McFee
Facilitating Team-Based Data Science: Lessons Learned from the DSC-WAV Project, Chelsey Legacy, Andrew Zieffler, Benjamin S. Baumer, Valerie Barr, and Nicholas J. Horton
Reading Datasets: Strategies for Interpreting the Politics of Data Signification, Lindsay Poirier
2020
A Permutation Test and Spatial Cross-Validation Approach to Assess Models of Interspecific Competition Between Trees, David Allen and Albert Y. Kim
Teaching Introductory Statistics with DataCamp, Benjamin Baumer, Andrew P. Bray, Mine Çetinkaya-Rundel, and Johanna S. Hardin
Integrating Data Science Ethics into an Undergraduate Major, Benjamin Baumer, Randi L. Garcia, Albert Y. Kim, Katherine M. Kinnaird, and Miles Q. Ott
Creating Optimal Conditions for Reproducible Data Analysis in R with ‘Fertile’, Audrey M. Bertin and Benjamin Baumer
The Influence of Peer and Parental Norms on First-Generation College Students’ Binge Drinking Trajectories, Graham T. DiGuiseppi, Jordan P. Davis, Matthew K. Meisel, Melissa A. Clark, Mya L. Roberson, Miles Q. Ott, and Nancy P. Barnett
Slack for (A)synchronous Course Communication, Albert Y. Kim, R. Jordan Crouser, and Benjamin Baumer
“Playing the Whole Game”: A Data Collection and Analysis Exercise With Google Calendar, Albert Y. Kim and Johanna Hardin
Teaching Computational Machine Learning (without Statistics), Katherine M. Kinnaird
Identification and Description of Potentially Influential Social Network Members using the Strategic Player Approach, Miles Q. Ott, Sara G. Balestrieri, Graham DiGuiseppi, Melissa A. Clark, Michael Bernstein, Sarah Helseth, and Nancy P. Barnett
SuPP & MaPP: Adaptable Structure-Based Representations For Mir Tasks, Claire Savard; Erin H. Bugbee; Melissa R, McGuirl; and Katherine M. Kinnaird
2019
Enrollment and Assessment of a First-Year College Class Social Network for a Controlled Trial of the Indirect Effect of a Brief Motivational Intervention, Nancy P. Barnett, Melissa A. Clark, Shannon R. Kenney, Graham DiGuiseppi, Matthew K. Meisel, Sara Balestrieri, Miles Q. Ott, and John Light
A Grammar for Reproducible and Painless Extract-Transform-Load Operations on Medium Data, Benjamin S. Baumer
The Impact of College Athletic Success on Donations and Applicant Quality, Benjamin Baumer and Andrew Zimbalist
Resampledata: Data sets for mathematical statistics with re- sampling in r, Laura Chihara, Tim Hesterberg, and Albert Y. Kim
Do Misperceptions of Peer Drinking Influence Personal Drinking Behavior? Results From a Complete Social Network of First-Year College Students, Melissa J. Cox, Angelo M. DiBello, Matthew K. Meisel, Miles Q. Ott, Shannon R. Kenney, Melissa A. Clark, and Nancy P. Barnett
Reduced Bias for Respondent Driven Sampling: Accounting for Non-Uniform Edge Sampling Probabilities in People Who Inject Drugs in Mauritius, Miles Q. Ott, Krista J. Gile, Matthew T. Harrison, Lisa G. Johnston, and Joseph W. Hogan
Fixed Choice Design and Augmented Fixed Choice Design for Network Data with Missing Observations, Miles Q. Ott, Matthew T. Harrison, Krista J. Gile, Nancy P. Barnett, and Joseph W. Hogan
Classification as Catachresis: Double Binds of Representing Difference with Semiotic Infrastructure, Lindsay Poirier
Data Sharing at Scale: A Heuristic for Affirming Data Cultures, Lindsay Poirier and Brandon Costelloe-Kuehn
ΔSCOPE: A New Method to Quantify 3D Biological Structures and Identify Differences in Zebrafish Forebrain Development, Morgan S. Schwartz, Jake Schnabl, Mackenzie P.H. Litz, Benjamin Baumer, and Michael Barresi
2018
U.S. College Students’ Social Network Characteristics and Perceived Social Exclusion: A Comparison Between Drinkers and Nondrinkers Based on PastMonth Alcohol Use, Sara G. Balestrieri, Graham T. DiGuiseppi, Matthew Meisel, Melissa A. Clark, Miles Q. Ott, and Nancy P. Barnett
SpatialEpi: Methods and Data for Spatial Epidemiology, Cici Chen, Albert Y. Kim, Michelle Ross, and Jon Wakefield
Relationships Between Social Network Characteristics, Alcohol Use, and Alcohol-Related Consequences in a Large Network of First-Year College Students: How Do Peer Drinking Norms Fit In?, Graham T. DiGuiseppi, Matthew K. Meisel, Sara G. Balestrieri, Miles Q. Ott, Melissa A. Clark, and Nancy P. Barnett
Resistance to Peer influence Moderates the Relationship Between Perceived (But Not Actual) Peer Norms and Binge Drinking in a College Student Social Network, Graham T. DiGuiseppi, Matthew K. Meisel, Sara G. Balestrieri, Miles Q. Ott, Melissa J. Cox, Melissa A. Clark, and Nancy P. Barnett
The fivethirtyeight R package: ‘Tame Data’ Principles for Introductory Statistics and Data Science Courses, Albert Y. Kim, Chester Ismay, and Jennifer Chunn
An Event- and Network-Level Analysis of College Students’ Maximum Drinking Day, Matthew K. Meisel, Angelo M. DiBello, Sara G. Balestrieri, Miles Q. Ott, Graham T. DiGuiseppi, Melissa A. Clark, and Nancy P. Barnett
Strategic Players for Identifying Optimal Social Network Intervention Subjects, Miles Q. Ott, John M. Light, Melissa A. Clark, and Nancy P. Barnett
A Comparative Analysis of Preservation Techniques for the Optimal Molecular Detection of Hookworm DNA in a Human Fecal Specimen, Marina Papaiakovou, Nils Pilotte, Benjamin Baumer, Jessica Grant, Kristjana Asbjornsdottir, Fabien Schaer, Yan Hu, Raffi Aroian, Judd Walson, and Steven A. Williams
2017
Lessons from Between the White Lines for Isolated Data Scientists, Benjamin Baumer
Advance Care Planning as a Shared Endeavor: Completion of ACP Documents in a Multidisciplinary Cancer Program, Melissa A. Clark, Miles Q. Ott, Michelle L. Rogers, Mary C. Politi, Susan C. Miller, Laura Moynihan, Katina Robison, Ashley Stuckey, and Don Dizon
Curriculum Guidelines for Undergraduate Programs in Data Science, Richard D. De Veaux, Mahesh Agarwal, Maia Averett, Benjamin Baumer, Andrew Bray, Thomas C. Bressoud, Lance Bryant, Lei Z. Cheng, Amanda Francis, Robert Gould, Albert Y. Kim, Matt Kretchmar, Qin Lu, Ann Moskol, Deborah Nolan, Roberto Pelayo, Sean Raleigh, Ricky J. Sethi, Mutiara Sondjaja, Neelesh Tiruviluamala, Paul X. Uhlig, Talitha M. Washington, Curtis L. Wesley, David White, and Ping Ye
Alcohol Perceptions and Behavior in a Residential Peer Social Network, Shannon R. Kenney, Miles Q. Ott, Matthew Meisel, and Nancy P. Barnett
OkCupid Data for Introductory Statistics and Data Science Courses, Albert Y. Kim and Adriana Escobedo-Land
Greater Data Science at Baccalaureate Institutions, Amelia McNamara, Nicholas J. Horton, and Benjamin S. Baumer
Devious Design: Digital Infrastructure Challenges for Experimental Ethnography, Lindsay Poirier
2016
Changing of the Guards: Strip Cover with Duty Cycling∗, Amotz Bar-Noy, Benjamin Baumer, and Dror Rawitz
The Smallest Non-Autograph, Benjamin Baumer, Yijin Wei, and Gary S. Bloom
Pushback: Critical Data Designers and Pollution Politics, Kim Fortun, Lindsay Poirier, Alli Morgan, Brandon Costelloe-Kuehn, and Mike Fortun
A Bayesian Method for Cluster Detection with Application to Five Cancer Sites in Puget Sound, Albert Y. Kim and Jon Wakefield
strategicplayers: Strategic Players. R package version 1.0., Miles Q. Ott
Unequal Edge Inclusion Probabilities in Link-Tracing Network Sampling With Implications for Respondent-Driven Sampling, Miles Q. Ott and Krista J. Gile
Bayesian Peer Calibration with Application to Alcohol Use, Miles Q. Ott, Joseph W. Hogan, Krista J. Gile, Crystal Linkletter, and Nancy P. Barnett
2015
Average Case Network Lifetime on an Interval with Adjustable Sensing Ranges, Amotz Bar-Noy and Benjamin Baumer
Set It and Forget It: Approximating the Set Once Strip Cover Problem, Amotz Bar-Noy, Benjamin Baumer, and Dror Rawitz
A Data Science Course for Undergraduates: Thinking with Data, Benjamin Baumer
OpenWAR: An Open Source System for Evaluating Overall Player Performance in Major League Baseball, Benjamin S. Baumer, Shane T. Jensen, and Gregory J. Matthews
Data Science in Statistics Curricula: Preparing Students to “Think with Data”, J. Hardin, R. Hoerl, Nicholas J. Horton, D. Nolan, B. Baumer, O. Hall-Holt, P. Murrell, R. Peng, P. Roback, D. Temple Lang, and M. D. Ward
2014
Quantifying Market Inefficiencies in the Baseball Players’ Market, Benjamin Baumer and Andrew Zimbalist
2013
As Strong as the Weakest Link: Mining Diverse Cliques in Weighted Graphs, Petko Bogdanov, Ben Baumer, Prithwish Basu, Amotz Bar-Noy, and Ambuj K. Singh
Repeated Changes in Reported Sexual Orientation Identity Linked to Substance Use Behaviors in Youth, Miles Q. Ott, David Wypij, Heather L. Corliss, Margaret Rosario, Sari L. Reisner, Allegra R. Gordon, and S. Bryn Austin
A Bayesian Model for Cluster Detection, Jonathan Wakefield and Albert Y. Kim
2012
Maximizing Network Lifetime on the Line with Adjustable Sensing Ranges, Amotz Bar-Noy and Ben Baumer
Parsing the Relationship Between Baserunning and Batting Abilities Within Lineups, Ben S. Baumer, James Piette, and Brad Null
2011
Age-Gaps in Sexual Partnerships: Seeing Beyond ‘Sugar Daddies’, Miles Q. Ott, Till Bärnighausen, Frank Tanser, Mark N. Lurie, and Marie-Louise Newell
Stability and Change in Self-Reported Sexual Orientation Identity in Young People: Application of Mobility Metrics, Miles Q. Ott, Heather L. Corliss, David Wypij, Margaret Rosario, and S. Bryn Austin
2009
Using Labeled Data to Evaluate Change Detectors in a Multivariate Streaming Environment, Albert Y. Kim, Caren Marzban, Donald B. Percival, and Werner Stuetzle
2007
Lessons Learned from the 1918-1919 Influenza Pandemic in Minneapolis and St. Paul, Minnesota, Miles Q. Ott, Shelly F. Shaw, Richard N. Danila, and Ruth Lynfield