2024
Examining Information Systems Use to Facilitate the Workplace Accommodation Process, Shiya Cao
2023
Editor’s note: On Fairness in Sports Analytics, Benjamin Baumer
Data Science Transfer Pathways from Associate's to Bachelor's Programs, Benjamin S. Baumer and Nicholas J. Horton
Data Science Corps Wrangle-Analyze-Visualize (DSC-WAV) project, Benjamin S. Baumer, Nicholas J. Horton, Ethan Meyers, and Andrea Dustin
Big Ideas in Sports Analytics and Statistical Tools for their Investigation, Benjamin S. Baumer, Gregory J. Matthews, and Quang Nguyen
Psychometric Properties of a Combined Go/No-Go and Continuous Performance Task Across Childhood, Caron A.C. Clark, Kaitlyn Cook, Rui Wang, Michael Rueschman, Jerilynn Radcliffe, Susan Redline, and H. Gerry Taylor
Population Modeling with Machine Learning can Enhance Measures of Mental Health - Open-Data Replication, Ty Easley, Ruiqi Chen, Kayla Hannon, Rosie Dutt, and Janine Bijsterbosch
repytah: An Open-Source Python Package for Building Aligned Hierarchies for Sequential Data, Chenhui Jia, Lizette Carpenter, Thu Tran, Amanda Y. Liu, Sasha Yeutseyva, Mariun Tapal, Yingke Wang, Zoie Kexin Zhou, Jordan Moody, Denise Nava, Eleanor Donaher, Lillian Yusha Jiang, Ben Bruncati, and Katherine M. Kinnaird
Attending to the Cultures of Data Science Work, Lindsay Poirier
Adenotonsillectomy for Snoring and Mild Sleep Apnea in Children: A Randomized Clinical Trial, Susan Redline, Kaitlyn Cook, Ronald D. Chervin, Stacey Ishman, Cristina M. Baldassari, Ron B. Mitchell, Ignacio E. Tapia, Raouf Amin, Fauziya Hassan, Sally Ibrahim, Kristie Ross, Lisa M. Elden, Erin M. Kirkham, David Zopf, Jay Shah, Todd Otteson, Kamal Naqvi, Judith Owens, Lisa Young, Susan Furth, Heidi Connolly, Caron A.C. Clark, Jessie P. Bakker, Susan Garetz, Jerilynn Radcliffe, H. Gerry Taylor, Carol L. Rosen, and Rui Wang
Evaluation of EDISON's Data Science Competency Framework Through a Comparative Literature Analysis, Karl R. B. Schmitt, Linda Clark, Katherine M. Kinnaird, Ruth E. H. Wertz, and Björn Sandstede
Comparison of Caregiver- and Child-Reported Quality of Life in Children With Sleep-Disordered Breathing, Phoebe Kuo Yu, Kaitlyn Cook, Jiayan Liu, Raouf S. Amin, Craig Derkay, Lisa M. Elden, Susan L. Garetz, Alisha S. George, Sally Ibrahim, Stacey L. Ishman, Erin M. Kirkham, S. Kamal Naqvi, Jerilynn Radcliffe, Kristie R. Ross, Gopi B. Shah, Ignacio E. Tapia, H. Gerry Taylor, David A. Zopf, Susan Redline, and Cristina M. Baldassari
2022
Symposium Focuses on Opportunities for Massachusetts Community Colleges, Benjamin Baumer, Nicholas J. Horton, Ethan Meyers, and Andrea Dustin
Prospects for Lattice QFTs on Curved Riemann Manifolds, Richard C. Brower, Casey E. Berger, George T. Fleming, Andrew D. Gasbarro, Evan K. Owen, Timothy G. Raben, Chung I. Tan, and Evan S. Weinberg
An Educator’s Perspective of the Tidyverse, Mine Çetinkaya-Rundel, Johanna Hardin, Benjamin Baumer, Amelia McNamara, Nicholas J. Horton, and Colin W. Rundel
A Multistate Competing Risks Framework for Preconception Prediction of Pregnancy Outcomes, Kaitlyn Cook, Neil J. Perkins, Enrique Schisterman, and Sebastien Haneuse
Mental Health in the UK Biobank: A Roadmap to Self-Report Measures and Neuroimaging Correlates, Rosie K. Dutt, Kayla Hannon, Ty O. Easley, Joseph C. Griffis, Wei Zhang, and Janine D. Bijsterbosch
Implementing GitHub Actions Continuous Integration to Reduce Error Rates in Ecological Data Collection, Albert Y. Kim, Valentine Herrmann, Ross Barreto, Brianna Calkins, Erika Gonzalez-Akre, Daniel J. Johnson, Jennifer A. Jordan, Lukas Magee, Ian R. McGregor, Nicolle Montero, Karl Novak, Teagan Rogers, Jessica Shue, and Kristina J. Anderson-Teixeira
Quantum Counter-Terms for Lattice Field Theory on Curved Manifolds, Evan K. Owen, Casey E. Berger, Richard C. Brower, George T. Fleming, Andrew D. Gasbarro, and Timothy G. Raben
Accountable Data: The Politics and Pragmatics of Disclosure Datasets, Lindsay Poirier
Comparison of Caregiver- and Child-Reported Quality of Life in Children With Sleep-Disordered Breathing, Phoebe Kuo Yu, Kaitlyn Cook, Jiayan Liu, Raouf S. Amin, Craig Derkay, Lisa M. Elden, Susan L. Garetz, Alisha S. George, Sally Ibrahim, Stacey L. Ishman, Erin M. Kirkham, S. Kamal Naqvi, Jerilynn Radcliffe, Kristie R. Ross, Gopi B. Shah, Ignacio E. Tapia, H. Gerry Taylor, David A. Zopf, Susan Redline, and Cristina M. Baldassari
2021
Modern Data Science with R: Second Edition, Benjamin Baumer, Daniel T. Kaplan, and Nicholas J. Horton
Complex Langevin and Other Approaches to the Sign Problem in Quantum Many-Body Physics, Casey E. Berger, L. Rammelmüller, A. C. Loheac, F. Ehmann, J. Braun, and J. E. Drut
Estimation of Conditional Power for Cluster-Randomized Trials with Interval-Censored Endpoints, Kaitlyn Cook and Rui Wang
Infer: An R Package for Tidyverse-Friendly Statistical Inference, Simon P. Couch, Andrew P. Bray, Chester Ismay, Evgeni Chasnovski, B. Baumer, and Mine Cetinkaya-Rundel
The Data Science Corps Wrangle-Analyze- Visualize Program: Building Data Acumen for Undergraduate Students, Nicholas J. Horton, Benjamin Baumer, Andrew Zieffler, and Valerie Barr
Statistical Inference via Data Science: A ModernDive into R and the Tidyverse, Chester Ismay and Albert Y. Kim
Moving Ethnography: Infrastructuring Doubletakes and Switchbacks in Experimental Collaborative Methods, Aalok Khandekar, Brandon Costelloe-Kuehn, Lindsay Poirier, Alli Morgan, Alison Kenner, Kim Fortun, and Mike Fortun
The Forestecology R Package for Fitting and Assessing Neighborhood Models of the Effect of Interspecific Competition on the Growth of Trees, Albert Y. Kim, David N. Allen, and Simon P. Couch
Automatic Hierarchy Expansion for Improved Structure and Chord Evaluation, Katherine M. Kinnaird and Brian McFee
Facilitating Team-Based Data Science: Lessons Learned from the DSC-WAV Project, Chelsey Legacy, Andrew Zieffler, Benjamin S. Baumer, Valerie Barr, and Nicholas J. Horton
An Integrated Magneto-Electrochemical Device for the Rapid Profiling of Tumour Extracellular Vesicles from Blood Plasma, Jongmin Park, Jun Seok Park, Chen Han Huang, Ala Jo, Kaitlyn Cook, Rui Wang, Hsing Ying Lin, Jan Van Deun, Huiyan Li, Jouha Min, Lan Wang, Ghilsuk Yoon, Bob S. Carter, Leonora Balaj, Gyu Seog Choi, Cesar M. Castro, Ralph Weissleder, and Hakho Lee
Reading Datasets: Strategies for Interpreting the Politics of Data Signification, Lindsay Poirier
2020
Quantitative Associations Between Health Insurance and Stage of Melanoma at Diagnosis among Nonelderly Adults in the United States, Boya Abudu, Kaitlyn A. Cook, Jeffrey E. Gershenwald, Philip R. Cohen, and Alan C. Geller
A Permutation Test and Spatial Cross-Validation Approach to Assess Models of Interspecific Competition Between Trees, David Allen and Albert Y. Kim
Teaching Introductory Statistics with DataCamp, Benjamin Baumer, Andrew P. Bray, Mine Çetinkaya-Rundel, and Johanna S. Hardin
Integrating Data Science Ethics into an Undergraduate Major, Benjamin Baumer, Randi L. Garcia, Albert Y. Kim, Katherine M. Kinnaird, and Miles Q. Ott
Thermodynamics of Rotating Quantum Matter in the Virial Expansion, Casey E. Berger, K. J. Morrell, and J. E. Drut
Creating Optimal Conditions for Reproducible Data Analysis in R with ‘Fertile’, Audrey M. Bertin and Benjamin Baumer
The Influence of Peer and Parental Norms on First-Generation College Students’ Binge Drinking Trajectories, Graham T. DiGuiseppi, Jordan P. Davis, Matthew K. Meisel, Melissa A. Clark, Mya L. Roberson, Miles Q. Ott, and Nancy P. Barnett
Slack for (A)synchronous Course Communication, Albert Y. Kim, R. Jordan Crouser, and Benjamin Baumer
“Playing the Whole Game”: A Data Collection and Analysis Exercise With Google Calendar, Albert Y. Kim and Johanna Hardin
Teaching Computational Machine Learning (without Statistics), Katherine M. Kinnaird
Identification and Description of Potentially Influential Social Network Members using the Strategic Player Approach, Miles Q. Ott, Sara G. Balestrieri, Graham DiGuiseppi, Melissa A. Clark, Michael Bernstein, Sarah Helseth, and Nancy P. Barnett
SuPP & MaPP: Adaptable Structure-Based Representations For Mir Tasks, Claire Savard; Erin H. Bugbee; Melissa R, McGuirl; and Katherine M. Kinnaird
2019
Enrollment and Assessment of a First-Year College Class Social Network for a Controlled Trial of the Indirect Effect of a Brief Motivational Intervention, Nancy P. Barnett, Melissa A. Clark, Shannon R. Kenney, Graham DiGuiseppi, Matthew K. Meisel, Sara Balestrieri, Miles Q. Ott, and John Light
A Grammar for Reproducible and Painless Extract-Transform-Load Operations on Medium Data, Benjamin S. Baumer
The Impact of College Athletic Success on Donations and Applicant Quality, Benjamin Baumer and Andrew Zimbalist
Resampledata: Data sets for mathematical statistics with re- sampling in r, Laura Chihara, Tim Hesterberg, and Albert Y. Kim
Do Misperceptions of Peer Drinking Influence Personal Drinking Behavior? Results From a Complete Social Network of First-Year College Students, Melissa J. Cox, Angelo M. DiBello, Matthew K. Meisel, Miles Q. Ott, Shannon R. Kenney, Melissa A. Clark, and Nancy P. Barnett
Third- and Fourth-Order Virial Coefficients of Harmonically Trapped Fermions in a Semiclassical Approximation, K. J. Morrell, Casey E. Berger, and J. E. Drut
Reduced Bias for Respondent Driven Sampling: Accounting for Non-Uniform Edge Sampling Probabilities in People Who Inject Drugs in Mauritius, Miles Q. Ott, Krista J. Gile, Matthew T. Harrison, Lisa G. Johnston, and Joseph W. Hogan
Fixed Choice Design and Augmented Fixed Choice Design for Network Data with Missing Observations, Miles Q. Ott, Matthew T. Harrison, Krista J. Gile, Nancy P. Barnett, and Joseph W. Hogan
Classification as Catachresis: Double Binds of Representing Difference with Semiotic Infrastructure, Lindsay Poirier
Data Sharing at Scale: A Heuristic for Affirming Data Cultures, Lindsay Poirier and Brandon Costelloe-Kuehn
ΔSCOPE: A New Method to Quantify 3D Biological Structures and Identify Differences in Zebrafish Forebrain Development, Morgan S. Schwartz, Jake Schnabl, Mackenzie P.H. Litz, Benjamin Baumer, and Michael Barresi
2018
U.S. College Students’ Social Network Characteristics and Perceived Social Exclusion: A Comparison Between Drinkers and Nondrinkers Based on PastMonth Alcohol Use, Sara G. Balestrieri, Graham T. DiGuiseppi, Matthew Meisel, Melissa A. Clark, Miles Q. Ott, and Nancy P. Barnett
Interacting Bosons at Finite Angular Momentum via Complex Langevin, Casey E. Berger and Joaquín E. Drut
SpatialEpi: Methods and Data for Spatial Epidemiology, Cici Chen, Albert Y. Kim, Michelle Ross, and Jon Wakefield
Relationships Between Social Network Characteristics, Alcohol Use, and Alcohol-Related Consequences in a Large Network of First-Year College Students: How Do Peer Drinking Norms Fit In?, Graham T. DiGuiseppi, Matthew K. Meisel, Sara G. Balestrieri, Miles Q. Ott, Melissa A. Clark, and Nancy P. Barnett
Resistance to Peer influence Moderates the Relationship Between Perceived (But Not Actual) Peer Norms and Binge Drinking in a College Student Social Network, Graham T. DiGuiseppi, Matthew K. Meisel, Sara G. Balestrieri, Miles Q. Ott, Melissa J. Cox, Melissa A. Clark, and Nancy P. Barnett
The fivethirtyeight R package: ‘Tame Data’ Principles for Introductory Statistics and Data Science Courses, Albert Y. Kim, Chester Ismay, and Jennifer Chunn
An Event- and Network-Level Analysis of College Students’ Maximum Drinking Day, Matthew K. Meisel, Angelo M. DiBello, Sara G. Balestrieri, Miles Q. Ott, Graham T. DiGuiseppi, Melissa A. Clark, and Nancy P. Barnett
Strategic Players for Identifying Optimal Social Network Intervention Subjects, Miles Q. Ott, John M. Light, Melissa A. Clark, and Nancy P. Barnett
A Comparative Analysis of Preservation Techniques for the Optimal Molecular Detection of Hookworm DNA in a Human Fecal Specimen, Marina Papaiakovou, Nils Pilotte, Benjamin Baumer, Jessica Grant, Kristjana Asbjornsdottir, Fabien Schaer, Yan Hu, Raffi Aroian, Judd Walson, and Steven A. Williams
2017
Lessons from Between the White Lines for Isolated Data Scientists, Benjamin Baumer
Advance Care Planning as a Shared Endeavor: Completion of ACP Documents in a Multidisciplinary Cancer Program, Melissa A. Clark, Miles Q. Ott, Michelle L. Rogers, Mary C. Politi, Susan C. Miller, Laura Moynihan, Katina Robison, Ashley Stuckey, and Don Dizon
Curriculum Guidelines for Undergraduate Programs in Data Science, Richard D. De Veaux, Mahesh Agarwal, Maia Averett, Benjamin Baumer, Andrew Bray, Thomas C. Bressoud, Lance Bryant, Lei Z. Cheng, Amanda Francis, Robert Gould, Albert Y. Kim, Matt Kretchmar, Qin Lu, Ann Moskol, Deborah Nolan, Roberto Pelayo, Sean Raleigh, Ricky J. Sethi, Mutiara Sondjaja, Neelesh Tiruviluamala, Paul X. Uhlig, Talitha M. Washington, Curtis L. Wesley, David White, and Ping Ye
Alcohol Perceptions and Behavior in a Residential Peer Social Network, Shannon R. Kenney, Miles Q. Ott, Matthew Meisel, and Nancy P. Barnett
OkCupid Data for Introductory Statistics and Data Science Courses, Albert Y. Kim and Adriana Escobedo-Land
Greater Data Science at Baccalaureate Institutions, Amelia McNamara, Nicholas J. Horton, and Benjamin S. Baumer
Devious Design: Digital Infrastructure Challenges for Experimental Ethnography, Lindsay Poirier
2016
Changing of the Guards: Strip Cover with Duty Cycling∗, Amotz Bar-Noy, Benjamin Baumer, and Dror Rawitz
The Smallest Non-Autograph, Benjamin Baumer, Yijin Wei, and Gary S. Bloom
Hard-Wall and Non-Uniform Lattice Monte Carlo Approaches to One-Dimensional Fermi Gases in a Harmonic Trap, Casey E. Berger, Joaquín E. Drut, and William J. Porter
A Bayesian Framework for the Classification of Microbial Gene Activity States, Craig Disselkoen, Brian Greco, Kaitlyn Cook, Kristin Koch, Reginald Lerebours, Chase Viss, Joshua Cape, Elizabeth Held, Yonatan Ashenafi, Karen Fischer, Allyson Acosta, Mark Cunningham, Aaron A. Best, Matthew DeJongh, and Nathan Tintle
Pushback: Critical Data Designers and Pollution Politics, Kim Fortun, Lindsay Poirier, Alli Morgan, Brandon Costelloe-Kuehn, and Mike Fortun
A General Method for Combining Different Family-Based Rare-Variant Tests of Association to Improve Power and Robustness of a Wide Range of Genetic Architectures, Alden Green, Kaitlyn Cook, Kelsey Grinde, Alessandra Valcarcel, and Nathan Tintle
A Bayesian Method for Cluster Detection with Application to Five Cancer Sites in Puget Sound, Albert Y. Kim and Jon Wakefield
Harmonically Trapped Fermions in Two Dimensions: Ground-State Energy and Contact of SU(2) and SU(4) Systems via a Nonuniform Lattice Monte Carlo Method, Zhihuan Luo, Casey E. Berger, and Joaquín E. Drut
strategicplayers: Strategic Players. R package version 1.0., Miles Q. Ott
Unequal Edge Inclusion Probabilities in Link-Tracing Network Sampling With Implications for Respondent-Driven Sampling, Miles Q. Ott and Krista J. Gile
Bayesian Peer Calibration with Application to Alcohol Use, Miles Q. Ott, Joseph W. Hogan, Krista J. Gile, Crystal Linkletter, and Nancy P. Barnett
A Multistep Approach to Single Nucleotide Polymorphism-Set Analysis: An Evaluation of Power and Type I Error of Gene-Based Tests of Association after Pathway-Based Association Tests, Alessandra Valcarcel, Kelsey Grinde, Kaitlyn Cook, Alden Green, and Nathan Tintle
2015
Average Case Network Lifetime on an Interval with Adjustable Sensing Ranges, Amotz Bar-Noy and Benjamin Baumer
Set It and Forget It: Approximating the Set Once Strip Cover Problem, Amotz Bar-Noy, Benjamin Baumer, and Dror Rawitz
A Data Science Course for Undergraduates: Thinking with Data, Benjamin Baumer
OpenWAR: An Open Source System for Evaluating Overall Player Performance in Major League Baseball, Benjamin S. Baumer, Shane T. Jensen, and Gregory J. Matthews
Energy, Contact, and Density Profiles of One-Dimensional Fermions in a Harmonic Trap via Nonuniform-Lattice Monte Carlo Calculations, Casey E. Berger, E. R. Anderson, and J. E. Drut
Data Science in Statistics Curricula: Preparing Students to “Think with Data”, J. Hardin, R. Hoerl, Nicholas J. Horton, D. Nolan, B. Baumer, O. Hall-Holt, P. Murrell, R. Peng, P. Roback, D. Temple Lang, and M. D. Ward
2014
Quantifying Market Inefficiencies in the Baseball Players’ Market, Benjamin Baumer and Andrew Zimbalist
Evaluating the Impact of Genotype Errors on Rare Variant Tests of Association, Kaitlyn Cook, Alejandra Benitez, Casey Fu, and Nathan Tintle
2013
As Strong as the Weakest Link: Mining Diverse Cliques in Weighted Graphs, Petko Bogdanov, Ben Baumer, Prithwish Basu, Amotz Bar-Noy, and Ambuj K. Singh
Repeated Changes in Reported Sexual Orientation Identity Linked to Substance Use Behaviors in Youth, Miles Q. Ott, David Wypij, Heather L. Corliss, Margaret Rosario, Sari L. Reisner, Allegra R. Gordon, and S. Bryn Austin
A Bayesian Model for Cluster Detection, Jonathan Wakefield and Albert Y. Kim
2012
Maximizing Network Lifetime on the Line with Adjustable Sensing Ranges, Amotz Bar-Noy and Ben Baumer
Parsing the Relationship Between Baserunning and Batting Abilities Within Lineups, Ben S. Baumer, James Piette, and Brad Null
2011
Age-Gaps in Sexual Partnerships: Seeing Beyond ‘Sugar Daddies’, Miles Q. Ott, Till Bärnighausen, Frank Tanser, Mark N. Lurie, and Marie-Louise Newell
Stability and Change in Self-Reported Sexual Orientation Identity in Young People: Application of Mobility Metrics, Miles Q. Ott, Heather L. Corliss, David Wypij, Margaret Rosario, and S. Bryn Austin
2009
Using Labeled Data to Evaluate Change Detectors in a Multivariate Streaming Environment, Albert Y. Kim, Caren Marzban, Donald B. Percival, and Werner Stuetzle