Publications from the Data Science for Social Good Fellowship Program

Peer-Reviewed Publications

Predictive Fairness to Reduce Misdemeanor Recidivism Through Social Service Interventions.
K. Rodolfa; E. Salomon; L. Haynes; I. Mendieta; J. Larson; R. Ghani. Proceedings of the ACM Conference on Fairness, Accountability, and Transparency (ACM FAT*) 2020.

An Experience-Centered Approach to Training Effective Data Scientists.
Kit T Rodolfa, Adolfo De Unanue, Matt Gee, and Rayid Ghani. Big Data Journal. 2019.

Using Machine Learning to Help Vulnerable Tenants in New York City. Teng Ye, Rebecca Johnson, Samantha Fu, Jerica Copeny, Bridgit Donnelly, Alex Freeman, Mirian Lima, Joe Walsh, and Rayid Ghani. Proceedings of the 2nd ACM SIGCAS Conference on Computing and Sustainable Societies (COMPASS ’19). ACM, New York, NY, USA, 248-258.

Improving Government Response to Citizen Requests Online.
Garren Gaut, Andrea Navarette, Laila Wahedi, Paul van der Boor, Adolfo de Unánue, Jorge Díaz, Eduardo Clark, and Rayid Ghani. Proceedings of the 1st ACM SIGCAS Conference on Computing and Sustainable Societies (COMPASS), 2018.

Reducing Incarceration through Prioritized Interventions.
Matt Bauman, Kate Boxer, Tzu-Yun Lin, Erika Salomon, Hareem Naveed, Lauren Haynes, Joe Walsh, Jen Helsby, Steve Yoder, Robert Sullivan, Chris Schneweis, and Rayid Ghani. Proceedings of the 1st ACM SIGCAS Conference on Computing and Sustainable Societies (COMPASS), 2018.

Using Machine Learning to Assess the Risk of and Prevent Water Main Breaks.
Avishek Kumar, Syed Ali Asad Rizvi, Benjamin Brooks, R. Ali Vanderveld, Kevin H. Wilson, Chad Kenney, Sam Edelstein, Adria Finch, Andrew Maxwell, Joe Zuckerbraun, and Rayid Ghani. Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2018.

Deploying Machine Learning Models for Public Policy: A Framework.
Klaus Ackermann, Joe Walsh, Adolfo De Unánue, Hareem Naveed, Andrea Navarrete Rivera, Sun-Joo Lee, Jason Bennett, Michael Defoe, Crystal Cody, Matt Morley, Lauren Haynes, and Rayid Ghani. Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2018.

Machine Learning for Social Services: A Case Study of Prenatal Case Management in Illinois.
Ian Pan, Laura Nolan, Rashida Brown, Paul van der Boor, Romana Khan, Rayid Ghani, Dan Harris. American Journal of Public Health, 2017.

Early Intervention Systems: Predicting Adverse Interactions Between Police and the Public.
Jennifer Helsby, Samuel Carton, Kenneth Joseph, Ayesha Mahmoud, Youngsoo Park, Andrea Navarrete, Klaus Ackermann, Joe Walsh, Lauren Haynes, Crystal Cody, Major Estella Patterson, and Rayid Ghani. Criminal Justice Policy Review, 2017.

Status of Breastfeeding and Child Immunization Outcomes in Clients of the Nurse–Family Partnership.
William Thorland, Dustin Currie, Emily R. Wiegand, Joe Walsh, and Nick Mader. Maternal and Child Health Journal, 2017.

Detecting Fraud, Corruption, and Collusion in International Development Contracts: The Design of a Proof-of-Concept Automated System.
Emily Grace, Ankit Rai, Elissa Redmiles, and Rayid Ghani. IEEE International Conference on Big Data, 2016.

Identifying Police Officers at Risk of Adverse Events.
Samuel Carton, Jennifer Helsby, Kenneth Joseph, Ayesha Mahmoud, Youngsoo Park, Joe Walsh, Crystal Cody, CPT Estella Patterson, Lauren Haynes, Rayid Ghani. Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2016.

The Legislative Influence Detector: Finding Text Reuse in State Legislation.
Matthew Burgess, Eugenia Giraudy, Julian Katz-Samuels, Joe Walsh, Derek Willis, Lauren Haynes, Rayid Ghani. Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2016.

Designing Policy Recommendations to Reduce Home Abandonment in Mexico.
Klaus Ackermann, Eduardo Blancas Reyes, Sue He, Thomas Anderson Keller, Paul van der Boor, Romana Khan. Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2016.

Identifying Earmarks in Congressional Bills.
Ellery Wulczyn, Madian Khabsa, Vrushank Vora, Matthew Heston, Joe Walsh, Chris Berry, Rayid Ghani. Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2016.

Early detection of properties at risk of blight using spatiotemporal data.
Eduardo Blancas Reyes, Jennifer Helsby, Katharina Rasch, Talia Kaufmann, Paul van der Boor, Lauren Haynes and Rayid Ghani. Data for Policy International Conference, 2016.

Building Better Early Intervention Systems.
Crystal Cody, Estella Patterson, Kerr Putney, Jennifer Helsby, Joe Walsh, Lauren Haynes, Rayid Ghani, Samuel Carton, Kenneth Joseph, Ayesha Mahmoud, and Youngsoo Park. Police Chief Magazine, 2016.

A Machine Learning Framework to Identify Students at Risk of Adverse Academic Outcomes.
Himabindu Lakkaraju, Everaldo Aguiar, David Miller, Nasir Bhanpuri, Rayid Ghani, Kecia Addison. Proceedings of the 21st ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2015.

Predictive Modeling for Public Health: Preventing Childhood Lead Poisoning.
Eric Potash, Joe Brew, Alexander Loewi, Subhabrata Majumdar, Andrew Reece, Joe Walsh, Eric Rozier, Emile Jorgenson, Raed Mansour, Rayid Ghani. Proceedings of the 21th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2015.

Mining Administrative Data to Spur Urban Revitalization.
Ben Green, Alejandra Caro, Matthew Conway, Robert Manduca, Tom Plagge, Abby Miller. Proceedings of the 21th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2015.

Early Prediction of Code Blue Using Electronic Medical Records.
Sriram Somanchi, Samrachana Adhikari, Allen Lin, Elena Eneva, and Rayid Ghani. Proceedings of the 21th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2015.

Who, When, and Why: A Machine Learning Approach to Prioritizing Students at Risk of not Graduating High School on Time.
Everaldo Aguiar, Himabindu Lakkaraju, Nasir Bhanpuri, David Miller, Ben Yuhas, Kecia Addison, Shihching Liu, Marilyn Powell, and Rayid Ghani. 5th International Learning Analytics and Knowledge (LAK) Conference 2015.

Early Code Blue Prediction Using Patient Medical Records.
Sriram Somanchi, Samrachana Adhikari, Allen Lin, Elena Eneva, and Rayid Ghani. Workshop on Machine Learning for Clinical Data Analysis and Healthcare – held with NIPS 2013.

Tweedr: Mining Twitter to Inform Disaster Response.
Z. Ashktorab, C. Brown, M. Nandi, A. Culotta.  The 11th International Conference on Information Systems for Crisis Response and Management (ISCRAM), 2014.

Conference Presentations:

Identifying Police Officers at Risk of Adverse Events. Sam Carton, Jen Helsby, Kenny Joseph, Mahmud, A., Youngsoo Park, Joe Walsh, Lauren Haynes, and Rayid Ghani (2016).  Population Association of America conference, Washington, DC, April 2016.

Detecting Text Reuse in State Legislative Bills.  Joe Walsh, Matthew Burgess, Eugenia Giraudy, Julian Katz-Samuels, Derek Willis, Rayid Ghani. Joint Statistical Meetings, 2016.

Predictive Modeling for Public Health: Preventing Childhood Lead Poisoning. Eric Potash, Joe Brew, Alexander Loewi, Subhabrata Majumdar, Andrew Reece, Joe Walsh, Eric Rozier, Emile Jorgensen, Raed Mansour, Rayid Ghani. Data for Good Exchange, 2015.

Blight reduction and neighborhood revitalization for the city of Cincinnati. Data for Good Exchange, 2015.

Identifying Earmarks in Congressional Bills. Data for Good Exchange, 2015. (D4GX Award Winner)

Reducing Home Abandonment in Mexico. Data for Good Exchange, 2015.

A Machine Learning Framework for Predicting College Persistence. Data for Good Exchange, 2015.

Predicting Adverse Births in Illinois to Improve Resource Allocation for the Better Birth Outcomes Program. Data for Good Exchange, 2015.

A Data-Driven Framework for Identifying High School Students at Risk of Not Graduating on Time. Data for Good Exchange, 2015.