Publications from the Data Science for Social Good Fellowship Program

Peer-Reviewed Publications

Reducing Incarceration Through Prioritized Interventions. Erika Salomon, Matthew J. Bauman, Tzu-Yun Lin, Kate Boxer, Hareem Naveed, Lauren Haynes, Joe Walsh, Jen Helsby, Steve Yoder, Robert Sullivan, Chris Schneweis, Rayid Ghani. Proceedings of the 23rd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2017. [Under review]

Status of Breastfeeding and Child Immunization Outcomes in Clients of the Nurse–Family Partnership. William Thorland, Dustin Currie, Emily R. Wiegand, Joe Walsh, and Nick Mader. Maternal and Child Health Journal, 2017.

Detecting Fraud, Corruption, and Collusion in International Development Contracts: The Design of a Proof-of-Concept Automated System. Emily Grace, Ankit Rai, Elissa Redmiles, and Rayid Ghani. IEEE International Conference on Big Data, 2016.

Identifying Police Officers at Risk of Adverse Events. Samuel Carton, Jennifer Helsby, Kenneth Joseph, Ayesha Mahmoud, Youngsoo Park, Joe Walsh, Crystal Cody, CPT Estella Patterson, Lauren Haynes, Rayid Ghani. Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2016.

The Legislative Influence Detector: Finding Text Reuse in State Legislation. Matthew Burgess, Eugenia Giraudy, Julian Katz-Samuels, Joe Walsh, Derek Willis, Lauren Haynes, Rayid Ghani. Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2016.

Designing Policy Recommendations to Reduce Home Abandonment in Mexico. Klaus Ackermann, Eduardo Blancas Reyes, Sue He, Thomas Anderson Keller, Paul van der Boor, Romana Khan. Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2016.

Identifying Earmarks in Congressional Bills. Ellery Wulczyn, Madian Khabsa, Vrushank Vora, Matthew Heston, Joe Walsh, Chris Berry, Rayid Ghani. Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2016.

Early detection of properties at risk of blight using spatiotemporal data. Eduardo Blancas Reyes, Jennifer Helsby, Katharina Rasch, Talia Kaufmann, Paul van der Boor, Lauren Haynes and Rayid Ghani. Data for Policy International Conference, 2016.

Building Better Early Intervention Systems. Crystal Cody, Estella Patterson, Kerr Putney, Jennifer Helsby, Joe Walsh, Rayid Ghani, Samuel Carton, Kenneth Joseph, Ayesha Mahmoud, and Youngsoo Park. Police Chief Magazine, 2016.

A Machine Learning Framework to Identify Students at Risk of Adverse Academic OutcomesHimabindu Lakkaraju, Everaldo Aguiar, David Miller, Nasir Bhanpuri, Rayid Ghani, Kecia Addison. Proceedings of the 21st ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2015.

Predictive Modeling for Public Health: Preventing Childhood Lead Poisoning.
Eric Potash, Joe Brew, Alexander Loewi, Subhabrata Majumdar, Andrew Reece, Joe Walsh, Eric Rozier, Emile Jorgenson, Raed Mansour, Rayid Ghani. Proceedings of the 21th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2015.

Mining Administrative Data to Spur Urban Revitalization.
Ben Green, Alejandra Caro, Matthew Conway, Robert Manduca, Tom Plagge, Abby Miller. Proceedings of the 21th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2015.

Early Prediction of Code Blue Using Electronic Medical Records.
Sriram Somanchi, Samrachana Adhikari, Allen Lin, Elena Eneva, and Rayid Ghani. Proceedings of the 21th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2015.

Who, When, and Why: A Machine Learning Approach to Prioritizing Students at Risk of not Graduating High School on Time.
Everaldo Aguiar, Himabindu Lakkaraju, Nasir Bhanpuri, David Miller, Ben Yuhas, Kecia Addison, Shihching Liu, Marilyn Powell, and Rayid Ghani. 5th International Learning Analytics and Knowledge (LAK) Conference 2015.

Early Code Blue Prediction Using Patient Medical Records.
Sriram Somanchi, Samrachana Adhikari, Allen Lin, Elena Eneva, and Rayid Ghani. Workshop on Machine Learning for Clinical Data Analysis and Healthcare – held with NIPS 2013.

Tweedr: Mining Twitter to Inform Disaster Response.
Z. Ashktorab, C. Brown, M. Nandi, A. Culotta.  The 11th International Conference on Information Systems for Crisis Response and Management (ISCRAM), 2014.

Conference Presentations:

Identifying Police Officers at Risk of Adverse Events. Sam Carton, Jen Helsby, Kenny Joseph, Mahmud, A., Youngsoo Park, Joe Walsh, Lauren Haynes, and Rayid Ghani (2016).  Population Association of America conference, Washington, DC, April 2016.

Detecting Text Reuse in State Legislative Bills.  Joe Walsh, Matthew Burgess, Eugenia Giraudy, Julian Katz-Samuels, Derek Willis, Rayid Ghani. Joint Statistical Meetings, 2016.

Predictive Modeling for Public Health: Preventing Childhood Lead Poisoning. Eric Potash, Joe Brew, Alexander Loewi, Subhabrata Majumdar, Andrew Reece, Joe Walsh, Eric Rozier, Emile Jorgensen, Raed Mansour, Rayid Ghani. Data for Good Exchange, 2015.

Blight reduction and neighborhood revitalization for the city of Cincinnati. Data for Good Exchange, 2015.

Identifying Earmarks in Congressional Bills. Data for Good Exchange, 2015.

Reducing Home Abandonment in Mexico. Data for Good Exchange, 2015.

A Machine Learning Framework for Predicting College Persistence. Data for Good Exchange, 2015.

Predicting Adverse Births in Illinois to Improve Resource Allocation for the Better Birth Outcomes Program. Data for Good Exchange, 2015.

A Data-Driven Framework for Identifying High School Students at Risk of Not Graduating on Time. Data for Good Exchange, 2015.