Fall 2023 Symposium
The Data Science Discovery Symposium was held on December 5th at Hearst Mining Building! Photos from our event can be found here.
Fall 2023 Award Winners

Cross-Regional Lead Service Line Prediction using Machine Learning
Data Science Insights Award
Jasmine Andrade, Pranav Walimbe, Andrew Kerr, Akash Iyer, Melad, Freddie Pham

The Data Initiative for Environmental and Climate Action in California's TK–12 Schools
Team Collaboration Award
Avani Gireesha, Cole Hesterberg, Isaac Gonzalez, Jessica Hsu, Michelle Lin

Project AEI
Data Visualization Award
Kai Koerber, Joseph Schull, Mizuho Li, Melad Sabagh, Danielle Wong, Xiaole Guo, Manu John, Theophilus Pedapolu, Hanqi Xiong, Henry Cen, Arnav Dalal

Data Source Pattern Studies for High Energy Physics Data Analysis
Cloud Computing Award
Malavikha Sudarshan

Data Science for Localized Climate Prediction Product
Ribbon of Excellence
Hailee Schuele, Irene Widiaman, Eesha Thaker, Yongzhen Shi James, Farouk Ghandour, Danielle Ma

Natural Hazard Recovery Team
Social Impact Award
Shuqi Chen
Discovery Projects
ChatBots: What can they be really used for? | UC Berkeley Department of Economics Professor DeLong [Full]
Assessing Creative Agency in the Big Ideas Course: From Imagination to Innovation | Creative Discovery (Division of Undergraduate Education)
Natural Language Processing: Technologies to Inform Practice in Science (TIPS) | Technology-Enhanced Learning in Science [Full]
Characterizing the Landscape of Student Learning Outcomes in Undergraduate Programs via a Web-scraping Text Analysis Approach | UC Berkeley Center for Teaching & Learning [Full]
Analyzing Students' Submitted Projects | Technovation
Creating Directed Learning Pathways With Machine Learning | University of California, Berkeley / Learning Bytes LLC
The Next Step in Public Education: Factors influencing elementary student success and family engagement | Rocketship Public Schools [Full]
The Data Initiative for Environmental and Climate Action in California's TK–12 Schools | Ten Strands [Full]
Random turn games governed by stakes | UC Berkeley Department of Mathematics [Full]
Achievement Gaps in Berkeley Public Schools | Berkeley Schools Accountability BUSD- LPAC [Full]
Datahub | UC Berkeley Datahub [Full]
The Costs and Benefits of Buying American | Haas School of Business [Full]
Can We Build Entrepreneurs and Better Negotiators? A Field Experiment in Uganda (Machine Learning and Big Data) | Haas School of Business [Full]
Credit Scoring Algorithms using Alternative Data Sources and Machine-Learning | Berkeley Haas, Institute for Business and Social Impact (IBSI) [Full]
Changes in the Global Economy | UC Berkeley Department of Sociology [Full]
Enhancing Severe Heatwave Forecasting Using Theory and Data Science | UC Berkeley Department of Earth and Planetary Science [Full]
Sub-seasonal wildfire burned area prediction with machine learning | Lawrence Berkeley National Laboratory [Full]
Discovery of Undocumented Oil and Gas Wells | Lawrence Berkeley National Laboratory
comfortDAT - an open-source data analysis toolkit for thermal comfort research | Center for the Built Environment at UC Berkeley
Building Machine Learning Models for Arctic Ice Restoration | Climformatics Inc
Data Visualization of Biological and Ecological Data | UCSC
Life Cycle Assessment of Building Structures | Forell | Elsesser [Full]
ML for Biodiversity Monitoring | UC Berkeley Dept. of Environmental Science, Policy and Management [Full]
Deep learning and language | UC Berkeley Department of Linguistics
Computationally evaluating design rationale | Co-Design Lab
Search Query Suggestions | Ithaka (JSTOR) [Full]
Missing Object Library | Art Practice and Berkeley Center for New Media.
AI Machine Translation for Cuneiform | FactGrid Cuneiform Project [Full]
Automated Quality Control and Analysis of Histopathology Images using deep learning | Merck & Co.
LLM for internal employee use | Andreessen Horowitz (a16z) [Full]
AI-powered development of novel CRISPR-based therapeutics | Scribe Therapeutics [Full]
Mobility Analytics | 99p Labs / Honda [Full]
Data Analyst in a Box | 99P Labs / Honda Research Institute [Full]
GMO acceptance of bioengineered brewer's yeast | Berkeley Yeast [Full]
Lead Identification | Berkeley Yeast [Full]
TranscribeGlass | TranscribeGlass software internship
Transparency in Segmentation Algorithms | Gauge
Image-To-Text for an Accessible Web | Gauge [Full]
Traffic signal conditions tested by a robot | Kiwibot [Full]
Violawalkhome: a mobile safety app | Violawalkhome/UC Berkeley/Stanford University [Full]
Identifying threats in the Web3 ecosystem | AnChain.ai [Full]
Candidate Feedback Report | HUELLA
DSG Interns - Various Projects | Data for Social Good Foundation [Full]
Proving that Public/Private Partnerships Promote Digital Equity | Human-I-T [Full]
Moment Price Estimator | Dapper Labs [Full]
Pagefelt AI Mentor | Pagefelt
Alecto AI | Alecto AI [Full]
Product Development AI | Rivian [Full]
RecSys - Merlin Models | Allbirds [Full]
Personalized Contextual Product Reviews | Allbirds [Full]
Enterprise Customer Content "Taste" | Coursera [Full]
User Metadata Standardization | Coursera [Full]
Impact of localizing/translating the discovery experience | Coursera [Full]
Monetization journey of existing free users | Coursera [Full]
Data-backed global pricing strategy | Coursera [Full]
Advanced Methods of Time Series Data Analysis | Applied Materials [Full]
Demonstrating Reproducibility in Data Science | Applied Materials [Full]
Sustainability Data Framework | Arm [Full]
Strain History Database | Triplebar
Using machine learning to elucidate the genetic underpinnings of multicellularity | UC-Berkeley & HHMI [Full]
The Unnatural Translation Database: Streamlining data entry through automation | NSF Center for Genetically Encoded Materials (C-GEM)
Enabling drug discovery with AI-powered quantum chemistry | Prescient Design/Genentech [Full]
Identifying the epigenetic and expression profile of regulatory T cells expressing an Ezh2 gain-of-function mutation TBD | UC Berkeley Department of Molecular Cell Biology
Chemoproteomic Analysis of Methionine Redox Sites | UC Berkeley Chemical Biology Lab
LookIt studies: a platform for developmental psychology research | UC Berkeley Department of Psychology
Developing CNN model for segmenting electron microscopy images | Department of Nutritional Sciences and Toxicology [Full]
Evolution of retinal cell types | UC Berkeley Shekhar Lab
A computational suite for the RNA universe | Innovative Genomics Institute [Full]
The polymorph evolution of protein two-dimensional crystals | University of Washington
Machine-learning tools for measuring spectral lines | National Institute of Standards and Technology [Full]
Surface Layer Scheme Exploration | NOAA Global System Lab (GSL)
Data on Recovery after Natural Hazard Events | National Institute of Standards and Technology [Full]
Fiber Reinforced Polymer (FRP) Retrofitted Shear Wall Relational Database Development | National Institute of Standards and Technology
Dataset for Automation of Quantum Dot Devices Control | National Institute of Standards and Technology [Full]
Predicting future water main breaks with pressure data | East Bay Municipal Utility District [Full]
SALT Research Group Data Post-Processing App Suite | UC Berkeley Nuclear Engineering
Classification of Dislocation Cores in HCP and BCC Metals | Chrzan Research Group, MSE
Data Access Pattern of Distributed Disk Cache system | Lawrence Berkeley National Laboratory [Full]
Network performance prediction and anomaly detection | Lawrence Berkeley National Laboratory [Full]
Graph Neural Network for Online Particle Identification at the Large Hadron Collider | Lawrence Berkeley National Laboratory [Full]
Detecting illegal sand mining with deep learning and remote sensing | I School
Machine learning to localize particles | UC Berkeley FLOW Lab [Full]
Flexible and Scalable Earthquake Forecasting | The Miller Institute and the Berkeley Seismology Lab
ChatCHW for community health workers | Haas School of Business [Full]
Gamifying cognitive training for emotion regulation | CALM Program
Analyzing Healthcare Prices | UC Berkeley Petris Center [Full]
Exploring Visualization Options with a Django Database | American Heart Association
Trend analysis of birth, acquired, age related disabilities and factors that impact such trends | Voice of Specially Abled People [Full]
EpiNu: Community Nutrition Security in DR Congo | Blum Center for Developing Economies
Homelessness and Mental Illness in California - towards a new scale of outcomes | Healthy Brains Global Initiative [Full]
Recidivism Coding Project: Investigating Potential Changes to Federal Recidivism Tracking | Department of Justice Federal Bureau of Prisons [Full]
Mapping Cultural Institutions in the US | Center for Law, Energy & the Environment (CLEE)
Data on Recovery after Natural Hazard Events | National Institute of Standards and Technology
Predicting lead in tap water | #1471 California State Water Resources Control Board [Full]
Project Sidewalk | Project Sidewalk [Full]
Archive of Urban Futures | UC Berkeley Geography Department / Moms 4 Housing
LMC (Language Model Uncertainty) | Moore Accuracy Lab
Adoption of new technologies, women's safety, and trust in formal institutions | UC Berkeley, Stanford University [Full]
The Demise of Urban Renewal and the Invention of Land Banking | UC Berkeley Department of Sociology
DaanMatch Knowledge Graph | DaanMatch
NGO Insights Dashboard | DaanMatch
Our Way of Life | UC Berkeley School of Social Welfare
Political Party Endorsements of Minority Rights | UC Berkeley Department of Political Science
Community Science Measuring Lead Hazards in Newark, New Jersey | Liberatory Infrastructures Lab
Evidenced-Based Parent Education: The Protective Factors Survey | Aspiranet [Full]
Global Hunger Data & Predictive Model | Bread for the World [Full]
Eviction Research Network | Eviction Research Network
Interrupting Lethal Structures: "No Cop City in San Pablo!" | Berkeley Underground Scholars [Full]
OpenSidewalks: responsible data science for equitable civic analytics | Taskar Center for Accessible Technology
The Opportunities and Risks and its Implications in Clinical Research and Training Development | The Center for Information Technology Research in the Interest of Society and the Banatao Institute (CITRIS)
Application Timeline

Priority Deadline: Friday 8/18
Final Deadline: Friday 8/25