The project focuses on generating realistic data sets for renewable generation integration. We use publicly available time-series data from generation companies to characterize real resource availability. Classically the ratings of generators are obtained from manufacturer data sheets and federal forms. However, these sources do not represent properly the technical constraints that generators are subject to during the year. In this project we will use collected data from 400+ generators in Texas to develop a data set used to study renewable energy futures in the state considering technical constraints. The project requires knowledge of least-squares and the capability to learn piece-wise least squares, knowledge of optimization and mix integer programming is a plus but not a requirement. The project will be developed in the programming language Julia.