Sophomore Year Reflection

When I present at the Campus Symposium in two years, I imagine my presentation showcasing the importance of using synthetic data generation for software development across all fields of study. I would structure my presentation by first, defining synthetic data generation while generally outlining its potential uses. Then, I would dedicate a section talking about the different ways that synthetic data is generated. After, I would connect my chosen courses to the different aspects of data generation: Database systems would allow me to discuss one of my experiences with data generation, which would introduce the perspective of the functionality of synthetic data generation for small-scale programs like the one I created. Introduction to sociology would allow me to discuss the perspective of unrepresentative data sets, which is a problem that synthetic data generation offers a potential solution for. Lastly, entertainment software design would allow me to discuss the uses of synthetic data in entertainment software design, bringing that perspective into the conversation. After discussing my courses, I would talk about my experience generating synthetic data as a part of my global-local engagement. 

Overall, I want to be able to tie my experiences together and give the audience a clear picture of the benefits of synthetic data generation in software development across all fields. Additionally, I want my presentation to create awareness of the ethical and societal implications of synthetic data. I want listeners to begin to ask questions like: Who gets represented in data? How do we currently overcome biased and misrepresented data sets? Do we? How can we ensure synthetic data doesn’t reinforce existing biases? By the time I present, I’ll have gained more experience through classes, research, and internships, and I will use that knowledge to encourage others to think critically about how we can use synthetic data generation to improve the development of software tools.