Lecture 25: The Bigger Picture#
Test#
Lab 12 / Project 3#
Schedule#
Python beyond data analysis#
We’ve been focusing on using Python and pandas for data analysis. What else is Python used for?
Data engineering#
Automation / recurring processes
Copying/moving/processing/publishing data, especially Big Data
ETL
Monitoring/alerting
Web development#
Building web sites that are interactive (more than just content)
Forms
Presenting data
Workflows, such as:
Signing up for things
Paying for things
Back ends / APIs
Machine learning#
Statistics, but fancy
Building models
Finding patterns
Recommendations
Detection
When people say “artificial intelligence,” they usually mean “machine learning.”
Source, with more thorough explanation
The process#
High-level
Create a model
Gather a bunch of data for training
If supervised machine learning, label it (give it the right answers)
Segment into training and test data
Train the model against the training dataset (have it identify patterns)
Test the model against the test dataset
Run against new data
If reinforcement learning, model refines itself
LLMs#
You have a head start: The fundamentals are applicable anywhere you’re using code.
Advanced Computing for Policy#
Ask Me Anything (AMA)#
Thanks to the TAs!
Thank you!#
Keep in touch.