Data Science for Economists
2026-04-01
WHO WE ARE
name / program / coding background
Course overview
| Week | Module |
|---|---|
| Apr 15 | Getting Started – Reproducibility, Git, Docker, IDE setup |
| Apr 22 | Toolkit – Shell basics, R fundamentals, Quarto |
| Apr 29 | Large Structured Data – Millions of rows: data.table, Parquet, duckplyr |
| May 06 | Web Scraping & APIs – HTML parsing, APIs, online prices |
| May 13 | Text as Data – Tokenization, bag-of-words, policy uncertainty |
| May 20 | Spatial & Satellite Data – CRS, nightlights, satellite imagery in R |
| Week | Module |
|---|---|
| May 27 | TBD |
| Jun 03 | Time as Data – Event studies, diff-in-diff, causal inference |
| Jun 10 | Machine Learning – Model selection, regularization, causal forests |
| Jun 17 | Large Language Models – LLM APIs, structured output, training a mini-LLM |
| no class | |
| Jul 01 | AI-Assisted Research – CLAUDE.md, agents, skills, LLM workflows |

