Session 1
PMAP 8521: Program evaluation
Andrew Young School of Policy Studies
Data science and public service
Data science and public service
Evidence, evaluation, and causation
Data science and public service
Evidence, evaluation, and causation
Class details
"To responsibly unleash the power of data to benefit all Americans"
11.4 million people + 23 days
Data shared between agencies - arresting officer checks database, sees if person has been cycling, sees if they need mental health services
Two jails in Florida closed after this
How do you use all this data
to make the world better?
Analyze it and
uncover insights!
(and take this class!)
Collecting and analyzing data from a representative sample in order to make inferences about a whole population
Turning raw data into
understanding, insight,
and knowledge
Turning raw data into
understanding, insight,
and knowledge
Collect
Analyze
Communicate
Collect
Analyze
Communicate
Measuring the effect of
social programs on society
Data science
(data + statistics +
communication)
Causal inference
(DAGs + econometrics)
Polio shots administered blindly with placebo - kids didn't know, doctor didn't know (double blind) - effect found
Why not just use those who didn't get a shot as the control group? Why placebo? Kids who opted out were poor (which actually had a lower incidence rate), so you can't compare them with the rich (which had higher incidence rate)
All drugs get the safety and efficacy requirement and double blind RCTs after that
Apply evidence to clinical
treatment decisions
Apply evidence to clinical
treatment decisions
Move away from clinical judgment
and "craft knowledge"
Apply evidence to clinical
treatment decisions
Move away from clinical judgment
and "craft knowledge"
Is this good?
No, this isn’t the greatest. Push for evidence-based practice can crowd out clinical judgment, which is valuable – we can’t ignore it or dismiss it
Doctors have seen things in the past. Lady Sybil in Downton Abbey and preeclampsia - family doctor knew more about the condition and Sybil—he had intuition and experience—but he was ignored
RAND health insurance study
RAND health insurance study
Oregon Medicaid expansion
RAND health insurance study
Oregon Medicaid expansion
HUD's Moving to Opportunity
RAND health insurance study
Oregon Medicaid expansion
HUD's Moving to Opportunity
Tennessee STAR
Jameel Poverty Action Lab (J-PAL)
Jameel Poverty Action Lab (J-PAL)
Campbell Collaboration
Should we have evidence for
every policy or program?
Should we have evidence for
every policy or program?
No!
Should we have evidence for
every policy or program?
No!
Science vs. art/craft/intuition
https://twitter.com/epiellie/status/1017622949799571456?s=12
In the 1970s, experts were saying that reducing the drinking age would be fine - 26 states did - created a natural quasi-experiment for researchers - Cook then found that highway fatalities increase by 10% for 18–20-year-olds. That wouldn't have been possible without legislators inadvertently performing the experiment, paving the way for better science
So knowledge flows between the two sides - use innovation from administrators as the basis for better science
Where does program evaluation
fit with all this?
Where does program evaluation
fit with all this?
It's a method for collecting evidence
for policies and programs
Needs assessment
Design and
theory assessment
Process evaluation
and monitoring
Impact evaluation
Efficiency evaluation (CBA)
Impact evaluation!
Correlation does not
imply causation
Correlation does not
imply causation
Except when it does
Correlation does not
imply causation
Except when it does
Even if it doesn't,
this phrase is useless
and kills discussion
How do we figure out correlation?
How do we figure out causation?
How do we figure out correlation?
Math and statistics
How do we figure out causation?
How do we figure out correlation?
Math and statistics
How do we figure out causation?
Philosophy. No math.
X causes Y if…
X causes Y if…
…we intervene and change X
without changing anything else…
X causes Y if…
…we intervene and change X
without changing anything else…
…and Y changes
"A variable X is a cause of a variable Y if Y in any way relies on X for its value.… X is a cause of Y if Y listens to X and decides its value in response to what it hears"
(Pearl, Glymour, and Jewell 2016, 5–6)
"A variable X is a cause of a variable Y if Y in any way relies on X for its value.… X is a cause of Y if Y listens to X and decides its value in response to what it hears"
(Pearl, Glymour, and Jewell 2016, 5–6)
Y doesn't necessarily listen only to X
"A variable X is a cause of a variable Y if Y in any way relies on X for its value.… X is a cause of Y if Y listens to X and decides its value in response to what it hears"
(Pearl, Glymour, and Jewell 2016, 5–6)
Y doesn't necessarily listen only to X
A light switch causes a light to go on, but not
if the bulb is burned out (no Y despite X), or if
the light was already on (Y without X)
Lighting fireworks causes noise
Lighting fireworks causes noise
Rooster crows cause the sunrise
Lighting fireworks causes noise
Rooster crows cause the sunrise
Getting an MPA/MPP increases your earnings
Lighting fireworks causes noise
Rooster crows cause the sunrise
Getting an MPA/MPP increases your earnings
Colds go away a few days
after you take vitamin C
Causation =
Correlation + time order +
nonspuriousness
Causation =
Correlation + time order +
nonspuriousness
How do you know if you have it right?
Causation =
Correlation + time order +
nonspuriousness
How do you know if you have it right?
You need a philosophical model
Causation =
Correlation + time order +
nonspuriousness
How do you know if you have it right?
You need a philosophical model
That's what this class is for!
Directed acyclic graphs (DAGs)
Graphical model of the process that generates the data
Maps your philosophical model
Fancy math ("do-calculus") tells you what to control for to isolate and identify causation
From "Master the Tidyverse" by RStudio
strike_damages_month <- bird_strikes %>% group_by(Month) %>% summarize(total_damages = sum(Cost, na.rm = TRUE), average_damages = mean(Cost, na.rm = TRUE))ggplot(data = strike_damages_month, mapping = aes(x = Month, y = total_damages)) + geom_col() + scale_y_continuous(labels = dollar) + labs(x = "Month", y = "Total damages", title = "Really expensive collisions happen in the fall?", subtitle = "Don't fly in August or October?", caption = "Source: FAA Wildlife Strike Database")
“There is no way of knowing nothing about a subject to knowing something about a subject without going through a period of much frustration and suckiness.”
“Push through. You’ll suck less.”
Hadley Wickham, author of ggplot2
No!
No!
You don't need to be a mechanic to drive a car safely
No!
You don't need to be a mechanic to drive a car safely
You don't need to be a computer scientist
or developer to use R safely
Words of wisdom from @allison_horst to her data analysis class this quarter: You don't need to be a mechanic to drive a car safely, and you don't need to be a computer scientist or developer to use R safely. #rstats #tidytuesday
https://twitter.com/ameliaritger/status/1214682596182904832?s=12
I always teach my students: there’s three levels of skill: driver, mechanic, engineer. You’re here to get your drivers license. But being able to change a tire can be helpful. Don’t worry not every one has to be an engineer. (I talk about R in social sciences)
You can do this.
Become an expert with R
Become an expert with R
Speak and do causation
Become an expert with R
Speak and do causation
Design rigorous evaluations
Become an expert with R
Speak and do causation
Design rigorous evaluations
Change the world with data
Math skills
Basic algebra
Math skills
Basic algebra
Computer science skills
None
Math skills
Basic algebra
Computer science skills
None
Statistical skills
Regression, differences in means, and statistical significance
Keyboard shortcuts
↑, ←, Pg Up, k | Go to previous slide |
↓, →, Pg Dn, Space, j | Go to next slide |
Home | Go to first slide |
End | Go to last slide |
Number + Return | Go to specific slide |
b / m / f | Toggle blackout / mirrored / fullscreen mode |
c | Clone slideshow |
p | Toggle presenter mode |
t | Restart the presentation timer |
?, h | Toggle this help |
o | Tile View: Overview of Slides |
Esc | Back to slideshow |
Session 1
PMAP 8521: Program evaluation
Andrew Young School of Policy Studies
Data science and public service
Data science and public service
Evidence, evaluation, and causation
Data science and public service
Evidence, evaluation, and causation
Class details
"To responsibly unleash the power of data to benefit all Americans"
11.4 million people + 23 days
Data shared between agencies - arresting officer checks database, sees if person has been cycling, sees if they need mental health services
Two jails in Florida closed after this
How do you use all this data
to make the world better?
Analyze it and
uncover insights!
(and take this class!)
Collecting and analyzing data from a representative sample in order to make inferences about a whole population
Turning raw data into
understanding, insight,
and knowledge
Turning raw data into
understanding, insight,
and knowledge
Collect
Analyze
Communicate
Collect
Analyze
Communicate
Measuring the effect of
social programs on society
Data science
(data + statistics +
communication)
Causal inference
(DAGs + econometrics)
Polio shots administered blindly with placebo - kids didn't know, doctor didn't know (double blind) - effect found
Why not just use those who didn't get a shot as the control group? Why placebo? Kids who opted out were poor (which actually had a lower incidence rate), so you can't compare them with the rich (which had higher incidence rate)
All drugs get the safety and efficacy requirement and double blind RCTs after that
Apply evidence to clinical
treatment decisions
Apply evidence to clinical
treatment decisions
Move away from clinical judgment
and "craft knowledge"
Apply evidence to clinical
treatment decisions
Move away from clinical judgment
and "craft knowledge"
Is this good?
No, this isn’t the greatest. Push for evidence-based practice can crowd out clinical judgment, which is valuable – we can’t ignore it or dismiss it
Doctors have seen things in the past. Lady Sybil in Downton Abbey and preeclampsia - family doctor knew more about the condition and Sybil—he had intuition and experience—but he was ignored
RAND health insurance study
RAND health insurance study
Oregon Medicaid expansion
RAND health insurance study
Oregon Medicaid expansion
HUD's Moving to Opportunity
RAND health insurance study
Oregon Medicaid expansion
HUD's Moving to Opportunity
Tennessee STAR
Jameel Poverty Action Lab (J-PAL)
Jameel Poverty Action Lab (J-PAL)
Campbell Collaboration
Should we have evidence for
every policy or program?
Should we have evidence for
every policy or program?
No!
Should we have evidence for
every policy or program?
No!
Science vs. art/craft/intuition
https://twitter.com/epiellie/status/1017622949799571456?s=12
In the 1970s, experts were saying that reducing the drinking age would be fine - 26 states did - created a natural quasi-experiment for researchers - Cook then found that highway fatalities increase by 10% for 18–20-year-olds. That wouldn't have been possible without legislators inadvertently performing the experiment, paving the way for better science
So knowledge flows between the two sides - use innovation from administrators as the basis for better science
Where does program evaluation
fit with all this?
Where does program evaluation
fit with all this?
It's a method for collecting evidence
for policies and programs
Needs assessment
Design and
theory assessment
Process evaluation
and monitoring
Impact evaluation
Efficiency evaluation (CBA)
Impact evaluation!
Correlation does not
imply causation
Correlation does not
imply causation
Except when it does
Correlation does not
imply causation
Except when it does
Even if it doesn't,
this phrase is useless
and kills discussion
How do we figure out correlation?
How do we figure out causation?
How do we figure out correlation?
Math and statistics
How do we figure out causation?
How do we figure out correlation?
Math and statistics
How do we figure out causation?
Philosophy. No math.
X causes Y if…
X causes Y if…
…we intervene and change X
without changing anything else…
X causes Y if…
…we intervene and change X
without changing anything else…
…and Y changes
"A variable X is a cause of a variable Y if Y in any way relies on X for its value.… X is a cause of Y if Y listens to X and decides its value in response to what it hears"
(Pearl, Glymour, and Jewell 2016, 5–6)
"A variable X is a cause of a variable Y if Y in any way relies on X for its value.… X is a cause of Y if Y listens to X and decides its value in response to what it hears"
(Pearl, Glymour, and Jewell 2016, 5–6)
Y doesn't necessarily listen only to X
"A variable X is a cause of a variable Y if Y in any way relies on X for its value.… X is a cause of Y if Y listens to X and decides its value in response to what it hears"
(Pearl, Glymour, and Jewell 2016, 5–6)
Y doesn't necessarily listen only to X
A light switch causes a light to go on, but not
if the bulb is burned out (no Y despite X), or if
the light was already on (Y without X)
Lighting fireworks causes noise
Lighting fireworks causes noise
Rooster crows cause the sunrise
Lighting fireworks causes noise
Rooster crows cause the sunrise
Getting an MPA/MPP increases your earnings
Lighting fireworks causes noise
Rooster crows cause the sunrise
Getting an MPA/MPP increases your earnings
Colds go away a few days
after you take vitamin C
Causation =
Correlation + time order +
nonspuriousness
Causation =
Correlation + time order +
nonspuriousness
How do you know if you have it right?
Causation =
Correlation + time order +
nonspuriousness
How do you know if you have it right?
You need a philosophical model
Causation =
Correlation + time order +
nonspuriousness
How do you know if you have it right?
You need a philosophical model
That's what this class is for!
Directed acyclic graphs (DAGs)
Graphical model of the process that generates the data
Maps your philosophical model
Fancy math ("do-calculus") tells you what to control for to isolate and identify causation
From "Master the Tidyverse" by RStudio
strike_damages_month <- bird_strikes %>% group_by(Month) %>% summarize(total_damages = sum(Cost, na.rm = TRUE), average_damages = mean(Cost, na.rm = TRUE))ggplot(data = strike_damages_month, mapping = aes(x = Month, y = total_damages)) + geom_col() + scale_y_continuous(labels = dollar) + labs(x = "Month", y = "Total damages", title = "Really expensive collisions happen in the fall?", subtitle = "Don't fly in August or October?", caption = "Source: FAA Wildlife Strike Database")
“There is no way of knowing nothing about a subject to knowing something about a subject without going through a period of much frustration and suckiness.”
“Push through. You’ll suck less.”
Hadley Wickham, author of ggplot2
No!
No!
You don't need to be a mechanic to drive a car safely
No!
You don't need to be a mechanic to drive a car safely
You don't need to be a computer scientist
or developer to use R safely
Words of wisdom from @allison_horst to her data analysis class this quarter: You don't need to be a mechanic to drive a car safely, and you don't need to be a computer scientist or developer to use R safely. #rstats #tidytuesday
https://twitter.com/ameliaritger/status/1214682596182904832?s=12
I always teach my students: there’s three levels of skill: driver, mechanic, engineer. You’re here to get your drivers license. But being able to change a tire can be helpful. Don’t worry not every one has to be an engineer. (I talk about R in social sciences)
You can do this.
Become an expert with R
Become an expert with R
Speak and do causation
Become an expert with R
Speak and do causation
Design rigorous evaluations
Become an expert with R
Speak and do causation
Design rigorous evaluations
Change the world with data
Math skills
Basic algebra
Math skills
Basic algebra
Computer science skills
None
Math skills
Basic algebra
Computer science skills
None
Statistical skills
Regression, differences in means, and statistical significance