Data Science for Studying Language and the Mind
Model reliability: when you estimate the parameters, there is some uncertainty on them
y ~ x
How certain can we be about the parameter estimates we obtained?
# A tibble: 2 × 2 term estimate <chr> <dbl> 1 intercept 1.75 2 x 0.733
But… why is there uncertainty around the parameter estimates at all?
We are interested in the model parameters that best describe the population from which the sample was drawn (not a given sample)
We can obtain confidence intervals around parameter estimates for models in the same we we did for point estimates like the mean: bootstrapping
Get confidence interval