The speed and simplicity of Stata
for the most basic modeling applications is amazing. However, for many of us who have switched to R
, the flexibility, the community, and the fact that R
is open source makes it, at least, a powerful complement.
These exercises focus on some of the most commonly used commands in Stata and how we can reproduce them in R.
Solutions are available here. Note that the flexibility and the vast number of packages for R
means there are often many perfectly valid ways to reach our ends.
Exercise 1
Install and load the AER
and foreign
packages. Load the PSID1982
data to your R
environment. Furthermore, save a copy of it in .dta
format to your hard drive so you can open it in Stata also and compare commands.
Exercise 2
Now that the data is loaded in both R
and Stata
, print summary statistics equivalent to Stata
‘s summarize
, describe
and list in 1/6
.
Exercise 3
Fit the following linear model and print a summary of the estimated parameters:
ln(wage) = α + β1education + β2experience + β3experience2 + β4 female
Exercise 4
Add a dummy for African American and test
whether the coefficients on the experience variables are jointly statistically significant from zero.
Exercise 5
Translate: twoway scatter lwage experience
.
Exercise 6
Make a histogram of log(wages): hist lwage
.
Exercise 7
drop south
from your data (frame) object.
Exercise 8
Find the equivalent of mean(wage) if married == 1 & gender == 2
, that is the mean wage for not married females.
Exercise 9
Make a two by two frequency table: tabulate occupation union
.
Exercise 10
Estimate a logistic regression with married
as an independent variable and education
and experience
as dependent variables. Estimate the marginal effect of an increase in education at the mean (margins, dydx(education) atmeans
).
Leave a Reply