A Machine Learning Approach to Improving Occupational Income Scores
These files are the replication files for "A Machine Learning Approach to Improving Occupational Income Scores" by Martin Saavedra and Tate Twinam. Abstract: Historical studies of labor markets frequently lack data on individual income. The occupational income score (OCCSCORE) is often used as an alternative measure of labor market outcomes. We consider the consequences of using OCCSCORE when researchers are interested in earnings regressions. We estimate race and gender earnings gaps in modern decennial Censuses as well as the 1915 Iowa State Census. Using OCCSCORE biases results towards zero and can result in gaps of the wrong sign. We use a machine learning approach to construct a new adjusted score based on industry, occupation, and demographics. The new income score provides estimates closer to earnings regressions. Lastly, we consider the consequences for estimates of intergenerational mobility elasticities.
Twinam, Tate; Saavedra, Martin (2018), "A Machine Learning Approach to Improving Occupational Income Scores", ICPSR, doi: 10.3886/e111103v1