






Study with the several resources on Docsity
Earn points by helping other students or get them with a premium plan
Prepare for your exams
Study with the several resources on Docsity
Earn points to download
Earn points by helping other students or get them with a premium plan
Community
Ask the community for help and clear up your study doubts
Discover the best universities in your country according to Docsity users
Free resources
Download our free guides on studying techniques, anxiety management strategies, and thesis advice from Docsity tutors
An introduction to the Heckman correction technique, a method used to obtain unbiased estimates when dealing with selective samples. The technique is particularly relevant for studies involving mobile phone owners connected to a Mobile Network Operator (MNO), where the sample is not randomly selected. the problem of selectivity bias, the assumptions required for the Heckman correction, and the derivation of the bias. It also presents a practical procedure (heckit) for implementing the correction.
What you will learn
Typology: Summaries
1 / 11
This page cannot be seen from the preview
Don't miss anything!
INS Romania
We have a population of N individuals and we want to fit a linear model to this population: but we have only a sample of only n individuals not randomly selected Running a regression using this sample would give biased estimates. The link with our project – these n individuals are mobile phone owners connected to one MNO; One possible solution : Heckman correction.
1 1 1
Two things are important here:
Preliminary assumptions: ( x,y 2 ) are always observed; y 1 is observed only when y 2 =1 ( sample selection effect ); ( ε, v ) is independent of x and it has zero mean (E ( ε, v ) =0) ; v ~ N(0,1); , correlation between residuals ( it is a key assumption ) : it says that the errors are linearly related and specifies a parameter that control the degree to which the sample selection biases estimation of β 1 (i.e., will introduce the selectivity bias). E ( | ) 0
This means that follows a truncated normal distribution and we can use a well know result: where z follows a standard normal distribution, a is a constant, is the standard normal pdf and the standard normal cumulative distribution function λ is called the inverse Mills ratio. 1 ( ) ( ) ( | ) a a E z z a
( ) ( )
x x x x E v v x
^
We obtained the parametric expression of the expected value of y 1 conditional on observable x and selection selectivity ( y 2
The last term in the above equation gives the selectivity correction. Plugging the term into the initial equation we could get an unbiased estimation of β 1 (as well as ).
1 2 1 1
( x )
This regression will give an unbiased estimate of β 1 (as well as ). In this last regression equation the dependent variables are x 1 and. Support for practical implementation: R package sampleSelection ; Stata heckman function; Eviews implementation; ( x )
This technique seems suitable to produce unbiased estimates in presence of selectivity which is the case of mobile phone subscribers (a mobile phone subscriber would have y 2 =1 in our presentation); Investigate if this technique could be applied (and where) to our project; At a first glance eq. (1) from David’s internal document could be a place where we can apply this correction technique since our estimations are only for the persons that have a phone registered to the MNO under consideration.