Docsity
Docsity

Prepare for your exams
Prepare for your exams

Study with the several resources on Docsity


Earn points to download
Earn points to download

Earn points by helping other students or get them with a premium plan


Guidelines and tips
Guidelines and tips

Exam Solutions Econometrics, Exams of Econometrics and Mathematical Economics

Past paper solutions for econometrics

Typology: Exams

2020/2021

Uploaded on 04/05/2023

violetxx
violetxx 🇬🇧

1 / 18

Toggle sidebar

This page cannot be seen from the preview

Don't miss anything!

bg1
EC220 Introduction to Econometrics
Exam Solutions
IRDAP 2020
Section A
(Answer all questions. This section carries 1/3 of the overall mark.)
Question 1
[33.34 marks]
Charitable giving is an increasingly important component of the economy. Yet, relatively little is known
about what motivates people to give to charities. Fundraisers are interested in assessing the effec-
tiveness of door-to-door fundraising campaigns, which are normally expensive and time-consuming.
A consultancy firm in London has access to a dataset on various characteristics (home address, age,
gender, occupation, whether the household donates before, ...) of 170,000 households.
In summer 2020, the firm sends ask-for-donation flyers to 100,000 households. The flyer only contains
the exact time of a door-to-door charity solicitation in the neighbourhood. The firm codes Flyeri= 1
if household iis sent a flyer, and 0 otherwise. The team then visits the 100,000 households, but are
only able to speak with 60,000 of them. The firm codes Visitedi= 1 if the door-to-door solicitor
could speak to household i, and 0 otherwise. After the flyer campaign, the firm records in the variable
Donatei= 1 if household idonates to a charity, and 0 otherwise.
(a) The consultancy firm claims that the 60,000 successful visits should constitute a random sam-
ple from the intended population of 100,000 households who receive flyers. Critically discuss
the claim. Could the team statistically test whether or not the claim holds? If yes, carefully de-
scribe one suitable test and necessary assumptions. If not, carefully explain why. [5 marks]
Solution : Students can discuss both sides of the story. The key aspect is the definition of a
random sample: the reason for the team being unable to reach the households door-to-door
should not be related with the intention to donate (i.e., the households should not actively try
to avoid the fundraisers). One story for why it might hold is that the team could not reach a
household because the time is not suitable for the household schedule for example, a visit to
relative, or another random outgoing event. Clearly this reason may not be plausible because
40,000 out of 100,000 miss the appointed time, indicating some systematic absence. That
is, the households would systematically avoid the fundraiser to avoid the door-to-door ask for
donation and thereby giving, e.g. because the visits are during the day and would therefore not
reach the working population which may have a different willingness to donate from the non-
working population. [3 marks for a good discussion of the randomness or selection story (or
both)]
We can statistically test this claim by running a balance test of observable characteristics be-
tween the successfully visited 60,000 households and the remaining 40,000. We do so by com-
paring the averages of household characteristics available in the data between the 60,000 and
the 40,000. [1 point] If we observe statistically significant differences in some averages be-
tween the two groups, the claim does not hold statistically. Even if we observe statistically
insignificant differences for all of the averages, to conclude that the claim holds, we must as-
sume that no unobservable characteristics could explain intention to donate. [1 point] Clearly
this is a very strong claim to make.
Note: No marks should be awarded for a discussion of how the 100,000 households receiv-
ing flyers may not be a random sample the question is about the 60,000 households among
flyer-receiving households who are visited.
©LSE ST 2020/EC220R IRDAP Page 1 of 18
pf3
pf4
pf5
pf8
pf9
pfa
pfd
pfe
pff
pf12

Partial preview of the text

Download Exam Solutions Econometrics and more Exams Econometrics and Mathematical Economics in PDF only on Docsity!

EC220 Introduction to Econometrics

Exam Solutions

IRDAP 2020

Section A

(Answer all questions. This section carries 1/3 of the overall mark.)

Question 1

[33.34 marks]

Charitable giving is an increasingly important component of the economy. Yet, relatively little is known

about what motivates people to give to charities. Fundraisers are interested in assessing the effec-

tiveness of door-to-door fundraising campaigns, which are normally expensive and time-consuming.

A consultancy firm in London has access to a dataset on various characteristics (home address, age,

gender, occupation, whether the household donates before, ...) of 170,000 households.

In summer 2020, the firm sends ask-for-donation flyers to 100,000 households. The flyer only contains

the exact time of a door-to-door charity solicitation in the neighbourhood. The firm codes Flyer i

if household i is sent a flyer, and 0 otherwise. The team then visits the 100,000 households, but are

only able to speak with 60,000 of them. The firm codes Visited i

= 1 if the door-to-door solicitor

could speak to household i, and 0 otherwise. After the flyer campaign, the firm records in the variable

Donatei = 1 if household i donates to a charity, and 0 otherwise.

(a) The consultancy firm claims that the 60,000 successful visits should constitute a random sam-

ple from the intended population of 100,000 households who receive flyers. Critically discuss

the claim. Could the team statistically test whether or not the claim holds? If yes, carefully de-

scribe one suitable test and necessary assumptions. If not, carefully explain why. [5 marks]

Solution : Students can discuss both sides of the story. The key aspect is the definition of a

random sample: the reason for the team being unable to reach the households door-to-door

should not be related with the intention to donate (i.e., the households should not actively try

to avoid the fundraisers). One story for why it might hold is that the team could not reach a

household because the time is not suitable for the household schedule for example, a visit to

relative, or another random outgoing event. Clearly this reason may not be plausible because

40,000 out of 100,000 miss the appointed time, indicating some systematic absence. That

is, the households would systematically avoid the fundraiser to avoid the door-to-door ask for

donation and thereby giving, e.g. because the visits are during the day and would therefore not

reach the working population which may have a different willingness to donate from the non-

working population. [3 marks for a good discussion of the randomness or selection story (or

both)]

We can statistically test this claim by running a balance test of observable characteristics be-

tween the successfully visited 60,000 households and the remaining 40,000. We do so by com-

paring the averages of household characteristics available in the data between the 60,000 and

the 40,000. [1 point] If we observe statistically significant differences in some averages be-

tween the two groups, the claim does not hold statistically. Even if we observe statistically

insignificant differences for all of the averages, to conclude that the claim holds, we must as-

sume that no unobservable characteristics could explain intention to donate. [1 point] Clearly

this is a very strong claim to make.

Note: No marks should be awarded for a discussion of how the 100,000 households receiv-

ing flyers may not be a random sample – the question is about the 60,000 households among

flyer-receiving households who are visited.

Give [1 out of 2 points] in the second part if students suggest that randomness cannot be tested.

No marks for suggesting a regression of donation on visiting – clearly we do not have informa-

tion on donation for unvisited households.

(b) The firm claims that the 100,000 households were randomly selected from the original 170,

addresses. Assume that the claim is true. The firm investigates the causal effect of the ask-

for-donation flyer campaign on giving behaviour in London.

(i) Could the firm use the available information to answer the causal question? If yes, carefully

describe and interpret the estimation regression that you would run. If not, explain why and

describe the additional information and assumptions you would require. [3.34 marks]

Solution : Yes we can. Using the full sample on 170,000 households [0.34 points] , we can

run the following regression:

Donate i

= α + βFlyer i

  • γX i

  • e i

(1)

[2 points; 1.5 if students include V isited i

or the interaction of V isited i

and F lyer i

in the

regression] where β is the coefficient of interest. We can also include a set of good neutral

controls of variables that could help explain intention to donate to improve the precision of

the causal estimate. [0.5 point]

The equation would help because the treatment variable Flyer i

is effectively randomised

so that there will no selection bias between those who receive the flyer and those who do

not. [0.5 point]

Note: A good discussion of random sampling in this question deserves [1 point].

A good discussion of a regression on V isited i

instead of F lyer i

deserves [1 point].

Do not penalise for using the probability to donate as an outcome variable.

(ii) A critic suggests pensioners are more likely to donate when they receive an ask-for-donation

flyer. Let Pensioneri = 1 if household i

s head is a pensioner, 0 otherwise. Could you test

this claim? If yes, carefully describe how. If no, explain why. [4 marks]

Solution : There are three steps. First, we could create a new interaction Pensioner i

×

Flyer i

and include the interaction in the equation above (together with Pensioner i

).

Donate i

= α 0

  • β 0

Flyer i

  • β 1

Pensioner i

  • β 2

Pensioner i

× Flyer i

  • γ 0

X

i

  • u i

[1.5 points]

(2)

Second, β 2

is the differential effect of receiving a flyer for pensioners and we would like to

test whether this is statistically significant and positive. [1 point]

Third, we can use a t-test to test this hypothesis [0.5 points; z-test also acceptable]. The

hypotheses are H 0 : β 2 = 0 vs. H 1 : β 2 > 0. [0.5 points; two-sided test acceptable] A

rejection of the null hypothesis will provide support for the claim. [0.5 points]

Note: Do not penalise if students include V isited i

as additional regressor here.

If students do not include the interaction but only F lyer i

and P ensioner i

, a very good

discussion can be awarded [2 out of 4 points] for the whole question.

(iii) The firm now would like to estimate the causal effect of a door-to-door solicitation visit

Dependent Variable: Amount of Donations i

Regressor (1) (2) (3)

Photographs Included i

(7.222)

(4.102)

(5.193)

Female-headedi

(0.151)

(0.061)

Pensioner i

(0.152)

Constant

(12.244)

(6.112)

(9.189)

Observations 100 100 100

(i) What are the average donations from all the households in the sample after the campaign?

Carefully explain your answer. If you cannot derive the answer, clearly indicate any further

information or assumptions necessary for your calculation. [2 marks]

Solution : The average donation from households without the photographs is the con-

stant: x without

= £ 80. 523. [0.5 point]

The average donation from households with the photographs is the constant plus the coef-

ficient for Photographs Included i

= 1: x with

= £ 80 .523 + 15.751 = £ 96. 274. [0.5 point]

The average donation from all households is:

x = 0. 4 × x without

    1. 6 × x with

(4)

= 0. 4 × £ 80 .523 + 0. 6 × £ 96. 274 (5)

= 32.2092 + 57.7644 = £ 89. 9736 [1 point] (6)

Note: The precise decimals are not important. A result around 89.97 or 90 pounds suffices.

Wrong calculations that include the constant are awarded [0.5 points].

(ii) A critic cites prior research that pensioners are just more likely to donate than working

households, all else equal. The critic suggests that the team must include the variable

Pensioner i

, which indicates whether household i

s head is a pensioner, in the estimation to

avoid omitted variable bias. Critically evaluate the critic

′ s suggestion. [5 marks]

Solution : If the photographs were properly randomised to include in the flyers sent to

households and the sample size is large enough (relying on the Central Limit Theorem),

the treatment should be randomised and any characteristics of the households would NOT

constitute an omitted variable. While being pensioners might be correlated with the out-

come, it should not be correlated with the treatment or the expectation of receiving the

treatment due to the randomisation. [2] points] Therefore, we do not have to include the

variable as the point estimate for the effect of Photographs to change when we add the

control for Pensioner as there would be no OVB. However, since Pensioner could explain

some of the residual error, it will help reduce the standard error of our estimates. The team

could include the variable in to improve the precision. [2 points]. This is especially impor-

tant in our case because our sample size is quite small (only 100) and the variance formula

indicates that we should include good neutral controls to improve precision. [1 point]

Note: Students should discuss the sample size to get the full mark.

If students refer to a regression involving the interaction between treatment and being a

pensioner and argue that this interaction suffers from OVB, award [up to 4 out of 5 points]

for an excellent discussion.

Even though besides the point, an excellent discussion of OVB can be awarded [up to 2 out

of 5 points] here.

(iii) Another critic interprets Column (2) as a causal evidence for women being more altruistic

than men. His rationale is that gender is assigned randomly at birth, Column (2) captures

the causal effect of having a female household head on the amount of donations. Carefully

explain whether the critic is right or wrong? [3 marks]

Solution : It is true that gender is randomised at birth, but it is not random that some

households have a female head. [1.5 point] As such, the interpretation of the coefficient

for Female cannot be causal, and could be biased. Households with a female head might

be systematically different from a male-headed female in a way that is correlated with their

intention to donate. [1.5 points].

Note: Students arguing that the critic is wrong for a different reason should be awarded

partial marks.

(iv) Carefully explain why the coefficients for Photographs Included i

are different in Column (1),

Column (2), and Column (3)? Clearly state any assumptions you make. [4 marks]

Solution : Because the treatment is randomised, there should be no OVB in the estima-

tions. [1 point] The difference in the coefficients is not due to OVB as such, but instead due

to sample variability of the estimates. [1 point] Indeed, the difference is small in magnitude

and the difference between the coefficients is cleary not statistically significant (SEs far

greater than difference). [1 point] The fact that the sample size is relatively small means

we should expect some sample variability [1 point].

Note: Even though besides the point, an excellent discussion of OVB or measurement error

can be awarded [up to 1.5 out of 4 points] here.

[1 point; LLN needs to be mentioned] Therefore, by the property of plim,

plim(

β) = β +

plim

1

n

n

i=

x i

u i

plim

1

n

n

i=

x

2

i

= β +

E[x i

u i

]

E[x

2

i

]

= β.

[2 points; deduct 1 point if plim is applied to sums, not averages] where the last equality follows

from

E[x i

u i

] = E[E[x i

u i

|x i

]] = E[x i

E[u i

|x i

]] = 0.

[1 point for showing this explicitly] [The first equality follows from the law of iterated expecta-

tions, the second equality follows from the property of the conditional expectation, and the last

equality follows from SLR.4.] Therefore,

β is consistent for β.

(c) Under SLR.1-5, derive E[

β

2 |X], where X = (x 1

,... , x n

). [4 marks]

Solution : Note that the conditional variance V ar(

β|X) is written as

V ar(

β|X) = E[(

β − E[

β|X])

2

|X]

= E[

β

2

|X] − (E[

β|X])

2

,

where the first equality follows from the definition of the conditional variance, and the second

equality follows from a direct calculation and property of the conditional expectation. Thus,

E[

β

2

|X] can be expressed as

(∗∗) E[

β

2

|X] = V ar(

β|X) + (E[

β|X])

2

.

Thus, we first compute E[

β|X] and V ar(

β|X), and then obtain E[

β

2 |X] by (**).

By (*) and taking the conditional expectation,

E[

β|X] = E

[

β +

n

i=

x i

u i

∑ n

i=

x

2

i

X

]

= β +

E

[∑

n

i=

xiui

X

]

n

i=

x

2

i

= β +

n

i=

x i

E[u i

|X]

n

i=

x

2

i

= β +

n

i=

x i

E[u i

|x i

]

n

i=

x

2

i

= β,

where the first equality follows from (*), the second and third equalities follow from the property

of conditional expectation, the fourth equality follows from SLR.2, and the fifth equality follows

from SLR.4. [1.5 points] Therefore,

E[

β|X] = β.

Also, the conditional variance V ar(

β|X) is obtained as follows:

V ar(

β|X) = V ar

β +

n

i=

x i

u i

∑ n

i=

x

2

i

X

V ar

n

i=

x i

u i

X

n

i=

x

2

i

2

n

i=

x

2

i

V ar(ui|X)

n

i=

x

2

i

2

n

i=

x

2

i

V ar(u i

|x i

n

i=

x

2

i

2

σ

2

n

i=

x

2

i

where the first equality follows from (*), the second and third equalities follow from the property

of conditional variance, the fourth equality follows from SLR.2, and the fifth equality follows from

SLR.5. [1.5 points]

Combining these results with (**), we obtain

E[

β

2

|X] =

σ

2

n

i=

x

2

i

  • β

2

.

[1 point; students need to have derived () here]**

(d) Now suppose another random sample of size n, {(y

(a)

i

, x

(a)

i

) : i = 1,... , n}, is available.

[Here “(a)” is a superscript to signify another sample.] Suppose {(y

(a)

i

, x

(a)

i

) : i = 1,... , n}

is independent of the original sample {(y i

, x i

) : i = 1,... , n}, and we impose the following

assumptions.

SLR.1a The population model is y

(a) = β

(a) x

(a)

  • u

(a)

. [This model may be different from the

one in SLR.1.]

SLR.2a We have a random sample of size n, {(y

(a)

i

, x

(a)

i

) : i = 1,... , n}, following the popula-

tion model in SLR.1a.

SLR.3a The sample outcomes on {x

(a)

i

: i = 1,... , n} are not all the same value.

SLR.4a The error term u

(a) satisfies E(u

(a) |x

(a) ) = 0 for any value of x

(a) .

SLR.5a The error term u

(a) satisfies V ar(u

(a) |x

(a) ) = (σ

(a) )

2 for any value of x

(a) (homoskedas-

ticity).

Let

β

(a)

be the OLS estimator for the regression from y

(a)

on x

(a)

, that is

β

(a)

=

n

i=

x

(a)

i

y

(a)

i

n

i=

(x

(a)

i

2

Show that E[

β −

β

(a) ] = β − β

(a) under SLR.1-4 and SLR.1a-4a. [3.33 marks]

Solution : Note that

E[

β −

β

(a)

] = E[

β] − E[

β

(a)

].

From the derivation in (c), we obtain E[

β|X] = β under SLR.1-4. Therefore,

E[

β] = E[E[

β|X]] = E[β] = β,

β 6 = β

(a)

. [Hint: In this setup, an unbiased estimator of σ

2 is obtained as

s

2

1

2 n− 2

n

i=

2

i

n

i=

(ˆu

(a)

i

2

where uˆ

(a)

i

= y

(a)

i

β

(a) x

(a)

i

.] [4 marks]

Solution : Based on the results derived in (d) and (e) combined with SLR.1-6, SLR.1a-6a, and

σ = σ

(a) , we obtain

β −

β

(a) − E[

β −

β

(a) ]

V ar(

β −

β

(a) |X, X

(a) )

β −

β

(a) − (β − β

(a) )

σ

2

1 ∑ n

i=

x

2

i

1

∑ n

i=

(x

(a)

i

)

2

∼ N (0, 1).

Under the null H 0

: β − β

(a) (and SLR.1-6, SLR.1a-6a, and σ = σ

(a) ), we obtain

t =

β −

β

(a)

s

2

1 ∑ n

i=

x

2

i

1

∑ n

i=

(x

(a)

i

)

2

∼ t 2 n− 2

[3 points; 1 for test statistic, 1 for correct distribution, 1 for required assumptions] Let

t 2 n− 2 , 1 −α/ 2

be the (1 − α/2)-th quantile of the t 2 n− 2

distribution. The testing procedure for

H

0

against H 1

with significance level α is

Reject H 0

if |t| > t 2 n− 2 , 1 −α/ 2

Do not reject H 0 if |t| ≤ t 2 n− 2 , 1 −α/ 2

[1 point]

[Alternative answer: We merge the two samples, and define the multiple regression model

(M ) y˜ i

= β x˜ i

  • β

(a)

˜x

(a)

i

  • ˜u i

for i = 1,... , 2 n, where

˜yi =

y i

for i = 1,... , n,

y

(a)

i−n

for i = n + 1,... , 2 n

˜x i

x i

for i = 1,... , n,

0 for i = n + 1,... , 2 n

˜x

(a)

i

0 for i = 1,... , n,

x

(a)

i

for i = n + 1,... , 2 n

Note that SLR.1-6, SLR.1a-6a, and σ = σ

(a)

guarantee the assumptions MLR.1-6 for this re-

gression model (M ). Therefore, we can test H 0

: β = β

(a) by the t-test based on (M ). The

t-statistic will be identical.] [Points to be awarded in accordance with the first approach]

Question 3

(a) Answer the following questions. [11.33 marks]

(i) Consider two scalar random variables x and u, where E(u) = 0. Compare three concepts:

(1) E(u|x) = 0, (2) Cov(x, u) = 0, and (3) x and u are independent. [5.33 marks]

Solution : First, (1) implies (2). To see that, if (1) is true, then

Cov(x, u) = E(xu) − E(x)E(u)

= E(xu)

= E(E[xu|x])

= E(xE[u|x])

where the first equality follows from the property of covariance, the second equality follows

from the assumption E(u) = 0, the third equality follows from the law of iterated expecta-

tions, the fourth equality follows from the property of the conditional expectation, and the

last equality follows from (1).

Second, on the other hand, (2) does not necessarily imply (1). This can be seen that (1)

implies

E[a(x)u] = E[E[a(x)u|x]] = E[a(x)E[u|x]] = 0,

for any function a(·), by the same argument above using the law of iterated expectation.

However, (2) only guarantees E[a(x)u] = 0 for the case of a(x) = x. [2 points]

Now we argue that (3) is even stronger than (1). To see that (3) implies (1), note that (3)

guarantees

E(u|x) = E(u) = 0,

where the first equality uses (3) and the second equality follows from the assumption

E(u) = 0. On the other hand, (1) does not necessarily imply (3). For example, even if

E(u|x) does not depend on x (and takes zero for all x), the conditional variance V ar(u|x)

may depend on x. In this case, u and x are not independent. [2 points]

In sum, (1) is stronger than (2), and (3) is stronger than (1) in the sense that

but (1) ; (3) and (2) ; (1). [1.33 points]

Note: Be generous in marking here. Few students will prove this as formally as the solution

states. Award marks if the conceptual idea of the different concepts and their hierarchy

comes across.

(ii) Consider the regression model

y = β 0 + β 1 x 1 + β 2 x 2 + u, E(u|x 1 , x 2 ) = 0.

Suppose that the error term is heteroskedastic, (i.e., V ar(u|x 1

, x 2

) varies with x 1

and x 2

).

(ii-1) Explain how to test the null hypothesis H

a

0

: β 1

= β 2

against the one-sided alternative

hypothesis H

a

1

: β 1 > β 2. [3 marks]

(ii-2) Suppose we want to test the null hypothesis H

b

0

: β 1

= β 2

= 0. Write down a test for

this hypothesis under homoskedasticity. Then explain the problem of this testing procedure

under the current setup. [3 marks]

(b) [11 marks]

It is postulated that a reasonable demand-supply model for the wine industry in Australia, under

market clearing assumption, would be given by

Q

t

= α 0

  • α 1

P

w

t

  • α 2

P

b

t

  • α 3

Y

t

  • α 4

A

t

  • u 1 t

demand

Q

t

= β 0

  • β 1

P

w

t

  • β 2

S

t

  • u 2 t

supply

where Q t

= real per capita consumption of wine, P

w

t

= price of wine relative to CPI, P

b

t

= price

of beer relative to CPI, Y t

= real per capita disposable income, A t

= real per capital advertising

expenditure, and S t

= storage cost at time t. CPI is the Consumer Price Index at time t. The

endogenous variables in this model are Q and P

w

, and the exogenous variables are P

b

, Y, A

and S. The variance of u 1 t

and u 2 t

are, respectively σ

2

1

, and σ

2

2

, and Cov(u 1 t

, u 2 t

) = σ 12

The errors do not exhibit any correlation over time.

(i) Derive the reduced form for P

w

t

.

[2 marks]

Solution : 1.5 points Upon equating demand and supply, we get

β 0

  • β 1

P

w

t

  • β 2

S

t

  • u 2 t

= α 0

  • α 1

P

w

t

  • α 2

P

b

t

  • α 3

Y

t

  • α 4

A

t

  • u 1 t

(β 1 − α 1 )P

w

t

= (α 0 − β 0 ) + α 2 P

b

t

  • α 3 Yt + α 4 At − β 2 St + u 1 t − u 2 t

P

w

t

α 0

− β 0

β 1

− α 1

α 2

β 1

− α 1

P

b

t

α 3

β 1

− α 1

Y

t

α 4

β 1

− α 1

A

t

β 2

β 1

− α 1

S

t

u 1 t

− u 2 t

β 1

− α 1

so

P

w

t

= π 0

  • π 1

P

b

t

  • π 2

Y

t

  • π 3

A

t

  • π 4

S

t

  • v t

, where v t

u 1 t

− u 2 t

β 1

− α 1

0.5 point For the 3rd equation we point out that β 1

= α 1

, (opposite signs because they

represent the slope of demand and supply function).

(ii) The OLS estimation of the demand function, based on annual data from 1955-1975 (T =

20), gave the following results (all variables are in logs and figures in parentheses are t-

ratios).

Q

t

(− 6 .04)

(4.0)

P

w

t

(− 0 .45)

P

b

t

(4.5)

Y

t

(− 1 .3)

A

t

All the coefficients except that of Y have the wrong signs. The coefficient of P

w (price

elasticity of demand, α 1

) not only has the wrong sign but also appears significant.

Explain why the OLS parameter estimator may give rise to these counter-intuitive results.

You are expected to use your results in answer (a) to support your answer.

[3 marks]

Solution : 2 points The resulting parameter estimates are biased and inconsistent, so

the parameter estimates, even for large sample (which this one clearly is not) are not likely

to be close to the true parameters. The inconsistency arises from the simultaneity where

prices and quantity are jointly determined.

1 point Using the result in b)i, we can show this explicitly:

Cov(P

w

t

, u 1 t

) = Cov(π 0

  • π 1

P

b

t

  • π 2

Y

t

  • π 2

A

t

  • π 3

S

t

  • v t

, u 1 t

= Cov(vt, u 1 t) as we assume P

b

t

, Yt, At and St are exogenous

= Cov(

u 1 t

− u 2 t

β 1

− α 1

, u 1 t

σ

2

1

− σ 12

β 1

− α 1

(iii) The supply equation is overidentified. Clearly explain this terminology. What distinguishes

overidentification from exact identification and underidentification? Provide one set of as-

sumptions that would render the supply equation exactly identified.

[3 marks]

Solution : 2 points The supply equation is overidentified because there are more instru-

ments than we need to deal with the endogeneity of P

w

t

in the supply equation. Using the

results in b)i, we can consider P

b

t

, Y

t

, A

t

, and S t

( relevance ). All of these variables have

been assumed to be exogenous ( valid ). We cannot use S t

as it affect the supply directly

( exclusion ). The remaining instrument, leave us with two more instruments than we need.

1 point If we only have 1 instrument to deal with the endogeneity of P

w

t

(e.g., say α 2 = α 4 =

0 ) we would have exact identification. (0.5 points if they argue we have exact identification

if two instruments show up in the supply equation)

(iv) Discuss how you should estimate the supply equation in light of the overidentification. Dis-

cuss the benefit of using overidentification conditions. [3 marks]

Solution : 2 points The student would need to describe the 2SLS procedure here.

Step 1: Estimate the reduced form (given in b)i) by OLS and obtain fitted values

P

w

t

= ˆπ 0

  • ˆπ 1

P

b

t

  • ˆπ 2

Y

t

  • ˆπ 3

A

t

  • ˆπ 4

S

t

Step 2: Estimate the following regression by OLS using the obove fitted values:

Q

t

= β 0

  • β 1

P

w

t

  • β 2

S

t

  • e 2 t

1 point The efficiency (precision) of our estimates will be better when utilizing the overiden-

tification condition rather than restricting ourselves to using only a single instrument (exact

identification). 2SLS really is an optimal IV estimator, which uses the best combination of

all instruments to ensure better precision of our parameter estimates.

2 points We should now show that, e.g., E (v t

v t− 1

) 6 = 0. Using the definition of v t

we have

E (vtvt− 1 ) = E ((ρcrimet− 1 + et) (ρcrimet− 2 + et− 1 ))

= ρ

2

E (crime t− 1

crime t− 2

) + E (e t

e t− 1

ρE (crime t− 2

e t

) + ρE (crime t− 1

e t− 1

1 point Clear explanations: The second term is zero because e t

does not exhibit autocor-

relation, the third term is zero as E(e t

|crime t− 1

, clearup t

, clearup t− 1

, ...) = 0, the final

term is non-zero because our equation (4.1) shows that crimet is a function of et, similarly

the first term is non-zero because ρ 6 = 0 revealing (using (4.1) again) that crime t− 1

is be a

function of crimet− 2 (lagging (4.1) one period).

(iii) Let us assume that the true model is displayed in (4.2), where v t

exhibits autocorrelation of

unknown form that displays weak dependence, and E(v t

|clearup t

, clearup t− 1

, clearup t− 2

  1. You are asked to test whether the long run effect of clear-up rates on the crime rate is

significant. Discuss how you can obtain the standard error of the long run effect required

to conduct the test. [5 marks]

Solution : 1 point The long run effect in this model is given by δ = β 1

  • β 2

  • β 3

1.5 points We can use the asymptotic t-test to test the hypothesis H 0

: δ = 0 against

H

A

: δ 6 = 0. The test statistic is

δ

SE

δ

a

∼ N (0, 1) under H 0

(t distribution acceptable)

and at the 5% level of significance we should reject if

ˆ δ

SE(

ˆ δ)

0.5 points Because of the presence of autocorrelation in the errors, we do need to

use HAC robust standard errors.

2 point To obtain the standard errors with standard regression package it would be

convenient to reparametrize the model beforehand:

crimet = β 0 +δclearupt +δ 1 (clearupt− 1 − clearupt)+δ 2 (clearupt− 2 − clearupt)+vt.

Alternatively, we could use the fact that

SE

β 1

β 2

β 3

SE (

β 1

2

  • SE (

β 2

2

  • SE (

β 3

2

  • 2

Cov(

β 1

β 2

Cov(

β 1

β 3

Cov(

β 2

β 3

the latter would therefore require us to obtain the estimated covariances.

(b) [8 marks]

Stevenson and Wolfers (2008) amongst others have analysed happiness using data collected

in the General Social Survey. Here we are interested in explaining the binary variable vhappy, a

dummy variable that denotes whether an individual considers him/herself "very happy" or not

(1 = yes, 0 = no). The following socio-demographic variables are considered: occattend and re-

gattend (which are dummy variables indicating whether the individual occasionally or regularly

attends church, where the excluded dummy indicates that the individual never attends church),

income (family income in ’000US$), unemp10 (dummy indicating whether the individuals has

been unemployed in the last 10 years), and educ (years of education completed). A random

sample of observations from US are available.

Advised that there are benefits to using the Probit model over the Linear Probability Model (you

are not asked to discuss this), you obtain the following results:

(i) Discuss how you can obtain the predicted probability of an individual who regularly attends

church, whose lincome equals 5, has not been unemployed in the last 10 years and has 13

years of education.

[3 marks]

Solution : The predicted probability of this individual is given by

Φ (− 1 .218 + .256 +. 198 ∗ 5 +. 0218 ∗ 13) = Φ (.311) [2 points] =. 6217 [1 point]

The predicted probability is 62.2% (or simply 0.62).

(ii) You want to test the joint significance of the church attendance variables occattend and

regattend. How would you conduct this test, and what additional information would you

require to implement it? Given the results presented in the table, what do you expect the

outcome of this test to be? Briefly explain your answer. [5 marks]

Solution : We are asked to test H 0 : β occattend

= β regattend

= 0 against H A

: at least

one of the coefficients is non-zero.

1 point We would want to run another probit regression where these two variables

are left out.

2 points Our LR will then compare the log-likelihood of the original model with the