










Study with the several resources on Docsity
Earn points by helping other students or get them with a premium plan
Prepare for your exams
Study with the several resources on Docsity
Earn points to download
Earn points by helping other students or get them with a premium plan
Community
Ask the community for help and clear up your study doubts
Discover the best universities in your country according to Docsity users
Free resources
Download our free guides on studying techniques, anxiety management strategies, and thesis advice from Docsity tutors
Past paper solutions for econometrics
Typology: Exams
1 / 18
This page cannot be seen from the preview
Don't miss anything!
(Answer all questions. This section carries 1/3 of the overall mark.)
[33.34 marks]
Charitable giving is an increasingly important component of the economy. Yet, relatively little is known
about what motivates people to give to charities. Fundraisers are interested in assessing the effec-
tiveness of door-to-door fundraising campaigns, which are normally expensive and time-consuming.
A consultancy firm in London has access to a dataset on various characteristics (home address, age,
gender, occupation, whether the household donates before, ...) of 170,000 households.
In summer 2020, the firm sends ask-for-donation flyers to 100,000 households. The flyer only contains
the exact time of a door-to-door charity solicitation in the neighbourhood. The firm codes Flyer i
if household i is sent a flyer, and 0 otherwise. The team then visits the 100,000 households, but are
only able to speak with 60,000 of them. The firm codes Visited i
= 1 if the door-to-door solicitor
could speak to household i, and 0 otherwise. After the flyer campaign, the firm records in the variable
Donatei = 1 if household i donates to a charity, and 0 otherwise.
(a) The consultancy firm claims that the 60,000 successful visits should constitute a random sam-
ple from the intended population of 100,000 households who receive flyers. Critically discuss
the claim. Could the team statistically test whether or not the claim holds? If yes, carefully de-
scribe one suitable test and necessary assumptions. If not, carefully explain why. [5 marks]
Solution : Students can discuss both sides of the story. The key aspect is the definition of a
random sample: the reason for the team being unable to reach the households door-to-door
should not be related with the intention to donate (i.e., the households should not actively try
to avoid the fundraisers). One story for why it might hold is that the team could not reach a
household because the time is not suitable for the household schedule for example, a visit to
relative, or another random outgoing event. Clearly this reason may not be plausible because
40,000 out of 100,000 miss the appointed time, indicating some systematic absence. That
is, the households would systematically avoid the fundraiser to avoid the door-to-door ask for
donation and thereby giving, e.g. because the visits are during the day and would therefore not
reach the working population which may have a different willingness to donate from the non-
working population. [3 marks for a good discussion of the randomness or selection story (or
both)]
We can statistically test this claim by running a balance test of observable characteristics be-
tween the successfully visited 60,000 households and the remaining 40,000. We do so by com-
paring the averages of household characteristics available in the data between the 60,000 and
the 40,000. [1 point] If we observe statistically significant differences in some averages be-
tween the two groups, the claim does not hold statistically. Even if we observe statistically
insignificant differences for all of the averages, to conclude that the claim holds, we must as-
sume that no unobservable characteristics could explain intention to donate. [1 point] Clearly
this is a very strong claim to make.
Note: No marks should be awarded for a discussion of how the 100,000 households receiv-
ing flyers may not be a random sample – the question is about the 60,000 households among
flyer-receiving households who are visited.
Give [1 out of 2 points] in the second part if students suggest that randomness cannot be tested.
No marks for suggesting a regression of donation on visiting – clearly we do not have informa-
tion on donation for unvisited households.
(b) The firm claims that the 100,000 households were randomly selected from the original 170,
addresses. Assume that the claim is true. The firm investigates the causal effect of the ask-
for-donation flyer campaign on giving behaviour in London.
(i) Could the firm use the available information to answer the causal question? If yes, carefully
describe and interpret the estimation regression that you would run. If not, explain why and
describe the additional information and assumptions you would require. [3.34 marks]
Solution : Yes we can. Using the full sample on 170,000 households [0.34 points] , we can
run the following regression:
Donate i
= α + βFlyer i
γX i
e i
(1)
[2 points; 1.5 if students include V isited i
or the interaction of V isited i
and F lyer i
in the
regression] where β is the coefficient of interest. We can also include a set of good neutral
controls of variables that could help explain intention to donate to improve the precision of
the causal estimate. [0.5 point]
The equation would help because the treatment variable Flyer i
is effectively randomised
so that there will no selection bias between those who receive the flyer and those who do
not. [0.5 point]
Note: A good discussion of random sampling in this question deserves [1 point].
A good discussion of a regression on V isited i
instead of F lyer i
deserves [1 point].
Do not penalise for using the probability to donate as an outcome variable.
(ii) A critic suggests pensioners are more likely to donate when they receive an ask-for-donation
flyer. Let Pensioneri = 1 if household i
′
s head is a pensioner, 0 otherwise. Could you test
this claim? If yes, carefully describe how. If no, explain why. [4 marks]
Solution : There are three steps. First, we could create a new interaction Pensioner i
Flyer i
and include the interaction in the equation above (together with Pensioner i
).
Donate i
= α 0
Flyer i
Pensioner i
Pensioner i
× Flyer i
i
[1.5 points]
(2)
Second, β 2
is the differential effect of receiving a flyer for pensioners and we would like to
test whether this is statistically significant and positive. [1 point]
Third, we can use a t-test to test this hypothesis [0.5 points; z-test also acceptable]. The
hypotheses are H 0 : β 2 = 0 vs. H 1 : β 2 > 0. [0.5 points; two-sided test acceptable] A
rejection of the null hypothesis will provide support for the claim. [0.5 points]
Note: Do not penalise if students include V isited i
as additional regressor here.
If students do not include the interaction but only F lyer i
and P ensioner i
, a very good
discussion can be awarded [2 out of 4 points] for the whole question.
(iii) The firm now would like to estimate the causal effect of a door-to-door solicitation visit
Dependent Variable: Amount of Donations i
Regressor (1) (2) (3)
Photographs Included i
(7.222)
(4.102)
(5.193)
Female-headedi
(0.151)
(0.061)
Pensioner i
(0.152)
Constant
(12.244)
(6.112)
(9.189)
Observations 100 100 100
(i) What are the average donations from all the households in the sample after the campaign?
Carefully explain your answer. If you cannot derive the answer, clearly indicate any further
information or assumptions necessary for your calculation. [2 marks]
Solution : The average donation from households without the photographs is the con-
stant: x without
= £ 80. 523. [0.5 point]
The average donation from households with the photographs is the constant plus the coef-
ficient for Photographs Included i
= 1: x with
= £ 80 .523 + 15.751 = £ 96. 274. [0.5 point]
The average donation from all households is:
x = 0. 4 × x without
(4)
= 32.2092 + 57.7644 = £ 89. 9736 [1 point] (6)
Note: The precise decimals are not important. A result around 89.97 or 90 pounds suffices.
Wrong calculations that include the constant are awarded [0.5 points].
(ii) A critic cites prior research that pensioners are just more likely to donate than working
households, all else equal. The critic suggests that the team must include the variable
Pensioner i
, which indicates whether household i
′
s head is a pensioner, in the estimation to
avoid omitted variable bias. Critically evaluate the critic
′ s suggestion. [5 marks]
Solution : If the photographs were properly randomised to include in the flyers sent to
households and the sample size is large enough (relying on the Central Limit Theorem),
the treatment should be randomised and any characteristics of the households would NOT
constitute an omitted variable. While being pensioners might be correlated with the out-
come, it should not be correlated with the treatment or the expectation of receiving the
treatment due to the randomisation. [2] points] Therefore, we do not have to include the
variable as the point estimate for the effect of Photographs to change when we add the
control for Pensioner as there would be no OVB. However, since Pensioner could explain
some of the residual error, it will help reduce the standard error of our estimates. The team
could include the variable in to improve the precision. [2 points]. This is especially impor-
tant in our case because our sample size is quite small (only 100) and the variance formula
indicates that we should include good neutral controls to improve precision. [1 point]
Note: Students should discuss the sample size to get the full mark.
If students refer to a regression involving the interaction between treatment and being a
pensioner and argue that this interaction suffers from OVB, award [up to 4 out of 5 points]
for an excellent discussion.
Even though besides the point, an excellent discussion of OVB can be awarded [up to 2 out
of 5 points] here.
(iii) Another critic interprets Column (2) as a causal evidence for women being more altruistic
than men. His rationale is that gender is assigned randomly at birth, Column (2) captures
the causal effect of having a female household head on the amount of donations. Carefully
explain whether the critic is right or wrong? [3 marks]
Solution : It is true that gender is randomised at birth, but it is not random that some
households have a female head. [1.5 point] As such, the interpretation of the coefficient
for Female cannot be causal, and could be biased. Households with a female head might
be systematically different from a male-headed female in a way that is correlated with their
intention to donate. [1.5 points].
Note: Students arguing that the critic is wrong for a different reason should be awarded
partial marks.
(iv) Carefully explain why the coefficients for Photographs Included i
are different in Column (1),
Column (2), and Column (3)? Clearly state any assumptions you make. [4 marks]
Solution : Because the treatment is randomised, there should be no OVB in the estima-
tions. [1 point] The difference in the coefficients is not due to OVB as such, but instead due
to sample variability of the estimates. [1 point] Indeed, the difference is small in magnitude
and the difference between the coefficients is cleary not statistically significant (SEs far
greater than difference). [1 point] The fact that the sample size is relatively small means
we should expect some sample variability [1 point].
Note: Even though besides the point, an excellent discussion of OVB or measurement error
can be awarded [up to 1.5 out of 4 points] here.
[1 point; LLN needs to be mentioned] Therefore, by the property of plim,
plim(
β) = β +
plim
1
n
n
i=
x i
u i
plim
1
n
n
i=
x
2
i
= β +
E[x i
u i
E[x
2
i
= β.
[2 points; deduct 1 point if plim is applied to sums, not averages] where the last equality follows
from
E[x i
u i
] = E[E[x i
u i
|x i
]] = E[x i
E[u i
|x i
[1 point for showing this explicitly] [The first equality follows from the law of iterated expecta-
tions, the second equality follows from the property of the conditional expectation, and the last
equality follows from SLR.4.] Therefore,
β is consistent for β.
(c) Under SLR.1-5, derive E[
β
2 |X], where X = (x 1
,... , x n
). [4 marks]
Solution : Note that the conditional variance V ar(
β|X) is written as
V ar(
β|X) = E[(
β − E[
β|X])
2
|X]
β
2
|X] − (E[
β|X])
2
,
where the first equality follows from the definition of the conditional variance, and the second
equality follows from a direct calculation and property of the conditional expectation. Thus,
β
2
|X] can be expressed as
β
2
|X] = V ar(
β|X) + (E[
β|X])
2
.
Thus, we first compute E[
β|X] and V ar(
β|X), and then obtain E[
β
2 |X] by (**).
By (*) and taking the conditional expectation,
β|X] = E
β +
n
i=
x i
u i
∑ n
i=
x
2
i
= β +
n
i=
xiui
n
i=
x
2
i
= β +
n
i=
x i
E[u i
n
i=
x
2
i
= β +
n
i=
x i
E[u i
|x i
n
i=
x
2
i
= β,
where the first equality follows from (*), the second and third equalities follow from the property
of conditional expectation, the fourth equality follows from SLR.2, and the fifth equality follows
from SLR.4. [1.5 points] Therefore,
β|X] = β.
Also, the conditional variance V ar(
β|X) is obtained as follows:
V ar(
β|X) = V ar
β +
n
i=
x i
u i
∑ n
i=
x
2
i
V ar
n
i=
x i
u i
n
i=
x
2
i
2
n
i=
x
2
i
V ar(ui|X)
n
i=
x
2
i
2
n
i=
x
2
i
V ar(u i
|x i
n
i=
x
2
i
2
σ
2
n
i=
x
2
i
where the first equality follows from (*), the second and third equalities follow from the property
of conditional variance, the fourth equality follows from SLR.2, and the fifth equality follows from
SLR.5. [1.5 points]
Combining these results with (**), we obtain
β
2
|X] =
σ
2
n
i=
x
2
i
2
.
[1 point; students need to have derived () here]**
(d) Now suppose another random sample of size n, {(y
(a)
i
, x
(a)
i
) : i = 1,... , n}, is available.
[Here “(a)” is a superscript to signify another sample.] Suppose {(y
(a)
i
, x
(a)
i
) : i = 1,... , n}
is independent of the original sample {(y i
, x i
) : i = 1,... , n}, and we impose the following
assumptions.
SLR.1a The population model is y
(a) = β
(a) x
(a)
(a)
. [This model may be different from the
one in SLR.1.]
SLR.2a We have a random sample of size n, {(y
(a)
i
, x
(a)
i
) : i = 1,... , n}, following the popula-
tion model in SLR.1a.
SLR.3a The sample outcomes on {x
(a)
i
: i = 1,... , n} are not all the same value.
SLR.4a The error term u
(a) satisfies E(u
(a) |x
(a) ) = 0 for any value of x
(a) .
SLR.5a The error term u
(a) satisfies V ar(u
(a) |x
(a) ) = (σ
(a) )
2 for any value of x
(a) (homoskedas-
ticity).
Let
β
(a)
be the OLS estimator for the regression from y
(a)
on x
(a)
, that is
β
(a)
=
n
i=
x
(a)
i
y
(a)
i
n
i=
(x
(a)
i
2
Show that E[
β −
β
(a) ] = β − β
(a) under SLR.1-4 and SLR.1a-4a. [3.33 marks]
Solution : Note that
β −
β
(a)
] = E[
β] − E[
β
(a)
].
From the derivation in (c), we obtain E[
β|X] = β under SLR.1-4. Therefore,
β] = E[E[
β|X]] = E[β] = β,
β 6 = β
(a)
. [Hint: In this setup, an unbiased estimator of σ
2 is obtained as
s
1
2 n− 2
n
i=
uˆ
2
i
n
i=
(ˆu
(a)
i
2
where uˆ
(a)
i
= y
(a)
i
β
(a) x
(a)
i
.] [4 marks]
Solution : Based on the results derived in (d) and (e) combined with SLR.1-6, SLR.1a-6a, and
σ = σ
(a) , we obtain
β −
β
(a) − E[
β −
β
(a) ]
V ar(
β −
β
(a) |X, X
(a) )
β −
β
(a) − (β − β
(a) )
σ
2
1 ∑ n
i=
x
2
i
1
∑ n
i=
(x
(a)
i
)
2
Under the null H 0
: β − β
(a) (and SLR.1-6, SLR.1a-6a, and σ = σ
(a) ), we obtain
t =
β −
β
(a)
s
2
1 ∑ n
i=
x
2
i
1
∑ n
i=
(x
(a)
i
)
2
∼ t 2 n− 2
[3 points; 1 for test statistic, 1 for correct distribution, 1 for required assumptions] Let
t 2 n− 2 , 1 −α/ 2
be the (1 − α/2)-th quantile of the t 2 n− 2
distribution. The testing procedure for
0
against H 1
with significance level α is
Reject H 0
if |t| > t 2 n− 2 , 1 −α/ 2
Do not reject H 0 if |t| ≤ t 2 n− 2 , 1 −α/ 2
[1 point]
[Alternative answer: We merge the two samples, and define the multiple regression model
(M ) y˜ i
= β x˜ i
(a)
˜x
(a)
i
for i = 1,... , 2 n, where
˜yi =
y i
for i = 1,... , n,
y
(a)
i−n
for i = n + 1,... , 2 n
˜x i
x i
for i = 1,... , n,
0 for i = n + 1,... , 2 n
˜x
(a)
i
0 for i = 1,... , n,
x
(a)
i
for i = n + 1,... , 2 n
Note that SLR.1-6, SLR.1a-6a, and σ = σ
(a)
guarantee the assumptions MLR.1-6 for this re-
gression model (M ). Therefore, we can test H 0
: β = β
(a) by the t-test based on (M ). The
t-statistic will be identical.] [Points to be awarded in accordance with the first approach]
(a) Answer the following questions. [11.33 marks]
(i) Consider two scalar random variables x and u, where E(u) = 0. Compare three concepts:
(1) E(u|x) = 0, (2) Cov(x, u) = 0, and (3) x and u are independent. [5.33 marks]
Solution : First, (1) implies (2). To see that, if (1) is true, then
Cov(x, u) = E(xu) − E(x)E(u)
= E(xu)
= E(E[xu|x])
= E(xE[u|x])
where the first equality follows from the property of covariance, the second equality follows
from the assumption E(u) = 0, the third equality follows from the law of iterated expecta-
tions, the fourth equality follows from the property of the conditional expectation, and the
last equality follows from (1).
Second, on the other hand, (2) does not necessarily imply (1). This can be seen that (1)
implies
E[a(x)u] = E[E[a(x)u|x]] = E[a(x)E[u|x]] = 0,
for any function a(·), by the same argument above using the law of iterated expectation.
However, (2) only guarantees E[a(x)u] = 0 for the case of a(x) = x. [2 points]
Now we argue that (3) is even stronger than (1). To see that (3) implies (1), note that (3)
guarantees
E(u|x) = E(u) = 0,
where the first equality uses (3) and the second equality follows from the assumption
E(u) = 0. On the other hand, (1) does not necessarily imply (3). For example, even if
E(u|x) does not depend on x (and takes zero for all x), the conditional variance V ar(u|x)
may depend on x. In this case, u and x are not independent. [2 points]
In sum, (1) is stronger than (2), and (3) is stronger than (1) in the sense that
but (1) ; (3) and (2) ; (1). [1.33 points]
Note: Be generous in marking here. Few students will prove this as formally as the solution
states. Award marks if the conceptual idea of the different concepts and their hierarchy
comes across.
(ii) Consider the regression model
y = β 0 + β 1 x 1 + β 2 x 2 + u, E(u|x 1 , x 2 ) = 0.
Suppose that the error term is heteroskedastic, (i.e., V ar(u|x 1
, x 2
) varies with x 1
and x 2
).
(ii-1) Explain how to test the null hypothesis H
a
0
: β 1
= β 2
against the one-sided alternative
hypothesis H
a
1
: β 1 > β 2. [3 marks]
(ii-2) Suppose we want to test the null hypothesis H
b
0
: β 1
= β 2
= 0. Write down a test for
this hypothesis under homoskedasticity. Then explain the problem of this testing procedure
under the current setup. [3 marks]
(b) [11 marks]
It is postulated that a reasonable demand-supply model for the wine industry in Australia, under
market clearing assumption, would be given by
t
= α 0
w
t
b
t
t
t
demand
t
= β 0
w
t
t
supply
where Q t
= real per capita consumption of wine, P
w
t
= price of wine relative to CPI, P
b
t
= price
of beer relative to CPI, Y t
= real per capita disposable income, A t
= real per capital advertising
expenditure, and S t
= storage cost at time t. CPI is the Consumer Price Index at time t. The
endogenous variables in this model are Q and P
w
, and the exogenous variables are P
b
, Y, A
and S. The variance of u 1 t
and u 2 t
are, respectively σ
2
1
, and σ
2
2
, and Cov(u 1 t
, u 2 t
) = σ 12
The errors do not exhibit any correlation over time.
(i) Derive the reduced form for P
w
t
.
[2 marks]
Solution : 1.5 points Upon equating demand and supply, we get
β 0
w
t
t
= α 0
w
t
b
t
t
t
(β 1 − α 1 )P
w
t
= (α 0 − β 0 ) + α 2 P
b
t
w
t
α 0
− β 0
β 1
− α 1
α 2
β 1
− α 1
b
t
α 3
β 1
− α 1
t
α 4
β 1
− α 1
t
β 2
β 1
− α 1
t
u 1 t
− u 2 t
β 1
− α 1
so
w
t
= π 0
b
t
t
t
t
, where v t
u 1 t
− u 2 t
β 1
− α 1
0.5 point For the 3rd equation we point out that β 1
= α 1
, (opposite signs because they
represent the slope of demand and supply function).
(ii) The OLS estimation of the demand function, based on annual data from 1955-1975 (T =
20), gave the following results (all variables are in logs and figures in parentheses are t-
ratios).
t
(− 6 .04)
(4.0)
w
t
(− 0 .45)
b
t
(4.5)
t
(− 1 .3)
t
All the coefficients except that of Y have the wrong signs. The coefficient of P
w (price
elasticity of demand, α 1
) not only has the wrong sign but also appears significant.
Explain why the OLS parameter estimator may give rise to these counter-intuitive results.
You are expected to use your results in answer (a) to support your answer.
[3 marks]
Solution : 2 points The resulting parameter estimates are biased and inconsistent, so
the parameter estimates, even for large sample (which this one clearly is not) are not likely
to be close to the true parameters. The inconsistency arises from the simultaneity where
prices and quantity are jointly determined.
1 point Using the result in b)i, we can show this explicitly:
Cov(P
w
t
, u 1 t
) = Cov(π 0
b
t
t
t
t
, u 1 t
= Cov(vt, u 1 t) as we assume P
b
t
, Yt, At and St are exogenous
= Cov(
u 1 t
− u 2 t
β 1
− α 1
, u 1 t
σ
2
1
− σ 12
β 1
− α 1
(iii) The supply equation is overidentified. Clearly explain this terminology. What distinguishes
overidentification from exact identification and underidentification? Provide one set of as-
sumptions that would render the supply equation exactly identified.
[3 marks]
Solution : 2 points The supply equation is overidentified because there are more instru-
ments than we need to deal with the endogeneity of P
w
t
in the supply equation. Using the
results in b)i, we can consider P
b
t
t
t
, and S t
( relevance ). All of these variables have
been assumed to be exogenous ( valid ). We cannot use S t
as it affect the supply directly
( exclusion ). The remaining instrument, leave us with two more instruments than we need.
1 point If we only have 1 instrument to deal with the endogeneity of P
w
t
(e.g., say α 2 = α 4 =
0 ) we would have exact identification. (0.5 points if they argue we have exact identification
if two instruments show up in the supply equation)
(iv) Discuss how you should estimate the supply equation in light of the overidentification. Dis-
cuss the benefit of using overidentification conditions. [3 marks]
Solution : 2 points The student would need to describe the 2SLS procedure here.
Step 1: Estimate the reduced form (given in b)i) by OLS and obtain fitted values
w
t
= ˆπ 0
b
t
t
t
t
Step 2: Estimate the following regression by OLS using the obove fitted values:
t
= β 0
w
t
t
1 point The efficiency (precision) of our estimates will be better when utilizing the overiden-
tification condition rather than restricting ourselves to using only a single instrument (exact
identification). 2SLS really is an optimal IV estimator, which uses the best combination of
all instruments to ensure better precision of our parameter estimates.
2 points We should now show that, e.g., E (v t
v t− 1
) 6 = 0. Using the definition of v t
we have
E (vtvt− 1 ) = E ((ρcrimet− 1 + et) (ρcrimet− 2 + et− 1 ))
= ρ
2
E (crime t− 1
crime t− 2
) + E (e t
e t− 1
ρE (crime t− 2
e t
) + ρE (crime t− 1
e t− 1
1 point Clear explanations: The second term is zero because e t
does not exhibit autocor-
relation, the third term is zero as E(e t
|crime t− 1
, clearup t
, clearup t− 1
, ...) = 0, the final
term is non-zero because our equation (4.1) shows that crimet is a function of et, similarly
the first term is non-zero because ρ 6 = 0 revealing (using (4.1) again) that crime t− 1
is be a
function of crimet− 2 (lagging (4.1) one period).
(iii) Let us assume that the true model is displayed in (4.2), where v t
exhibits autocorrelation of
unknown form that displays weak dependence, and E(v t
|clearup t
, clearup t− 1
, clearup t− 2
significant. Discuss how you can obtain the standard error of the long run effect required
to conduct the test. [5 marks]
Solution : 1 point The long run effect in this model is given by δ = β 1
β 2
β 3
1.5 points We can use the asymptotic t-test to test the hypothesis H 0
: δ = 0 against
A
: δ 6 = 0. The test statistic is
δ
δ
a
∼ N (0, 1) under H 0
(t distribution acceptable)
and at the 5% level of significance we should reject if
ˆ δ
ˆ δ)
0.5 points Because of the presence of autocorrelation in the errors, we do need to
use HAC robust standard errors.
2 point To obtain the standard errors with standard regression package it would be
convenient to reparametrize the model beforehand:
crimet = β 0 +δclearupt +δ 1 (clearupt− 1 − clearupt)+δ 2 (clearupt− 2 − clearupt)+vt.
Alternatively, we could use the fact that
SE
β 1
β 2
β 3
β 1
2
β 2
2
β 3
2
Cov(
β 1
β 2
Cov(
β 1
β 3
Cov(
β 2
β 3
the latter would therefore require us to obtain the estimated covariances.
(b) [8 marks]
Stevenson and Wolfers (2008) amongst others have analysed happiness using data collected
in the General Social Survey. Here we are interested in explaining the binary variable vhappy, a
dummy variable that denotes whether an individual considers him/herself "very happy" or not
(1 = yes, 0 = no). The following socio-demographic variables are considered: occattend and re-
gattend (which are dummy variables indicating whether the individual occasionally or regularly
attends church, where the excluded dummy indicates that the individual never attends church),
income (family income in ’000US$), unemp10 (dummy indicating whether the individuals has
been unemployed in the last 10 years), and educ (years of education completed). A random
sample of observations from US are available.
Advised that there are benefits to using the Probit model over the Linear Probability Model (you
are not asked to discuss this), you obtain the following results:
(i) Discuss how you can obtain the predicted probability of an individual who regularly attends
church, whose lincome equals 5, has not been unemployed in the last 10 years and has 13
years of education.
[3 marks]
Solution : The predicted probability of this individual is given by
Φ (− 1 .218 + .256 +. 198 ∗ 5 +. 0218 ∗ 13) = Φ (.311) [2 points] =. 6217 [1 point]
The predicted probability is 62.2% (or simply 0.62).
(ii) You want to test the joint significance of the church attendance variables occattend and
regattend. How would you conduct this test, and what additional information would you
require to implement it? Given the results presented in the table, what do you expect the
outcome of this test to be? Briefly explain your answer. [5 marks]
Solution : We are asked to test H 0 : β occattend
= β regattend
= 0 against H A
: at least
one of the coefficients is non-zero.
1 point We would want to run another probit regression where these two variables
are left out.
2 points Our LR will then compare the log-likelihood of the original model with the