









Study with the several resources on Docsity
Earn points by helping other students or get them with a premium plan
Prepare for your exams
Study with the several resources on Docsity
Earn points to download
Earn points by helping other students or get them with a premium plan
Community
Ask the community for help and clear up your study doubts
Discover the best universities in your country according to Docsity users
Free resources
Download our free guides on studying techniques, anxiety management strategies, and thesis advice from Docsity tutors
Economic statistics is mainly concerned with collecting, processing, and presenting economic data in the form of charts and tables.
Typology: Lecture notes
1 / 15
This page cannot be seen from the preview
Don't miss anything!
1.1 Definition
Econometrics is a mixture of economics, mathematics and statistics as illustrated in Figure 1.1.1.
Figure 1.1.
Economics, or economic theory, makes statements or hypotheses that are mostly qualitative in nature. Only econometrics gives empirical or numerical content to most economic theory.
The main concern of mathematical economics is to express economic theory in mathematical form (equations) without regard to measurability or empirical verification of the theory. Econometrics, as noted previously, is mainly interested in the empirical verification of economic theory.
Economic statistics is mainly concerned with collecting, processing, and presenting economic data in the form of charts and tables. The data thus collected constitute the raw data for econometric work. But the economic statistician does not go any further, not being concerned with using the collected data to test economic theories.
Although mathematical statistics provides many tools to analyze the data, the econometrician often needs special methods in view of the unique nature of most economic data, namely, that the data are not generated as the result of a controlled experiment. The econometrician generally depends on data that cannot be controlled directly.
In econometrics the modeler is often faced with observational as opposed to experimental data.
That is, in the social sciences, the data that one generally encounters are non-experimental in nature, that is, not subject to the control of the researcher. This lack of control often creates special problems for the researcher in pinning down the exact causes affecting a particular situation.
1.2 Categories
General- and narrow-defined econometrics The former is a general designation of all econometrics methods such as regression analysis, input-output analysis, time series analysis, semi- and non-parametric analysis and so on. The latter is just the usual-said classical econometric by using the regression analysis.
Theoretical and applied econometrics Theoretical econometrics is concerned with the development of appropriate methods for measuring economic relationships specified by econometric models. It must spell out the assumptions of this method, its properties, and what happens to these properties when one or more of the assumptions of the method are not fulfilled. In applied econometrics we use the tools of theoretical econometrics to study some special field(s) of economics and business, such as the production function, investment function, demand and supply functions, portfolio theory, etc.
1.3 Application
The application of econometrics includes four parts: structure analysis, economic forecasting or predicting, policy comments, verifying and developing economic theory.
Structure analysis
It studies the relations between economic variables and different from the usual-said industrial structure, consumption structure, invest structure and so on. That is, it studies the influence of the change of one or some variables on the others or the whole economic system by using the elastic analysis, multiplier analysis and comparative static analysis.
Economic forecasting
Econometrics can be used to either analyze the historic data or forecasting the future. Econometric model is developed from the application in economic forecasting.
Case: GHI Financial Services Limited It is late February 1998, GHI Financial Services limited, a British company, has an offer to buy a call option on the US dollar expiring on 6 March. The following information is available:
Table 1.1.2 Measures of forecasting accuracy Measure Simple 50-day 25-day Double ARIMA MAE 0.569 0.393 0.430 0.379 0. MSE 0.0045 0.0020 0.0020 0.0019 0. R^2 0.18^ 0.18^ 0.08^ 0.36^ 0.
DA 0.56 0.44 0.33 0.44 0.
On this occasion, therefore, the model generating the most accurate forecasts has led to the right decision.
Policy comments
Policy comment is to choose the better policy among many different policies to implement, or study the influence of different policies on economic targets. Because the economic policy cannot be experimented in advance, the econometric models that reveal the relations of variables in economic system could be used as an economic policy lab to comment the influence of different policies on economic targets.
There are three approaches to do so, in which the economic target is used as the dependent variable of the economic model and the economic policy as the explanatory variable.
Verifying and developing economic theory
There are two functions of econometric model. One is to verify the economic theory; that is, constructing the model using certain economic theory and then applied the historic sample data to fit the model; if the fit is good, the theory is confirmed. Another is to discover and develop economic theory. You can use the economic data to fit many kinds of econometric model you can imagine. The relations revealed by the econometric model with the best fit are just the law that the economic behavior follows.
Broadly speaking, traditional econometric methodology proceeds along the following lines:
To illustrate the preceding steps, let us consider the well-known Keynesian theory of consumption.
Keynes stated: The fundamental psychological law is that men [women] are disposed, as a rule and on average, to increase their consumption as their income increases, but not as much as the increase in their income.
A mathematical economist might suggest the following form of the Keynesian consumption function:
y = β 1 + β 2 x 0 < β 2 < 1 1.2.
where y = consumption expenditure and x = income, and where β1 and β2, known as the parameters of the model, are, respectively, the intercept and slope coefficients.
To allow for the inexact relationships between economic variables, the econometrician would modify the deterministic consumption function (1.2.1) as follows:
y = β 1 + β 2 x + ε 1.2.
where^ ε^ , known as the disturbance, or error, term, is a random (stochastic) variable that has well-defined probabilistic properties. The disturbance term ε may well represent all those factors that affect consumption but are not taken into account explicitly.
Using this technique and the given data, we obtain the following estimates of β1 and β2, namely, ā184.08 and 0.7064. Thus, the estimated consumption function is:
The estimated consumption function (i.e., regression line) is shown in above Figure.
Four tests before forecast:
As noted earlier, Keynes expected the MPC to be positive but less than 1. In our example we found the MPC to be about 0.70.
But before we accept this finding as confirmation of Keynesian consumption theory, we must enquire whether this estimate is sufficiently below unity to convince us that this is not a chance occurrence or peculiarity of the particular data we have used. In other words, is 0.70 statistically less than 1? If it is, it may support Keynesā theory. We have to develop suitable criteria to find out whether the estimates are in accord with the expectations of the theory that is being tested. Such confirmation or refutation of economic theories on the basis of sample evidence is based on a branch of statistical theory known as statistical inference. Throughout this course we shall see how this inference process is actually conducted.
If the chosen model does not refute the hypothesis or theory under consideration, we may use it to predict the future value(s) of the dependent variable y on the basis of known or expected future value(s) of the explanatory, or predictor, variable x.
To illustrate, suppose we want to predict the mean consumption expenditure for 1997. The GDP value for 1997 was 7269.8 billion dollars. Putting this GDP figure on the right-hand side of (1.2.3), we obtain:
If the regression results given in (1.2.3) seem reasonable, simple arithmetic will show that
which gives x = 7197, approximately. That is, an income level of about 7197 (billion) dollars, given an MPC of about 0.70, will produce an expenditure of about 4900 billion dollars.
3.1 Why is called Regression?
The term regression was introduced by Francis Galton. In a famous paper, Galton found that, although there was a tendency for tall parents to have tall children and for short parents to have short children, the average height of children born of parents of a given height tended to move or āregressā toward the average height in the population as a whole.
Definition: The modern interpretation of regression is, however, quite different. Broadly speaking, we may say Regression analysis is concerned with the study of the dependence of one variable, the dependent variable, on one or more other variables, the explanatory variables, with a view to estimating and/or predicting the (population) mean or average value of the former in terms of the known or fixed (in repeated sampling) values of the latter.
Examples
Figure 1.3.
Figure 1.3.
The techniques of regression analysis discussed in this course are specially designed to study such dependence among variables.
3.2 Concepts
Statistical versus deterministic relationships
In regression analysis we are concerned with what is known as the statistical, not deterministic, dependence among variables. In statistical relationships among variables we essentially deal with random or stochastic variables, that is, variables that have probability distributions. In functional or deterministic dependency, on the other hand, we also deal with variables, but these variables are not random or stochastic.
Correlation versus causation
Correlation is the pure mathematical relationships between two variables. The judgment of whether or not to have correlation between two variables is only dependent on the data. Causation indicates the dependence of variables in behavior mechanism; the variable as the result is determined by the variable as the causation. The variables with causation are bound to have mathematical correlation, but not vice versa.
Regression versus causation
Although regression analysis deals with the dependence of one variable on other variables, it does not necessarily imply causation. In the words of Kendall and Stuart, āA statistical relationship, however strong and however suggestive, can never establish causal connection: our ideas of causation must come from outside statistics, ultimately from some theory or otherā.
Regression versus correlation
Closely related to but conceptually very much different from regression analysis is correlation analysis, where the primary objective is to measure the strength or degree of linear association between two variables. The correlation coefficient, which we shall study in detail in Chapter 3, measures this strength of (linear) association. In regression analysis, as already noted, we are not primarily interested in such a measure. Instead, we try to estimate or predict the average value of one variable on the basis of the fixed values of other variables.
Regression and correlation have some fundamental differences that are worth mentioning. In regression analysis there is an asymmetry in the way the dependent and explanatory variables are treated. The dependent variable is assumed to be statistical, random, or stochastic, that is, to have a probability distribution. The explanatory variables, on the other hand, are assumed to have fixed values (in repeated sampling). In correlation analysis, on the other hand, we treat any (two) variables symmetrically; there is no
Time Series Data
A time series is a set of observations on the values that a variable takes at different times.
Such data may be collected at regular time intervals, such as daily (e.g., stock prices, weather reports), weekly (e.g., money supply figures), monthly [e.g., the unemployment rate, the Consumer Price Index (CPI)], quarterly (e.g., GDP), annually (e.g., government budgets), quinquennially, that is, every 5 years (e.g., the census of manufactures), or decennially (e.g., the census of population).
The data shown in Table 1.2.1 are an example of time series data.
Although time series data are used heavily in econometric studies, they present special problems for econometricians such as autocorrelation, non-stationarity etc.
Table 1.4.
Cross-Section Data
Cross-section data are data on one or more variables collected at the same point in time, such as the census of population conducted by the Census Bureau every 10 years (the latest being in year 2000), the surveys of consumer expenditures conducted by the University of Michigan, and the opinion polls by Gallup and umpteen other organizations.
A concrete example of cross-sectional data is given in Table 1.4.1 This table gives data on egg production and egg prices for the 50 states in the union for 1990 and 1991. For each year the data on the 50 states are cross-sectional data. Thus, in Table 1.4.1 we have two cross-sectional samples.
Just as time series data create their own special problems, cross-sectional data too have their own problems, specifically the problem of heterogeneity.
Pooled Data
In pooled, or combined, data are elements of both time series and cross-section data.
The data in Table 1.4.1 are an example of pooled data.
Panel, Longitudinal, or Micropanel Data
This is a special type of pooled data in which the same cross-sectional unit (say, a family or a firm) is surveyed over time.
4.2 Categories of Variables
The variables that we will generally encounter fall into four broad categories: ratio scale, interval scale, ordinal scale, and nominal scale.
Ratio Scale
A ratio scale variable satisfies three properties. For a variable X, taking two values, X1 and X2, the ratio X1/X2 and the distance (X2 ā X1) are meaningful quantities. Also, there is a natural ordering (ascending or descending) of the values along the scale. Therefore, comparisons such as X2 ⤠X1 or X2 ā„ X1 are meaningful.
Most economic variables such as GDP, income, price, cost etc. belong to this category.