Science topic

Cointegration - Science topic

Explore the latest questions and answers in Cointegration, and find Cointegration experts.
Questions related to Cointegration
  • asked a question related to Cointegration
Question
3 answers
What are the pre-estimation tests or cautions for Dynamic Panel Data Analysis? For example, in Pooled OLS we run the Unit root Test, Cointegration Test, VIF test etc.
Relevant answer
Answer
the pre-estimation condition is the presence of endogeneity in the data set
  • asked a question related to Cointegration
Question
4 answers
Hello everyone
Relevant answer
Answer
Johansen test of cointegration is all about the determination of the existence of long run relationship amongst variables captured in an econometric model. The condition is that if there exist alleast one cointegration at both trace and max. eigen test. However if there exist atleast one cointegrating equation at at any of the test, we can accept the existence of a long run relationship
  • asked a question related to Cointegration
Question
3 answers
I have selected price series which they represent five markets {Geographically separated].
All these series are stationary at level.
to test the cointegration, can i apply Johanson Co integration.
I believe that I can apply because all the prices series are stationary at same order.
Relevant answer
Answer
No, Johansen cointegration is used when series are non-stationary at levels but stationary at first differences. If your price series are already stationary at levels, cointegration analysis is not applicable.
  • asked a question related to Cointegration
Question
3 answers
Dear all, does anyone has the estimation code for Xiao (2009) quantile cointegration test.
Relevant answer
Answer
Quantile cointegration estimates assess the relationship between time series at different quantiles rather than just the mean. This approach helps identify how cointegration varies across different points of the conditional distribution, offering a more nuanced view of long-term relationships in the presence of non-constant volatility or asymmetric effects.
  • asked a question related to Cointegration
Question
11 answers
Hello everyone, I am conducting
Relevant answer
Answer
Chuck A Arize The eigenvectors or equivalently the characteristic roots of the Companion Matrix of a VECM determine its "stability".
If the roots are outside or on the unit circle ( alternatively if the eigenvectors are inside or on the unit circle then the process is non-stationary but "stable". By stable I mean that it converges to an equilibrium after a shock.
If the roots are inside or on the unit circle ( alternatively if the eigenvectors are outside or on the unit circle then the process is explosive.
It is possible that there is a form of overshooting in one of the equations in a VECM (positive coefficient on the ecm). If the system is "stable" the other equations (including the "short-term" effects) may still restore the new equilibrium.
Your suggestion regarding "incorrect signs" appears on many internet forums. It is not correct.
  • asked a question related to Cointegration
Question
5 answers
My thesis includes variables of inflation rate (monthly), interest rate (weekly), exchange rate (daily), and crude oil production (monthly), as well as imports and exports (annually). The period is from 2018 to 2024. I will use many pre-tests such as unit root tests, cointegration tests, etc. Before starting, I am worried about the missing data. My missing data is in the exchange rate and interest rate. I need guidance. Since I want to use panel data, is interpolation useful for time series data? Will it provide accurate results? Please help and guide me. Thank you.
Relevant answer
Answer
To replace missing data in a time series using EViews, you can follow these steps:
1. **Open your time series data in EViews**: Load the data into EViews and make sure the variable with the missing data is selected.
2. **Identify and mark the missing values**: In EViews, you can identify missing values by looking for the "NA" or "?" symbols in the data. You can also use the "View" menu and select "View Data" to see the data in a spreadsheet-like format, where missing values will be highlighted.
3. **Choose a method to replace the missing values**: EViews provides several options to replace missing data, including:
- **Mean/Median Substitution**: Replace the missing values with the mean or median of the non-missing values.
- **Linear Interpolation**: Replace the missing values by linearly interpolating between the nearest non-missing values.
- **Time Series Modeling**: Use time series models, such as ARIMA or Exponential Smoothing, to estimate the missing values based on the patterns in the available data.
4. **Replace the missing values**: Once you've chosen a method, you can use the "Fill Missing" command in EViews to replace the missing value.
5. **Verify the results**: After replacing the missing values, review the data to ensure that the substitutions are appropriate and do not introduce significant biases or distortions.
  • asked a question related to Cointegration
Question
4 answers
Kao's panel cointegration tests , is there anyone willing to explain me the eviews-9 output for the Kao's panel cointegration tests?
Relevant answer
Answer
To apply the Kao Residual Cointegration Test in EViews:
  1. Open the workfile and create a group of the variables.
  2. Open the group window, go to View > Cointegration Test....
  3. Select Kao Residual Cointegration Test and configure settings.
  4. Run the test and interpret results.
  • asked a question related to Cointegration
Question
7 answers
I have estimated a VECM in Stata with rank =2 on five variables after confirming that all variables are I(1) and there are 2 CI rank using the Johansen test. Now I want to check CI relations stability using the recursive procedure proposed by Hansen and Johansen (1992, 1999). I do not know how to go about this in Stata or EViews or R.
Relevant answer
Answer
Yes, you can check the stability of the cointegration relation using the Hasza and Johansen recursive method in EViews.
  • asked a question related to Cointegration
Question
12 answers
I am working on a 21 sample size data time series, and after applying ADF test one of my variables are integrated of order 2 I(2), two variables are integrated of order 1 I(1) and two are integrated at level I(0). Which cointegration approach should I apply before testing for causality between the variables?
Relevant answer
Answer
When variables are integrated in different orders (e.g., some are integrated of order 1, I(1), and others are integrated of order 0, I(0)), the appropriate cointegration approach to use is the ARDL (Autoregressive Distributed Lag) model.
ARDL models can accommodate variables with different orders of integration by including both level and first-difference terms in the regression equation. This approach allows for testing cointegration among variables with mixed integration orders and provides robust results under various scenarios.
In summary, ARDL models are suitable when you have a mix of integrated and non-integrated variables and want to test for cointegration relationships among them.
  • asked a question related to Cointegration
Question
4 answers
I have used the ARDL model to model data over ten years. The results of the bounds test indicate that there is a cointegration relationship at the 5% and 10% levels. However, the ARDL and ECM results are not satisfactory. What is your suggestion for me? Should I change the model?
Relevant answer
Answer
please check cointegration test and if all your variables are cointegrated simply use VECM.
  • asked a question related to Cointegration
Question
5 answers
Hi everybody!
Please, I´d thank you to help me with a doubt regarding forecasting based on a VECM.
I estimated a VECM model with 3 variables, there is just one cointegrating relation, and there is no problem with this estimation.
Now I want to forecast, and for that goal I have (from information of an international organization) the predicted values of 2 of the 3 variables included in my VECM. Therefore, what I want now is to predict that one remaining variable, using the estimation of the VECM, and the given external forecasts of the 2 other variables.
I think there must be the way to do it, but I don´t realize about it yet. I´m using STATA.
Any suggetions about this?
In advance, thank you all so much!!
Relevant answer
Answer
You can forecast one variable of a Vector Error Correction Model (VECM) using external forecasts of the other two variables by incorporating these forecasts into the VECM equation and solving for the variable of interest.
  • asked a question related to Cointegration
Question
3 answers
Hi!
I want to use the ADL model for my data analysis. However, after performing a stationary test, dependent and 6/8 independent variables are stationary only in differences. The other two are stationary in levels.
Is the cointegration test always necessary?
If so, I found on the Internet that I can only use the Pesaran Bounds test because I have a mix of I(0) and I(1) variables. Is it true? I am not sure.
And how do you perform that test?
Thanks a lot for your suggestions.
Relevant answer
Answer
After performing a stationary test for your data analysis using the ADL model, you have found that the dependent variable and 6 out of 8 independent variables are stationary only in their differences, while the other two are stationary in levels. In this scenario, you can proceed with modeling your data using an Autoregressive Distributed Lag (ADL) model.
Adam Tomko Moges Mengstu Kassaw Srk Haqbin Benjamine Gaspar Miku The ADL model is suitable for situations where variables exhibit different stationarity properties, such as some being stationary in levels and others in differences.The ADL model allows for the inclusion of lagged values of both the dependent and independent variables, accommodating the mixed stationarity properties of your variables. By incorporating lagged values of the variables that are stationary in differences, you can capture the short-term dynamics and relationships in your data. At the same time, including the variables that are stationary in levels enables you to account for the long-term equilibrium relationships.
This approach aligns with the flexibility of the ADL model, which can handle variables with diverse stationarity characteristics, making it a suitable choice for your data analysis scenario. By appropriately specifying the model with the lagged terms of the variables based on their stationarity properties, you can effectively capture the dynamics and relationships within your dataset. Adam Tomko
  • asked a question related to Cointegration
Question
6 answers
I am working in panel data. Could anyone help me the stata command for Durbin-Hausman cointegration test, proposed by Westerlund (2008)? Because I got a mixed order of integration.
Relevant answer
Answer
there is option in the upper menu you can run from there no need for the command
  • asked a question related to Cointegration
Question
3 answers
The variables under study are of mixed orders of integration i.e., I(0) and I(1). I want to estimate the long-run relationship between variables therefore it is recommended to apply pooled mean group panel ARDL test. So, it is necessary to apply pedroni cointegration test before?
Relevant answer
Answer
Yes, it is possible to skip it.
  • asked a question related to Cointegration
Question
1 answer
1. My research involved 10 explanatory variables. After performing the CIPS panel unit root test, I found 4 variables stationary at level 1, 3 variables stationary at level I (1), and 2 variables stationary at level I (2). What should I do next? Do I perform a cointegration test?
2. I run both the Westerlund cointegration and Pedroni cointegration tests in Stata and EViews, but Stata shows "No more than six covariates can be specified." and with Eviews I can't run the test with more than 7 variables. Then what should I do?
Relevant answer
Answer
Since your dependent variable (CO2) is integrating at level (1) and all other variables are have a mixed order of integration, it is not advisable to use Pedroni and Westerlund cointegration tests. Because these two cointegration tests can only be used when DV is integrating at 1st difference and all other independent variables may be of I(0) or I(1) order. So, you can go for PMG-ARDL which is applicable in your case.
Second thing is that, some of your variables are integrating at 2nd difference. So, it better to drop those variables and use only use those variables that are integrating at most at I(1). You can refer to this article
  • asked a question related to Cointegration
Question
1 answer
I am running a moderation analysis and have 2 IVs 1 moderator and 5 control variables. I want to check cointegration through westerlund cointegration test as my model suffers from cross section dependence. But every time I run, Stata give error i.e. variables should not exceed 7.
Kindly suggest the remedy
Relevant answer
Answer
Do you find any solutions? My model has 11 explanatory variables, I also faced same situation.
  • asked a question related to Cointegration
Question
1 answer
I want to run Westerlund and Edgerton 2008 cointegration test with structural breaks.
Relevant answer
Answer
I Do not have, but you can use a free version of GAUSS available for students called GAUSS Light 1. It has a maximum matrix size of 10,000 elements, no multi-threading support, and no debugger. You can purchase pre-written, customizable GAUSS Application Modules separately for an additional fee.
  • asked a question related to Cointegration
Question
5 answers
If the data has passed stationary tests and does not exhibit unit root problems, then the assumptions of many time series models have been met. In such cases, there might not be a need to test for cointegration, as cointegration is often explored when dealing with non-stationary time series. Am I wrong?
Relevant answer
Answer
Cointegration analysis requires data to be integrated of at least order one (be nonstationary). If data are stationary (integrated of order zero) there is no need to apply cointegration analysis because the data cannot be cointegrated. This is because cointegration means that the linear combination of variables has an order of integration below that of the variables themselves. For example, if Y~I(0) and X~I(0) their linear combination cannot have an order of integration below 0 and they therefore cannot be cointegrated. Nevertheless, the model involving only stationary variables may still exhibit autocorrelation and need a dynamic specification (including lags of variables) that has a static long run solution (essentially obtained by removing time subscripts). However, this long run (equilibrium) solution will not be a cointegrating equation.
  • asked a question related to Cointegration
Question
1 answer
dear sir/ma'am,
I face a problem in testing the data while using stata. i can't fine the commands for CSARDL and Westerlund panel cointegration test. kindly tell me the commands of these tests and provide a solution for following problem i'm facing.
Relevant answer
Answer
Try this;
xtcointtest westerlund Y X1 X2 X3
xtcointtest westerlund Y X1 X2 X3 , trend
  • asked a question related to Cointegration
Question
11 answers
My variables are:
1. exchange rate i(1) (nominal exchange rate)
2. interest rate i(3) (2 year bond yields)
but they are not cointegrated. Any suggestions are welcome, even if I have to use different data to represent my variables.
Relevant answer
Answer
A time series is I(d) if it is
  1. non-stationary
  2. becomes stationary when differenced d times
  3. is not stationary when differenced d-1 times
Two I(d) time series Xt and Yt are cointegrated C(d, r) if there exists a linear combination of Xt and Yt such that Zt = Xt + a Yt is I(r) where r<d.
Two I(1) time series Xt and Yt are cointegrated if a linear combination of Xt and Yt exists, such that Zt = Xt + a Yt is I(0) stationary.
Note that a non-stationary time series is not necessarily integrated of any order. For example, it is often found that real GDP is nonstationary. Log(real GDP) is also nonstationary. The first difference of Log(real GDP) is usually found to be stationary and therefore Log(real GDP) is I(1). If this is so real GDP is not integrated of any order. However, applying unit root tests may imply that it is integrated of high order. Such a result is spurious.
An I(3) variable can not be an explanatory variable for an I(1). An I(1) variable can not be an explanatory variable for an I(3) variable. An I(3) variable can not be cointegrated with an I(1) variable.
If these points are not clear to you and you wish to work with such series you should attend a set of relevant time series lectures or consult a time series book such as Enders (2014), Applied Econometric Time Series, Wiley.
I would think that economic theory and common sense might imply that the log of the exchange rate might be I(1). (Asset pricing theory ?) I can not think of any reason why your interest rate is I(3). (I would not use the log of the interest rate?). Your result is almost certainly spurious. Are there any institutional or other factors that might cause discontinuities in your series? Is the 2-year bond market deep?
There are a lot of variables missing in your analysis. I suspect that the difference between the domestic and foreign bond rates may be important. The differences between domestic and foreign inflation may also be important. There may also be other domestic or external factors that may be important. Consider the following example. If foreign bond rates increase and domestic rates match them there should be no pressure on the exchange rate.
  • asked a question related to Cointegration
Question
4 answers
Hello everyone. I am currently doing my thesis on 'The impact of trade openness on economic growth: evidence from china' . I have 1 dependent variable and 5 independent variables. All the variables are I(1) so I have proceeded with Johansen integration test which revealed 5 cointegrating equations. I would like some guidance on how to proceed. There are almost no studies where 5 cointegrating equations are found. Most find 1 cointegrating equation. Please find attached my raw data. All variables except population growth were log transformed before performing the tests.
Relevant answer
Answer
When there are multiple cointegrating equations, you can use vector error correction models (VECMs) to analyze the relationship between the variables. VECMs include both short-run and long-run dynamics and can be used to analyze the relationship between multiple variables1
  • asked a question related to Cointegration
Question
9 answers
I am a final year student and I would be glad to get as many helps as I could. I have utlized a lot of Youtube vidoes to learn about the ARDL model estimation but I am now confused. This is because different videos provided different approaches, so I am not longer sure of the step by step to doing it.
For instance, Crunch econometrix said to estimate the ARDL model, then afterwards, do the bounds test to check for cointegration. If cointegration exists, then I would need to estimate an ECM model using least squares by manually entering the equation that would contain the ECM into Eviews. On the other hand,if cointegration doesn't exist, then a short-run ARDL using the differenced variables would be estimated using Least squares( I also would manually enter the equation into eviews, but differenced and with lags). I don't know how correct this procedure is especially since other videos I watched didn't have this.
But I am bothered given the fact that nothing is said of the coefficients obtained for the variables and their lags after estimating the first ARDL (I think it is also called the unrestricted error correction model, right?), especially as regards how to interpret them or what their relevant is to the study. I use EViews 12 which provides the coefficient diagnostics. The "Long run form and bounds test" provides the conditional error correction model. what are the relevance of the estimates provides for the coefficients and their lags and how do you interpret them?
Then there is the "error correction form" feauture that provides the ECM regression,which also reveals some estimations for the difference of the variables (with only lags). What is the relevance of these estimates and how do you interpret them?
Relevant answer
Answer
To run an ARDL model from start to finish, you can follow these steps:
  1. Collect the data you need for your model.
  2. Open the Eviews program.
  3. Load the data you want to use for your model.
  4. Click on the “Quick” menu and select “Estimate Equation”.
  5. In the Equation Specification window, select the variables you want to include in your model.
  6. Click on the “ARDL” button to specify that you want to run an ARDL model.
  7. Specify the order of integration for each variable in your model.
  8. Click on the “OK” button to run the model.
In summary, to run an ARDL model from start to finish, you can collect the data you need for your model, open the Eviews program, load the data you want to use for your model, click on the “Quick” menu and select “Estimate Equation”, select the variables you want to include in your model, click on the “ARDL” button to specify that you want to run an ARDL model, specify the order of integration for each variable in your model, and click on the “OK” button to run the model.
  • asked a question related to Cointegration
Question
4 answers
I have weekly price data of pineapple for the months of June and July for 12 years. I want to know the spatial market integration. Please advise
Relevant answer
Answer
Yes, it is possible to do cointegration analysis for agricultural commodities that are available only for 2/3 months in a year. Cointegration is a statistical method used to test the long-term relationship between two or more time series1. It is commonly used in finance and economics to analyze the relationship between commodity prices and other variables such as supply and demand, inflation rates, exchange rates, etc.
In fact, a study conducted by Allen et al. confirmed the cointegration relationships among agricultural commodities (including corn, wheat, and sugar), ETC
  • asked a question related to Cointegration
Question
6 answers
We know that correlation and cointegration are two different things. The question that I want to put here and share with you is whether this is true even when we consider the wavelet concept?
Relevant answer
Answer
It is merely a scale-localized version of the usual cross-correlation between two signals. In cross-correlation, you determine the similarity between two sequences by shifting one relative to the other, multiplying the shifted sequences element by element, and summing the result.
Wavelet cannot be used to test for cointegration
  • asked a question related to Cointegration
Question
7 answers
Hi All,
I am working with time series data which has a dependent variable I(2) { Stationary at second order difference} and other independent variables are stationary at I(1).
Also, there exists cointegration between these variables.
I want to know whether I can run the VECM process with I(2) Dependent variable and I(1) Independent variables.?
Please advise possible options that I can follow.
Thank you.
Relevant answer
Answer
Thank you for your responses. I reduced my sample data points and tried again. Because my dataset included data from the covid 19 period. So I removed covid 19 affected period from my dataset and run the unit root again. Then all the log forms of variables were stationary at level 1 (I1).
Thank you both of you very much.
Cheers!!!
  • asked a question related to Cointegration
Question
2 answers
In many articles I saw that ARDL model can be used when there is only 1 cointegration relationship. Therefore, to check the number of cointegration relationships I used Johansen cointegration test and found there is only 1 cointegration relationship. But in theory, ARDL model uses bound F statistic test to check whether there are cointegration relationships exist. How do we identify number of cointegration relationships using bound test. Is it unnecessary to use bound test if I already used the Johansen cointegration test?
Relevant answer
Answer
The ARDL bounds test assumes at most one long run levels relationship. It does not test for multiple long run relations. The Johansen test can test for multiple cointegrating relationships. The difference is that the Johansen procedure assumes all levels variables are I(1) whereas the ARDL procedure assumes the dependent levels variable is I(1) and the independent variables can be I(1) or I(0). Using both tests may be useful if there is uncertainty over the orders of integration of the levels variables and the number of long run relations. If you are certain all levels variables are I(1) you only need to use Johansen. If you are certain there is one long run levels relationship and uncertain over the independent variables' orders of integration you only need to use the ARDL test.
  • asked a question related to Cointegration
Question
7 answers
Does anyone know if Westerlund's test for panel cointegration is available in R?
Thanks,
Aurélien
Relevant answer
Answer
I am using xtwest stata command it has limitation can not use more than 6 variables, in a model.
  • asked a question related to Cointegration
Question
7 answers
If all my dependent and independent variables are stationary at first difference NOT at level, SHOULD I REPLACE (ardl y x1 x2) with (ardl d.y d.x1 d.x2)? when there is no cointegration and I want to estimate a short-run ardl model or it doesn't matter?, since my diagnostic tests are good enough to rely on when running (ardl y x1 x2)OR these diagnostic tests are invalid as well?
Thanks in advance.
  • asked a question related to Cointegration
Question
3 answers
I want to check how important some variables are when they make a cointegration. But I did not know if we can know what the cointegration vector exactly is. And can we get more information from the vector?
Relevant answer
Answer
A vector of I(1) variables yt is said to be cointegrated if there exist at vector βi such that βiyt is trend stationary. If there exist r such linearly independent vectors βi,i= 1,...,r, then yt is said to be cointegrated with cointegrating rank r.
  • asked a question related to Cointegration
Question
6 answers
Dear colleagues,
I have a query related to the commands of continuously updated full modified (CUP-FM) and continuously updated bias-corrected (CUP-BC) estimators in the Stata.
Thank you.
Relevant answer
Answer
I am looking for the stata commands for CUP-FM and CUP-BC estimation like you. Have you had them? If so, can you share with me?
Thank you
Zubair Ashraf
  • asked a question related to Cointegration
Question
3 answers
I have few time series which are cointegrated. First I differenced the non-stationary series and fitted VAR after making them stationary. Later I fitted vector error correction model separately and obtained the long run and short run relationship. As I know taking difference removes the long run relationship of variables. So does that mean I can use VAR to explain the short run relationship even the series are cointegrated? If so can I compare the results of VAR model and short run results of VECM model?
Relevant answer
Answer
The comparison between the two models may provide insights into how the variables respond to short-term shocks and how they adjust back to their long-run equilibrium relationships I think.
  • asked a question related to Cointegration
Question
4 answers
Dears,
I have two variables and all of them are integrated at order zero I(0) meaning they are stationary. Yet i want to apply VAR model, can i do that? if yes, could you please provide some reference? Also, Can it happen that johansen Cointegration test says my variable are ciontegrated wile they are stationary at level? Based on the results of johansen cointegration test can I apply VECM?
Kind regards,
Relevant answer
Answer
Hello Adam Roble ,
As your variables are stationary at level I(0), then you should go for VAR model directly after selecting the lag length. VECM is not ok for your dataset.
You can use Johansen cointegration test when all variables are stationary at first difference I(1). When Johansen test result shows cointegration among variables only then you can run VECM model.
And you can also conduct Granger Causality test to find out if the lag values of independent variables affect the endogenous variable or not.
  • asked a question related to Cointegration
Question
2 answers
some of my variables are integrated at 1st difference and some are 2nd difference. which method should I accept? ARDL or Cointegration or VECM. Please answer.
Relevant answer
Answer
If series are of same order of integration, we use cointegration otherwise ARDL. Second difference is rare. According to many econometricians, 2nd difference is to be avoided.
So far your variables are concerned, 3 rates (percentages), 1 index (ratio), Govt effectiveness (dummy or berometric var) and 2 $ values (GDP/Capita and remittance).
Ratios are done max at first difference, you better look into the data.
take natural log (ln) of the $ values, it will serve the purpose.
  • asked a question related to Cointegration
Question
4 answers
I am looking for the 2nd Generation panel cointegration R code. particularly Westerlund (2007). Any idea ?
Relevant answer
Answer
Joao Carlos Souza Marques There is nothing about 2nd generation panel cointegration mentioned.
  • asked a question related to Cointegration
Question
4 answers
I hope you all are doing good. My question is regarding long-run estimation. I am informed that when Johansen results show more than one cointegrating equations, we cannot use FMOLS as technique to estimate long-run relationship. I would, therefore, like to know what estimation procedure/model we should follow when we have more than one cointegrating equation. Thank you.
Relevant answer
Answer
This is a difficult problem. I will try an explain some of the points using a 4-variable system with 2 cointegrating vectors.
1. Say you have 4 variables (x1, x2, x3 and x4) and the Johansen test imp[lies that you have 2 cointegrating relationships.
2. The Johansen methodology then estimates two cointegrating vectors which I will write as
ecm1 = a11 x1 + a12 x2 + a13 x3 + a14 x4, and
ecm2 = a21 x1 + a22 x2 + a23 x3 + a24 x4.
The problem is that any two independent combinations of these two ecm's is also a set of cointegrating vectors. The problem is that econometrics can not choose the appropriate two ecm's from the infinity of ecm's available.
3. At this stage you need to understand the concept of identification. If you are not familiar with identification read the section in your econometrics textbook on identification in simultaneous equations.
4. An example of identification in the ecm's at 2 above would be to specify in ecm 1 that the coefficient of x4 be zero, and the coefficient of x1 be 1. In ecm 2 you might insist that the coefficient of x1 be zero and that of x2 be 1. This gives you two ecm's
ecm12 = x1 +b12x2 + b13 x3
ecm22 = x2 +b22x2 + b24 x4
The identification restrictions are obtained by economic argument and do not require any tests. Your software manual should explain how to impose the restrictions on the system.
5. You can now test what is known as overidentifying restrictions (for example that b24 = 0 if this is hypothesized by your economic theory).
The Johansen test and estimation methodology is based on some very complicated linear algebra and gives exact algebraic solutions to the maximum likelihood problem. I would generally recommend Juselius (2006), The Cointegrated VAR Model: Methodology and Applications, OUP and Kilian & Lütkepohl (2017), Structural Vector Autoregressive Analysis, OUP as covering a lot of this material. These are graduate-level texts. If you master this you will have made great progress.
It is also possible to estimate models such as these using numerical methods to solve the maximum likelihood problem rather than the exact solutions proposed by Johansen. I presume that this is the MECM method proposed by Othmane Lamzihri. There is coverage of this methodology in Martin, Hurn & Harris Econometric Modelling with Time Series, CUP. This is an easier approach to VECM and MECM estimation but it is not as comprehensive. It contains a lot of sample programs.
Both methodologies should give the same results when used on the same specification. The Johansen methodology requires fewer computer resources. The Maximum likelihood Method requires some programming. However, It may be easier to implement some model restrictions in MECM.
  • asked a question related to Cointegration
Question
4 answers
After conducting ARDL and bound Test using Eviews 10. the variables were not cointegrated since the F-statistic was smaller than I(0), However, and looking at the long run relationship, the variables were statistically significant. In this case, what decision must be taken?
Relevant answer
Answer
You say
looking at the long-run relationship, the variables were statistically significant.
How do you come to this conclusion? I suspect that Eviews has calculated t-statistics for these variables. These t-statistics do not have a standard t-distribution. P-values for these statistics calculated in this way are not valid indicators of significance.
My reading of your result is that there is no cointegration. Perhaps you should consider your underlying model
  • asked a question related to Cointegration
Question
4 answers
Can I run the ARDL method even if my residuals do not follow normal distribution. I have already taken the log of the variables and they follow normal distribution but not when I estimate the ARDL model?
Relevant answer
Answer
In small samples the distribution of the residuals of a model need to be normally distributed for hypothesis tests on the coefficients to follow the standard distributions. In large samples the Central Limit Theorem applies and the coefficients follow conventional distributions despite the residuals not following a normal distribution. However, the tests for a long run levels relationship in an ARDL model are nonstandard. Nevertheless, my understanding is that in large samples the ARDL model can have nonnormally distributed residuals and the simulated nonstandard critical values will be valid. However, this will not be the case in small samples. Others may have more information on whether the nonstandard critical values are valid in large samples when the ARDL model's residuals are non-normal.
  • asked a question related to Cointegration
Question
2 answers
Using E-Views 9, I ran the ARDL test, resulting in an R-Squared value in the initial ARDL model output and an R-Squared value under the Bounds test. so, what is the difference between these two R squared values?
Relevant answer
Answer
The difference between R squared in the ARDL model and R squared in the bounds test is that R squared in the ARDL model is about the rubostness of the overall model result and not about the robustness of the bounds test result only. This means R squared in the ARDL model will tell you how much the variability in Y or Ys (your dependent variable/variables) is/are explained by the ARDL model given Xs variables (your independent variables). On the other hand, R squared in the bounds test is specifically about the bounds test result within the ARDL model and not about the robustness of the overall model result. You need an ARDL model first to have a bounds test but not a bounds test to have the ARDL model. I hope it helps you.
  • asked a question related to Cointegration
Question
12 answers
Dear colleagues,
I read about the lag length criteria, but I'm still unsure if I understand it well because I have different results.
How to decide which lag length is optimal? I applied it to my 29 years' GDP and energy consumption dataset using 4, 8, and 9 lags, and I got different results on different selections.
Is there a general rule of the number I should begin the analysis with?
Many thanks
Relevant answer
Answer
I presume that you have real GDP and your energy consumption is in volume terms and that you are working with the log of both variables.
I would suggest that you go to your textbook and see if you can replicate the results of a lag selection example in the test book. It is difficult to give you a diagnosis without your data and your program. Otherwise, try a different econometric package (e.g. https://gretl.sourceforge.net/ which is free and user-friendly).
Treating 4 as maximum lags your criteria must be estimated using observations 5 to 29 as independent and 1 to 4 as lagged variables only, Treating 8 as maximum lags your criteria must be estimated using observations 9 to 29 as independent and 1 to 8 as lagged variables only, etc.
In ADF tests the maximum lag is often taken as
[the integer part of 12*(T/100)^0.25] which is 8 for 29 observations. See Hayashi, Econometrics page 592 and references therein. Many econometric packages used to use this as a maximum lag for determining lag length in this and other occasions. I would hope that with 29 observations it would specify a smaller number of lags as appropriate.
How can you justify economically that eight lags could possibly be valid? to add an extra lag to carry out this test. If you do not have cointegration then you use standard Granger tests and VAR in first differences. You will need strong economic arguments to justify the assumption that there are no other variables that are impacting both GDP and energy consumption. If there are such variables your test will not be valid.
How can you justify economically that eight lags could possibly be valid? There is something wrong with your model. The eight lags may be adjusting for autocorrelation caused by misspecification or an omitted variable. Unless there is something wrong with your calculations you need to rethink your model.
  • asked a question related to Cointegration
Question
3 answers
Hi! I have a number of variables which are non-stationary in the long-run (1900-2016), but stationary during some sub-sets of that period (1900-1930, 1946-1973, 1980-2016). Can I use cointegration tests with those variables (target dependent variable is always non-stationary) on the sub-sets (shorter periods)?
  • asked a question related to Cointegration
Question
1 answer
I have 30 days observations on the fluctuations of airline prices of various departure days. Can I use cointegration tests, (mainly used to check the cointegration of long run effects) to establish the co-integrated movement evident from time series plot
Relevant answer
Answer
Yes, you can use it.
  • asked a question related to Cointegration
Question
2 answers
Good day Researchers!
I am running a Panel data and I just finished with the unit root tests using Fisher(ADF and PP) and LLC. My dependent variables are not stationary at level but stationary at first difference for the Fisher, while in LLC one of the dependent variable is stationary at level and difference.
While my independent variables results is mixed. Some are stationary at first difference, while some are stationary at both but one of the independent variable is not stationary at both level and difference.
Should I still run a Cointegration test?
Relevant answer
Answer
co integration analysis it is important to do unit root analysis to identify the stationary and order of integration. Using Augmented Dickey Fuller tests and KPSS Test. The results reveal that all the series are non stationary at levels as test statistics are greater than the critical values. However by taking the first
difference all the series are found to be non-stationary. then by taking the second order differential all series are found be Stationary. Thus the null hypothesis is rejected and the log values of future, spot prices are found to be stationary at Second difference and are integrated in the order of two.
  • asked a question related to Cointegration
Question
4 answers
I have panel data (T=10, N=26) where all variables are integrated I(1) with cross-section dependence. I applied Westerlund test and found no cointegration. So I proceeded with Pvar (Panel var) estimation. However, I want to confirm the robustness of my analysis by applying another estimation technique. Any advice?
Relevant answer
Answer
You can try AMG and CCMG for robustness.
  • asked a question related to Cointegration
Question
4 answers
I understand that if they are cointegrated, the results of any regressions in between them won't be spurious, therefore you can run the Granger test in levels. Am I wrong?
Relevant answer
Answer
Yes. You may run Granger causality test at level to find out the causal relationship.
  • asked a question related to Cointegration
Question
3 answers
Hi everyone,
I am running ARDL equations for the connection between imports (dependent variable), domestic growth and the real exchange rate (explaining variables) of several countries. Now, to test for cointegration, ARDL requires the bounds test. However, I am asking myself if the following is possible: run the ARDL, then run it in its ECM form. If the error correction coefficient is significantly negative (between -1 and 0), we get confirmation that the ARDL is cointegrated and fine. Hence, it would not be a test for cointegration before running any regression but running the ECM to test for cointegration. Is this valid?
Relevant answer
Answer
Thanks for these links!
  • asked a question related to Cointegration
Question
3 answers
Currently I am studying VAR methodologies in the hope of constructing a model for a future project, and have a fairly limited understanding of the necessary criteria to be met to generate robust results. My readings of recent literature make few mentions of residual diagnostics, specifically the joint normality of the residuals.
I have found through trial and error that a small number of exogenous spike ( blip ) dummy variables at key dates such as financial crises or policy changes, through visual inspection of residuals, have corrected the non-normality issue, I have found almost no evidence of similar studies doing the same, which leads me to wonder whether such measures are in fact misspecifications.
Tests for co-integration, in my case the Johansen test, give warnings (Eviews 12) against adding any exogenous variables, so as not to invalidate critical values. However, lag selection criteria for a VAR model differs when correcting residual non-normality with exogenous dummies. My current understanding of creating a VEC model is that, as a preliminary measure, both lag length selection and co-integration tests are performed on the model in levels, assuming all series are I(1) processes. Therefore, my question is whether one should:
1) Perform a cointegration test without dummies and select the lag length with dummies.
2) Abandon the dummies altogether, and subsequently violate the normality assumption.
3) Perform both Lag length selection and co-integration testing on the VAR in levels, then add dummies to the VECM.
My intention is to follow the common empirical approach in analysing both IFR and VDC of the VECM model (assuming there is cointegration), should this have any bearing on the matter. My understanding is that normality impacts only the validity of hypothesis testing however, in my reading, I have found no evidence to suggest that IRF and VDC standard errors are robust to non-normality.
many thanks,
Andrew Slaven
(Undergraduate Student at Aberystwyth University)
Relevant answer
Answer
I don't think that there is an easy answer to your question. The problems you discuss are covered in Juselius (2006), The Cointegrated VAR Model, Oxford. When I last used Eviews their Johansen routines would not have covered all the routines in this book. That was some time ago and things may have changed. Cats in Rats (estima.com) or the equivalent in Oxmetrics were better. For a more modern survey, you might look at Killian and LUtkepohl, Structural Vector Autoregressive Analysis, Oxford. These are graduate-level tests and are not easygoing.
  • asked a question related to Cointegration
Question
3 answers
Quantile cointegration test
Relevant answer
Answer
You can try Kuriyama (2016) cu sum quantile cointegration test.. Check the link for supplementary materials
  • asked a question related to Cointegration
Question
2 answers
how to determine the number of cointegrating equations from the bound test of ardl?
Relevant answer
Answer
Dear Baylie,
Interesting query. I have always utilized ARDL bound test to assess presence/absence of cointegration relationships among the identified variables (single cointegration).
  • asked a question related to Cointegration
Question
10 answers
I am using 3-time series data and after testing for unit root for each variable, the results show that at a level all of them are non-stationary the first difference leads to stationary. Therefore, three variables are integrated in order (1). So, I applied the cointegration test, the results I got for both the Trace and Max tests are below:
  • None: fail to reject the null hypothesis
  • at most 1: fail to reject the null hypothesis
  • at most 2: reject the null hypothesis
How can I make sense of these? I understood that in the "none" case, there is no cointegration. However, I am still confused at the "at most 1" and "at most 2" cases. Please help me with this. Thank you!
Relevant answer
Answer
Ateeb Akhter Shah Syed This is a tough question. First, you must understand the concepts of identification in a structural economic system of equations. This is covered well in many introductory and intermediate textbooks. Before you do any unit root tests or cointegration analysis you should consider what your theory and common sense say about the integration state of your variables and ask what cointegration equations you expect to find in your analysis. You then look to see how you can identify the system using theory and common sense.
When you have completed your examination of the economics of the system You can then use the Johansen methodology to estimate and test your model. Now say your econometric analysis implies that there is more than one cointegration vector. The trouble with Johansen is that it only estimates the cointegration space of the cointegration vectors. For example, if you have three cointegrating vectors then any three independent linear combinations of the original cointegrating vectors are also valid cointegration vectors. You must use your economic theory from your preliminary analysis of the model to select three unique cointegrating vectors. For a brief explanation have a look at Martin, Hurn, and Harris (2013), Econometric Modelling with Time Series, Cambridge page 682.
Some but not all time-series packages have the facilities to manage a lot of these calculations. CATS in RATS (https://www.estima.com/) or CATS in Oxmetrics. These are based on the Juselius book. Check your own software and see how far it goes.
If you need to do advanced work in this field you will need to understand what is in the Juselius book and in recent research in this field (see Killian and Lutkepohl (2017), Structural Vector Autoregressive Analysis, Cambridge). If you need to do less, consult your tutor, supervisor, or lecturer to see what he thinks is appropriate for your existing knowledge of economics, econometrics, and computing. This is a difficult topic and you could spend a lot of time learning how to do it.
  • asked a question related to Cointegration
Question
5 answers
Information provided:
1) There is no cointegration between both the variables therefore VECM is no longer applicable.
ii) The I(2) variable consists of zero therefore log differencing is ruled out too.
Relevant answer
Answer
Dear Waseem Baig,
You may explore ARDL and non-linear cointegration techniques.
  • asked a question related to Cointegration
Question
3 answers
I am examining 4 time series. I have evaluated these using a cointegration test. Unfortunately, I lack the technical understanding to interpret these results correctly.
The results can be found on sheet 4 "Test auf Kointegration"
I would be very grateful if someone could help me!
Relevant answer
Answer
Sorry for the typo- the corrected version
Cointegration is primarily used to identify possible causality among the variables. It rules out spurious relationships. Direction and strength can be evaluated from the coefficient estimation model.
  • asked a question related to Cointegration
Question
2 answers
I am using a GARCH-DCC MODEL to examine the co-movements of the price variation between the financial markets and the commodity markets. The model output gives me a sum of the ARCH and GARCH coefficients bigger than 1 that would mean that the variance is exponentially increasing over time. Not a desirable phenomenon. i'm getting this in a DCC-GARCH(1,1) but i also get this when i try higher orders. My variables are cointegrated so I made a VECM model and then used the residuals to estimate the DCC-GARCH MODEL.
I have daily data with 700 observations, you can see my output bellow.
What can i do to fix this?
PS: the problem isn't the size of my sample, i also get this result when i use a big sample
Relevant answer
Answer
Hi, thank you for replying.
I thought that could be the problem but when i test the model with my full sample with 4165 observations i get the same problem. I attached the file bellow with the full sample
  • asked a question related to Cointegration
Question
5 answers
I am using an ARDL model however I am having some difficulties interpreting the results. I found out that there is a cointegration in the long run. I provided pictures below.
Relevant answer
Answer
Mr a. D.
The ECT(-1)os always the lagged value of your dependent variable.
Regards
  • asked a question related to Cointegration
Question
3 answers
I use the bitcoin price and other block chains data to do the Johansen Cointegration test. The result show that they have two Cointegration. Then, I construct the VECM, but I don't know how to interpret the results. What is the "ect2" means?
  • asked a question related to Cointegration
Question
6 answers
Consider a cointegration model with two independent variables. I am interested in measuring the substitution effect of one independent variable over the other independent variable?
Relevant answer
Answer
Dear Brajesh ji
please see this article Augmented Reality is Eating the Real-world! The substitution...hope it helps
  • asked a question related to Cointegration
Question
3 answers
Hello Researchers, I have a set of 5 variables with log of 4 variables and a bank Rate in % for 49 data points. All the data points are I(1) integrated. I have been trying to get the Max Lag length and Rank on Stata. So far, I have tried (lag, rank) (1,1) (2,2) (4,2) (5,4) and (6,3) combinations. And in each of these results the coefficient in the cointegrating equation of short run comes out as non negative.
I have found all the optimal points in terms of lag and rank and yet the conclusion is not strong.
Any other method that needs to be taken forward to model my dataset? Would ARDL Help?
Relevant answer
Answer
You can go to the Robust approach, you may use ARDL as well as VECM and accept the model which provides more accurate results.
  • asked a question related to Cointegration
Question
6 answers
Is cointegration among predictors with a binary dependent variable a meaningful alternative to demonstrating stationarity of the predictors
Relevant answer
Answer
Emmanuel Gabreyohannes Can you provide a literature justification that the predictors need not be stationary in logit and probit models?
  • asked a question related to Cointegration
Question
5 answers
Hi everyone...my question is that ...
How to put one variable as dependent variable in Johansen cointegration test, if all variables in the model can be endogenous. And how to identify the cointegrating relationship among them, because if I have (for example) 5 variables in the model and I am putting each variable as a dependent variable then each will give a different cointegrating relationship..so how to identify the exact relationship among these 5 variables and how to select the dependent variable.
Relevant answer
Answer
Dear Devesh ji,
Selection of DV and resulting model is primarily a researcher's choice which is subject to following :
#You get a negative ECM (in your case)
#The necessary prerequisites are fulfilled.
#The short run and long run makes sense to what you are trying to convey through your research.
#Validate the underlying relationships of DV and IVs through different modeling techniques.
With best regards
  • asked a question related to Cointegration
Question
6 answers
I want to check the causality among variables, and also wanted to proceed with VECM. As per my knowledge, the steps to apply VECM is as follow
1. Check stationary (variables must he integrated of order 1)
2. Check cointegration (Johenson if more than 2 variables, Engle-Granger if 2 variables)
3. If cointegration exist, go for VECM (also known as RESTRICTED VAR), If do not exist, go for VAR (UNRESTRICTED VAR), we can also check the SR granger causalty if cointegration doesn't exist
But
What's about the criteria of granger causality. As per my knowledge, to apply granger causalty, variables must be stationary.
So, the question is
1. Can we apply granger causality if variables are stationary at first difference (e.g., 1(1)) or at mix order (e.g., I (1), I (1)). Or we can apply it only if all the variables are stationary at level?
Relevant answer
Answer
First some comments on your statements on VECN
As per my knowledge, the steps to apply VECM is as follow
1. Check stationary (variables must he integrated of order 1)
The Johansen methodology can be extended to include I(0) and I(2) variables. This is not covered in introductory textbooks or in many econometric packages. See Juselius (2006) The cointegrated VAR model for details.
2. Check cointegration (Johenson if more than 2 variables, Engle-Granger if 2 variables)
Johansen can be used if there are one or more cointegrating vectors. Engle-Granger can be used when there is only one cointegrating vector. Engle-Granger can be used when there are more than two variables. Johansen may be better if you have two variables. If one of the variables is not weakly exogenous a single equation will miss part of the adjustment process.
Before you do any VECM econometrics it is important that you have an econometric model that includes indications of the degree of integration of your variables and indicates the nature of any cointegration relationships that you might find. You will need this to enable the identification of your cointegration relationships. Applying VECM recipes to a data set and then trying to provide economic explanations of your result will usually lead to spurious results.
If your data are I(0) Granger causality can be tested with the standard VAR methodology
If your data are I(1) I would recommend that you consult Lutkepohl (2003), New Introduction to Multiple Time Series Analysis, Springer, pages 316-321. Granger causality is also applicable but is a little more complicated.
Be warned also that the t-statistic on an error correction term does not have a t-distribution, To test that such a t statistic is significant you can construct a Likelihood ratio test comparing the likelihood of the unrestricted model (relevant ecm terms included) with that of the unrestricted model (relevant ecm terms excluded). Significant ecm terms imply Granger causality through the error correction term.
  • asked a question related to Cointegration
Question
7 answers
I have 5 variables (1 dependent variable and 4 independent variables), and the 4 is stationary at level and only 1 is stationary at first different. However, I'm required to conduct the Granger Causality test. Is there any alternative test or solution that can solve my issue?
Thanks
Relevant answer
Answer
Without knowing the nature of your variables and economic model it is difficult to answer your question. the following points should help
  1. If it suits your underlying economics I would consider differencing the 1 I(1) variable. then you have 5 I(0) variables that could be used in a VAR and then you can do the required Granger causality test on the VAR.
  2. In some cases, it might be better to integrate some or all of the I(0) variables so that they become I(1) and use a cointegration analysis. (For example equity index returns (first difference of the logs of are often I(0) integrating them -reversing the first differencing - gives the log of the equity index price which will be I(1)). If there is cointegration you can then run Granger Causality tests on the resulting VECM
  3. If your dependent variable is I(0) your explanatory variables must also be I(0). The explanatory variables can be I(0) variables, first differences of I(1) variables (stationary) or cointegrated combinations of I(1) variables which are cointegrated
  4. If your dependent variable is I(1) the explanatory variable(s) must be I(1) and cointegrated with the dependent variable. Otherwise, the regression is spurious.
  • asked a question related to Cointegration
Question
9 answers
Hello,
My question is about the ARDL model: If we have checked the presence of all assumptions to apply an ARDL model (such as integration order mix of I(0) and I(1), one cointegration relationship,... etc.) when it comes to fitting the model, we must use the stationary time series or the original time series?
Regards
Relevant answer
Answer
As long as the series are not I(2) you can use them in the ARDL model, this means you can employ I(0) or I(I) orcombination of the two.
best,
G K Zestos
  • asked a question related to Cointegration
Question
6 answers
Dear researchers,
I am interested in studying the long-run relationship between the panel datasets. However, most of the empirical works used the PMG/ARDL technique to test the long-run parameters of the variables and the error correction term of the model. But they failed to confirm the long-run relationship by using appropriate panel cointegration techniques (Eg. Padroni/ Kao test). In the meantime, some studies employed the panel cointegration test before applying the PMG/ARDL technique. They argue that the PMG/ARDL technique is only used to resolve the heterogeneous problem. Thus, I want to know that panel cointegration is needed to apply before PMG/ARDL technique.
Relevant answer
Answer
Dear Ahamed.
For a robust result, you should carry out a cointegration test to verify if the variables are cointegrated before moving on to PMG/ARDL techniques.
Regards
  • asked a question related to Cointegration
Question
4 answers
In my model, I had four I(1) variables and after running JJ test, I got the result that there are 3 co-integration equation.
Now, I want to run ECM Model, but I am not sure what to put in the Co-integration rank?
Please help
Relevant answer
Answer
Dear Puneet.
Why not consider using ARDL as recommended @Mwoya Byaro.
Regards
  • asked a question related to Cointegration
Question
12 answers
Hi everyone,
When there is non-stationarity in time series data due to structural breaks( more than 2 structrural breaks )
1) then how to conduct Cointegration analysis and
2) will use of VECM model still be meaningful in this case?(if found cointegrated)
Thank you
Relevant answer
Answer
If you are using time-series, you can identify the break years first. For this, you can use the Maki cointegration. After that you can apply the ARDL method where you can externally incorporate the break years to capture the effect of SB in your final model.
  • asked a question related to Cointegration
Question
5 answers
Hi everyone.
The ARDL can be used when variables are of different order of intégration not greater than one!! if one variable is I(2) for instance, the ARDL is no longer suitable.
Does anyoneby the way knows if variables are integrated I(d) with different orders with d>1 which method to use?
Relevant answer
Answer
Dear Devesh Kumar Pant I think that it would be a great idea to replace the variable that is integrated to order 2 or higher and to find a substitute of that variable if that is not that excruciating.
  • asked a question related to Cointegration
Question
6 answers
Good day scholars,
I'm working on the impact of some macroeconomics variables on stock prices. I have done the differences of all variables and they are all stationary at First difference and also test for the arch and Garch effect for the stock prices. I'm lost on how to do the cointegration test, I'm I supposed to do cointegration of the variables directly or the one I difference?
Relevant answer
Answer
Hi, you can find help from this source:
  • asked a question related to Cointegration
Question
5 answers
Hello all,
There is some confusion regarding the steps to be followed to fit a cointegrated SVAR model? Mainly, when we fit a VAR model and find a significant cointegration among variables, a VECM model will be the best model; after which we can perform an SVAR model that considers the cointegration component!
I found the study by Loría et al. (2010), but the steps were not clear!
I need a detailed tutorial to follow in fitting a co-integrated SVAR with the Eviews, Stata, or R programs
Best wishes
Relevant answer
Answer
Dear Chellai.
Using Eviews.
* Import your data into Eviews
* Then select Group Statistics
*Then go to Quick
* Then click Johanson Cointegration test
* From the series list, list out your variables accordingly.
* Click OK
* From the Johanson cointegration test icon, adjust the lag intervals.
Then click okay for your result.
There is no specific icon for SVAR cointegration test.
Regards
  • asked a question related to Cointegration
Question
5 answers
Dear Scholars,
I have 2 variables. Both are non stationary. My dependent variable is stationary at 2nd difference and independent variable is stationary at 1st difference. I am using panel data set, the number of country is 5 and 18 years. How can I solve this problem, which model is better for this situation?
Best Regards
Relevant answer
Dear Zaki Farhana
I suggest that you first transform the series into log form and then check the degree of cointegration.
  • asked a question related to Cointegration
Question
8 answers
I have estimated a VECM model in EViews and using Johansen test I obtained that there were cointegration vectors. The output for short-run equation of VECOM contains two cointegration vectors, I was just wondering what is the speed of adjustment for them? Please find my EViews output below. Thank you so much for your time!
Relevant answer
Answer
I agree with all comments made above by John C Frain
Let me add an obvious point he needs to check. The equation has no significant error correction term, even though he claims he has two vectors. In no case did the dynamics translate into the steady-state. Take out the trend variable and ensure that appropriate outliers have been handled with dummies vars, including seasonality, and avoid extreme models (i.e., without intercept and inclusion trend variable, Johansen 5).
  • asked a question related to Cointegration
Question
13 answers
I am asking this question regarding my undergrad thesis. I am new to econometrics, so apologoes in advance. I wanted to analyse the impact of gender inequality in education on economic growth for India. I went through Klasen and Lamana (2009), Ali (2015), and Chuadhry (2005) and a few other papers to see if I could make a model for India.
I decided on the following variables as I think they would be suitable:
  1. GDP per capita, PPP (constant 2017 international $)
  2. School enrollment, primary (gross), gender parity index (GPI)
  3. School enrollment, secondary(gross), gender parity index (GPI)
  4. Ratio of female to male labor force participation rate (%) (modeled ILO estimate)
  5. Working age populatiom (% of total population)
  6. Fertility rate, total (births per woman)
  7. Population growth (annual %)
I have data for these variables from 1993-2018. My question is how should I proceed from here? From what I have read, if I am to run a regression model, i need to prove whether the time series are stationary and then do the regression. If not, i would have to look for cointegration. I plan on doing the analysis in either Python or EViews as I am comfortable in them. Please also suggest what other variables that can be used, what potential problems that I might run into and what steps could be taken to solve them.
I ran the exact model as specified in Klasen and Lamana (2009) and would like someone could help me understand the results that I got. In the Klasen paper I could understand uptill Table3. I am attaching the jupyter notebook.
Regards,
Dhruv Sinha
Relevant answer
Answer
  1. Granger Causality: If ecm1 is significant in the gdp equation then there is Granger causality through ecm1. The t-statistic on ecm1 in the equation for gdp does not follow a t-distribution.
  2. To test significance you must estimate the system with the relevant coefficients constrained to zero and do a likelihood ratio test. Not doing so is a regular mistake and is rarely recognized as one. This used to require programming in stata. I have not got access to modern versions of stata or eviews
  3. Granger causality is in effect the error in the difference in forecasts when all variables (X, Y) are taken into account and when Y is excluded. If the difference in the errors is not significant then Y does not Granger cause X. X and Y may be multivariate. If (X, Y) are stationary then Granger causality may be implemented using a simple joint zero test on coefficients in the VAR. It is not as straightforward when the series are non-stationary. You can have Granger causality either through the ecm adjustment terms or the short-run coefficients on the first difference terms. There is also the Toda-Yamamoto test which is relatively easy to implement but may not be that efficient (you need efficiency with your small sample)
  4. Are you using the log of the gdp variable? If one of your education variables increases by say .01, do you expect it to have the same additive effect on your gdp variable regardless of the level of the gdp variable? On the contrary, if you expect the effect on the GDP variable to increase (or decrease) by a fraction of the gdp variable then you should log the gdp variable before your analysis. This might improve your normality tests.
  5. Are there any big jumps in your education variables (possibly caused by policy changes)? Such exceptional values could cause your education variables to fail the test.
  6. Are you happy with the identification that stata assumes? Have you looked at the economic meaning of these long-term relationships? Perhaps some rescaling might make them more meaningful.
  7. I am still concerned about the economic basis of the analysis. Unless this is sound you can not make causal conclusions. I admit that I am not familiar with the economy and social conditions in your country. You are in a much better position to judge. The problem is that there are many other factors that give rise to changes in your GDP variable. If these also give rise to changes in your education variables your analysis will include the effect of these other factors in the estimate of the effect of the education ratios. If this is so the residuals may also be affected and this is why you may fail the specification tests.
  • asked a question related to Cointegration
Question
6 answers
My research has the dependent variable that cannot have data prior to 1995 as the existence of the variable commences since that year only. So I now have 22 annual time series data. I am testing for the determinants against 6 other variables which are strongly related to the dependent variable. My test results indicate the following:
Unit root test (DF-GLS) - stationary
ARDL F-bounds test - Cointegration exists at a 1 per cent significance level
ARDL long-run cointegration - All explanatory variables are statistically significant
ECM - Cointegration across all variables shows statistical significance
Diagnostics tests - The results validate the null hypothesis assumptions of no autocorrelation (Serial Correlation LM Test - Breusch-Godfrey), the existence of homoscedasticity (Heteroscedasticity Test: Breusch-Pagan-Godfrey), normal distribution of the residuals (Normality Test: Jarque-Bera), and a correct model specification (Ramsey RESET).
Parameter stability - CUSUM and CUSUMSQ plots are within the critical lines of 5 per cent.
Can I go ahead with this econometric analysis or is there still a possibility of a spurious regression with such data? Please share your thoughts and suggestions. Are there are any studies on similar lines i.e. small sample data for ARDL estimation that I can refer to?
Relevant answer
Answer
For classic frequentist analysis (i.e, panel model & time series); depends on the hypothesis.
There is no rules of thumbs casted on stone, that a certain number of independent variables should be the minimum in exploring the links between dependent variables and independent variables. But having 22 observations are enough and have high degree of freedom to conduct the study. However, avoid including too many variables in the same model (i.e 7 variables) may cause multicollinearity.
You can also use Bayesian estimate, in which the small samples sizes can be simulated to larger sample by MCMC....
  • asked a question related to Cointegration
Question
6 answers
Recently, I'm doing an analysis using VECM. I have three variables with an optimal lag length of 3 and cointegration of three. Since the lag length can't be the same as the number of cointegration, how to fix this?
Relevant answer
Answer
There is no relationship between lag length and number of cointegration but you need to check that there is no diagnostic problems with the selected lag length such as autocorrelation problem, multicollinearity problem, and heteroskedasticity problem.
  • asked a question related to Cointegration
Question
4 answers
title - did covid-19 effect in import, export and GDP
data - time series
country - only one country
dependent variable - GDP
independent variable - import, export
dummy - pandemic crisis
Then I use ADF test to check the stationarity of dependent and independent variable. With using the dependent variable and independent variables I check the Johansen Cointegration then I use Vector error correction model then last I used Granger Causality.
Question - in which part I need to include my dummy variables to see did covid-19 effect in gdp, import and export.
Relevant answer
Answer
It is not clear, what you want to show by your analysis. Do you try to find an explanation for GDP (for forecasting or?) and trying to get rid of the disturbing effect of Covid or is the main goal to estimate the Covid-effect on the economy? For me it seems that you are not interested in an economic problem, but only want to get some exercise in econometric methods and programs. Your "model" (i.e. equation to estimate) seems to be
GDP = Function(Exports, Imports, Covid19-Dummy), where, I suppose, that Function is linear, but the variables are transformed (ln, differences, exp etc.?). Whatever these transformations, this is a rather strange model. It is neither demand nor supply side, and I cannot see any other theoretical base.
As GDP is - by definition - the sum of internal demand (consumption, investment, change of stocks)+exports-imports, there is a severe endogeneity problem (see John's contribution) - especially as imports can be assumed as highly dependent on GDP (via income and consumption or because they are inputs for the domestic production).
  • asked a question related to Cointegration
Question
4 answers
Hi, is it possible to conduct panel cointegration test with 1 variable in panel and 1 variable in time series? If yes, can we do it on Eviews or STATA? Thanks.
Relevant answer
Following your response on the research interest through your research topic, then, it is imperative to know the basic purpose of cointegration test, regardless of whether panel cointegration or time cointegration.
Cointegration is use to test for joint stability or comovement of variables. Hence, the variable must be more than one. Second, the two variables must be in same order of dataset- whether time series or panel dataset.
Since your study objective consider multiple countries, then, convert the oil prices in time series to the stack/panel series. Then both panel variables can now under panel cointegration test.