Jurgen A. Doornik’s research while affiliated with University of Oxford and other places

What is this page?


This page lists works of an author who doesn't have a ResearchGate profile or hasn't added the works to their profile yet. It is automatically generated from public (personal) data to further our legitimate goal of comprehensive and accurate scientific recordkeeping. If you are this author and want this page removed, please let us know.

Publications (118)


Detecting Breaks in Trends by Trend-indicator Saturation
  • Preprint

January 2025

·

6 Reads

·

Jurgen Doornik

·

David F. Hendry

·

Felix Pretis


Forecasting the UK top 1% income share in a shifting world

May 2024

·

29 Reads

·

1 Citation

UK top income shares have varied hugely over the past two centuries, ranging from more than 30% to less than 7% of pre‐tax national income allocated to the top 1 percentile. We build a congruent dynamic linear regression model of the top 1% income share allowing for economic, political and social factors. Saturation estimation is used to model outliers and trend breaks, proxying underlying structural changes driving income inequality in the UK. We use the model to forecast the top 1% income share over the last 15 years, and compare to a range of forecast devices. Despite a well‐specified constant parameter model conditioning on significant explanatory variables, the best performing forecasts are obtained from a random walk and a smoothed random walk. These results are explained by the presence of shifts in the income share over the forecast period, resulting in forecasts from equilibrium correction models converging to the wrong equilibrium. Our best prediction for 2026 based on the most recent data from 2021 (a 5‐year ahead projection) is that the pre‐tax top 1% income share will remain at the most recent realized value of 12.7%, but there is a large degree of uncertainty, with a 95% confidence band ranging from 10% to 15.7%.



Figure 1. (a) Top 1% income shares in the UK since 1918; (b) annual changes in log monthly UK real GDP since 2007; (c) thousand-year changes in atmospheric CO 2 in parts per million (ppm) over Ice Ages, and last 250 years; (d) UK daily new confirmed cases of COVID-19 to August 2021.
Figure 2. (a) UK annual productivity y t − l t over 1860-2017; (b) Office of Budget Responsibility (OBR) five-year-ahead forecasts of UK productivity.
Figure 3. (a) Actual and fitted values for (y − l) t from TIS; (b) scaled residuals; (c) residual density and histogram; (d) trend adjustment of the estimated model given the retained trend indicators.
Figure 4. (a) 1-year ahead pseudo forecasts for (y − l) t from model (11); (b) model (12); and (c) 1-year ahead forecasts from Cardt.
Figure 7. Recursive forecasts of smoothed daily reductions in CO 2 during 2020: global emissions (left), international aviation (right). Three forecasting methods: AR(1) (first row); robust AR(1) (middle row); Cardt (bottom row). Data from carbon monitor.

+2

Forecasting Facing Economic Shifts, Climate Change and Evolving Pandemics
  • Article
  • Full-text available

December 2021

·

190 Reads

·

7 Citations

Econometrics

By its emissions of greenhouse gases, economic activity is the source of climate change which affects pandemics that in turn can impact badly on economies. Across the three highly interacting disciplines in our title, time-series observations are measured at vastly different data frequencies: very low frequency at 1000-year intervals for paleoclimate, through annual, monthly to intra-daily for current climate; weekly and daily for pandemic data; annual, quarterly and monthly for economic data, and seconds or nano-seconds in finance. Nevertheless, there are important commonalities to economic, climate and pandemic time series. First, time series in all three disciplines are subject to non-stationarities from evolving stochastic trends and sudden distributional shifts, as well as data revisions and changes to data measurement systems. Next, all three have imperfect and incomplete knowledge of their data generating processes from changing human behaviour, so must search for reasonable empirical modeling approximations. Finally, all three need forecasts of likely future outcomes to plan and adapt as events unfold, albeit again over very different horizons. We consider how these features shape the formulation and selection of forecasting models to tackle their common data features yet distinct problems.

Download

One, three, and seven days ahead forecast evaluation May-December 2020; LAC6 (Brazil, Mexico, Colombia, Argentina, Peru, Chile)
Modeling and forecasting the COVID‐19 pandemic time‐series data

August 2021

·

92 Reads

·

18 Citations

Objective: We analyze the number of recorded cases and deaths of COVID-19 in many parts of the world, with the aim to understand the complexities of the data, and produce regular forecasts. Methods: The SARS-CoV-2 virus that causes COVID-19 has affected societies in all corners of the globe but with vastly differing experiences across countries. Health-care and economic systems vary significantly across countries, as do policy responses, including testing, intermittent lockdowns, quarantine, contact tracing, mask wearing, and social distancing. Despite these challenges, the reported data can be used in many ways to help inform policy. We describe how to decompose the reported time series of confirmed cases and deaths into a trend, seasonal, and irregular component using machine learning methods. Results: This decomposition enables statistical computation of measures of the mortality ratio and reproduction number for any country, and we conduct a counterfactual exercise assuming that the United States had a summer outcome in 2020 similar to that of the European Union. The decomposition is also used to produce forecasts of cases and deaths, and we undertake a forecast comparison which highlights the importance of seasonality in the data and the difficulties of forecasting too far into the future. Conclusion: Our adaptive data-based methods and purely statistical forecasts provide a useful complement to the output from epidemiological models.


Selecting a Model for Forecasting

June 2021

·

272 Reads

·

21 Citations

Econometrics

We investigate forecasting in models that condition on variables for which future values are unknown. We consider the role of the significance level because it guides the binary decisions whether to include or exclude variables. The analysis is extended by allowing for a structural break, either in the first forecast period or just before. Theoretical results are derived for a three-variable static model, but generalized to include dynamics and many more variables in the simulation experiment. The results show that the trade-off for selecting variables in forecasting models in a stationary world, namely that variables should be retained if their noncentralities exceed unity, still applies in settings with structural breaks. This provides support for model selection at looser than conventional settings, albeit with many additional features explaining the forecast performance, and with the caveat that retaining irrelevant variables that are subject to location shifts can worsen forecast performance.


Robust Discovery of Regression Models

June 2021

·

59 Reads

·

42 Citations

Econometrics and Statistics

Successful modeling of observational data requires jointly discovering the determinants of the underlying process and the observations from which it can be reliably estimated, given the near impossibility of pre-specifying both. To do so requires avoiding many potential problems, including substantive omitted variables; unmodeled non-stationarity and misspecified dynamics in time series; non-linearity; and inappropriate conditioning assumptions, as well as incorrect distributional shape combined with contaminated observations from outliers and shifts. The aim is to discover robust, parsimonious representations that retain the relevant information, are well specified, encompass alternative models, and evaluate the validity of the study. An approach is proposed that provides robustness in many directions. It is demonstrated how to handle apparent outliers due to alternative distributional assumptions; and discriminate between outliers and large observations arising from non-linear responses. Two empirical applications, utilizing datasets popularized in previous applications, show substantive improvements from the proposed approach to robust model discovery.


THE VALUE OF ROBUST STATISTICAL FORECASTS IN THE COVID-19 PANDEMIC

April 2021

·

35 Reads

·

16 Citations

National Institute Economic Review

The Covid-19 pandemic has put forecasting under the spotlight, pitting epidemiological models against extrapolative time-series devices. We have been producing real-time short-term forecasts of confirmed cases and deaths using robust statistical models since 20 March 2020. The forecasts are adaptive to abrupt structural change, a major feature of the pandemic data due to data measurement errors, definitional and testing changes, policy interventions, technological advances and rapidly changing trends. The pandemic has also led to abrupt structural change in macroeconomic outcomes. Using the same methods, we forecast aggregate UK unemployment over the pandemic. The forecasts rapidly adapt to the employment policies implemented when the UK entered the first lockdown. The difference between our statistical and theory based forecasts provides a measure of the effect of furlough policies on stabilising unemployment, establishing useful scenarios had furlough policies not been implemented.


Forecasting Principles from Experience with Forecasting Competitions

February 2021

·

553 Reads

·

22 Citations

Forecasting

Economic forecasting is difficult, largely because of the many sources of nonstationarity influencing observational time series. Forecasting competitions aim to improve the practice of economic forecasting by providing very large data sets on which the efficacy of forecasting methods can be evaluated. We consider the general principles that seem to be the foundation for successful forecasting, and show how these are relevant for methods that did well in the M4 competition. We establish some general properties of the M4 data set, which we use to improve the basic benchmark methods, as well as the Card method that we created for our submission to that competition. A data generation process is proposed that captures the salient features of the annual data in M4.


Citations (81)


... Ericsson (2011b) proposes a systematic structure for discussing and developing such extensions. See Ericsson (2012), Castle, Clements, and Hendry (2013), Hendry and Pretis (2013), Castle, Doornik, Hendry, and Pretis (2015), Pretis, Schneider, Smerdon, and Hendry (2016), Ericsson, (2016b), Walker, Pretis, Powell-Smith, and Goldacre (2019), Castle, Doornik, and Hendry (2020), Guerard, (2022), Stechemesser et al. (2024), You (2024), and Castle, Doornik, and Hendry (2024) for details, discussion, and applications across a wide range of topics, including asset demand, GDP, income inequality, anthropogenic CO 2 , volcanic eruptions, medical practices, and COVID-19. ...

Reference:

Improving empirical models and forecasts with saturation-based machine learning
Forecasting the UK top 1% income share in a shifting world
  • Citing Article
  • May 2024

... There is an extensive literature that provides solutions to in-sample shifts, including selecting observations that exclude the break (see Pesaran and Timmermann, 2007;Pesaran et al., 2013); fitting to expanding or rolling windows (see Giacomini and Rossi, 2009;Inoue et al., 2017); testing for and conditioning on breaks (see Bai and Perron, 1998); applying model selection using break indicators (see Castle et al., 2023); and switching to predictors that are robust following shifts to avoid forecast failure (see e.g., Hendry, 2006;Castle et al., 2015a;Martinez et al., 2022). For unanticipated forecast period shifts, namely events that occur after forecasts are made, little can be done, like the sudden onset of COVID-19. ...

Improving models and forecasts after equilibrium-mean shifts
  • Citing Article
  • October 2023

International Journal of Forecasting

... Pretis et al. (2015) used TIS and SIS to assess climate models. Castle et al. (2021) utilized TIS and SIS in identifying shifts in trends within a long-term UK production function. Ghouse et al. (2022) utilized the IIS technique to identify the structural breaks in the returns and volatility of commercial banks in Pakistan. ...

Forecasting Facing Economic Shifts, Climate Change and Evolving Pandemics

Econometrics

... This is not to say that quantitative research cannot be used for counterfactual analysis. Time series analysis is considered the statistical analog to qualitative counterfactual analysis, and forecasting-an important function of time series modeling-is designed to answer "what if" questions about the future, rather than historical events that have already happened (Brandt and Freeman 2006;Doornik, Castle, and Hendry 2021). More recently developed quantitative modeling techniques like matching (Henderson and Chattfield 2011;Iacus, King, and Porro 2012), natural experiments (Robinson, McNulty, and Krasno 2009), synthetic controls (Bove and Nistico 2014;Costalli, Moretti, and Pischedda 2017), and Bayesian predictions (Brandt and Freeman 2006;Eggers and Lauderdale 2016) have also been used to address counterfactual problems. ...

Modeling and forecasting the COVID‐19 pandemic time‐series data

... While there are important regularities in both climate and macroeconomic data, and many of those that are informative about the future are embodied in empirical systems, sudden unanticipated changes are not rare and can be large, leading to forecast failure. Recent examples with substantial impacts on climate-related economic variables include the 'Financial Crisis', Sars-Cov-2 pandemic and Russia's invasion of Ukraine (see Castle et al., 2021aCastle et al., , 2021b, for principles of forecasting applicable to non-stationary processes and model selection when forecasting). Figure 14.2 focuses on more recent changes in CO 2 . ...

Selecting a Model for Forecasting

Econometrics

... e work of Stuart et al. (2021) shows that the FWS system has helped retain jobs in the United Kingdom and should be implemented as part of companies' human resources policies for workforce retention. Castle et al. (2021), using forecasting techniques, demonstrate that furlough policies have stabilized unemployment in the United Kingdom. Pope et al. (2020) show, through statistical analysis, that job retention schemes have mitigated the negative effects of the pandemic on the labour market in the case of the United Kingdom, although the extension of these schemes varied across sectors. ...

THE VALUE OF ROBUST STATISTICAL FORECASTS IN THE COVID-19 PANDEMIC
  • Citing Article
  • April 2021

National Institute Economic Review

... In regression analysis, one of the critical assumptions is that the residuals are independent of each other, meaning there is no autocorrelation. This assumption is particularly important in time-series data or datasets where the observations have a natural order (Castle et al., 2023;Gujarati and Porter, 2009). Autocorrelation occurs when residuals from one observation are systematically related to residuals from another, violating the independence assumption. ...

Robust Discovery of Regression Models
  • Citing Article
  • June 2021

Econometrics and Statistics

... Overall, as highlighted by Castle, Doornik, and Hendry (2021), it is clear that the estimates of the sum of the AR(p) coefficients cluster near unity most of the time, pointing to high persistence in the data or near-unit root behaviour. ...

Forecasting Principles from Experience with Forecasting Competitions

Forecasting

... It is becoming increasingly apparent that econometric techniques are particularly suited to analysing this data: see, for example, Li and Linton (2020), Manski and Molinari (2020) and the review by Dolton (2021). Much of the research using these techniques has focused on short-term forecasting of cases, hospital admissions and deaths, with notable examples being Doornik et al. (2020), Doornik et al. (2021) and Harvey et al. (2021). It has also been directed at generalising, to stochastic settings, compartmental epidemiological models, such as the well-known "susceptible (S), infected (I) and recovered or deceased (R)", or SIR, model, as in Korolev (2020) and Pesaran and Yang (2021). ...

Statistical Short-Term Forecasting Of The COVID-19 Pandemic
  • Citing Article
  • December 2020

Clinical Immunolgy & Immunotherapy

... Ref. [38] suggested using penalized regression techniques for macroeconomic forecasting, but ref. [14,39,40] argued in favor of factor-based models. Additionally, Autometrics has been proposed as an effective approach by [41]. However, Big Data can be distinguished by [42], specifically in three categories. ...

Modelling non-stationary ‘Big Data’
  • Citing Article
  • September 2020

International Journal of Forecasting