Content uploaded by Kesten Green
Author content
All content in this area was uploaded by Kesten Green on May 10, 2024
Content may be subject to copyright.
Predictive validity and reliability
of causal temperature models
Effect of variable choice
Kesten C. Green
A talk to the
Climate Science & Economics Group
8th of May 2024
J. Scott Armstrong
& Kesten C. Green
FOREWORD BY
VERNON SMITH
AFTERWORD BY
TERENCE KEALEY
Useful Knowledge
SCIENTIFIC
METHOD
THE
A Guide to Finding
Background
•IPCC rejects a substantive contribution of the Sun to increasing
temperatures since the 1950s based on “attribution studies”.
•Connolly, Soon, Connolly, et al.’s (2023) “Challenges in the
Detection and Attribution of Northern Hemisphere Surface
Temperat ure Trends Since 1850”
•Compared the statistical fits (adjusted-R2s) of causal models of 5
measures of NH Surface Temperature (ST, ℃)
•275 models estimated using putative causal variable (Wm-2)
•one of 28 Solar measures (including IPCC’s one), or none
•IPCC’s Volcanic variable, or not
•IPCC’s “All Anthropogenic” variable, or not
•Statistical fit figures were consistent with temperatures being:
“mostly human-caused, mostly natural, or some combination of
both”
But do any of the models provide useful forecasts?
•Predictive validity:
•Which, if any, models provide forecasts that are more
accurate than those from a simple benchmark? (i.e., have
smaller errors)
•Forecasts in this study are derived using “actual,” rather
than forecast, values of causal variables
•Statistical fit (adjusted-R2) does NOT answer that question
•Reliability:
•Which, if any, models provide forecasts that are
independent of the subset of data used for estimation?
•What would one expect of the accuracy (errors) of the
forecasts of a valid model as more data is added to the
estimation sample?
Why Northern Hemisphere?
•68 percent of Earth’s
land surface is in NH
•More weather stations
with longer history
https://svs.gsfc.nasa.gov/4961
IPCC’s Anthropogenic & Volcanic variables
•Anthro:
•Putative human impact on Earth’s
temperature
•Composite of 11 proposed anthropogenic
influences
•Mainly CO2 emissions in atmosphere
•Volcanic
•Effect of eruptions via atmosphere
Total sol ar insolation (TSI) var iables
•IPCC Solar:
•IPCC’s AR6 uses Matthes, et al. (2017) TSI estimate
•A low-variability estimate
•Two high-variability estimates were chosen for this study from
Connolly, et al.’s ( 2 0 2 3 ) 2 7 a l t e r n a t i v e s t o I P C C s o l a r
•Solar B2000 (Bard, et al., 2000)
•11th largest range
•1st smallest correlation with IPCC Solar (0.39)
•Solar H1993 (Hoyt & Schatten, 1993)
•2nd largest range
•4th smallest correlation with IPCC Solar (0.62)
•Endorsed in IPCC’s AR4, but dropped for AR6
Eight models tested in this study a,b
a With Willie Soon
b Models that include the Anthro variable use the IPCC’s preferred data and
formulations. Those that do not include the Anthro variable, are herein described as
“Independent”.
Model
name
Causal variables Forecast variable
AVL Anthro
Volcanic
-NH All Land Annual Average
Temp era tur e An om al y
AVSL
Anthro
Volcanic
Solar IPCC
S
B
VL
Solar B2000
Volcanic
-
S
H
VL
Solar H1993
Volcanic
-
AVR Anthro
Volcanic
-
NH Rural Land Annual Average
Temp era tur e An om al y
AVSR
Anthro
Volcanic
Solar IPCC
S
B
VR
Solar B2000
Volcanic
-
S
H
VR
Solar H1993
Volcanic
-
Measuring errors relative to a benchmark
𝐶𝑢𝑚𝑅𝐴𝐸(𝑜𝑟(𝑅𝑒𝑙𝑀𝐴𝐸 =∑!"#
$𝑒!
%
∑!"#
$𝑒!
∗
𝑈𝑀𝐵𝑅𝐴𝐸
( =( 𝑀𝐵𝑅𝐴𝐸
1−𝑀𝐵𝑅𝐴𝐸,
𝑤ℎ𝑒𝑟𝑒(𝑀𝐵𝑅𝐴𝐸 =1
𝑛8
!"#
$𝑒!
𝑒!
%+ 𝑒!
∗(
𝑎𝑛𝑑(𝑒!
∗(𝑖𝑠(𝑡ℎ𝑒(𝑖𝑡ℎ(𝑒𝑟𝑟𝑜𝑟(𝑓𝑟𝑜𝑚(𝑎(𝑏𝑒𝑛𝑐ℎ𝑚𝑎𝑟𝑘(𝑚𝑜𝑑𝑒𝑙(
An appropriate benchmark model of temperatures
Why?
•Simplicity
•Green & Armstrong (2015)
•Conservatism
•Armstrong, Green, & Graefe (2015)
•Realism
•Apparent trends in Earth temperatures reverse on all time scales
•Causal variables difficult if not impossible to forecast accurately
•Prior evidence
•Green, Armstrong, & Soon (2009) benchmark of mean historical
No change, or no trend
= Median of estimation sample temperatures
Absolute Errors of NH Temperature Forecasts to 2018 (
℃
)
All land Rural land
Es#ma#on period: 1850 to 1899
0
1
2
3
4
5
6
1900 1910 1920 1930 1940 1950 1960 1970 1980 1990 2000 2010
Anthro, Volcanic
Anthro, Volcanic, IPCC Solar
Median historical temperature
B2000 Solar, Volcanic
H1993 Solar, Volcanic
0
1
2
3
4
5
6
1900 1910 1920 1930 1940 1950 1960 1970 1980 1990 2000 2010
Absolute Errors of NH Temperature Forecasts to 2018 (
℃
)
All land Rural land
Es#ma#on period: 1850 to 1949
0
1
2
3
4
5
6
1950 1960 1970 1980 1990 2000 2010
0
1
2
3
4
5
6
1950 1960 1970 1980 1990 2000 2010
Absolute Errors of NH Temperature Forecasts to 2018 (
℃
)
All land Rural land
Es#ma#on period: 1850 to 1969
0
1
2
3
4
5
6
1970 1980 1990 2000 2010
0
1
2
3
4
5
6
1970 1980 1990 2000 2010
Absolute Errors of NH Temperature Forecasts to 2018 (
℃
)
All land Rural land
Es#ma#on period: 1850 to 1999
0
1
2
3
4
5
6
2000 2005 2010 2015
0
1
2
3
4
5
6
2000 2005 2010 2015
Predictive validity vs statistical fit of models
•Average correlation -0.26
•Sign-reversed Pearson’s r of UMBRAE vs adjusted-R2
•i.e., !
𝑅!↑$ ∴ $𝐴𝑐𝑐𝑢𝑟𝑎𝑐𝑦$ ↓
•Large and negative r for 6/8 combinations of
estimation period and temperature series (All Land;
Rural Land)
•Only for All Land models estimated with largest
sample (1850 – 1999) (i.e., 1/8) was r large and positive
Reliability of models
Median absolute errors of NH temperature forecasts for 2000 to 2018 (
℃
)
0
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
℃
Number of observations used for estimating models
Yea r o f la st obs er va ti on ad de d to es ti ma ti on sa mp le
Median histori cal temperature
Anthro, Volcanic
Anthro, Volcanic, IPCC Solar
B2000 Solar, Volcanic
H1993 Solar, Volcanic
All Land temperature
models
Rural Land temperature models
50 100 150|50 100 150
1899 1949 1999|1899 1949 1999
Reliability of models 2: Effect of sampling strategy
Forecast errors etc. from models estimated with 1850-1933 vs odd years data (
℃
)
From estimating models using 84
consecutive to using 84 odd years
data…
Reductions in MdAE (IQR)
IPCC: 57%+ (75%+).
Independent: Little or none.
Changes in model coefficients*
IPCC: IPCC Solar small –ve to small
+ve; Anthro large, halved.
Independent: H1993 and B2000 Solar
nearly doubled, >4 x IPCC Solar.
*Estimated using standardised data.
IPCC models of All Land; Independent
models of Rural Land
Reliability of models 3: Effect of sampling strategy
Parameters of models estimated with standardised data 1850-1933 vs odd years
Your conclusions on…
1. Predictive validity of IPCC vs Independent vs
Benchmark models?
2. Relationship between statistical fit (in-sample) and
predictive validity (out of sample)?
3. Reliability of IPCC vs Independent vs Benchmark
models?
4. Practical usefulness of IPCC vs Independent vs
Benchmark models?
References
Armstrong, J.S., Green, K.C. & Graefe, A. (2015). Golden rule of forecasting: Be conservative.
Journal of Business Research, 68, 1717-1731. https://doi.org/10.1016/j.jbusres.2015.03.031
Bard, E., Raisbeck, G., Yiou, F. & Jouzel, J. (2000). Solar irradiance during the last 1200 years
based on cosmogenic nuclides. Tellus B, 52, 985-992. http://dx.doi.org/10.1034/j.1600-
0889.2000.d01-7.x
Connolly, R., Soon, W., Connolly, M., Baliunas, S., Berglund, J., Butler, C.J., Cionco, R.G., Elias,
A.G., Fedorov, V.M., Harde, H., Henry, G.W., Hoyt, D.V., Humlum, O., Legates, D.R.,
Scafetta, N., Solheim, J.-E., Szarka, L., Velasco Herrera, V.M., Yan, H. & Zhang, W. (2023).
Challenges in the Detection and Attribution of Northern Hemisphere Surface Temperature
Trends Since 1850. Research in Astronomy and Astrophysics, 23, 105015.
https://iopscience.iop.org/article/10.1088/1674-4527/acf18e
Green, K.C. & Armstrong, J.S. (2015). Simple versus complex forecasting: The evidence. Journal
of Business Research, 68, 1678-1685. https://doi.org/10.1016/j.jbusres.2015.03.026
Green, K.C., Armstrong, J.S. & Soon W. (2009). Validity of climate change forecasting for public
policy decision making. International Journal of Forecasting, 25, 826-832.
https://doi.org/10.1016/j.ijforecast.2009.05.011
Hoyt, D.V. & Schatten, K.H. (1993). A discussion of plausible solar irradiance variations, 1700-
1992. Journal of Geophysical Research, 98, 18895-18906.
https://doi.org/10.1029/93JA01944
Matthes, K. et al. (2017). Solar forcing for CMIP6 (v3.2). Geoscientific Model Development, 10,
2247-2302. https://doi.org/10.5194/gmd-10-2247-2017