ArticlePDF Available

Evaluating the Performance of Past Climate Model Projections

Wiley
Geophysical Research Letters
Authors:
  • NASA Goddard Institute for Space Studies,

Abstract and Figures

Plain Language Summary Climate models provide an important way to understand future changes in the Earth's climate. In this paper we undertake a thorough evaluation of the performance of various climate models published between the early 1970s and the late 2000s. Specifically, we look at how well models project global warming in the years after they were published by comparing them to observed temperature changes. Model projections rely on two things to accurately match observations: accurate modeling of climate physics and accurate assumptions around future emissions of CO2 and other factors affecting the climate. The best physics‐based model will still be inaccurate if it is driven by future changes in emissions that differ from reality. To account for this, we look at how the relationship between temperature and atmospheric CO2 (and other climate drivers) differs between models and observations. We find that climate models published over the past five decades were generally quite accurate in predicting global warming in the years after publication, particularly when accounting for differences between modeled and actual changes in atmospheric CO2 and other climate drivers. This research should help resolve public confusion around the performance of past climate modeling efforts and increases our confidence that models are accurately projecting global warming.
This content is subject to copyright. Terms and conditions apply.
Evaluating the Performance of Past Climate
Model Projections
Zeke Hausfather
1
, Henri F. Drake
2,3
, Tristan Abbott
3
, and Gavin A. Schmidt
4
1
Energy and Resources Group, University of California, Berkeley, CA, USA,
2
Massachusetts Institute of
Technology/Woods Hole Oceanographic Institution Joint Program in Oceanography, Woods Hole, MA, USA,
3
Department of Earth, Atmospheric, and Planetary Sciences, Massachusetts Institute of Technology, Cambridge, MA,
USA,
4
NASA Goddard Institute for Space Studies, Broadway, NY, USA
Abstract Retrospectively comparing future model projections to observations provides a robust and
independent test of model skill. Here we analyze the performance of climate models published between
1970 and 2007 in projecting future global mean surface temperature (GMST) changes. Models are compared
to observations based on both the change in GMST over time and the change in GMST over the change in
external forcing. The latter approach accounts for mismatches in model forcings, a potential source of
error in model projections independent of the accuracy of model physics. We nd that climate models
published over the past ve decades were skillful in predicting subsequent GMST changes, with most models
examined showing warming consistent with observations, particularly when mismatches between
modelprojected and observationally estimated forcings were taken into account.
Plain Language Summary Climate models provide an important way to understand future
changes in the Earth's climate. In this paper we undertake a thorough evaluation of the performance of
various climate models published between the early 1970s and the late 2000s. Specically, we look at how
well models project global warming in the years after they were published by comparing them to observed
temperature changes. Model projections rely on two things to accurately match observations: accurate
modeling of climate physics and accurate assumptions around future emissions of CO
2
and other factors
affecting the climate. The best physicsbased model will still be inaccurate if it is driven by future changes in
emissions that differ from reality. To account for this, we look at how the relationship between temperature
and atmospheric CO
2
(and other climate drivers) differs between models and observations. We nd that
climate models published over the past ve decades were generally quite accurate in predicting global
warming in the years after publication, particularly when accounting for differences between modeled and
actual changes in atmospheric CO
2
and other climate drivers. This research should help resolve public
confusion around the performance of past climate modeling efforts and increases our condence that
models are accurately projecting global warming.
1. Introduction
Physicsbased models provide an important tool to assess changes in the Earth's climate due to external
forcing and internal variability (e.g., Arrhenius, 1896; IPCC, 2013). However, evaluating the performance
of these models can be challenging. While models are commonly evaluated by comparing hindcastsof
prior climate variables to historical observations, the development of hindcast simulations is not always
independent from the tuning of parameters that govern unresolved physics (Gettelman et al., 2019;
Mauritsen et al., 2019; Schmidt et al., 2017). There has been relatively little work evaluating the
performance of climate model projections over their future projection period (referred to hereafter as
model projections), as much of the research tends to focus on the latest generation of modeling results
(Eyring et al., 2019).
Many different sets of climate projections have been produced over the past several decades. The rst time
series projections of future temperatures were computed using simple energy balance models in the early
1970s, most of which were solely constrained by a projected external forcing time series (originally, CO
2
con-
centrations) and an estimate of equilibrium climate sensitivity from singlecolumn radiativeconvective
equilibrium models (e.g., Manabe & Wetherald, 1967) or general circulation models (e.g., Manabe &
Wetherald, 1975). Simple energy balance models have since been gradually sidelined in favor of
©2019. American Geophysical Union.
All Rights Reserved.
RESEARCH LETTER
10.1029/2019GL085378
Key Points:
Evaluation of uninitialized
multidecadal climate model future
projection performance provides a
concrete test of model skill
The quasilinear relationship
between model/observed forcings
and temperature change is used to
control for errors in projected
forcing
Model simulations published
between 1970 and 2007 were skillful
in projecting future global mean
surface warming
Supporting Information:
Supporting Information S1
Correspondence to:
Z. Hausfather,
hausfath@gmail.com
Citation:
Hausfather, Z., Drake, H. F., Abbott, T.,
& Schmidt, G. A. (2020). Evaluating the
performance of past climate model
projections. Geophysical Research
Letters,47, e2019GL085378. https://doi.
org/10.1029/2019GL085378
Received 16 SEP 2019
Accepted 26 NOV 2019
Accepted article online 4 DEC 2019
HAUSFATHER ET AL. 1of10
increasingly high resolution and comprehensive general circulation models, which were rst published in
the late 1980s (e.g., Hansen et al., 1988; IPCC, 2013; Stouffer et al., 1989).
Climate model projections are usefully thought about as predictions conditional upon a specic forcing sce-
nario. We consider these to be projections of possible future outcomes when the intent was to use a realistic
forcing scenario and where the realized forcings were qualitatively similar to the projection forcings.
Evaluating model projections against observations subsequent to model development provides a test of
model skill, and successful projections can concretely add condence in the process of making projections
for the future. However, evaluating future projection performance requires a sufcient period of time post-
publication for the forced signal present in the model projections to be differentiable from the noise of nat-
ural variability (Hansen et al., 1988; Hawkins & Sutton, 2012).
Researchers have previously evaluated prior model projections from the Hansen et al. (1988) National
Aeronautics and Space Administration Goddard Institute for Space Studies model (Hargreaves, 2010;
Rahmstorf et al., 2007), the Stouffer et al. (1989) Geophysical Fluid Dynamics Laboratory model (Stouffer
& Manabe, 2017), the IPCC First Assessment Report (FARIPCC, 1990; Frame & Stone, 2012), and the
IPCC Third and Fourth Assessment reports (IPCC, 2001; IPCC, 2007; Rahmstorf et al., 2012). However,
todate there has been no systematic review of the performance of past climate models, despite the availabil-
ity of warming projections starting in 1970.
This paper analyzes projections of global mean surface temperature (GMST) change, one of the most visible
climate model outputs, from several generations of past models. GMST plays a large role in determining cli-
mate impacts, is tied directly to internationalagreedupon mitigation targets, and is one of the climate vari-
ables that has the most accurate and longest observational records. GMST is also the output most commonly
available for many early climate models run in the 1970s and 1980s.
Two primary factors inuence the longterm performance of model GMST projections: (1) the accuracy of
the model physics, including the sensitivity of the climate to external forcings and the resolution or parame-
terization of various physical processes such as heat uptake by the deep ocean and (2) the accuracy of pro-
jected changes in external forcing due to greenhouse gases and aerosols, as well as natural forcing such as
solar or volcanic forcing.
While climate models should be evaluated based on the accuracy of model physics formulations, climate
modelers cannot be expected to accurately project future emissions and associated changes in external for-
cings, which depend on human behavior, technological change, and economic and population growth.
Climate modelers often bypass the task of deterministically predicting future emissions by instead projecting
a range of forcing trajectories representative of several plausible futures bracketed by marginally plausible
extremes. For example, Hansen et al., 1988 consider a lowemissions extreme Scenario C with more drastic
curtailment of emissions than has generally been imagined,a highemissions extreme Scenario A wherein
emissions must eventually be on the high side of reality,as well as a middleground Scenario B, which is
perhaps the most plausible of the three.More recently, the Representative Concentration Pathways (RCPs)
used in CMIP5 and the IPCC AR5 report similarly includes a number of plausible scenarios bracketed by a
lowemissions extreme Scenario RCP2.6 and a highemissions extreme Scenario RCP8.5 (van Vuuren et al.,
2011). Thus, an evaluation of model projection performance should focus on the relationship between the
model forcings and temperature change, rather than simply assessing how well projected temperatures com-
pare to observations, particularly in cases where projected forcings differ substantially from our best esti-
mate of the subsequently observed forcings.
This approachcomparing the relationship between forcing and temperatures in both model projections
and observationscan effectively assess the performance of the model physics while accounting for poten-
tial mismatches in projected forcing that climate modelers did not address at the time. In this paper we apply
both a conventional assessment of the change in temperature over time and a novel assessment of the
response of temperature to the change in forcing to assess the performance of future projections by past cli-
mate models compared to observations.
Climate modeling efforts have advanced substantially since the rst modern singlecolumn (Manabe &
Strickler, 1964) and general circulation models (Manabe et al., 1965) of Earth's climate were published in
the mid1960s, resulting in continually improving model hindcast skill (Knutti et al., 2013; Reichler &
10.1029/2019GL085378
Geophysical Research Letters
HAUSFATHER ET AL. 2of10
Kim, 2008). While these improvements have rendered virtually all of the models described here operation-
ally obsolete, they remain valuable tools as they are in a unique position to have their projections evaluated
by virtue of their decadeslong postpublication projection periods.
2. Methods
We conducted a literature search to identify papers published prior to the early1990s that include climate
model outputs containing both a time series of projected future GMST (with a minimum of two points in
time) and future forcings (including both a publication date and future projected atmospheric CO
2
concen-
trations, at a minimum). Eleven papers with 14 distinct projections were identied that t these criteria.
Starting in the mid1990s, climate modeling efforts were primarily undertaken in conjunction with the
IPCC process (and later, the Coupled Model Intercomparison Projects, CMIPs), and model projections were
taken from models featured in the IPCC FAR (1990), Second Assessment Report (SARIPCC, 1996), Third
Assessment Report (TARIPCC, 2001), and Fourth Assessment Report (AR4IPCC, 2007).
The specic models projections evaluated were Manabe, 1970 (hereafter Ma70), Mitchell, 1970 (Mi70),
Benson, 1970 (B70), Rasool & Schneider, 1971 (RS71), Sawyer, 1972 (S72), Broecker, 1975 (B75), Nordhaus,
1977 (N77), Schneider & Thompson, 1981 (ST81), Hansen et al., 1981 (H81), Hansen et al., 1988 (H88), and
Manabe & Stouffer, 1993 (MS93). The energy balance model projections featured in the main text of the
FAR, SAR, and TAR were examined, while the CMIP3 multimodel mean (and spread) was examined for
the AR4 (multimodel means were not used as the primary IPCC projections featured in the main text prior
to the AR4). Details about how each individual model projection was digitized and analyzed as well as assess-
ments of individual models included in the rst three IPCCreports can be found in the supporting information.
The AR4 projection was excluded from the main analysis in the paper as both the observational uncertain-
ties and model projection uncertainties are too large over the short 20072017 period to draw many useful
conclusions, and its inclusion makes the gures difcult to read. However, analyses including the AR4 pro-
jection can be found in the supporting information.
We assessed model projections over the period between the date the model projection was published and the
end of 2017 or when the model projection ended in cases where model runs did not extend through 2017. An
end date of 2017 was chosen for the analysis because the ensemble of observational estimates of radiative
forcings we used only extends through that date.
Five different observational temperature time series were used in this analysisNational Aeronautics and
Space Administration GISTEMP (Lenssen et al., 2019), National Oceanic and Atmospheric
Administration GlobalTemp (Vose et al., 2012), Hadley/UEA HadCRUT4 (Morice et al., 2012), Berkeley
Earth (Rohde et al., 2013), and Cowtan and Way (Cowtan & Way, 2014). The observational temperature
records used do not present a completely liketolike comparison with models, as models provide surface
air temperature (SAT) elds while observations are based on SAT elds over land and sea surface tempera-
ture (SST) elds over the ocean. This means that the trends in the models used here are likely biased high
compared to observations, as model blended eld trends are about 7% (±5%) lower than model global
SAT elds over the 19702017 period (Cowtan et al., 2015; Richardson et al., 2016). However, the absence
of SST elds from the models analyzed here prevents a comparison of blended SAT/SST
against observations.
We compared observations to climate model projections over the model projection period using two
approaches: change in temperature versus time and change in temperature versus change in radiative for-
cing (implied TCR). We use an implied TCR metric to provide a meaningful modelobservation compar-
ison even in the presence of forcing differences. Implied TCR is calculated by regressing temperature
change against radiative forcing for both models and observations, and multiplying the resulting values by
the forcing associated with doubled atmospheric CO
2
concentrations, F
2x
, (following Otto et al., 2013):
TCRimplied ¼F2xΔT=ΔFanthro
We express implied TCR with units of temperature using a xed value of F
2x
= 3.7 W/m
2
(Vial et al.,
2013). ΔF
anthro
includes only anthropogenic forcings and excludes volcanic and solar changes to avoid
10.1029/2019GL085378
Geophysical Research Letters
HAUSFATHER ET AL. 3of10
introducing sharp interannual changes in forcing that would complicate the interpretation of TCR over
shorter time periods. For the observational record, ΔF
anthro
is based on a 1,000member ensemble of obser-
vationally informed forcing estimates (Dessler & Forster, 2018). Model forcings are recomputed from pub-
lished formulas and tables when possible and otherwise digitized from published gures (see supporting
information section S2 for details). Instantaneous forcings rather than effective or efcacyadjusted forcing
are used, as those are all that is available for some early models (Hansen et al., 2005; Marvel et al., 2016; see
supporting information section S1.0). Details on the approach used to calculate implied TCR can be found in
supporting information section S1.2.
Comparing models and observations via implied TCR assumes a linear relationship between forcing and
warming, an approach that has been widely used in prior analyses (Gregory et al., 2004; Otto et al., 2013).
If forcing varies sufciently slowly in time and deep ocean temperatures remain approximately constant,
then a linear relationship is expected to hold with a constant of proportionality that depends on the strength
of radiative feedbacks and ocean heat uptake (Held et al., 2010). In this regime, our implied TCR metric pro-
vides information about model physics and is unaffected by the time rate of change of forcing; moreover, pre-
vious studies have suggested that the temperature response to twentieth century anthropogenic forcing falls
within this regime (Gregory & Forster, 2008; Gregory & Mitchell, 1997; Held et al., 2010).
However, sudden increases or decreases such as those associated with volcanic eruptions will not engender
an equivalent immediate temperature response. For this reason, only anthropogenic forcings were used in
estimating TCR
implied
, as all models evaluated lacked additional volcanic events during their projection per-
iods with the exception of Scenarios B and C of H88. Similarly, thermal inertia in the climate system can
affect the relationship between temperature and external forcing if forcing increases sufciently rapidly
(Geoffroy et al., 2012). Scenarios where forcing is rapidly increasing will, all things being equal, tend to be
further away from an equilibrium state than scenarios with more gradual increase after a given period of
time (Rohrschneider et al., 2019) and thus have a lower implied TCR. With a few exceptions (e.g., RS71,
H88 Scenarios A and C), however, most models evaluated had a rate of external forcing increase in the pro-
jection period within 1.3 times of the mean estimate of observational forcings and thus likely fall into the
regime where implied TCR depends largely on radiative feedbacks and ocean heat uptake.
In this analysis we refer to model projections as consistent or inconsistent with observations based on a com-
parison of the differences between the two. Specically, if the 95% condence interval in the differences
Figure 1. Rate of external forcing increase (in watts per meter squared per decade) in models and observations over model projection periods
10.1029/2019GL085378
Geophysical Research Letters
HAUSFATHER ET AL. 4of10
between the modeled and observed metrics includes 0, the two are deemed consistent; otherwise, they are
inconsistent (Hausfather et al., 2017). Additionally, we follow the approach of Hargreaves (2010) in
calculating a skill score for each model for both temperature versus time and implied TCR metrics. This
skill score is based on the rootmeansquare errors of the model projection trend versus observations
compared to a zerochange nullhypothesis projection. See supporting information section S1.3 for details
on calculating consistency and skill scores.
3. Results
A direct comparison of projected and observed temperature change during each historical model's projection
period can provide an effective test of model skill, provided that model projection forcings are reasonably in
line with the ensemble of observationally informed estimates of radiative forcings. In about 9 of the 17 model
projections examined, the projected forcings were within the uncertainty envelope of observational forcing
ensemble. However, the remaining eight modelsRS71, H81 Scenario 1, H88 Scenarios A, B, and C, FAR,
MS93, and TARhad projected forcings signicantly stronger or weaker than observed (Figure 1). For the
latter, an analysis comparing the implied TCR between models and observations may provide a more accu-
rate assessment of model performance.
Comparisons between climate models and observations over model projection periods are shown in Figure 2
for both temperature versus time and implied TCR metrics (differences between models and observations
are shown in Figure S2). Overall the majority of model projections considered were consistent with observa-
tions under both metrics. Using the temperature versus time metric, 10 of the 17 model projections show
results consistent with observations. Of the remaining seven model projections, four project more warming
than observedN77, ST81, and H88 Scenarios A and Bwhile three project less warming than observed
RS71, H81 Scenario 2a, and H88 Scenario C.
Figure 2. Comparison of trends in temperature versus time (top panel) and implied TCR (bottom panel) between observations and models over the model projec-
tion periods displayed at the bottom of the gure. Figure S1 shows a variant of this gure with the AR4 projections included
10.1029/2019GL085378
Geophysical Research Letters
HAUSFATHER ET AL. 5of10
When mismatches between projected and observed forcings are taken into account, a better performance is
seen. Using the implied TCR metric, 14 of the 17 model projections were consistent with observations; of the
three that were not, Mi70 and H88 Scenario C showed higher implied TCR than observations, while RS71
showed lower implied TCR (Schneider, 1975; see supporting information Text S2 for a discussion of the
anomalously lowequilibrium climate sensitivity (ECS) model used in RS71).
A number of model projections were inconsistent with observations on a temperature versus time basis but
are consistent once mismatches between modeled and observed forcings are taken into account. For exam-
ple, whileN77 and ST81 projected more warming than observed, their implied TCRs are consistent with
observations despite forcings withinthough on the high end ofthe ensemble range of observational esti-
mates. Similarly, while H81 Scenario 2a projects less warming than observed, its implied TCR is consistent
with observations.
A number of 1970sera models (Ma70, Mi70, B70, B75, and N77) show implied TCR on the high end of the
observational ensemblebased range. This is likely due to their assumption that the atmosphere equilibrates
instantly with external forcing, which omits the role of transient ocean heat uptake (Hansen et al., 1985).
However, despite this high implied TCR, a number of the models (e.g., Ma70, Mi70, B70, and B75) still
end up providing temperature projections inline with observations as their forcings were on the lower
end of observations due to the absence of any nonCO
2
forcing agents in their projections.
In principle, the same underlying model should show consistent results for modestly different forcing sce-
narios under the implied TCR metric. However, the inconsistency of the H88 Scenario C is illustrative of
Figure 3. Hansen et al., 1988 projections compared with observations on a temperature versus time basis (top) and temperature versus external forcing (bottom).
The dashed gray line in the top panel represent the start of the projection period. The transparent blue lines in the lower panel represent 500 random samples of
the 5,000 combinations of the ve temperature observation products and the 1,000 ensemble members of estimated forcings (the full ensemble is subsampled
for visual clarity). The dashed blue lines show the 95% condence intervals for the 5,000member ensemble (see supporting information Text S1.4 for details).
Anomalies for both temperature and forcing are shown relative to a 19581987 preprojection baseline.
10.1029/2019GL085378
Geophysical Research Letters
HAUSFATHER ET AL. 6of10
the limitations of the implied TCR metric when the model forcings differ
dramatically from observations, as Scenario C has roughly constant for-
cings after the year 2000.
The H88 model provides a helpful illustration of the utility of an approach
that can account for mismatches between modeled and observed forcings.
H88 was featured prominently in congressional testimony, and the recent
thirtieth anniversary of the event in 2018 focused considerable attention
on the accuracy of the projection (Borenstein & Foster, 2018; United
States. Cong. Senate, 1988). H88's most plausibleScenario B overesti-
mated warming experienced subsequent to publication by around 54%
(Figure 3). However, much of this mismatch was due to overestimating
future external forcingparticularly from CH
4
and halocarbons
(Figure S3). When H88 Scenario B is evaluated based on the relationship
between projected temperatures and projected forcings, the results are
consistent with observations (Figures 2 and 3).
Skill score median estimates and uncertainties for both temperature ver-
sus time and implied TCR metrics are shown in Table 1 (see supporting
information Text S1.3). A skill score of one represents perfect agreement
between a model projection and observations, while a skill score of less
than 0 represents worse performance than a nochange null
hypothesis projection.
The average of the median skill scores across all the model projections
evaluated is 0.69 for the temperature versus time metric. Only three projections (RS71, H88 Scenario A,
and H88 Scenario B) had skill scores below 0.5, while H81 Scenario 1 had the highest skill score of any
model0.93. Using the implied TCR metric, the average projection skill of the models was also 0.69.
Models with implied TCR skill scores below 0.5 include Mi70, RS71, and H88 Scenario C, while MS93 had
the highest skill score at 0.87. H88 Scenarios A and B and the IPCC FAR all performed substantially better
under an implied TCR metric, reecting the role of misspecied future forcings in their hightemperature
projections. It is important to note that the skill score uncertainties for very short future projection peri-
odsas in the case of the TAR and AR4are quite large and should be treated with caution due to the com-
bination of shortterm temperature variability and uncertainties in the forcings.
A number of model projections had external forcings that poorly matched observational estimates due to the
exclusion of nonCO
2
forcing agents. However, all models included projected future CO
2
concentrations,
providing a common metric for comparison, and these are shown in Figure S4. Most of the historical climate
model projections overestimated future CO
2
concentrations, some by as much as 40 ppm over current levels,
with projected CO
2
concentrations increasing up to twice as fast as actually observed (Meinshausen, 2017).
Of the 1970s climate model projections, only Mi70 projected atmospheric CO
2
growth inline with observa-
tions. Many 1980s projections similarly overestimated CO
2
, with only the Hansen 88 Scenarios A and B pro-
jections close to observed concentrations.
The rst three IPCC assessments included projections based on simple energy balance models tuned to gen-
eral circulation model results, as relatively few individual model runs were available at the time. From the
AR4 onward IPCC projections were based on the multimodel mean and model spread. We examine indivi-
dual models from the rst three IPCC reports on both a temperature versus time and implied TCR basis in
Figure S5.
4. Conclusions and Discussion
In general, past climate model projections evaluated in this analysis were skillful in predicting subsequent
GMST warming in the years after publication. While some models showed too much warming and a few
showed too little, most models examined showed warming consistent with observations, particularly when
mismatches between projected and observationally informed estimates of forcing were taken into account.
We nd no evidence that the climate models evaluated in this paper have systematically overestimated or
Table 1
Model Skill Scores Over the Projection Period, Where 1 Represents
Perfect Agreement With Observations and Less Than 0 Represents
Worse Performance Than a NoChange Null Hypothesis
Model Timeframe ΔT/Δtskill ΔT/ΔFskill
Ma70 19702000 0.84 [0.57 to 0.99] 0.51 [0.11 to 0.94]
Mi70 19702000 0.91 [0.69 to 0.99] 0.41 [0.26 to 0.90]
B70 19702000 0.78 [0.45 to 0.97] 0.63 [0.06 to 0.96]
RS71 19712000 0.19 [0.16 to 0.25] 0.42 [0.28 to 0.59]
S72 19722000 0.83 [0.49 to 0.99] 0.83 [0.43 to 0.98]
B75 19752010 0.85 [0.64 to 0.98] 0.72 [0.31 to 0.97]
N77 19772017 0.67 [0.44 to 0.84] 0.79 [0.48 to 0.98]
ST81 19812017 0.76 [0.53 to 0.94] 0.82 [0.52 to 0.98]
H81(1) 19812017 0.93 [0.81 to 0.99] 0.74 [0.59 to 0.93]
H81(2a) 19812017 0.77 [0.66 to 0.91] 0.87 [0.69 to 0.99]
H88(A) 19882017 0.38 [0.01 to 0.68] 0.81 [0.63 to 0.98]
H88(B) 19882017 0.48 [0.08 to 0.77] 0.79 [0.41 to 0.98]
H88(C) 19882017 0.66 [0.48 to 0.89] 0.28 [0.46 to 0.84]
FAR 19902017 0.63 [0.29 to 0.87] 0.86 [0.68 to 0.99]
MS93 19932017 0.71 [0.20 to 0.97] 0.87 [0.61 to 0.99]
SAR 19952017 0.73 [0.58 to 0.95] 0.66 [0.49 to 0.91]
TAR 20012017 0.81 [0.15 to 0.98] 0.76 [0.13 to 0.98]
AR4 20072017 0.56 [0.35 to 0.92] 0.60 [0.37 to 0.93]
Note. Both temperature versus time (ΔT/year) and implied TCR (ΔT/ΔF)
median scores and uncertainties are shown.
10.1029/2019GL085378
Geophysical Research Letters
HAUSFATHER ET AL. 7of10
underestimated warming over their projection period. The projection skill of the 1970s models is particularly
impressive given the limited observational evidence of warming at the time, as the world was thought to
have been cooling for the past few decades (e.g., Broecker, 1975; Broecker, 2017).
A number of highprole model projectionsH88 Scenarios A and B and the IPCC FAR in particularhave
been criticized for projecting higher warming rates than observed (e.g., Michaels & Maue, 2018). However,
these differences are largely driven by mismatches between projected and observed forcings. H88 A and B
forcings increased 97% and 27% faster, respectively, than the mean observational estimate, and FAR forcings
increased 55% faster. On an implied TCR basis, all three projections have high model skill scores and are
consistent with observations.
While climate models have grown substantially more complex than the early models examined here, the
skill that early models have shown in successfully projecting future warming suggests that climate models
are effectively capturing the processes driving the multidecadal evolution of GMST. While the relative sim-
plicity of the models analyzed here renders their climate projections operationally obsolete, they may be use-
ful tools for verifying or falsifying methods used to evaluate stateoftheart climate models. As climate
model projections continue to mature, more signals are likely to emerge from the noise of natural variability
and allow for the retrospective evaluation of other aspects of climate model projections.
References
Arrhenius, S. (1896). On the inuence of carbonic acid in the air upon the temperature of the ground. Philosophical Magazine and Journal
of Science,5(41), 237276.
Benson, G. S. (1970). Carbon dioxide and its role in climate change. Proceedings of the National Academy of Sciences,67(2), 898899. https://
doi.org/10.1073/pnas.67.2.898
Borenstein, S., & Foster, N. (2018). Warned 30 years ago, global warming 'is in our living room'. New York, NY: Associated Press. https://
www.apnews.com/dbd81ca2a7244ea088a8208bab1c87e2 June 18, 2018. (last accessed Aug 22, 2019).
Broecker, W. (2017). When climate change predictions are right for the wrong reasons. Climatic Change,142(12), 16. https://doi.org/
10.1007/s105840171927y
Broecker, W. S. (1975). Climatic change: Are we on the brink of a pronounced global warming? Science,189(4201), 460463. https://doi.
org/10.1126/science.189.4201.460
Cowtan, K., Hausfather, Z., Hawkins, E., Jacobs, P., Mann, M. E., Miller, S. K., et al. (2015). Robust comparison of climate models with
observations using blended land air and ocean sea surface temperatures. Geophysical Research Letters,42, 65266534. https://doi.org/
10.1002/2015GL064888
Cowtan, K., & Way, R. G. (2014). Coverage bias in the HadCRUT4 temperature series and its impact on recent temperature trends.
Quarterly Journal of the Royal Meteorological Society,140, 19351944. https://doi.org/10.1002/qj.2297
Dessler, A. E., & Forster, P. M. (2018). An estimate of equilibrium climate sensitivity from interannual variability. Journal of Geophysical
Research: Atmospheres,123, 86348645. https://doi.org/10.1029/2018JD028481
Eyring, V., Cox, P. M., Flato, G. M., Gleckler, P. J., Abramowitz, G., Caldwell, P., et al. (2019). Taking climate model evaluation to the next
level. Nature Climate Change,9(2), 102110. https://doi.org/10.1038/s415580180355y
Frame, D. J., & Stone, D. A. (2012). Assessment of the rst consensus prediction on climate change. Nature Climate Change,3(4), 357359.
https://doi.org/10.1038/nclimate1763
Geoffroy, O., SaintMartin, D., Olivié, D. J. L., Voldoire, A., Bellon, G., & Tytéca, S. (2012). Transient climate response in a twolayer energy
balance model. Part I: Analytical solution and parameter calibration using CMIP5 AOGCM experiments. Journal of Climate,26(6),
18411857. https://doi.org/10.1175/JCLID1200195.1
Gettelman, A., Hannay, C., Bacmeister, J. T., Neale, R. B., Pendergrass, A. G., Danabasoglu , G., et al. (2019). High climate sensitivity in the
Community Earth System Model Version 2 (CESM2). Geophysical Research Letters,46, 83298337. https://doi.org/10.1029/
2019GL083978
Gregory, J. M., & Forster, P. M. (2008). Transient climate response estimated from radiative forcing and observed temperature change.
Journal of Geophysical Research,113, D23105. https://doi.org/10.1029/2008JD010405
Gregory, J. M., Ingram, W. J., Palmer, M. A., Jones, G. S., Stott, P. A., Thorpe, R. B., et al. (2004). A new method for diagnosing radiative
forcing and climate sensitivity. Geophysical Research Letters,31, L03205. https://doi.org/10.1029/2003GL018747
Gregory, J. M., & Mitchell, J. F. B. (1997). The climate response to CO
2
of the Hadley Centre coupled AOGCM with and without ux
adjustment. Geophysical Research Letters,24(15), 19431946. https://doi.org/10.1029/97GL01930
Hansen, J., Fung, I., Lacis, A., Rind, D., Lebedeff, S., Ruedy, R., et al. (1988). Global climate changes as forecast by Goddard Institute
for Space Studies threedimensional model. Journal of Geophysical Research,93, 93419364. https://doi.org/10.1029/
JD093iD08p09341
Hansen, J., Johnson, D., Lacis, A., Lebedeff, S., Lee, P., Rind, D., & Russell, G. (1981). Climate impact of increasing atmospheric carbon
dioxide. Science,213(4511), 957966. https://doi.org/10.1126/science.213.4511.957
Hansen, J., Russell, G., Lacis, A., Fung, I., Rind, D., & Stone, P. (1985). Climate response times: Dependence on climate sensitivity and
ocean mixing. Science,229(4716), 857859. https://doi.org/10.1126/science.229.4716.857
Hansen, J., Sato, M., Ruedy, R., Nazarenko, L., Lacis, A., Schmidt, G. A., et al. (2005). Efcacy of climate forcings. Journal of Geophysical
Research,110, D18104. https://doi.org/10.1029/2005JD005776
Hargreaves, J. C. (2010). (2010). Skill and uncertainty in climate models. Wiley Interdisciplinary Reviews: Climate Change,1,556564.
https://doi.org/10.1002/wcc.58
Hausfather, Z., Cowtan, K., Clarke, D. C., Jacobs, P., Richardson, M., & Rohde, R. (2017). Assessing recent warming using instrumentally
homogeneous sea surface temperature records. Science Advances,3(1). https://doi.org/10.1126/sciadv.1601207
10.1029/2019GL085378
Geophysical Research Letters
HAUSFATHER ET AL. 8of10
Acknowledgments
Z. H. conceived the project, Z. H. and H.
F. D. created the gures, and Z. H., H. F.
D., T. A., and G. S. helped gather data
and wrote the article text. A public
GitHub repository with code used to
analyze the data and generate gures
and csv les containing the data shown
in the gures is available online
(https://github.com/hausfath/
OldModels). Additional information on
the code and data used in the analysis
can be found in the supporting
information. We would like to thank
Piers Forster for providing the
ensemble of observationallyinformed
radiative forcing estimates. No
dedicated funding from any of the
authors supported this project.
Hawkins, E., & Sutton, R. (2012). Time of emergence of climate signals. Geophysical Research Letters,39, L01702. https://doi.org/10.1029/
2011GL050087
Held, I. M., Winton, M., Takahashi, K., Delworth, T., Zeng, F., & Vallis, G. K. (2010). Probing the fast and slow components of global
warming by returning abruptly to preindustrial forcing. Journal of Climate,23, 24182427. https://doi.org/10.11 75/2009JCLI3466.1
IPCC (AR4) (2007). In S. Solomon, D. Qin, M. Manning, Z. Chen, M. Marquis, K. B. Averyt, M. Tignor, & H. L. Miller (Eds.), Climate change
2007: The physical science basis, Contribution of Working Group I to the Fourth Assessment Report of the Intergovernmental Panel on
Climate Change. Cambridge, UK and New York, NY: Cambridge University Press. ISBN 9780521880091 (pb: 9780521705967)
IPCC (AR5) (2013). In T. F. Stocker, D. Qin, G.K. Plattner, M. Tignor, S. K. Allen, J. Boschung, A. Nauels, Y. Xia, V. Bex, & P. M. Midgley
(Eds.), Climate change 2013: The physical science basis. Contribution of Working Group I to the Fifth Assessment Report of the
Intergovernmental Panel on Climate Change (p. 1535). Cambridge, UK and New York, NY: Cambridge University Press.
IPCC (FAR) (1990). Climate change: The IPCC scientic assessment. Report prepared by Working Group I. In J. T. Houghton, G. J. Jenkins,
& J. J. Ephraums (Eds.), Intergovernmental Panel on Climate Change (p. 365). Cambridge, UK and New York, NY: Cambridge University
Press.
IPCC (SAR) (1996). In J. T. Houghton, L. G. Meira Filho, B. A. Callander, N. Harris, A. Kattenberg, & K. Maskell (Eds.), Climate change
1995: The science of climate change, Contribution of Working Group I to the Second Assessment Report of the Intergovernmental Panel on
Climate Change. Cambridge, UK and New York, NY: Cambridge University Press. ISBN 0521564336 (pb: 0521564360)
IPCC (TAR) (2001). In J. T. Houghton, Y. Ding, D. J. Griggs, M. Noguer, P. J. van der Linden, X. Dai, K. Maskell, & C. A. Johnson (Eds.),
Climate change 2001: The scientic basis, Contribution of Working Group I to the Third Assessment Report of the Intergovernmental Panel
on Climate Change. Cambridge, UK and New York, NY: Cambridge University Press. ISBN 0521807670 (pb: 0521014956)
Knutti, R., Masson, D., & Gettelman, A. (2013). Climate model genealogy: Generation CMIP5 and how we got there. Geophysical Research
Letters,40, 11941199. https://doi.org/10.1002/grl.50256
Lenssen, N. J. L., Schmidt, G. A., Hansen, J. E., Menne, M. J., Persin, A., Ruedy, R., & Zyss, D. (2019). Improvements in the GISTEMP
uncertainty model. Journal of Geophysical Research: Atmospheres,124, 63076326. https://doi.org/10.1029/2018JD029522
Manabe, S. (1970). The dependence of atmospheric temperature on the concentration of carbon dioxide. In S. F. Singer (Ed.), Global effects
of environmental pollution (Chap. 3, pp. 2529). Dordrecht: Springer.
Manabe, S., Smagorinsky, J., & Strickler, R. F. (1965). Simulated climatology of a general circulation model with a hydrologic cycle. Monthly
Weather Review,93, 769798. https://doi.org/10.1175/15200493(1965)093<0769:SCOAGC>2.3.CO;2
Manabe, S., & Stouffer, R. J. (1993). Centuryscale effects of increased atmospheric CO
2
on the oceanatmosphere system. Nature,
364(6434), 215218. https://doi.org/10.1038/364215a0
Manabe, S., & Strickler, R. F. (1964). Thermal equilibrium of the atmosphere with a convective adjustment. Journal of the Atmospheric
Sciences,21, 361385. https://doi.org/10.1175/15200469(1964)021<0361:TEOTAW>2.0.CO;2
Manabe, S., & Wetherald, R. T. (1967). Thermal equilibrium of the atmosph ere with a given distribution of relative humidity. Journal of the
Atmospheric Sciences,24(3), 241259. https://doi.org/10.1175/15200469(1967)024<0241:TEOTAW>2.0.CO;2
Manabe, S., & Wetherald, R. T. (1975). The effects of doubling the CO
2
concentration on the climate of a general circulation model. Journal
of the Atmospheric Sciences,32,315. https://doi.org/10.1175/15200469(1975)032<0003:TEODTC>2.0.CO;2
Marvel, K., Schmidt, G. A., Miller, R. L., & Nazarenko, L. S. (2016). Implications for climate sensitivity from the response to individual
forcings. Nature Climate Change,6, 386. Retrieved from. https://doi.org/10.1038/nclimate2888
Mauritsen, T., Bader, J., Becker, T., Behrens, J., Bittner, M., Brokopf, R., et al. (2019). Developments in the MPIM Earth System Model
version 1.2 (MPIESM 1.2) and its response to increasing CO
2
.Journal of Advances in Modeling Earth Systems,11(4), 9981038. https://
doi.org/10.1029/2018MS001400
Meinshausen, M., Vogel, E., Nauels, A., Lorbacher, K., Meinshausen, N., Etheridge, D. M., et al. (2017). Historical greenhouse gas con-
centrations for climate modelling (CMIP6). Geoscientic Model Development,10(5), 20572116. https://doi.org/10.5194/
gmd.10.2057.2017
Michaels, P., & Maue, R. (2018). Thirty years on, How well do global warming predictions stand up? The Wall Street Journal, June 21st.
Mitchell, J. M. (1970). A preliminary evaluation of atmospheric pollution as a cause of the global temperature uctuation of the past
century. In S. F. Singer (Ed.), Global Effects of Environmental Pollution (Chap. 12, pp. 139155 ). Dordrecht: Springer.
Morice, C. P., Kennedy, J. J., Rayner, N. A., & Jones, P. D. (2012). Quantifying uncertainties in global and regional temperature change
using an ensemble of observational estimates: The HadCRUT4 dataset. Journal of Geophysical Research,117, D08101. https://doi.org/
10.1029/2011JD017187
Nordhaus, W. (1977). Strategies for the control of carbon dioxide (Cowles Foundation Discussion Papers). Cowles Foundation for Research
in Economics, Yale University. Retrieved from https://econpapers.repec.org/RePEc:cwl:cwldpp:443
Otto, A., Otto, F. E. L., Boucher, O., Church, J., Hegerl, G., Forster, P. M., et al. (2013). Energy budget constraints on climate response.
Nature Geoscience,6, 415. https://doi.org/10.1038/ngeo1836
Rahmstorf, S., Cazenave, A., Church, J. A., Hansen, J. E., Keeling, R. F., Parker, D. E., & Somerville, R. C. J. (2007). Recent climate
observations compared to projections. Science,316(5825), 709709. https://doi.org/10.1126/science.1136843
Rahmstorf, S., Foster, G., & Cazenave, A. (2012). Comparing climate projections to observations up to 2011. Environmental Research Letters,
7(4), 44035. https://doi.org/10.1088/17489326/7/4/044035
Rasool, S. L., & Schneider, S. H. (1971). Atmospheric carbon dioxide and aerosols: Effects of large increases on global climate. Science,
173(3992), 138141. https://doi.org/10.1126/science.173.3992.138
Reichler, T., & Kim, J. (2008). How well do coupled models simulate today's climate? Bulletin of the American Meteorological Society,89,
303312. https://doi.org/10.1175/BAMS893303
Richardson, M., Cowtan, K., Hawkins, E., & Stolpe, M. B. (2016). Reconciled climate response estimates from climate models and the
energy budget of Earth. Nature Climate Change,6, 931. https://doi.org/10.1038/nclimate3066
Rohde, R., Muller, R. A., et al. (2013). A new estimate of the average Earth surface land temperature spanning 1753 to 2011. Geoinfor
Geostat: An Overview 1:1. https://doi.org/10.4172/gigs.1000101
Rohrschneider, T., Stevens, B., & Mauritsen, T. (2019). On simple representations of the climate response to external radiative forcing.
Climate Dynamics.,53(56), 31313145. https://doi.org/10.1007/s00382019046864
Sawyer, J. S. (1972). Manmade carbon dioxide and the greenhouseeffect. Nature,239(5366), 2326. https://doi.org/10.1038/239023a0
Schmidt, G. A., Bader, D., Donner, L. J., Elsaesser, G. S., Golaz, J. C., Hannay, C., et al. (2017). Practice and philosophy of climate model
tuning across six U.S. modeling centers. Geoscientic Model Development,10, 32073223. https://doi.org/10.5194/gmd.10.3207.2017
Schneider, S. H. (1975). On the carbon dioxideclimate confusion. Journal of the Atmospheric Sciences,32, 20602066. https://doi.org/
10.1175/15200469(1975)032<2060:OTCDC>2.0.CO;2
10.1029/2019GL085378
Geophysical Research Letters
HAUSFATHER ET AL. 9of10
Schneider, S. H., & Thompson, S. L. (1981). Atmospheric CO
2
and climate: Importance of the transient response. Journal of Geophysical
Research,86(C4), 31353147. https://doi.org/10.1029/JC086iC04p03135
Stouffer, R. J., & Manabe, S. (2017). Assessing temperature pattern projections made in 1989. Nature Climate Change,7(3), 163165. https://
doi.org/10.1038/nclimate3224
Stouffer, R. J., Manabe, S., & Bryan, K. (1989). Interhemispheric asymmetry in climate response to a gradual increase of atmospheric CO
2
.
Nature,342(6250), 660662. https://doi.org/10.1038/342660a0
United States. Cong. Senate (1988). Committee on Energy and Natural Resources. Greenhouse Effect and Global Climate Change.
Hearings, June 23, 1988. 100th Cong. 1st sess. Washington: GPO.
van Vuuren, D. P., Edmonds, J., Kainuma, M., Riahi, K., Thomson, A., Hibbard, K., et al. (2011). The representative concentration path-
ways: An overview. Climatic Change,109(1), 5. https://doi.org/10.1007/s105840110148z
Vial, J., Dufresne, J.L., & Bony, S. (2013). On the interpretation of intermodel spread in CMIP5 climate sensitivity estimates. Climate
Dynamics,41(11), 33393362. https://doi.org/10.1007/s0038201317259
Vose, R. S., Arndt, D., Banzon, V. F., Easterling, D. R., Gleason, B., Huang, B., et al. (2012). NOAA's merged landocean surface tem-
perature analysis. Bulletin of the American Meteorological Society,93(11), 16771685. https://doi.org/10.1175/BAMSD1100241.1
10.1029/2019GL085378
Geophysical Research Letters
HAUSFATHER ET AL. 10 of 10
... Ito et al 5 made their projections without knowing the relationship between COVID-19 fluctuations and anxiety disorder incidencevarying assumptions underpinning projection scenarios will be validated (or not) as they correspond to accumulating observations over time (in a similar vein to how, for instance, climate models are appraised). 6 Illness-death models of chronic disease (e.g. diabetes) require no remission parameter (i.e. an individual cannot transition from having diabetes to being susceptible to diabetesit is a 'one-way street' in that sense). ...
Article
Full-text available
Ito et al present an illness–death model projecting 82 scenarios for the prevalence of anxiety disorders in Germany from 2019 to 2030 following the COVID-19 pandemic. We suggest the modelling framework used by Ito et al has promising applications for mental health epidemiology.
... This study finds that in the northern permafrost regions, compound extreme events associated with extreme high temperatures exhibit an increasing trend, while those associated with extreme low temperatures show a decreasing trend. This observation aligns with the overall trends in extreme temperature changes in the northern permafrost regions [50], as well as with previous findings on compound extreme events [51,52]. ...
Article
Full-text available
The northern permafrost regions are increasingly experiencing frequent and intense extreme events, with a rise in the occurrence of compound extreme events. Many climate-related hazards in these areas are driven by such compound events, significantly affecting the stability and functionality of vegetation ecosystems. However, the cumulative and lagged effects of compound extreme events on vegetation remain unclear, which may lead to an underestimation of their actual impacts. This study provides a comprehensive analysis of the spatiotemporal variations in compound extreme events and the vegetation response to these events in the northern permafrost regions from 1982 to 2022. The primary focus of this study is on examining the cumulative and lagged effects of compound extreme climate events on the Kernel Normalized Difference Vegetation Index (kNDVI) during the growing seasons. The results indicate that in high-latitude regions, the frequency of extreme high temperature–precipitation compound events and high temperature–drought compound events have increased in 58.0% and 67.0% of the areas, respectively. Conversely, the frequency of extreme low temperature–drought compound events and extreme low temperature–precipitation compound events has decreased in 70.6% and 57.2% of the areas, with the high temperature–drought compound events showing the fastest increase. The temporal effects of compound extreme events on kNDVI vary with vegetation type; they produce more cumulative and lagged effects compared with single extreme high-temperature events and fewer effects compared with single extreme precipitation events, with compound events significantly affecting forest and grassland ecosystems. Notably, extreme high temperature–precipitation compound events exhibit the strongest cumulative and lagged effects on vegetation, while extreme low temperature–drought compound events influence wetland and shrubland areas within the same month. This study underscores the importance of a multivariable perspective in understanding vegetation dynamics in permafrost regions.
... It does not falsify any conclusion as reported in the IPCC assessment reports. Most of the models do quite well (Hausfather et al., 2020), be it that some details may need improvement. ...
Article
Full-text available
Climate change imposes severe impacts on society and the economy. Solving this problem may require a major redesign of how available resources on our planet are exploited, and those changes induce resistance. Therefore, the general public and politicians welcome claims that climate change is not as bad as scientists conclude. An article in this issue by May and Crok casting doubt on the conclusions of climate science was submitted to this journal and apparently passed peer review. On that basis, the authors claim authority and a high status for their article in posts on social media. However, there are serious problems with this article. It applies a number of logical fallacies frequently used by climate denialists. Rather than debunking all of these fallacies, the present article uses the one by May and Crok to showcase several of these fallacies. These include examples of rhetorical tricks denialists frequently use to cast doubt on the major findings of climate science such as use of fake experts, cherry picking, creating false expectations, and misrepresentations of available understanding.
... In climate change research, the selection of GCMs is a critical step in predicting future changes in meteorological data [34]. In previous studies, multiple indicators, such as the correlation coefficient and root mean square error (RMSE), were commonly used to select GCMs. ...
Article
Full-text available
The ridge–furrow plastic mulching technique has been widely applied due to its benefits of increasing temperature, conserving moisture, reducing evaporation, and boosting yields. Hydrus-2D is a computer model designed to simulate the two-dimensional movement of water in soil characterized by a low cost and high flexibility compared to field experiments. This study, based on field experiment data from Jianping County, Liaoning Province, China, during 2017–2018, developed Hydrus-2D models for two distinct field management practices: non-mulched flat cultivation (NM-FC) and mulched ridge tillage (M-RT). Furthermore, it simulated the dynamic changes in farmland water variations during the summer maize growth period (2021–2100) under climate change scenarios, specifically medium and high emission pathways (SSP2-4.5 and SSP5-8.5), based on the FGOALS-g3 model, which exhibits the highest similarity to the climate pattern of Jianping County in the Coupled Model Intercomparison Project Phase 6 (CMIP6) global climate models and the Shared Socioeconomic Pathways (SSPs). The results showed that in the future FGOALS-g3 model, net radiation exhibited a significant upward trend under the SSP2-4.5 scenario (Z = 2.38), while the average air temperature showed a highly significant increase under both SSP2-4.5 and SSP5-8.5 scenarios, with Z-values of 6.48 and 8.90, respectively. The Hydrus-2D model demonstrated high simulation accuracy in both NM-FC and M-RT treatments (R2 ranging from 0.86 to 0.96, with RMSE not exceeding 0.011), accurately simulating the dynamic changes in soil water content (SWC) under future climate change. Compared to NM-FC, M-RT reduced evaporation, increased transpiration, and effectively decreased the leakage caused by increased future precipitation, resulting in a 0.04 and 0.01 cm3/cm3 increase in surface and deep soil SWC, respectively, during the summer maize growing season, significantly improving water use efficiency. Moreover, M-RT treatment reduced the impact coefficients of climate change on various water balance parameters, stabilizing changes in these parameters and SWC under future climate conditions. This study demonstrates the significant advantages of M-RT in coping with climate change, providing key scientific evidence for future agricultural water resource management. These findings offer valuable insights for policymakers and farmers, particularly in developing adaptive land management and irrigation strategies, helping to improve water use efficiency and promote sustainable agricultural practices.
Article
Anthropogenically forced climate change signals are emerging from the noise of internal variability in observations, and the impacts on society are growing. For decades, Climate or Earth System Models have been predicting how these climate change signals will unfold. While challenges remain, given the growing forced trends and the lengthening observational record, the climate science community is now in a position to confront the signals, as represented by historical trends, in models with observations. This review covers the state of the science on the ability of models to represent historical trends in the climate system. It also outlines robust procedures that should be used when comparing modeled and observed trends and how to move beyond quantification into understanding. Finally, this review discusses cutting-edge methods for identifying sources of discrepancies and the importance of future confrontations.
Preprint
Full-text available
This paper proposes a new, verifiable model for determining the price of goods and services, addressing limitations of existing supply and demand theories. The model accounts for factors influencing production costs, including environmental impact, and incorporates both individual business decisions and large-scale economic forecasting. A key advantage is its potential for validation against real-world economic data, unlike current theories, leading to more informed and justifiable economic policies. Data collection limitations are acknowledged, but the paper demonstrates the model's conceptual soundness. Ultimately, the model aims to improve economic predictions and separate economic policy from political bias.
Article
Full-text available
AI models are criticized as being black boxes, potentially subjecting climate science to greater uncertainty. Explainable artificial intelligence (XAI) has been proposed to probe AI models and increase trust. In this review and perspective paper, we suggest that, in addition to using XAI methods, AI researchers in climate science can learn from past successes in the development of physics-based dynamical climate models. Dynamical models are complex but have gained trust because their successes and failures can sometimes be attributed to specific components or sub-models, such as when model bias is explained by pointing to a particular parameterization. We propose three types of understanding as a basis to evaluate trust in dynamical and AI models alike: (1) instrumental understanding, which is obtained when a model has passed a functional test; (2) statistical understanding, obtained when researchers can make sense of the modeling results using statistical techniques to identify input–output relationships; and (3) component-level understanding, which refers to modelers' ability to point to specific model components or parts in the model architecture as the culprit for erratic model behaviors or as the crucial reason why the model functions well. We demonstrate how component-level understanding has been sought and achieved via climate model intercomparison projects over the past several decades. Such component-level understanding routinely leads to model improvements and may also serve as a template for thinking about AI-driven climate science. Currently, XAI methods can help explain the behaviors of AI models by focusing on the mapping between input and output, thereby increasing the statistical understanding of AI models. Yet, to further increase our understanding of AI models, we will have to build AI models that have interpretable components amenable to component-level understanding. We give recent examples from the AI climate science literature to highlight some recent, albeit limited, successes in achieving component-level understanding and thereby explaining model behavior. The merit of such interpretable AI models is that they serve as a stronger basis for trust in climate modeling and, by extension, downstream uses of climate model data.
Conference Paper
A lot of effort is required from the humanity to preserve the global climate from changes that could make the living conditions on Earth unsustainable. Decarbonization of the energy sector is meant to give the key contribution by humans to keep the global temperature rise by the end of this century below levels determined by the Paris Agreement. Transition from fossil to renewable and other energy sources that do not emit greenhouse gases is an inevitable goal for the humanity to preserve natural cabon cycle from a risky inbalance. As the climate system on Earth is extremely complex, there is no possibility for climate change phenomena to be examined in a laboratory like the majority of the thermo-hydaulic phenomena is, the thermal engineers and scientists are left to depend only on historic records and mathematical simulations of an extremely large quantity of climate change drivers studied within several tens or hundreds of scientific disciplines. Still, many of the interrelations or feedbacks among climate phenomena remain quite uncertain or entirely unknown, so that many of the climate change drivers need to be identified and explaned to make humanity aware of all the risks to be prevented in fighting against climate change beyond repair. The paper lists the major known thermo-hydraulic and other drivers of the climate system and related open questions in the field of the thermal science as a pathway for further research and studies of thermal and hydaulic kind in the climate system.
Article
Full-text available
The 21st century presents formidable challenges, including climate change, energy supply, health and disease invasion, and the need for a sustainable environment. Among these, climate change poses a significant threat, particularly to biodiversity and food security, with narrowly adapted and endemic species being the most vulnerable. India, with its unique geography, history, and culture, is distinguished by its remarkable diversity of natural ecosystems. It is one of the twelve mega biodiversity hotspots globally, reflecting the breadth of its biological diversity. This study offers a comprehensive analysis of the impacts of climate change on four interconnected domains in India: agriculture, land resources, water resources, and biodiversity. Our findings reveal that rising temperatures, altered precipitation patterns, and the increased frequency of extreme weather events are significantly undermining India's agricultural productivity, which in turn poses serious risks to food security. Land degradation and desertification are accelerating in vulnerable regions, while water resources are under increasing pressure due to factors such as glacial retreat, shifting monsoon patterns, and heightened evapotranspiration rates. Furthermore, India's rich biodiversity is at risk, with shifts in species distribution and potential extinctions on the horizon. The study underscores the intricate interplay between these sectors, highlighting the urgent need for integrated adaptation strategies. We conclude by proposing a framework for policy interventions that address these challenges holistically. This framework emphasizes the importance of sustainable resource management, climate-resilient agricultural practices, and ecosystem conservation as essential strategies for mitigating the impacts of climate change on India's natural and agricultural systems.
Article
Full-text available
Over the last 20 years, we have dramatically improved hydrometeorological data including isotopes, but are we making the most of this data? Stable isotopes of oxygen and hydrogen in the water molecule (stable water isotopes – SWI) are well known tracers of the global hydrological cycle producing critical climate science. Despite this, stable water isotopes are not explicitly included in influential climate reports (e.g. Intergovernmental Panel on Climate Change, IPCC) except for paleoclimate reconstructions. Continuous developments in modelling approaches have now made isotope-enabled modelling of climate and hydrology more powerful and easier to perform, reducing prediction uncertainty and providing more robust simulations. We argue that it is time to incorporate stable water isotopes and isotope-enabled modelling into mainstream hydroclimatic forecasting with the prospect of vastly improving climate change predictions and evidence.
Article
Full-text available
Earthquake ruptures dynamically activate coseismic off‐fault damage around fault cores. Systematic field observation efforts have shown the distribution of off‐fault damage around main faults, while numerical modeling using elastic‐plastic off‐fault material models have demonstrated the evolution of coseismic off‐fault damage during earthquake ruptures. Laboratory‐scale microearthquake experiments have pointed out the enhanced high‐frequency radiation due to the coseismic off‐fault damage. However, the detailed off‐fault fracturing mechanisms, subsequent radiation, and its contribution to the overall energy budget remain to be fully understood because of limitations of current observational techniques and model formulations. Here, we constructed a new physics‐based dynamic earthquake rupture modeling framework, based on the combined finite‐discrete element method, to investigate the fundamental mechanisms of coseismic off‐fault damage, and its effect on the rupture dynamics, the radiation and the overall energy budget. We conducted a 2‐D systematic case study with depth and showed the mechanisms of dynamic activation of the coseismic off‐fault damage. We found the decrease in rupture velocity and the enhanced high‐frequency radiation in near field due to the coseismic off‐fault damage. We then evaluated the overall energy budget, which shows a significant contribution of the coseismic off‐fault damage to the overall energy budget even at depth, where the damage zone width becomes narrower. The present numerical framework for the dynamic earthquake rupture modeling thus provides new insights into earthquake rupture dynamics with the coseismic off‐fault damage.
Article
Full-text available
The Community Earth System Model Version 2 (CESM2) has an equilibrium climate sensitivity (ECS) of 5.3 K. ECS is an emergent property of both climate feedbacks and aerosol forcing. The increase in ECS over the previous version (CESM1) is the result of cloud feedbacks. Interim versions of CESM2 had a land model that damped ECS. Part of the ECS change results from evolving the model configuration to reproduce the long‐term trend of global and regional surface temperature over the twentieth century in response to climate forcings. Changes made to reduce sensitivity to aerosols also impacted cloud feedbacks, which significantly influence ECS. CESM2 simulations compare very well to observations of present climate. It is critical to understand whether the high ECS, outside the best estimate range of 1.5–4.5 K, is plausible.
Article
Full-text available
We outline a new and improved uncertainty analysis for the Goddard Institute for Space Studies Surface Temperature product version 4 (GISTEMP v4). Historical spatial variations in surface temperature anomalies are derived from historical weather station data and ocean data from ships, buoys, and other sensors. Uncertainties arise from measurement uncertainty, changes in spatial coverage of the station record, and systematic biases due to technology shifts and land cover changes. Previously published uncertainty estimates for GISTEMP included only the effect of incomplete station coverage. Here, we update this term using currently available spatial distributions of source data, state‐of‐the‐art reanalyses, and incorporate independently derived estimates for ocean data processing, station homogenization, and other structural biases. The resulting 95% uncertainties are near 0.05 °C in the global annual mean for the last 50 years and increase going back further in time reaching 0.15 °C in 1880. In addition, we quantify the benefits and inherent uncertainty due to the GISTEMP interpolation and averaging method. We use the total uncertainties to estimate the probability for each record year in the GISTEMP to actually be the true record year (to that date) and conclude with 87% likelihood that 2016 was indeed the hottest year of the instrumental period (so far).
Article
Full-text available
Plain Language Summary Tsunami hazard assessment is routinely based on assessing the impacts of long‐period waves generated by vertical seafloor motions reaching the coast tens of minutes after the earthquake in typical subduction‐zone environments. This view is inadequate for assessing hazard associated with strike‐slip earthquakes such as the magnitude 7.5 2018 Palu earthquake, which resulted in tsunami effects much larger than would normally be associated with horizontal fault motion. From an extraordinary collection of 38 amateur and closed circuit television videos we estimated tsunami arrival times, amplitudes, and wave periods at different locations around Palu Bay, where the most damaging waves were reported. We found that the Palu tsunamis devastated widely separated coastal areas within a few minutes after the mainshock and included unusually short wave periods, which cannot be explained by the earthquake fault slip alone. Post‐tsunami surveys show changes in the coastline, and this combined with video footage provides potential locations of submarine landslides as tsunami sources that would match the arrival times of the waves. Our results emphasize the importance of estimating tsunami hazards along coastlines bordering strike‐slip fault systems and have broad implications for considering shorter‐period nearly instantaneous tsunamis in hazard mitigation and tsunami early warning systems.
Article
Full-text available
Global warming in response to external radiative forcing is determined by the feedback of the climate system. Recent studies have suggested that simple mathematical models incorporating a radiative response which is related to upper- and deep-ocean disequilibrium (ocean heat uptake efficacy), inhomogeneous patterns of surface warming and radiative feedbacks (pattern effect), or an explicit dependence of the strength of radiative feedbacks on surface temperature change (feedback temperature dependence) may explain the climate response in atmosphere-ocean coupled general circulation models (AOGCMs) or can be useful for interpreting the instrumental record. We analyze a two-layer model with an ocean heat transport efficacy, a two-region model with region specific heat capacities and radiative responses; a one-layer model with a temperature dependent feedback; and a model which combines elements of the two-layer/region models and the state-dependent feedback parameter. We show that, from the perspective of the globally averaged surface temperature and radiative imbalance, the two-region and two-layer models are equivalent. State-dependence of the feedback parameter introduces a nonlinearity in the system which makes the adjustment timescales forcing-dependent. Neither the linear two-region/layer models, nor the state-dependent feedback model adequately describes the behavior of complex climate models. The model which combines elements of both can adequately describe the response of more comprehensive models but may require more experimental input than is available from single forcing realizations.
Article
Full-text available
A new release of the Max Planck Institute for Meteorology Earth System Model version 1.2 (MPI-ESM1.2) is presented. The development focused on correcting errors in and improving the physical processes representation, as well as improving the computational performance, versatility, and overall user friendliness. In addition to new radiation and aerosol parameterizations of the atmosphere, several relatively large, but partly compensating, coding errors in the model's cloud, convection, and turbulence parameterizations were corrected. The representation of land processes was refined by introducing a multilayer soil hydrology scheme, extending the land biogeochemistry to include the nitrogen cycle, replacing the soil and litter decomposition model and improving the representation of wildfires. The ocean biogeochemistry now represents cyanobacteria prognostically in order to capture the response of nitrogen fixation to changing climate conditions and further includes improved detritus settling and numerous other refinements. As something new, in addition to limiting drift and minimizing certain biases, the instrumental record warming was explicitly taken into account during the tuning process. To this end, a very high climate sensitivity of around 7 K caused by low-level clouds in the tropics as found in an intermediate model version was addressed, as it was not deemed possible to match observed warming otherwise. As a result, the model has a climate sensitivity to a doubling of CO 2 over preindustrial conditions of 2.77 K, maintaining the previously identified highly nonlinear global mean response to increasing CO 2 forcing, which nonetheless can be represented by a simple two-layer model.
Article
Full-text available
Earth system models are complex and represent a large number of processes, resulting in a persistent spread across climate projections for a given future scenario. Owing to different model performances against observations and the lack of independence among models, there is now evidence that giving equal weight to each available model projection is suboptimal. This Perspective discusses newly developed tools that facilitate a more rapid and comprehensive evaluation of model simulations with observations, process-based emergent constraints that are a promising way to focus evaluation on the observations most relevant to climate projections, and advanced methods for model weighting. These approaches are needed to distil the most credible information on regional climate changes, impacts, and risks for stakeholders and policy-makers.
Article
Full-text available
Estimating the equilibrium climate sensitivity (ECS; the equilibrium warming in response to a doubling of CO2) from observations is one of the big problems in climate science. Using observations of interannual climate variations covering the period 2000 to 2017 and a model-derived relationship between interannual variations and forced climate change, we estimate that ECS is likely 2.4–4.6 K (17–83% confidence interval), with a mode and median value of 2.9 and 3.3 K, respectively. This analysis provides no support for low values of ECS (below 2 K) suggested by other analyses. The main uncertainty in our estimate is not observational uncertainty but rather uncertainty in converting observations of short term, mainly unforced climate variability to an estimate of the response of the climate system to long-term forced warming.
Article
Full-text available
Model calibration (or “tuning”) is a necessary part of developing and testing coupled ocean–atmosphere climate models regardless of their main scientific purpose. There is an increasing recognition that this process needs to become more transparent for both users of climate model output and other developers. Knowing how and why climate models are tuned and which targets are used is essential to avoiding possible misattributions of skillful predictions to data accommodation and vice versa. This paper describes the approach and practice of model tuning for the six major US climate modeling centers. While details differ among groups in terms of scientific missions, tuning targets, and tunable parameters, there is a core commonality of approaches. However, practices differ significantly on some key aspects, in particular, in the use of initialized forecast analyses as a tool, the explicit use of the historical transient record, and the use of the present-day radiative imbalance vs. the implied balance in the preindustrial era as a target.