Regions of significant influence on unforced global mean surface air temperature variability in climate models
Abstract
We document the geographic regions where local variability is most associated with unforced global mean surface air temperature (GMT) variability in Coupled Model Intercomparison Project Phase 5 coupled global climate models (GCMs) at both the subdecadal and interdecadal timescales. For this purpose, Regions of Significant Influence on GMT are defined as locations that have a statistically significant correlation between local surface air temperature (SAT) and GMT (with a regression slope greater than 1), and where local SAT variation leads GMT variation in time. In both GCMs and observations, subdecadal timescale GMT variability is most associated with SAT variation over the eastern equatorial Pacific. At the interdecadal timescale, GMT variability is also linked with SAT variation over the Pacific in many GCMs, but the particular spatial patterns are GCM dependent, and several GCMs indicate a primary association between GMT and SAT over the Southern Ocean. We find that it is difficult to validate GCM behavior at the interdecadal timescale because the pattern derived from observations is highly depended on the method used to remove the forced variability from the record. The magnitude of observed GMT variability is near the ensemble median at the subdecadal timescale but well above the median at the interdecadal timescale. GCMs with a stronger subdecadal relationship between GMT and SAT over the Pacific tend to have more variable subdecadal GMT while GCMs with a stronger interdecadal relationship between GMT and SAT over parts of the Southern Ocean tend to have more variable GMT.
1 Introduction
Global mean surface air temperature (GMT) is one of the most recognized metrics of climate change on a range of timescales. Therefore, variability in GMT is of great interest to the scientific community as well as to the general public. Changes in GMT can result both from external radiative forcings [Bindoff et al., 2013] and from internally generated unforced variability [Hasselmann, 1976; Hawkins and Sutton, 2009; Leith, 1978]. External radiative forcings change GMT by imposing an energy imbalance at the top of the atmosphere and include both anthropogenic (e.g., changes in greenhouse gas concentration, aerosols, and land use) and natural (e.g., changes in solar radiation and volcanic aerosols) influences. Unforced variability in GMT emerges out of the internal dynamics of the ocean-atmosphere system, involving a vertical redistribution of heat within the climate system (e.g., the movement of heat between the ocean and the atmosphere) [Meehl et al., 2013; Trenberth et al., 2002] and/or a change in the Earth's top-of-atmosphere energy budget which effects the total amount of heat in the system [Brown et al., 2014; Palmer and McNeall, 2014].
It is well established that the observed centennial-scale warming of GMT since the industrial revolution has been primarily caused by external forcings and increased greenhouse gas concentrations in particular [Bindoff et al., 2013]. Equally well established is that nonseasonal variability in GMT, on subdecadal timescales, tends to be dominated by unforced variability [Brown et al., 2012; Easterling and Wehner, 2009; Hawkins and Sutton, 2009]. The primary exception to this is the response to a major volcanic eruption. At the interdecadal timescale (variability with characteristic timescales from ~10 to ~100 years), however, the magnitude of unforced GMT variability is still uncertain. This is exemplified in the ongoing debate concerning the relative importance of forced versus unforced influences on interdecadal variability in the rate of warming over the twentieth century (i.e., strong warming from the 1910s to 1940s, steady temperatures until the 1970s, strong warming until the late 1990s). Several studies have argued that this variability (superimposed on top of long-term forced warming) was a result of corresponding variability in external radiative forcings [Hansen et al., 2005; Stott et al., 2006, 2000; Wilcox et al., 2013; Wild, 2009]. Alternatively, these fluctuations may have been caused by changes in sign of unforced modes of variability internal to the climate system [Chylek et al., 2014; Crowley et al., 2014; England et al., 2014; Meehl and Teng, 2014; Swanson and Tsonis, 2009; Tung and Zhou, 2013; Wu et al., 2011].
Even within the literature that attributes twentieth century interdecadal GMT variation to unforced variability, there is substantial disagreement on what modes of variability (and thus geographic regions) are most associated with GMT changes. The two regions that are most often evoked as being particularly influential on interdecadal GMT variability are the North Atlantic Ocean and the Pacific Ocean. The North Atlantic is typically thought to influence GMT via the Atlantic Multidecadal Oscillation (AMO) [Kerr, 2000], and the Pacific Ocean is usually thought to influence GMT via low-frequency variability in the El Niño–Southern Oscillation (ENSO) [Vimont, 2005; Zhang et al., 1997], the Interdecadal Pacific Oscillation (IPO) [Power et al., 1999], or the Pacific Decadal Oscillation (PDO) [Mantua et al., 1997]. Most studies do not agree on the relative influence of these regions on GMT and tend to emphasize either the North Atlantic's influence [Chen and Tung, 2014; Chylek et al., 2014; DelSole et al., 2010; Delworth and Knutson, 2000; Hunt, 2011; Keenlyside et al., 2008; Kravtsov and Spannagle, 2008; McGregor et al., 2014; Muller et al., 2013; Schlesinger and Ramankutty, 1994; Semenov et al., 2010; Swanson et al., 2009; Tung and Zhou, 2013; Wu et al., 2011; Wyatt and Curry, 2013; Zhang et al., 2007] or the Pacific's influence [England et al., 2014; Huber and Knutti, 2014; Kosaka and Xie, 2013; Meehl et al., 2013; Meehl and Teng, 2014; Risbey et al., 2014; Trenberth and Fasullo, 2013].
- Do current GCMs act in accordance with the established finding that the eastern equatorial Pacific has the largest influence on unforced GMT variability on subdecadal timescales? Do GCMs accurately simulate the magnitude of unforced subdecadal GMT variability?
- Do GCMs lend more support to the notion that interdecadal variability in GMT is associated with surface air temperature (SAT) variability in the North Atlantic or the Pacific? (SAT and sea surface temperature anomalies are highly correlated at these timescales.) Are there other locations that influence interdecadal GMT variability in some GCMs? Do GCMs accurately simulate the magnitude of interdecadal GMT variability?
- What are the first-order physical processes that contribute to GMT variability in GCMs and is there any relationship between the magnitude of simulated GMT variability and the underlying modes associated with that variability?
The remainder of the paper is structured as follows: Section 2 gives the background and justification of a methodology (Regions of Significant Influence (ROSI)) that seeks to identify the locations that are most influential on unforced GMT variability. Section 3 discusses the observational and GCM data that the ROSI methodology is applied to. Section 4 presents the key results of the ROSI methodology, applied at both the subdecadal and interdecadal timescales. Section 5 discusses the primary findings inferred across the multimodel ensemble with an emphasis on first-order physical processes that may be responsible for the GCM-simulated variability. This section also analyzes the relationship between the underlying modes associated with GMT variability and the magnitude of simulated GMT variability across the model suite. Finally, a summary is presented in section 6.
2 Regions of Significant Influence on GMT in Observations and GCMs
Changes in GMT do not typically result from globally coherent variations in SAT. For example, it is well known that dynamics associated with the El Niño–Southern Oscillation (ENSO) can affect GMT [Foster and Rahmstorf, 2011; Privalsky and Jensen, 1995; Trenberth et al., 2002; Wigley, 2000], but this effect is heavily influenced by SAT anomalies in the central/eastern equatorial Pacific rather than emerging from a coherent, globally uniform change in SAT [Deser et al., 2010; Messié and Chavez, 2011; Trenberth et al., 2002]. Therefore, it is reasonable to expect that SAT variation (and by proxy, modes of variability) will be more or less associated with GMT change depending on the region.
In order to quantify the influence of various modes of variability on GMT, predefined climatic indices are sometimes compared to GMT via regression/correlation analysis. For example, Muller et al. [2013] performed cross-correlation analysis between various climate indices and land GMT, while others have used ENSO indices or the AMO as predictors of GMT in multiple linear regression analysis [Chylek et al., 2014; Foster and Rahmstorf, 2011; Zhou and Tung, 2012]. Such methodology can be problematic if the climate index does not fully capture the underlying physical mode's influence on GMT. For example, the AMO is sometimes defined as the detrended, basin mean, sea surface temperature of the Atlantic from 0°N to 60°N [Enfield et al., 2001]. However, temperatures do not necessarily vary coherently within this domain [Delworth et al., 2007; Deser et al., 2010; Zhang and Wang, 2013]. If only part of this domain has a strong association with GMT, then the act of averaging over the entire domain will obscure this relationship. This is especially true in the case of the IPO and PDO indices which have positive and negative temperature anomalies incorporated into their characteristic pattern [Mantua et al., 1997; Power et al., 1999]. Furthermore, an underlying physical mode may work to influence GMT through complex teleconnections and feedbacks that may not be represented by a given index.
- Local SAT and GMT had a statistically significant correlation (at 95% confidence level) at zero time lag, suggesting high coherence between a physical climatic mode at the given location and GMT. Statistical significance was calculated with a Monte Carlo method that accounted for persistence in the time series data (see supporting information).
- Local SAT change preceded GMT changes, suggesting that local dynamics were primarily influencing, rather than responding to, changes in the global climate. The lead-lag relationship was obtained by identifying the time lag with the highest cross-correlation value between the local SAT time series and the GMT time series with a maximum lag of ±6 months for the subdecadal timescale and ±24 months for the interdecadal timescale. Results are not overly sensitive to the maximum lag size (Figure S14 in the supporting information).
- The linear regression coefficient (at zero time lag) between local SAT and GMT (“a” in equation 1) was greater than 1, indicating that local SAT had larger-magnitude variability than GMT.
(1)This condition ensured that ROSIs would be regions where local SAT variability worked to enhance the magnitude of GMT variability. A regression coefficient less than 1, even with a perfect correlation, would imply that local SAT variability worked to reduce the magnitude of GMT variability. However, results are not overly sensitive to the inclusion/exclusion of this criterion in the ROSI definition (Figure S14).
The goal of imposing these constraints was to identify the locations over the globe that were most responsible for GMT variability while disregarding those locations that primarily responded to global variability or tended to damp GMT variability.
It should be noted that other definitions of ROSIs are possible and the present definition excludes certain variability that could justly be argued to be “influential” on GMT. For example, local modes of variability may lag GMT in time but still play a major role in maintaining a GMT anomaly. Additionally, even if a local mode leads GMT, it may lead by a longer amount of time than the maximum lag length imposed in our ROSI definition. For example, the North Atlantic Oscillation has been shown to lead Northern Hemisphere mean temperature by 15–20 years [Li et al., 2013]. In this case the ROSI would not identify the “original cause” of the large-scale variability but rather a more proximate cause of the large-scale variability.
3 Unforced Variability in GCMS and in Observations
In order to identify the geographic locations that have the largest association with unforced GMT variability, it was necessary to isolate the unforced component of variability from both observations and GCMs. For GCMs, we utilized preindustrial control runs that incorporated no external radiative forcings, and thus, all variability was necessarily unforced. We used 34 of these runs (first 2400 months, deseasonalized) from the Coupled Model Intercomparison Project Phase 5 (CMIP5) multimodel ensemble [Taylor et al., 2011] and removed any long-term model drift (via linear detrending at each global grid location) that may have been due to incomplete model spin-up.
For observations we used the monthly NASA Goddard Institute for Space Studies (GISS) Surface Temperature analysis [Hansen et al., 2010] from 1880 to 2013 because it provided the most extensive spatial coverage of the three most widely used instrumental data sets (the others being HadCRUT4 and NOAA Merged Land-Ocean Surface Temperature Analysis). In order to obtain unforced variability from the observational record, it was necessary to remove the component of variability that resulted from external radiative forcings. This was done in two separate ways, referred to below as Method (1) and Method (2).
In Method (1), unforced variability was obtained by subtracting the CMIP5 “Historical” (1900–2005) and “RCP 6.0” (2006–2013) experiment ensemble mean (representing externally forced variability) from the observational data set at each point in space and time (see Ting et al. [2009] for a detailed explanation of this method). Only those GCMs that participated in both the Historical and RCP 6.0 experiments were incorporated in the ensemble mean to avoid any discontinuities that might arise from different GCMs participating in each experiment. Method (1) presumes that the CMIP5 ensemble mean accurately represents the temporal and spatial structure of forced variability over the contemporary climate record. However, some studies have implied that the time-varying aerosol forcing in the CMIP5 historical experiment (which has a high degree of uncertainty [Hansen et al., 2011; Otto et al., 2013; Suzuki et al., 2013]) has been “overfit” to interdecadal variations in GMT over parts of the twentieth century [Chylek et al., 2014; DelSole et al., 2010; Tung and Zhou, 2013; Wu et al., 2011], and thus, it masks some interdecadal unforced variability over that time (“overfitting” masks unforced variability by mistakenly attributing it to time-varying external forcings). To allow for this possibility, we also obtain unforced variability in the observational record via Method (2), in which time series were detrended with empirical mode decomposition [Huang et al., 1998] at every grid point. This method is similar in concept to a linear detrend except that it allows for the slope of the secular component (representing forced change) to vary in time, and thus, it is better suited than linear detrending for isolating unforced variability [Wu et al., 2011, 2007].
Once the unforced variability was obtained both in GCMs and in observations, variations were decomposed into subdecadal and interdecadal components. The interdecadal component was identified with a low-pass filter (Locally Weighted Scatterplot Smoothing [Cleveland, 1979]) that passed variability on the timescale of 120 months (10 years) and longer (results are not particularly sensitive to the use of different low-pass filters or different cutoff timescales, see supporting information). The subdecadal component was defined as the residual of a given time series after the interdecadal component was subtracted. Grid level SAT time series as well as GMT time series were both decomposed into subdecadal and interdecadal components prior to subsequent analysis.
4 GCM-by-GCM ROSIs
4.1 Subdecadal Timescale Variability
As stated above, a variety of methodologies have established that ENSO variability has an impact on GMT at subdecadal timescales [Foster and Rahmstorf, 2011; Privalsky and Jensen, 1995; Trenberth et al., 2002; Wigley, 2000]. However, Banholzer and Donner [2014] recently suggested that central Pacific El Niño events (i.e., Dateline El Niño or El Niño Modoki events [Ashok et al., 2007; Trenberth and Stepaniak, 2001]) may have a much weaker influence on GMT than traditional eastern Pacific El Niño events. This result illustrates that there is still much to learn about the specific causes of unforced GMT variability, even at the subdecadal timescale, and thus, it is useful to document the regions most associated with subdecadal GMT variability in CMIP5 GCMs.
In all 34 GCMs, as well as in both observational estimates using different techniques to isolate unforced variability, there is a positive relationship between tropical Pacific SAT and GMT at the subdecadal timescale (Figure 1) and tropical Pacific SAT tends to lead GMT in time (Figure S1). Additionally, in every GCM but two (MRI-CGCM3 and inmcm4), and in both observational estimates, the ROSI extended to the west coast of South America which supports the notion that traditional, eastern Pacific ENSO events have the largest impact on GMT [Banholzer and Donner, 2014]. Most GCMs also had relatively large regression coefficients (a in equation 1) in the Polar Regions and in the tropical Atlantic and Indian Ocean. These regions, however, were characterized by SAT variability that tended to lag behind the GMT, and thus, they were not classified as ROSIs (Figure S1).

The standard deviations of the two observed subdecadal GMT time series were ranked seventeenth and eighteenth out of 36 (Figure 1), which suggests that there is not a systematic GMT magnitude bias in the multimodel ensemble (despite large biases from individual GCMs). Also, the fact that the two observational standard deviations were so similar (~0.124 K for both Method (1) and Method (2)) indicates that this result is relatively insensitive to the method used to isolate unforced variability.
4.2 Interdecadal Timescale Variability
The eastern equatorial Pacific has been identified as a primary influence on subdecadal timescale GMT in numerous other studies with independent methodologies. Thus, the agreement of the ROSI methodology with these previous findings (section 4.1) suggests that it may be useful for illuminating the locations most associated with unforced GMT variability on the interdecadal timescale as well.
Figure 2 shows the regression coefficients between local SAT and GMT as well as the ROSIs at the interdecadal timescale (lead-lag times are shown in Figure S2). The GCMs display a much more diverse range of patterns for the interdecadal timescale than for the subdecadal timescale. Additionally, ROSIs tend to be more spatially heterogeneous and dispersed across the globe instead of being concentrated exclusively in a single region.

Despite the large diversity of patterns apparent in Figure 2, only a few GCMs highlight the North Atlantic as a large ROSI (MIROC5, inmcm4, and GISS-E2-R-p3), although many show positive regression coefficients there: ACCESS1-0, MIROC5, IPSL-CM5B-LR, CNRM-CM5, CSIRO-Mk3-6-0, GFDL-ESM2M, GISS-E2-R-p3, IPSL-CM5A-LR, inmcm4, HadGEM2-CC, GFDL-ESM2-G, CMCC-CMS, and IPSL-CM5A-MR. Many GCMs with large positive regression coefficients in the North Atlantic are not identified as ROSIs because they tend to lag, rather than lead GMT change (Figure S2).
It is interesting to note, however, that even if the North Atlantic is not identified as a ROSI, dynamics in this region may affect unforced GMT variability through global teleconnections. In particular, relative warmth of the North Atlantic has been suggested to intensify the Pacific Walker Circulation and induce a negative IPO pattern in the Pacific [Chikamoto et al., 2012; Hong et al., 2013; Kucharski et al., 2011; McGregor et al., 2014] which, in turn, is usually associated with unforced GMT cooling [Huber and Knutti, 2014; Kosaka and Xie, 2013; Meehl et al., 2011, 2013; Risbey et al., 2014; Trenberth and Fasullo, 2013]. Under this paradigm, we may expect to see negative regression coefficients between North Atlantic SAT and GMT because a relatively cool North Atlantic should induce a weaker Pacific Walker Circulation, warmer central/eastern Pacific SATs, and warmer GMT overall. Several GCMs produce patterns reminiscent of this description (bcc-csm1-1, CMCC-CESM, CanESM2, and NorESM1-MECESM1-BGC) but the majority do not.
Most GCMs do indicate a large ROSI over the equatorial Pacific Ocean (ACCESS1-0, MIROC5, CNRM-CM5, CESM1-CAM5, CMCC-CESM, CSIRO-Mk3-6-0, MPI-ESM-PGMT, MIROC-ESM, IPSL-CM5A-MR, MPI-ESM-LR, MIROC-ESM-CHEM, ACCESS1, CanESM2, NorESM1, CESM1-WACCM, NorESM1, CESM1-BGC, CCSM4, CESM1-FASTCHEM, and inmcm4) which supports the results of a number of recent studies [Huber and Knutti, 2014; Kosaka and Xie, 2013; Meehl et al., 2011, 2013; Risbey et al., 2014; Trenberth and Fasullo, 2013]. However, some GCMs have very few ROSIs in the Pacific (or North Atlantic) and instead have ROSIs scattered across the globe (HadGEM2-CC and GFDL-ESM2G) or concentrated over the Southern Ocean/Antarctica (bcc-csm1-1-m, GFDL-CM3, bcc-csm1-1, GISS-E2-H-p3, MRI-CGCM3, and GISS-E2-H-p2). The diversity of patterns across the GCMs indicates that the results obtained from any single-model study should be interpreted with caution.
Ideally, it would be possible to robustly assess GCM performance by comparing the regression patterns in Figure 2 to observations. Unfortunately, this task is challenging because the observational regression pattern and ROSI distribution are highly sensitive to the method used to isolate unforced variability at the interdecadal timescale (red-framed panels in Figure 2). When Method (1) is used, the Pacific's SAT field stands out as being particularly influential on unforced GMT; however, when Method (2) is used, the North Atlantic's SAT field appears to be more associated with GMT. The difference between the two methods is also quite apparent in the GMT time series (red-framed panels in Figure 3). Method (2) produces a large downward fluctuation from the early 1940s to the late 1970s and a subsequent upward fluctuation until the mid-2000s, consistent with the unforced oscillation identified in Swanson et al. [2009], Tung and Zhou [2013], and Wu et al. [2011]. Conversely, Method (1) produces very little interdecadal unforced variability throughout the middle of the twentieth century and then produces a large downward trend from the late 1990s to the present which corresponds to the recent GCM overestimation of global warming [Fyfe et al., 2013]. The disagreement between the two methods exemplifies the uncertainty of whether this interdecadal SAT variability, particularly over the North Atlantic region, can be attributed to time-varying aerosol forcing [Booth et al., 2012; Mann et al., 2014] or to unforced variability [Chylek et al., 2014; Ting et al., 2009; Tung and Zhou, 2013; Wu et al., 2011; Zhang et al., 2013]. In particular, Method (1) attributes more twentieth century North Atlantic SAT variability to external aerosol forcings because of the region's proximity to strong aerosol emissions in the GCMs. However, it is possible that the aerosol forcing in some GCMs may be overfit to observations, and thus, Method (1) may mistake some unforced variability for forced variability [Zhang et al., 2013]. This remains an unsettled issue, and the purpose of the present work is not to resolve it. Instead, we simply note that observational estimates of unforced interdecadal variability are highly sensitive to the method used to remove the forced component of variability. Additionally, the modeled regression patterns and ROSI distributions over the Southern Ocean and Antarctica are very difficult to compare to observations due to lack of data coverage until the latter part of the twentieth century. These factors combine to make it exceedingly difficult to assess the performance of the GCMs at this timescale. Furthermore, there is little difference between simulated interdecadal variability between GCMs that closely matched observed subdecadal variability and GCMs that did not (Figure S3) indicates that subdecadal timescale skill is a poor constraint on interdecadal timescale features.

Despite the differences in the phasing of the observed unforced interdecadal GMT variability between Method (1) and Method (2), Figure 3 indicates that the magnitude of unforced GMT variability is relatively consistent. The two observational GMT time series had similar standard deviations (0.09 K for Method (1) and 0.1 K for Method (2)) and were ranked third and fourth out of 36. This suggests that the simulated interdecadal GMT variability in the CMIP5 multimodel ensemble may be biased low on average (this finding is also supported by the spectral comparison shown in Figure S4). In particular, the low bias appears to be a due mostly to reduced interdecadal variability simulated over most of the world oceans relative to observations (Figure S6). It should be noted, however, that the recent downward fluctuation apparent in the Method (1) time series might be inflated if the increase in radiative forcings has been overestimated over the past decade [Fyfe et al., 2013; Schmidt et al., 2014].
5 Patterns Across the Multimodel Ensemble
5.1 Multimodel Mean Regression Patterns
Figure 4 shows the GCM mean regression relationship between local SAT and GMT at both the subdecadal and interdecadal timescales (Figures 4a–4c) as well as the cumulative spatial distribution of ROSIs (i.e., the number of GCMs that identified a given grid location as a ROSI) (Figures 4d–4f). For the interdecadal timescale, there was sufficient diversity among SAT versus GMT regression patterns (Figure 2) to support separating the GCMs into categories prior to averaging across models. Separating the GCMs into two categories with k-means clustering [Mucha, 1986] was useful in this regard as it divided the GCMs with a large Southern Ocean emphasis in their SAT versus GMT regression pattern from those with more of an emphasis on the Pacific Ocean (the cluster assignment for each GCM is shown in Figure 2). In order to gain insight on the underlying physical processes responsible for the simulated unforced GMT variability, GCM mean maps of anomalous sea level pressure (SLP, Figures 4g–4i), surface upward energy flux (Qsfc, Figures 4j–4l), and top-of-atmosphere downward radiative flux (QTOA, Figures 4m–4o) regressed against GMT are also shown (a positive sign for either Qsfc or QTOA indicates anomalous energy flux into the atmosphere). SLP is useful for assessing how large-scale atmospheric circulation and mass change with GMT, while Qsfc and QTOA give insight into the geographic locations of anomalous energy flow into and out of the atmosphere associated with GMT variation. In order to identify the most robust patterns, we focus here on the GCM mean results across the three categories (Subdecadal Timescale, Interdecadal Timescale Cluster 1, and Interdecadal Timescale Cluster 2).

5.1.1 Subdecadal Timescale
At the subdecadal timescale, the central equatorial Pacific represents a particularly influential location on GMT (Figures 4a and 4d). GMT tends to be anomalously high (low) when there is anomalously high (low) SLP over the western Pacific Ocean and Indian Ocean (Figure 4g). Such a SLP pattern is associated with weakened (strengthened) easterly equatorial trade winds and an anomalously warm (cool) eastern Pacific due partly to decreased (increased) upwelling from below the thermocline as well as the advection of anomalously warm (cool) surface water from the western (eastern) Pacific [Neelin et al., 1998].
The relationship between Qsfc in the equatorial Pacific and GMT indicates that a large portion of unforced variability in GMT can be traced to the anomalous sequestration (La Niña) and release (El Niño) of heat over the equatorial Pacific Ocean (Figure 4j). There is also a large positive Qsfc anomaly in the north-central Pacific on subdecadal timescales which comes about because El Niño (La Niña) events are associated with enhanced (reduced) convection at the equator [Trenberth et al., 1998] and a quasi-stationary Rossby wave response that causes a stronger (weaker) Aleutian Low. The change in the Aleutian Low strength implies anomalous northerly (southerly) winds over the north-central Pacific and thus anomalously positive (negative) Qsfc [Alexander, 1990; Alexander et al., 2002; Emery and Hamilton, 1985; Lau and Nath, 1994; Wang et al., 2012]. The opposite is the case along the west coast of North America, which gives rise to the “horseshoe-like” dipole pattern in North Pacific SAT (Figure 4a). Positive (negative) GMT anomalies are also associated with anomalously positive (negative) downward QTOA imbalances over the equatorial Pacific (Figure 4m). These imbalances are likely due to the ENSO shortwave heat flux feedback [Bellenger et al., 2013; Bony et al., 1997] in the eastern equatorial Pacific and changes in outgoing longwave radiation (due to changes in convection and atmospheric water vapor) in the western equatorial Pacific (Figure S7). These anomalous imbalances are partially offset by anomalies of the opposite sign over the subsident regions of the circulation in the subtropical Pacific in both hemispheres [Pierrehumbert, 1995].
5.1.2 Interdecadal Timescale
At the interdecadal timescale, Cluster 1's SAT regression pattern (Figure 4b) and ROSI distribution (Figure 4e) are suggestive of a large influence from low-frequency ENSO variability and/or the IPO on GMT. For Cluster 1, anomalously high (low) SLP over the western Pacific and Indian Ocean (Figure 4h) implies relaxed (enhanced) easterly trade winds which can cause the circulation of the Pacific shallow overturning cells to weaken (strengthen), leading to anomalously low (high) subsidence of warm water [England et al., 2014; Meehl et al., 2013] and anomalously high (low) Qsfc over the western equatorial Pacific (Figure 4k). For Cluster 2, the SAT regression pattern (Figure 4c) and ROSI distribution (Figure 4f) are suggestive of a large influence from low-frequency variability over the Southern Ocean. The SLP anomalies over much of the midlatitudes are of the same sign but less intense than in Cluster 1 (Figure 4i), except over the Southern Ocean and Antarctica where the positive relationship between SLP (e.g., the Southern Annular Mode) and GMT is much stronger.
High (low) GMT in both interdecadal clusters is associated with anomalously high (low) Qsfc over parts of the Southern Ocean (Figures 4k and 4i). For Cluster 1, this may be due to a teleconnection between the sea ice extent over the Pacific-facing Southern Ocean and interdecadal ENSO variability [Okumura, 2013; Yuan, 2004]. For Cluster 2, the large magnitude and spatial extent of the Qsfc versus GMT regression pattern over the Southern Ocean suggest that dynamics involving variability in Antarctic Bottom Water formation and sea ice feedbacks may play a critical role in GMT variability in these GCMs [Latif et al., 2013; Martin et al., 2013].
Both Cluster 1 and Cluster 2 appear to receive at least some GMT variability from the Atlantic Ocean. In particular, when GMT is high (low), the North Atlantic Oscillation [Hurrell, 1995] tends to be positive (negative) (Figures 4h and 4i), and Qsfc over the subpolar North Atlantic tends to be anomalously positive (negative), indicating a large anomalous flux of energy into (out of) the atmosphere (Figures 4k and 4i). This would suggest that high (low) GMT occurs during periods of high (low) oceanic heat convergence associated with corresponding changes in the Atlantic Meridional Overturning Circulation (AMOC) [Gulev et al., 2013]. However, since there is not a significant ROSI count over the subpolar North Atlantic (Figures 4e and 4f), the AMOC influence on GMT in these GCMs may be nonlocal (i.e., the AMOCs influence may not be communicated through its effect on local SAT). This is consistent with the finding that variability in the AMOC is not independent from IPO variability in at least one GCM [Meehl et al., 2013].
Anomalous QTOA flux also plays a role in interdecadal GMT variability for both clusters. Specifically, anomalously high (low) GMT is associated with anomalously positive (negative) QTOA flux over the central and eastern Pacific in Cluster 1 (Figure 4n). The eastern equatorial Pacific is characterized by large-scale atmospheric subsidence, relatively cool surface temperatures (for the latitude) and marine stratiform clouds [Klein and Hartmann, 1993] that have a net cooling effect on the surface [Hartmann et al., 1992]. Therefore, a decrease (increase) in these clouds due to SAT warming (cooling) and increased (decreased) convection should cause a positive (negative) QTOA anomaly [Sun et al., 2003; Wallace et al., 1989] which can enhance (suppress) the initial SAT anomaly (Figure S7). In the western Pacific, SAT warming (cooling) is associated with increased (decreased) deep convective towers and atmospheric water vapor that can reduce (increase) outgoing longwave radiation sufficiently to cause a positive (negative) QTOA anomaly (Figure S7). Anomalously high GMT is also associated with anomalously positive QTOA over the Weddell Sea in both clusters (more so in Cluster 2 (Figure 4o) than in Cluster 1 (Figure 4n)). This provides evidence that many GCMs may have positive feedbacks on their unforced GMT variability that involve sea ice [Drijfhout et al., 2013; Latif et al., 2013].
5.2 Relationship Between the Magnitude of GMT Variability and Its Underlying SAT, SLP, Qsfc, and QTOA Regression Patterns
Figures 1 and 2 are ordered in terms of the standard deviation of GMT at their respective timescales. From visual inspection, GCMs with more variable GMT tend to have different SAT versus GMT regression patterns than GCMs with less variable GMT. For example, at the subdecadal timescale (Figure 1), several of the GCMs with the most variable GMT (top left of Figure 1) have a higher positive SAT versus GMT regression coefficient over the tropical eastern Pacific and a more negative SAT versus GMT regression coefficient over the midlatitude western Pacific in both hemispheres. This leads to the supposition that the magnitude of a GCM's GMT variability may be related to its underlying SAT versus GMT regression pattern. In order to investigate this, we regressed the local SAT versus GMT pattern for each GCM against the corresponding standard deviation of each GCM's GMT. Regression coefficients were calculated across GCM space in a manner similar to an analysis performed by Soden and Vecchi [2011]. The intermodel regression slopes are mapped in Figure 5 along with analogous calculations applied to the regression coefficients of SLP versus GMT, Qsfc versus GMT, and QTOA versus GMT.

The subdecadal (SAT versus GMT) versus STDEV(GMT) plot (Figure 5a) indicates that GCMs with more variable GMT tend to have a more emphasized ENSO-like pattern with larger SAT versus GMT regression coefficients in the tropical Pacific as well as over Alaska and the southeast Pacific. Additionally, GCMs with more variable subdecadal GMT tend to have smaller SAT versus GMT regression coefficients over much of North America, Eurasia, and Antarctica. This indicates that there is less land and polar amplification of ENSO SAT response in GMCs with more variable GMT. On the other hand, at the interdecadal timescale, GCMs with more variable GMT tend to have a less emphasized ENSO/IPO SAT versus GMT regression pattern (Figure 5b). Instead, GCMs with more variable GMT have a much stronger SAT versus GMT regression slope over the Weddell Sea and the Indian and Atlantic sectors of the Southern Ocean.
The subdecadal (SLP versus GMT) versus STDEV(GMT) pattern also indicates that ENSO-like variability is emphasized more in GCMs with large GMT variance. In particular, SLP tends to decrease more in the eastern Pacific and increase more over the Indian Ocean per degree of GMT warming (Figure 5c) for GCMs with more variable GMT. On the other hand, the opposite pattern is found at the interdecadal timescale where GCMs with more variable GMT tend to have a less-positive regression coefficient between SLP and GMT over the Indian Ocean and a less-negative regression coefficient over the eastern Pacific (Figure 5d).
At the subdecadal timescale, the (Qsfc versus GMT) versus STDEV(GMT) plot (Figure 5e) indicates that larger anomalous heat fluxes over the tropical Pacific and northwest Pacific (per degree of GMT change) are associated with more variable GMT. Additionally, the subdecadal (QTOA versus GMT) versus STDEV(GMT) (Figure 5g) looks similar to the QTOA versus GMT pattern (Figure 4m) which indicates that larger positive TOA feedbacks in the central equatorial Pacific are associated with more variable GMT.
At the interdecadal timescale, the (Qsfc versus GMT) versus STDEV(GMT) and the (QTOA versus GMT) versus. STDEV(GMT) (Figures 5f and 5h) are relatively noisy, and there are only a few locations with statistically significant intermodel regressions. It is the case that more QTOA and Qsfc flux (per degree of GMT change) over parts of the Southern Ocean near Antarctica is associated with more variable GMT. In particular, GCMs with more variable GMT tend to have larger changes in absorbed shortwave radiation in these regions (Figure S8), implicating positive feedbacks involving sea ice. This may imply that GCMs that simulate the type of variability described in Latif et al. [2013] and Martin et al. [2013] are more likely to produce large-magnitude interdecadal GMT variability. More variable GMT is also associated with less QTOA flux (per degree of GMT) over the tropical eastern Pacific, which further supports the notion that GCMs with more variable GMT tend to have less of an emphasis on the IPO and more of an emphasis on dynamics in the Southern Ocean.
6 Summary
We have performed a regression analysis between local SAT and GMT variability in both GCMs and observations in order to document the geographic locations that are most associated with unforced GMT variability at both the subdecadal and interdecadal timescales. We defined Regions of Significant Influence (ROSIs) on GMT which are locations that had statistically significant SAT versus GMT correlations, had SAT that led GMT in time, and had a SAT versus GMT regression coefficient greater than 1 meaning that local SAT variability tended to enhance rather than damp GMT variability.
This study confirms that almost all CMIP5 GCMs act in accordance with the established finding that subdecadal timescale GMT variability is associated with ENSO activity. In particular, GMT variability tends to be associated with equatorial Pacific SAT anomalies that extend to the west coast of South America supporting recent observationally based findings [Banholzer and Donner, 2014]. It was shown that the anomalous energy necessary to change subdecadal GMT comes primarily from the tropical and northern Pacific Ocean (with contributions from positive feedbacks at the top of the atmosphere in the equatorial Pacific). Additionally, the magnitude of GCM-simulated subdecadal variability was not systematically biased (i.e., there were roughly an equal number of GCMs that overestimated the magnitude as underestimated the magnitude compared to observations). At the subdecadal timescale, higher amplitude ENSO-like spatial patterns (per degree of GMT change) were shown to be associated with more variable GMT across the GCMs.
On the interdecadal timescale, the CMIP5 GCMs are relatively inconsistent in the spatial patterns that are most responsible for variability in GMT, and thus, any results from single-model studies should be interpreted with caution. Overall, most GCMs display a ROSI pattern that emphasizes the Pacific Ocean and is reminiscent of the Interdecadal Pacific Oscillation (IPO). The North Atlantic (i.e., AMO) does not stand out as being particularly influential on GMT in most GCMs. Interestingly, SAT over the Southern Ocean especially near the Weddell Sea does have a strong association with GMT variability in many GCMs. Interdecadal GMT variability tends to be enhanced due to anomalous heat flux at the ocean surface (over the western equatorial Pacific, the Southern Ocean, and the subpolar North Atlantic) as well as at the top of the atmosphere (over the equatorial and eastern Pacific, over the Weddell Sea, and over Baffin Bay and the Greenland Sea).
For interdecadal variability, observational regression patterns between SAT and GMT as well as the ROSI distribution were very sensitive to the method used to remove the forced variability from the record, making it difficult to validate the GCMs. The magnitude of unforced interdecadal GMT variability in observations, however, was relatively insensitive to the method used to remove the forced variability. The magnitude of observed unforced GMT variability was larger than in 32 GCMs (out of 34), which indicates that the GCM ensemble may systematically underestimate unforced GMT variability at the interdecadal timescale. GCMs with more variable GMT tend to have less of an emphasis on their tropical Pacific SAT versus GMT pattern and more of an emphasis on their Southern Ocean SAT versus GMT pattern indicating that GCMs with more vigorous Southern Ocean variability may be better able to simulate the correct magnitude of GMT variability.
Acknowledgments
We would like to acknowledge the World Climate Research Programme's Working Group on Coupled Modelling, which is responsible for CMIP, and we thank the climate modeling groups for making their model's (listed in the panels in Figures 1-3) output available for analysis. The CMIP5 data used for this study can be downloaded at http://cmip-pcmdi.llnl.gov/cmip5/data_portal.html. We acknowledge NASA Goddard Institute for Space Studies Surface Temperature Analysis (GISTEMP) for making their data available for download at http://data.giss.nasa.gov/gistemp/. We would also like to acknowledge Heidi Winner for providing editorial services. This work was partially supported by NSF grant AGS-1147608.





