Robust comparison of climate models with observations using blended land air and ocean sea surface temperatures
This article was corrected on 29 FEB 2016. See the end of the full text for details.
Abstract
The level of agreement between climate model simulations and observed surface temperature change is a topic of scientific and policy concern. While the Earth system continues to accumulate energy due to anthropogenic and other radiative forcings, estimates of recent surface temperature evolution fall at the lower end of climate model projections. Global mean temperatures from climate model simulations are typically calculated using surface air temperatures, while the corresponding observations are based on a blend of air and sea surface temperatures. This work quantifies a systematic bias in model‐observation comparisons arising from differential warming rates between sea surface temperatures and surface air temperatures over oceans. A further bias arises from the treatment of temperatures in regions where the sea ice boundary has changed. Applying the methodology of the HadCRUT4 record to climate model temperature fields accounts for 38% of the discrepancy in trend between models and observations over the period 1975–2014.
1 Introduction
Climate model projections of the global mean temperature response to future greenhouse gas emissions provide an important basis for decision making concerning mitigation and adaptation to climate change. However, model projections are subject to uncertainty in the size of the temperature response, which arises primarily from the scale of the amplifying effect of the cloud feedback and the temporal evolution of climate forcings [Flato et al., 2013; Andrews et al., 2012; Sherwood et al., 2014]. Comparison of model projections against the observed rate of warming over recent decades can provide a test of the ability of models to simulate the transient evolution of climate. The comparison is complicated by the need to accurately simulate changes in atmospheric composition and solar radiation, as well as accounting for the unforced variability of the climate system [Schmidt et al., 2014]. The fact that the observations fall at the lower end of the envelope of model simulations over the last decade has led to suggestions that climate model forecasts may overestimate the potential future warming resulting from increasing greenhouse gas concentrations [Fyfe et al., 2013].
Observational records of global mean surface temperature are typically determined from air temperature measurements on land, blended with sea surface temperature (SST) observations measured in the top few meters of the ocean [Morice et al., 2012; Kennedy et al., 2011a]. Temperature records may be based on spatially incomplete data [Morice et al., 2012; Vose et al., 2012] or on data that have been infilled to provide an estimate of the global mean temperature [Hansen et al., 2010; Rohde et al., 2013; Cowtan and Way, 2014]. Observations of temperature are typically converted into anomalies (i.e., changes with respect to some baseline period) to allow observations from different environments to be meaningfully combined.
A homogenous global temperature record would ideally be based on a property which is independent of the surface type (land, ocean, or ice), such as air temperatures at a uniform height above the surface. However, sea surface temperature observations have historically been used in preference to marine air temperatures due to inhomogeneities in older marine air temperature data sets [Kent et al., 2013]. Infilled temperature records typically extrapolate air temperatures over sea ice, because the insulating effect of ice and snow isolates the air from the water [Kurtz et al., 2011], an approach which is supported by observations [Rigor et al., 2000], atmospheric reanalyses [Simmons and Poli, 2014], and satellite data [Comiso and Hall, 2014].
Global averages of the observational temperature records are typically compared to near‐surface air temperature from an ensemble of climate model simulations (e.g., Intergovernmental Panel on Climate Change Fifth Assessment Report Working Group 1, Figure 9.8 [Flato et al., 2013]). When comparing against spatially incomplete records, the model temperature fields may be masked to reduce coverage to match the observations or make the assumption that the observed regions are representative of the unobserved regions. This assumption may not hold for the last two decades of accelerated Arctic warming [Simmons and Poli, 2014; Saffioti et al., 2015]. Although in some cases the model simulations were masked for coverage, most studies have used the surface air temperature field from models rather than blended land‐ocean temperatures, with the notable exception of Marotzke and Forster [2015] and some attribution studies, e.g., Knutson et al.[2013].
A true like‐with‐like comparison would involve blending the air and sea surface temperature fields from the models in a manner consistent with the observational records. The purposes of this work are to evaluate the impact of comparing air temperatures from models with the blended observational data and to establish guidelines for the determination of blended temperature comparisons. These require changes both in the way global mean temperature from models is evaluated and ideally also in the preparation of blended observational data sets.
2 Data and Methods
The impact of using blended temperatures was evaluated for climate model simulations from the Coupled Model Intercomparison Project Phase 5 (CMIP5) archive [Taylor et al., 2012] using a combination of the historical and Representative Concentration Pathway 8.5 (RCP8.5) emissions scenarios. The calculation of a gridded blended temperature record requires the surface air temperature (“tas” in CMIP5 nomenclature), sea surface temperature (“tos”), sea ice concentration (“sic”), and the proportion of ocean in each grid cell (“sftotf”). After eliminating incompatible data sets (Figure S1 in the supporting information), there were 84 useable model runs from 36 models. The Climate Data Operators software package (available at http://www.mpimet.mpg.de/cdo,version1.6.8) was used to convert all fields onto a standard 1 × 1° grid, using distance‐weighted interpolation to avoid the loss of coverage when interpolating fields containing missing values (however, similar results were obtained using nearest‐neighbor interpolation or the native ocean grids).
For each model simulation, a global mean temperature series is calculated from the unblended surface air temperature field for comparison. A blended temperature field is then calculated using the air and sea surface temperature fields, using the land mask and sea ice concentration. In the blended temperature field, the air temperature for the whole grid cell is used as an estimate of the air temperature over land and sea ice, while the sea surface temperature is used for the proportion of the cell occupied by open water. Ideally, there would be separate simulated estimates for air temperature over land and ocean in fractional grid boxes, but these are not standard diagnostics in the CMIP5 models. The blended temperature field, Tblend, therefore takes the following form:
(1)If a sea surface temperature or sea ice concentration cell is missing (e.g., for the Commonwealth Scientific and Industrial Research Organisation model sea surface temperatures are missing for ice cells), wair is set to 1.0, ensuring that the blended temperature matches the air temperature. The difference between the latitude‐weighted global means of the blended temperature and of the unblended air temperature provides a measure of the bias in the model‐observation comparison.
Implicit assumptions in the implementation of the blending calculation may influence the results; therefore, three possible variants of the calculation were investigated:
- The calculation may be performed over the whole globe, or alternatively, the fields may be masked to reduce coverage to that of the observational data. The full coverage calculation provides a measure of the bias in a comparison with an infilled record, while the masked calculation provides a measure of the bias in a comparison with an incomplete coverage data set such as HadCRUT4 [Morice et al., 2012].
- The calculation may be performed using absolute temperatures, which are output by the climate model runs, or using temperature anomalies which are conventionally used for blending in the case of the observational record. In the latter case, anomalies are calculated with respect to the period 1961–1990 for consistency with HadCRUT4.
- The blending calculation can be performed using the monthly varying sea ice cover or a fixed sea ice coverage in order to isolate any confounding effects due to the change of a grid cell from ice to open water. For the fixed sea ice case, sea surface temperatures are only used for grid cells for which the sea ice concentration is zero for the corresponding month of every year from 1961 onward. In this case the remaining grid cells are considered 100% sea ice and thus take the same value as in the unblended case.
These three options can be employed in any combination. The differences between the air‐temperature‐only calculation and two variants of the blended calculation (absolute versus anomaly based) are illustrated in Figure 1.

One further method was implemented with the aim of providing a better comparison to the HadCRUT4 temperature data. This requires reproducing the HadCRUT4 algorithm, the coarse HadCRUT4 grid, and the coverage of observations within each large grid cell. The steps are as follows:
- The air and sea surface temperatures are converted to anomalies using the HadCRUT4 baseline period (1961–1990).
- The air temperatures are masked to include only grid cells containing a nonzero land fraction.
- Sea surface temperatures are masked to include only cells with no more than 5% sea ice. While the HadCRUT4 calculation does not explicitly take sea ice into account, observations from ships and buoys are confined to open water.
- The remaining air and sea temperatures in each cell of the coarse 5 × 5° grid used by HadCRUT4 are averaged, omitting any values excluded by the previous steps.
- The air and sea temperatures are masked to match the coverage of the air and sea temperatures in the HadCRUT4 data, respectively.
- The temperatures are then blended: cells containing only an air or sea temperature take that value; otherwise, the air and sea temperatures are blended according to the land fraction for the grid cell. (As with HadCRUT4, the land fraction is bounded by a minimum value of 0.25 for coastal cells so that air temperature observations on small islands are not eliminated.)
- Following the HadCRUT4 convention, the global mean temperature is calculated from the mean of the cosine‐weighted hemispheric means.
Improved compatibility between the model‐derived temperatures and the observational data is achieved at a cost of complexity and of producing a set of model results which are only comparable to a specific observational data set.
3 Results
The difference between the global mean blended temperature and the global mean air temperature was determined for 36 CMIP5 models with 84 historical/RCP8.5 simulations, using global data (i.e., no coverage mask) and blending absolute temperatures with a variable sea ice boundary (Figure 2). The blended temperatures show consistently less change than air temperature, with blended temperatures lower than air temperatures over recent decades. Over the period 2009–2013 the difference between multimodel global mean blended and air temperatures is 0.031 ± 0.010°C (1σ) relative to 1961–1990, and this difference is estimated to increase in magnitude with time to 0.17 ± 0.04°C by the year 2100.

The effect is broadly similar in magnitude across all the models both during the historical period and over the 21st century with the exception of the Beijing Climate Centre model, “bcc‐csm.” The different behavior of the “bcc‐csm” model appears to arise from surface air temperature being almost equal to the skin temperature (“ts” in the CMIP5 nomenclature) in that model alone (Figure S2). Preindustrial control simulations were examined (where available) to determine whether model drift due to nonequilibrium initial conditions contributes to the difference between air and sea surface temperatures. In every case the difference between the blended and air temperature trends at the end of the control run was at least an order of magnitude smaller than the effect identified here (Figure S3).
The mean difference across all models between the global mean blended and global mean air temperatures was compared for the previously described variants of the blending calculation and for the HadCRUT4 method (Figure 3). The difference between the blended and air temperatures is greater when using anomalies (as in the observational record) than when using absolute temperatures. The reason arises from changes in the ice edge. As ice melts, grid cells switch from taking air temperatures to taking sea surface temperatures. When blending anomalies, the temperature anomaly is determined with respect to a period in the past when air temperatures over the ice were lower, while the sea surface temperatures under the ice (constrained by the freezing point of seawater) are unchanged. Thus, the transition from air temperature anomaly (which is warmer than the baseline period) to sea surface temperature anomaly (which is roughly the same as during the baseline period) introduces a cool bias at the point when the ice melts (Figure S4).

When blending is performed using absolute temperatures, the blended temperature change is consistently around 95% of the air temperature change, for both the RCP8.5 scenario and the RCP4.5 scenario where temperatures have largely stabilized by 2100 (Figure S5). When blending is performed using temperature anomalies, the blended temperature change is reduced to about 92% of the air temperature change for the RCP8.5 scenario. The role of ice melt in the difference between blending absolute temperatures and temperature anomalies is confirmed by fixing the sea ice coverage; in this case both absolute and anomaly calculations give identical results (although the impact of blending is now underestimated due to the omission of large regions of formerly ice‐covered ocean).
Masking the model data to match the HadCRUT4 observations reduces the difference between the global mean blended and air temperatures slightly when using anomalies and increases it slightly when using absolute temperatures. This behavior arises from the change in sign of the difference between the blended and air temperatures in ice melt cells between the anomaly and absolute cases (Figure S6).
When emulating the HadCRUT4 method, the difference between the air and blended temperatures is marginally greater than the result from the masked blended anomaly calculation. The difference arises primarily from the handling of ice edge cells. The coarse 5 × 5° grid of the HadCRUT4 also contributes to spreading the effective area over which the ice edge plays a role.
The differences between the air and sea surface temperature changes are small compared to the uncertainties and bias corrections in the sea surface temperatures [Kennedy et al., 2011b, 2011a], and so observational data are of limited use in detecting this bias. The comparison of daily sea surface temperatures to nighttime‐only marine air temperatures is confounded by diurnal range effects as well as inhomogeneities in the observations, with the MOHMAT and HadNMAT2 marine air temperature data [Rayner et al., 2003; Kent et al., 2013] showing substantial differences to the SSTs not seen in the models (Figure S7). Similarly, uncertainties in the assimilated observations limit the utility of atmospheric reanalyses for this purpose (Figure S8).
What are the implications of using blended temperatures on a model‐observation comparison for the CMIP5 models? Figure 4 shows a comparison of the 84 RCP8.5 model runs against the HadCRUT4 data, using either air or blended temperatures and the HadCRUT4 blending algorithm (i.e., with the HadCRUT4 coverage and averaging conventions). When using air temperatures, the HadCRUT4 data fall below the 90% range of climate model simulations for the years 2011–2013. When using the blended temperatures, the observations are at the lower end of the 90% range for 2011 and 2012 and within it for 2013.

The recent divergence between the models and the observations occurs after 1998, the period commonly associated with the so‐called global warming “hiatus” [Fyfe et al., 2013; Fyfe and Gillett, 2014; Tollefson, 2014]. Several contributory factors to the divergence have been identified, including an increase in moderate volcanic eruptions [Solomon et al., 2011; Ridley et al., 2014; Santer et al., 2014a, 2014b], a reduction in solar activity, a decrease in stratospheric water vapor concentration [Solomon et al., 2010], internal variability [Meehl et al., 2011, 2013; Trenberth and Fasullo, 2013; Kosaka and Xie, 2013; Mann et al., 2014; Steinman et al., 2015; Dai et al., 2015], and a bias due to the omission of the Arctic, which is warming more rapidly than projected by the models [Cowtan and Way, 2014; Saffioti et al., 2015]. The contribution of internal variability to the remaining discrepancy between the models and the observations is beyond the scope of this analysis.
Using an impulse response model, Schmidt et al. [2014] estimate the temperature impact of the slower than predicted growth in forcing due to volcanoes, solar cycle, and also the possible cooling effect of an increase in aerosol emissions over the hiatus period. Other studies have found negligible or even a warming contribution of aerosols on hiatus temperature trends [Regayre et al., 2014; Gettelman et al., 2015; Thorne et al., 2015], although Schmidt et al. [2014] include nitrate aerosols which are omitted from the other studies. The model outputs were also adjusted using the estimated impacts from Schmidt et al.[2014] due to volcanoes, solar cycle, and greenhouse emissions but not aerosols (Figure 4b). When using blended temperatures, the observations lie well within the 90% range of RCP8.5 runs for the whole of the last decade. Similar results are obtained from adjustments to the model temperatures derived using the Bern2.5D climate model of intermediate complexity [Huber and Knutti, 2014]. Notably, Thorne et al. [2015] did not find a detectable reduction in the recent temperature increase when using updated forcings in a large ensemble of NorESM simulations.
The impact of using blended rather than air temperatures accounts for 25% of the difference between the models and the observations over the period 2009–2013. The adjustments by Schmidt et al. [2014] due to the overestimated forcings account for another 27% of the difference when omitting the tropospheric aerosol term or 41% of the difference when including aerosols. Over the period 1975–2014 the use of blended rather than air temperatures accounts for 38% of the difference in trend between the models and the observations (Table S1) or almost all of the difference if the last 5 years are omitted, consistent with the results of Marotzke and Forster[2015]. The model simulations suggest that the 40 year trend in HadCRUT4 is suppressed by 0.016 ± 0.004°C/decade compared to an air temperature record with the same coverage, and 0.030 ± 0.011°C/decade compared to a global air temperature record.
Comparisons to the infilled reconstructions of Cowtan and Way [2014] and Rohde et al. [2013] require different variants of the blending calculation (Text S1) but lead to similar conclusions. Comparisons to the other temperature data sets will in turn require an appropriate choice of blending method or development of a custom method appropriate to that data set. The comparison will depend on explicit and/or implicit assumptions in the blending and anomaly calculations and is therefore best addressed by the record providers.
4 Discussion
These results have implications in three areas: first in the comparison of climate model ensembles to the observational record, second in estimating climate sensitivity, and third in the preparation of observational temperature records.
When comparing models to observations, the comparison should be strictly performed using blended land/ocean temperatures rather than air temperatures from the models. The size of the difference between the blended and air temperatures is sensitive to assumptions in the blending calculation and in particular whether blending is performed using absolute temperatures or anomalies. The most conservative approach is to blend absolute temperatures from the models (i.e., air temperature over land and ice, and sea surface temperature for the oceans), in which case the global mean blended temperatures will typically show 5% less warming than the air temperatures. However, the actual impact of the use of blended temperatures on the observational record is nearly twice as great owing to the blending of anomalies in the observational data.
Replication of the HadCRUT4 blending algorithm on the model outputs leads to a reduction in the model‐observation divergence of 0.053 ± 0.014°C over the years 2009–2013, or about a quarter of the divergence over that period. However the replication is not exact: for example, the results will depend on the climatology by which anomalies are calculated for ocean cells which were sea ice during the baseline period [Rayner et al., 2006]. The comparison would also be further improved by the inclusion of a land‐only surface air temperature field in future CMIP phases.
Comparison to other versions of the temperature record should ideally also involve reproducing the blending method for that particular observational data set. However, comparison to multiple observational data sets at the same time is then inconvenient, because the model ensemble will be different for each observational record. Alternatively, instead of modifying the model temperatures to match the methodology of a particular observational record, each observational record can be modified to produce an estimate of the global mean air temperature. The required correction is determined from the difference between the blended and air temperatures from the models using the methodology of the corresponding observational record. All the observational records may then be compared simultaneously.
Estimates of climate sensitivity, at least over decadal to centennial time scales, will be lower for blended temperatures than for air temperatures. Estimates of transient climate response (TCR) should therefore be quoted with an indication of whether the value was determined using observed air or blended temperatures and, in the case of blended temperatures, whether blending was performed using absolute temperatures or anomalies. In the case of blended absolute temperatures, TCR values are likely to be about 95% of those for air temperatures or 92% for blended anomalies. Estimates of TCR from the observational record are based on blended temperatures and thus are expected to underestimate TCR by nearly 10% in comparison to quoted figures for the models.
There are two implications for observational records. First, a blended record from air temperatures over land and sea ice and sea surface temperatures over open ocean slightly underestimates the change in temperature diagnosed using global air temperatures alone. Second, the blending calculation should ideally be conducted with absolute temperatures to avoid introducing a cool bias due to the transformation of cells from sea ice to open water, particularly for infilled records. Otherwise, the approach of fixing the sea ice extent (Text S1) mitigates the problem at the cost of introducing a different but smaller bias. The new data set of Karl et al. [2015] incorporates adjustments to SSTs to match nighttime marine air temperatures [Huang et al., 2015] and so may be more comparable to model air temperatures. The difference between air and sea surface temperature trends diagnosed here provides support for an increase in temperature trends when using marine air temperatures, as reported in Karl et al.[2015].
Finally, we emphasize that robust comparisons of observations and models require a like‐with‐like approach and encourage further development of appropriate diagnostics from model simulations to facilitate such comparisons.
Acknowledgments
The HadCRUT4 data are available from http://www.metoffice.gov.uk/hadobs/hadcrut4/. The CMIP5 model outputs are available from http://pcmdi9.llnl.gov/esgf-web-fe/. The CDO package is available from https://code.zmaw.de/projects/cdo. Computer code is available from http://www-users.york.ac.uk/∼kdc3/papers/robust2015/. K.C. is grateful to the University of York for the provision of computing resources and to ETH‐Zurich for data access. E.H. is funded by the UK Natural Environment Research Council. R.G.W. is funded by the Natural Sciences and Engineering Research Council of Canada. The authors would like to thank Gavin Schmidt, Reto Knutti, Markus Huber, John Kennedy, and Mark Richardson for data and advice.
The editor thanks two anonymous reviewers for their assistance in evaluating this paper.
Erratum
- In the originally published version of this paper, there were errors due to different treatments of land cells in the CMIP “tos” and “sic” fields from different models: while most models set land cells to a missing value, a few set them to a constant. The paper has been updated to correct the different treatments of land cells. The estimated bias due to the use of air rather than blended temperatures is reduced by 5–7% compared to the original manuscript. Specifically, the following errors have been corrected: In the first paragraph of the Results section, 0.033 has been changed to 0.031; 0.18 has been changed to 0.17. In the fourth paragraph of the Results section, 91% has been changed to 92%. In the penultimate paragraph of the Results section, the first instance of 27% has been changed to 25%; 0.017 has been changed to 0.016. In the third paragraph of the Discussion section, 0.056 has been changed to 0.053; 0.015 has been changed to 0.014. In the fifth paragraph of the Discussion section 91% has been changed to 92%; “about 10%” has been changed to “nearly 10%”. Figures 2 and 3 have been updated to reflect the corrected treatments of land cells. Corrections to Figure 4 have not been made since the changes are imperceptible. The Supporting Information file has been corrected with the following: Text S1: For the HadCRUT4 infilled data, the blending bias has been changed from 18% to 17%. For Berkeley Earth, the figure is 24% (previously 25%). Table S1: Some values have changed by 1 in the third decimal place. Also, the blending bias has been updated from 89% to 83% Figure S3 has been updated so that the MIROC5 model no longer shows evidence of drift in the early part of the simulation. Figure S5 has been updated so that four models show reduced bias. Figure S6 shows slight differences around Arctic coasts. Updated Figures S9–S11 show imperceptible changes to the blue line.
Number of times cited: 32
- Hans Visser, Sönke Dangendorf, Detlef P. van Vuuren, Bram Bregman and Arthur C. Petersen, Signal detection in global mean temperatures after “Paris”: an uncertainty and sensitivity analysis, Climate of the Past, 14, 2, (139)
- A. J. Simmons, P. Berrisford, D. P. Dee, H. Hersbach, S. Hirahara and J.‐N. Thépaut, A reassessment of temperature variations and trends from global reanalyses and monthly surface climatological datasets, Quarterly Journal of the Royal Meteorological Society, 143, 702, (101-119), (2016).
- Robert G. Way, Frank Oliva and Andre E. Viau, Underestimated warming of northern Canada in the Berkeley Earth temperature product, International Journal of Climatology, 37, 4, (1746-1757), (2016).
- Michael E. Mann, Sonya K. Miller, Stefan Rahmstorf, Byron A. Steinman and Martin Tingley, Record temperature streak bears anthropogenic fingerprint, Geophysical Research Letters, 44, 15, (7936-7944), (2017).
- Timothy J. Osborn, Phil D. Jones and Manoj Joshi, Recent United Kingdom and global temperature variations, Weather, 72, 11, (323-329), (2017).
- Renzo A. Mosetti, Do the inaccuracies of climate models limit the possibility of a human control of global warming?, The European Physical Journal Plus, 132, 5
- Andrew P. Schurer, Michael E. Mann, Ed Hawkins, Simon F. B. Tett and Gabriele C. Hegerl, Importance of the pre-industrial baseline for likelihood of exceeding Paris goals, Nature Climate Change, 7, 8, (563)
- Peter Thorne, Briefing: Global surface temperature records: an update, Proceedings of the Institution of Civil Engineers - Forensic Engineering, 10.1680/jfoen.17.00001, 170, 2, (50-53), (2017).
- Benjamin D. Santer, John C. Fyfe, Giuliana Pallotta, Gregory M. Flato, Gerald A. Meehl, Matthew H. England, Ed Hawkins, Michael E. Mann, Jeffrey F. Painter, Céline Bonfils, Ivana Cvijanovic, Carl Mears, Frank J. Wentz, Stephen Po-Chedley, Qiang Fu and Cheng-Zhi Zou, Causes of differences in model and satellite tropospheric warming rates, Nature Geoscience, 10, 7, (478)
- James S. Risbey, Michael R. Grose, Didier P. Monselesan, Terence J. O'Kane and Stephan Lewandowsky, Transient response of the global mean warming rate and its spatial variation, Weather and Climate Extremes, 18, (55)
- Boyin Huang, Peter W. Thorne, Viva F. Banzon, Tim Boyer, Gennady Chepurin, Jay H. Lawrimore, Matthew J. Menne, Thomas M. Smith, Russell S. Vose and Huai-Min Zhang, Extended Reconstructed Sea Surface Temperature, Version 5 (ERSSTv5): Upgrades, Validations, and Intercomparisons, Journal of Climate, 10.1175/JCLI-D-16-0836.1, 30, 20, (8179-8205), (2017).
- Martin B. Stolpe, Iselin Medhaug and Reto Knutti, Contribution of Atlantic and Pacific Multidecadal Variability to Twentieth-Century Temperature Changes, Journal of Climate, 10.1175/JCLI-D-16-0803.1, 30, 16, (6279-6295), (2017).
- Scott Power, François Delage, Guomin Wang, Ian Smith and Greg Kociuba, Apparent limitations in the ability of CMIP5 climate models to simulate recent multi-decadal change in surface temperature: implications for global temperature projections, Climate Dynamics, 10.1007/s00382-016-3326-x, 49, 1-2, (53-69), (2016).
- Anson H. Cheung, Michael E. Mann, Byron A. Steinman, Leela M. Frankcombe, Matthew H. England and Sonya K. Miller, Comparison of Low-Frequency Internal Climate Variability in CMIP5 Models and Observations, Journal of Climate, 10.1175/JCLI-D-16-0712.1, 30, 12, (4763-4776), (2017).
- Gareth S. Jones and John J. Kennedy, Sensitivity of Attribution of Anthropogenic Near-Surface Warming to Observational Uncertainty, Journal of Climate, 30, 12, (4677)
- K. Haustein, M. R. Allen, P. M. Forster, F. E. L. Otto, D. M. Mitchell, H. D. Matthews and D. J. Frame, A real-time Global Warming Index, Scientific Reports, 10.1038/s41598-017-14828-5, 7, 1, (2017).
- Ed Hawkins, Pablo Ortega, Emma Suckling, Andrew Schurer, Gabi Hegerl, Phil Jones, Manoj Joshi, Timothy J. Osborn, Valérie Masson-Delmotte, Juliette Mignot, Peter Thorne and Geert Jan van Oldenborgh, Estimating Changes in Global Temperature since the Preindustrial Period, Bulletin of the American Meteorological Society, 10.1175/BAMS-D-16-0007.1, 98, 9, (1841-1856), (2017).
- Paul-Arthur Monerie, Marie-Pierre Moine, Laurent Terray and Sophie Valcke, Quantifying the impact of early 21st century volcanic eruptions on global-mean surface temperature, Environmental Research Letters, 10.1088/1748-9326/aa6cb5, 12, 5, (054010), (2017).
- Michael E. Mann, Byron A. Steinman, Sonya K. Miller, Leela M. Frankcombe, Matthew H. England and Anson H. Cheung, Predictability of the recent slowdown and subsequent recovery of large‐scale surface warming using statistical methods, Geophysical Research Letters, 43, 7, (3459-3467), (2016).
- Lena Hellmann, Leonid Agafonov, Fredrik Charpentier Ljungqvist, Olga Churakova (Sidorova), Elisabeth Düthorn, Jan Esper, Lisa Hülsmann, Alexander V Kirdyanov, Pavel Moiseev, Vladimir S Myglan, Anatoly N Nikolaev, Frederick Reinig, Fritz H Schweingruber, Olga Solomina, Willy Tegel and Ulf Büntgen, Diverse growth trends and climate responses across Eurasia’s boreal forest, Environmental Research Letters, 11, 7, (074021)
- Michael E. Mann, Stefan Rahmstorf, Byron A. Steinman, Martin Tingley and Sonya K. Miller, The Likelihood of Recent Record Warmth, Scientific Reports, 6, 1
- Stephan Lewandowsky, James S. Risbey and Naomi Oreskes, The “Pause” in Global Warming: Turning a Routine Fluctuation into a Problem for Science, Bulletin of the American Meteorological Society, 10.1175/BAMS-D-14-00106.1, 97, 5, (723-733), (2016).
- Ed Hawkins and Rowan Sutton, Connecting Climate Model Projections of Global Temperature Change with the Real World, Bulletin of the American Meteorological Society, 97, 6, (963)
- Boyin Huang, Peter W. Thorne, Thomas M. Smith, Wei Liu, Jay Lawrimore, Viva F. Banzon, Huai-Min Zhang, Thomas C. Peterson and Matthew Menne, Further Exploring and Quantifying Uncertainties for Extended Reconstructed Sea Surface Temperature (ERSST) Version 4 (v4), Journal of Climate, 10.1175/JCLI-D-15-0430.1, 29, 9, (3119-3142), (2016).
- Marena Lin and Peter Huybers, Revisiting Whether Recent Surface Temperature Trends Agree with the CMIP5 Ensemble, Journal of Climate, 10.1175/JCLI-D-16-0123.1, 29, 24, (8673-8687), (2016).
- Bala Rajaratnam, Joseph Romano, Michael Tsiang and Noah S. Diffenbaugh, Debunking the climate hiatus, Climatic Change, 133, 2, (129)
- Jianjun Yin, Jonathan Overpeck, Cheryl Peyser and Ronald Stouffer, Big Jump of Record Warm Global Mean Surface Temperature in 2014–2016 Related to Unusually Large Oceanic Heat Releases, Geophysical Research Letters, , (2018).
- Iselin Medhaug, Martin B. Stolpe, Erich M. Fischer and Reto Knutti, Reconciling controversies about the ‘global warming hiatus’, Nature, 10.1038/nature22315, 545, 7652, (41-47), (2017).
- Reto Knutti, Maria A. A. Rugenstein and Gabriele C. Hegerl, Beyond equilibrium climate sensitivity, Nature Geoscience, 10.1038/ngeo3017, 10, 10, (727-736), (2017).
- Robert J. H. Dunn, Kate M. Willett, Andrew Ciavarella and Peter A. Stott, Comparison of land surface humidity between observations and CMIP5 models, Earth System Dynamics, 10.5194/esd-8-719-2017, 8, 3, (719-747), (2017).
- Mark Richardson, Kevin Cowtan, Ed Hawkins and Martin B. Stolpe, Reconciled climate response estimates from climate models and the energy budget of Earth, Nature Climate Change, 10.1038/nclimate3066, 6, 10, (931-935), (2016)., (2016).
- Chris Huntingford, Hui Yang, Anna Harper, Peter M. Cox, Nicola Gedney, Eleanor J. Burke, Jason A. Lowe, Garry Hayman, William J. Collins, Stephen M. Smith and Edward Comyn-Platt, Flexible parameter-sparse global temperature time profiles that stabilise at 1.5 and 2.0 °C, Earth System Dynamics, 10.5194/esd-8-617-2017, 8, 3, (617-626), (2017).




