Volume 34, Issue 2 e2019GB006393
Research Article
Open Access

Sources of Uncertainty in Regional and Global Terrestrial CO2 Exchange Estimates

A. Bastos

Corresponding Author

A. Bastos

Department of Geography, Ludwig-Maximilians Universität, München, Germany

Correspondence to: A. Bastos,

[email protected]

Search for more papers by this author
M. O'Sullivan

M. O'Sullivan

College of Engineering, Mathematics and Physical Sciences, University of Exeter, Exeter, UK

Search for more papers by this author
P. Ciais

P. Ciais

Laboratoire des Sciences du Climat et de l'Environnement, CEA-CNRS-UVSQ, UMR8212, Gif-sur-Yvette, France

Search for more papers by this author
D. Makowski

D. Makowski

University Paris-Saclay, AgroParisTech, INRAE, UMR 211, Thiverval-Grignon, France

CIRED, Nogent-sur-Marne, France

Search for more papers by this author
S. Sitch

S. Sitch

College of Life and Environmental Sciences, University of Exeter, Exeter, UK

Search for more papers by this author
P. Friedlingstein

P. Friedlingstein

College of Engineering, Mathematics and Physical Sciences, University of Exeter, Exeter, UK

Search for more papers by this author
F. Chevallier

F. Chevallier

Laboratoire des Sciences du Climat et de l'Environnement, CEA-CNRS-UVSQ, UMR8212, Gif-sur-Yvette, France

Search for more papers by this author
C. Rödenbeck

C. Rödenbeck

Max Planck Institute for Biogeochemistry, Jena, Germany

Search for more papers by this author
J. Pongratz

J. Pongratz

Department of Geography, Ludwig-Maximilians Universität, München, Germany

Max Planck Institute for Meteorology, Hamburg, Germany

Search for more papers by this author
I. T. Luijkx

I. T. Luijkx

Department of Meteorology and Air Quality, Wageningen University and Research, Wageningen, Netherlands

Search for more papers by this author
P. K. Patra

P. K. Patra

Research Institute for Global Change, Japan Agency for Marine-Earth Science and Technology, Yokohama, Japan

Search for more papers by this author
P. Peylin

P. Peylin

Laboratoire des Sciences du Climat et de l'Environnement, CEA-CNRS-UVSQ, UMR8212, Gif-sur-Yvette, France

Search for more papers by this author
J. G. Canadell

J. G. Canadell

CSIRO Oceans and Atmosphere, Canberra, ACT, Australia

Search for more papers by this author
R. Lauerwald

R. Lauerwald

Department of Geoscience, Environment and Society, Université Libre de Bruxelles, Bruxelles, Belgium

Search for more papers by this author
W. Li

W. Li

Laboratoire des Sciences du Climat et de l'Environnement, CEA-CNRS-UVSQ, UMR8212, Gif-sur-Yvette, France

Ministry of Education Key Laboratory for Earth System Modeling, Department of Earth System Science, Tsinghua University, Beijing, China

Search for more papers by this author
N. E. Smith

N. E. Smith

Department of Meteorology and Air Quality, Wageningen University and Research, Wageningen, Netherlands

Search for more papers by this author
W. Peters

W. Peters

Department of Meteorology and Air Quality, Wageningen University and Research, Wageningen, Netherlands

Centre for Isotope Research, University of Groningen, Groningen, Netherlands

Search for more papers by this author
D. S. Goll

D. S. Goll

Department of Geography, University of Augsburg, Augsburg, Germany

Search for more papers by this author
A.K. Jain

A.K. Jain

Department of Atmospheric Sciences, University of Illinois at Urbana-Champaign, Urbana, IL, USA

Search for more papers by this author
E. Kato

E. Kato

Institute of Applied Energy, Minato, Japan

Search for more papers by this author
S. Lienert

S. Lienert

Climate and Environmental Physics, Physics Institute and Oeschger Centre for Climate Change Research, University of Bern, Bern, Switzerland

Search for more papers by this author
D. L. Lombardozzi

D. L. Lombardozzi

Climate and Global Dynamics Laboratory, National Center for Atmospheric Research, Boulder, CO, USA

Search for more papers by this author
V. Haverd

V. Haverd

CNRM, Université de Toulouse, Mto-France, CNRS, Toulouse, France

Search for more papers by this author
J. E. M. S. Nabel

J. E. M. S. Nabel

Max Planck Institute for Meteorology, Hamburg, Germany

Search for more papers by this author
B. Poulter

B. Poulter

Biospheric Sciences Lab, NASA, Greenbelt, MD, USA

Search for more papers by this author
H. Tian

H. Tian

International Center for Climate and Global Change Research, School of Forestry and Wildlife Sciences, Auburn University, Auburn, AL, USA

Search for more papers by this author
A. P. Walker

A. P. Walker

Environmental Sciences Division and Climate Change Science Institute, Oak Ridge National Laboratory, Oak Ridge, TN, USA

Search for more papers by this author
S. Zaehle

S. Zaehle

Max Planck Institute for Biogeochemistry, Jena, Germany

Search for more papers by this author
First published: 07 February 2020
Citations: 73

Abstract

The Global Carbon Budget 2018 (GCB2018) estimated by the atmospheric CO urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0002 growth rate, fossil fuel emissions, and modeled (bottom-up) land and ocean fluxes cannot be fully closed, leading to a “budget imbalance,” highlighting uncertainties in GCB components. However, no systematic analysis has been performed on which regions or processes contribute to this term. To obtain deeper insight on the sources of uncertainty in global and regional carbon budgets, we analyzed differences in Net Biome Productivity (NBP) for all possible combinations of bottom-up and top-down data sets in GCB2018: (i) 16 dynamic global vegetation models (DGVMs), and (ii) 5 atmospheric inversions that match the atmospheric CO urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0003 growth rate. We find that the global mismatch between the two ensembles matches well the GCB2018 budget imbalance, with Brazil, Southeast Asia, and Oceania as the largest contributors. Differences between DGVMs dominate global mismatches, while at regional scale differences between inversions contribute the most to uncertainty. At both global and regional scales, disagreement on NBP interannual variability between the two approaches explains a large fraction of differences. We attribute this mismatch to distinct responses to El Niño–Southern Oscillation variability between DGVMs and inversions and to uncertainties in land use change emissions, especially in South America and Southeast Asia. We identify key needs to reduce uncertainty in carbon budgets: reducing uncertainty in atmospheric inversions (e.g., through more observations in the tropics) and in land use change fluxes, including more land use processes and evaluating land use transitions (e.g., using high-resolution remote-sensing), and, finally, improving tropical hydroecological processes and fire representation within DGVMs.

Key Points

  • Top-down and bottom-up estimates of net land-atmosphere CO urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0004 fluxes agree well globally but show important mismatches at regional scales
  • Regional mismatches are dominated by differences between inversions and interannual variability in CO urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0005 fluxes
  • Mismatches between top-down and bottom-up data sets are explained by sensitivity to climate and by uncertainty in land use change forcing

1 Introduction

The United Nations Framework Convention on Climate Change Paris Agreement from 2015 (UNFCCC, 2015) has the goal to limit the increase in global average temperature well below 2 °C above preindustrial levels, and pursue efforts not to exceed the target of 1.5 °C above preindustrial levels. To achieve this overarching goal, the agreement calls for “a balance between anthropogenic emissions by sources and removals by sinks of greenhouse gases in the second half of this century.”

On the policy side, the problem is expected to be tackled at the national level through Nationally Determined Contributions, that is, mitigation policies defined by countries (UNFCCC, 2015). The collective progress toward the overall goal of the agreement is to be assessed regularly in the “global stocktake” process to support the update of Nationally Determined Contributions according to the best available science. However, the ultimate goal of stabilizing global mean surface temperature requires evaluating common progress at the global scale, as well as the evolution of natural sinks. It is therefore crucial that bottom-up estimates of CO urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0006 fluxes (e.g., inventories and models) provide accurate estimates that are consistent with the global atmospheric greenhouse gas mole fractions.

Global carbon (C) budgets for the anthropogenic perturbation have been estimated in the successive Intergovernmental Panel on Climate Change (IPCC) assessment reports (IPCC, 2001, 2007, 2013) and have been revised and updated in the Global Carbon Budget (GCB) by the Global Carbon Project almost every year since 2005 (Le Queré et al., 2009; Le Quéré, Andrew, Friedlingstein, Sitch, Pongratz, et al., 2018; Le Quéré, Andrew, Friedlingstein, Sitch, Hauck, et al., 2018b).

Fossil fuel and cement production emissions ( urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0007) can be estimated from historical energy statistics (Boden et al., 2017; UNFCCC, 2018) with a 1 standard deviation (1 urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0008) uncertainty of 5–11% (Le Quéré, Andrew, Friedlingstein, Sitch, Hauck, et al., 2018b; Quilcaille et al., 2018). The fluxes from changes in land use and management ( urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0009) cannot be directly measured and are estimated in Le Quéré, Andrew, Friedlingstein, Sitch, Hauck, et al. (2018b) by bookkeeping models (Hansis et al., 2015; Houghton & Nassikas, 2017), with a reported uncertainty of 0.7 Pg C yr urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0010 (1 urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0011) for decadal average urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0012 over the industrial era. Global carbon uptake by the ocean and land can also not be directly measured, and are estimated in the GCB by process-based models for the land, and process- and data-driven models for the ocean (Le Quéré, Andrew, Friedlingstein, Sitch, Hauck, et al., 2018b; Sitch et al., 2013).

In the Global Carbon Budget 2018 (GCB2018; Le Quéré, Andrew, Friedlingstein, Sitch, Hauck, et al., 2018b), the net annual balance between the anthropogenic sources and the sinks of CO urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0013 from process-based models does not exactly match the accurately observed atmospheric CO urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0014 growth rate ( urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0015; Dlugokencky & Tans, 2018). The residual flux resulting from this gap, the “budget imbalance,” can be interpreted as a measure of the limitations of the data sets used and of the imperfect process understanding by the modeling community (Le Quéré, Andrew, Friedlingstein, Sitch, Hauck, et al., 2018b). Since the budget imbalance does not show a significant trend since the 1960s but high year-to-year variability, Le Quéré, Andrew, Friedlingstein, Sitch, Hauck, et al. (2018b) proposed that errors in the land and ocean sinks explain most of the imbalance term, but uncertainty in urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0016 is also known to be high (Arneth et al., 2017; Piao et al., 2018). In GCB2018, estimates of ocean and land net CO urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0017 fluxes from process-based models were compared with those from four observation-based atmospheric inversions (Chevallier et al., 2005; Rödenbeck et al., 2003; Saeki & Patra, 2017; van der Laan-Luijkx et al., 2017) for the globe and over three latitudinal bands, where large differences between process-based models and inversions, but also between inversions were found. A recent study (Gaubert et al., 2019) further compared 10 different inversion systems, including the four mentioned above, showing an overall fair match with aircraft CO urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0018 measurements and varying consistency ( urn:x-wiley:gbc:media:gbc20956:gbc20956-math-001910% on 3-year mean varying with the inversion) with urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0020. However, their results showed that differences in fossil fuel emission priors used affected the ocean-land partitioning, setting limits to the accuracy of inversions for quantifying regional surface-atmosphere CO urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0021 fluxes. Even though inversions were adjusted for differences in urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0022 priors in Le Quéré, Andrew, Friedlingstein, Sitch, Hauck, et al. (2018b), large disparities were still found for regional fluxes estimated by the four atmospheric inversions, especially the balance between the Northern Hemisphere and the tropics.

Le Quéré, Andrew, Friedlingstein, Sitch, Hauck, et al. (2018b) proposed that regional-level analyses may uncover sources of errors in the inferred fluxes. For example, comparisons of net CO urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0023 surface flux estimates from atmospheric inversions and dynamic global vegetation models (DGVMs) allowed assessing in detail the response of the tropical net CO urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0024 exchange to the 2015–2016 El Niño event, and evaluating the DGVMs' ability to simulate the anomalies in carbon fluxes in 2015/16 (Bastos et al., 2018; Gloor et al., 2018; Rödenbeck et al., 2018; van Schaik et al., 2018). Likewise, the studies produced in the framework of the REgional Carbon Cycle Assessment and Processes (RECCAP; Canadell et al., 2012) provided a first evaluation of global and regional budgets, budget component fluxes and uncertainties. RECCAP produced multiple global syntheses of regional CO urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0025 fluxes between 1990 and 2009 and their driving processes (Andres et al., 2012; Ciais et al., 2014; Chen et al., 2013; Houghton et al., 2012; Khatiwala et al., 2013; Peters et al., 2012; Peylin et al., 2013; Sitch et al., 2013; Wanninkhof et al., 2013) as well as independent budgets for nine land, four ocean and coastal regions.

In this study, we take a closer look at the regional mismatch between the top-down and bottom-up data sets in GCB2018 (Le Quéré, Andrew, Friedlingstein, Sitch, Hauck, et al., 2018b) to identify the sources of uncertainty in the terrestrial components, specifically to:
  • (i) identify the regions driving uncertainties in global budget estimates;
  • (ii) quantify the relative contribution of uncertainty in net CO urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0026 surface flux estimates from inversions (top-down) and from DGVMs (bottom-up);
  • (iii) identify those processes contributing the most to uncertainty in global and regional budgets.

We focus on a group of 18 land regions from Tian et al. (2018), which are consistent with the previous nine RECCAP large regions and with ecoclimatic specificities, but follow political borders. Evaluating the agreement between top-down and bottom-up estimates of CO urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0027 fluxes over such regions allows assessing the confidence and limitations of the state-of-the-art science to provide information on regional budgets, relevant for the global stocktaking process yet with traceable consistency with the urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0028.

In this study, we rely on net CO urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0029 exchange from inversions, considered as uncertain estimates of surface CO urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0030 fluxes consistent with urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0031 and compare inversions' estimates to those from the 16 DGVMs in GCB2018. Since Gaubert et al. (2019) recently analyzed the sources of uncertainty between inversions, here we focus on predictors that can explain the differences between DGVMs and inversions, or differences between individual DGVMs. By statistically modeling the mismatch between inversions and DGVMs, we are able to attribute uncertainty in the global budgets to a few key regions mainly in the tropics, and to identify those processes contributing the most to regional and global uncertainties.

2 Data

2.1 Atmospheric Inversions

Atmospheric inversions use an optimization process by which atmospheric CO urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0032 mole fraction measurements are used to constrain a priori estimates of the spatial distribution of surface CO urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0033 fluxes using an atmospheric transport model. In the optimization, information about errors of measurements and of priors, errors in the transport model, and the spatiotemporal structure of fluxes is also included (Peylin et al., 2013).

The four inversion systems used in this study (Table 1) are all based on in situ CO urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0034 observation measurements but differ in several aspects, such as the observational data assimilated and period covered, the transport model and prior fluxes used, the model grid, the spatiotemporal a priori correlation structure, as well as in the fossil fuel data sets used. A complete description of the different atmospheric inversions can be found in Table A3 in Le Quéré, Andrew, Friedlingstein, Sitch, Hauck, et al. (2018b). Here we use two versions of CarboScope (Rödenbeck et al., 2003), both covering a period longer than 30 years but with variable number of assimilated sites: s76 (1976–2017, eight stations) and s85 (1985–2017, 21 stations). Thus, we have used in total five inversion data sets (CAMS, two CarboScope versions, MIROC, and CarbonTracker Europe; Table 1).

Table 1. Top-Down and Bottom-Up Data Sets Used in This Study, Including References and Period Covered
Data set/model Reference Period Fire N-dep SC WH
Inversions (top-down)
Data set
Copernicus Atmosphere Chevallier et al. (2005) 1979–2017
Monitoring Service (CAMS)
CarboScope s76 Rödenbeck et al. (2003) 1976–2017
CarboScope s85 1985–2017
MIROC Patra et al. (2018) 1997–2017
CarbonTracker Europe (CTE) van der Laan-Luijkx et al. (2017) 2001–2017
DGVMs (bottom-up)
Model
CABLE-POP Haverd et al. (2018) 1979–2017 N Y Y Y
CLASS-CTEM Melton and Arora (2016) Y Y N N
CLM5.0 Oleson et al. (2013) Y N Y Y
DLEM Tian et al. (2015) N Y N Y
ISAM Meiyappan et al. (2015) N Y N Y
JSBACH Mauritsen et al. (2018) Y Y Y Y
JULES Clark et al. (2011) N N N N
LPJ Poulter et al. (2011) Y N Y Y
LPJ-GUESS Smith et al. (2014) Y Y Y Y
LPX-Bern Lienert and Joos (2018) Y Y N N
OCN Zaehle et al. (2010) N Y N Y
ORCHIDEE Krinner et al. (2005) N N N Y
ORCHIDEE-CNP Goll et al. (2017) N Y N N
SDGVM Walker et al. (2017) Y Y N N
SURFEX Joetzjer et al. (2015) Y N N N
VISIT Kato et al. (2013) Y N Y Y
  • Note. For the DGVMs, we indicate whether they simulate fires, nitrogen deposition (N-dep), shifting cultivation (SC), and wood harvest (WH), and in some models further include irrigation and nitrogen fertilization. A complete description of the processes in each model can be found in Le Quéré, Andrew, Friedlingstein, Sitch, Hauck, et al. (2018b).

Because they are based on CO urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0035 concentration measurements, atmospheric inversions estimate the net surface-atmosphere CO urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0036 fluxes including both the natural (fires, storms, pests, and diseases) and anthropogenic disturbance terms, the subsequent recovery, as well as the carbon taken up from the atmosphere over land but then passed on to the oceans through freshwaters, estuaries, and coastal areas (Hartmann et al., 2009; Mayorga et al., 2010; Regnier et al., 2013).

As in Le Quéré, Andrew, Friedlingstein, Sitch, Hauck, et al. (2018b) and Peylin et al. (2013), we adjusted the ocean and land fluxes for differences in fossil fuel emission ( urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0037) priors over large latitudinal bands. As reference urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0038, we chose the data used by CAMS, the Emission Database for Global Atmospheric Research (Olivier et al., 2017) scaled to the Carbon Dioxide Information Analysis Center (Marland et al., 2008) estimates. However, this is only a first-order correction as the biases in urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0039 not only affect the flux estimation of the region in question but also the neighboring regions (Saeki & Patra, 2017). The inversion surface fluxes were then remapped to a regular 1°  urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0040 1° latitude/longitude grid and then aggregated to the 18 land regions.

Inversions determine total CO urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0041 fluxes between the surface and the atmosphere whereas the GCB approach with land and ocean biogeochemical models determines the anthropogenic budget of CO urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0042. The difference between total and anthropogenic CO urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0043 fluxes is that there is a background preindustrial uptake of CO urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0044 on land that sustains carbon export from soils to rivers and to the ocean, where compensatory outgassing of CO urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0045 occurs. This requires an adjustment of inversion CO urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0046 fluxes to transform them into anthropogenic CO urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0047 fluxes, as performed in Le Quéré, Andrew, Friedlingstein, Sitch, Hauck, et al. (2018b) over latitudinal bands.

Based on the data-driven estimates of fluvial exports of organic (Mayorga et al., 2010) and inorganic carbon (Hartmann et al., 2009) to the coast, Zscheischler et al. (2017) produced a spatially explicit data set of climatological land-ocean carbon transfers at 1°  urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0048 1° latitude/longitude resolution and includes the fluxes from dissolved inorganic carbon from atmospheric origin and from weathering and dissolved and particulate organic carbon (DOC and POC). In this study, the DOC and POC exports of this data set were rescaled per basin to match the estimates of Resplandy et al. (2018). After aggregating these rescaled estimates to the 18 land regions, we subtracted the fluvial carbon exports from the regionally aggregated inversion net surface CO urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0049 fluxes, to calculate regional net biospheric production (NBP) that can be compared with the DGVM estimates. These data are available in (Bastos, 2019).

2.2 DGVMs

DGVMs simulate water, energy and biogeochemical exchanges between the surface and the atmosphere through ecosystems activity, including growth, turnover and decomposition of vegetation, and soil carbon processes. DGVMs simulate the response of ecosystems to changes in environmental conditions (increasing CO urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0050, changing climate, and altered nitrogen deposition) and land use activity.

Here we use a set of 16 DGVMs from the TRENDY-v7 intercomparison (Sitch et al., 2015) that contributed to GCB2018. In the simulations for GCB2018 (Simulation S3), all 16 models are forced with: (i) observed climate from the Climate Research Unit (Harris et al., 2014) and the Japanese 55-year Reanalysis (Kobayashi et al., 2015) data sets, the CRU-JRA reanalysis, following the methodology in Viovy (2016); (ii) global CO urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0051 concentration from NOAA/ESRL (Dlugokencky & Tans, 2018); (iii) land use change transitions and land management fields from the Land Use Harmonization (LUH2v2.1h) from Hurtt et al., 2011 (2011, 2017), based on HYDE 3.1 (Klein Goldewijk et al., 2011); and (iv) gridded data of nitrogen (N) deposition when N cycling is simulated by models.

Some models simulate fire emissions ( urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0052) and nutrient cycling (nitrogen in ten models, and nitrogen and phosphorus in one model), including the effects of N deposition (Table 1). While all DGVMs simulate the fluxes resulting from forest clearing, pasture and crop conversion, abandonment and regrowth and crop harvest, they implement them using different assumptions about the areas being converted (e.g., gross versus net conversion). The way DGVMs simulate management practices and the fate of released carbon also varies between models. Ten models simulate wood harvest and six include shifting cultivation, roughly corresponding to gross transitions (Table 1). Only few DGVMs simulate crop fertilization or irrigation. To calculate urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0053, the S3 simulation is compared to the S2 simulation, which is forced with CO urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0054 and climate changes only, and keeping a fixed land cover map in 1700 (Le Quéré, Andrew, Friedlingstein, Sitch, Hauck, et al., 2018b).

For urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0055 estimated in this way, fluxes resulting from environmental change over managed lands as compared to intact vegetation are included (Pongratz et al., 2014). Due to the effect of CO urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0056 fertilization, intact vegetation would at present provide a slightly stronger sink, which implies that emissions from, for example, deforestation are now slightly higher than if compared to the preindustrial state. This term is referred to as loss of additional sink capacity (LASC) as proposed by Pongratz et al. (2014). In TRENDY-v7, an additional simulation was performed where preindustrial CO urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0057 and climate were kept fixed, and only land use changes and management were allowed to change between 1700 and 2017 (S4). urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0058 estimated in this way does not include the LASC term and is therefore more comparable in terms of processes to the ones estimated by the bookkeeping methods, except for the values of C densities used.

Outputs of monthly NBP from all three simulations (S2, S3, and S4) were first resampled to a common 1°  urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0059 1° latitude/longitude grid, and then aggregated for each region. The regional fluxes from S2 and S3 simulations are available from (Bastos, 2019).

2.3 Bookkeeping Models

Bookkeeping models track changes in above- and below-ground biomass carbon densities resulting from land use change processes such as deforestation/afforestation, cropland or pasture expansion, wood harvest, shifting cultivation, and forest regrowth after land abandonment. In GCB2018, two bookkeeping models that estimate urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0060 at the global scale have been considered: the bookkeeping model of Houghton and Nassikas (2017), referred as HN2017 henceforth, and the “Bookkeeping of Land Use Emissions” model (BLUE) described in Hansis et al. (2015). To estimate urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0061, bookkeeping models first calculate the changes in biomass, soil and the atmosphere carbon pools resulting from a given transition following specific response curves. The resulting fluxes for a given year can be calculated as the difference in carbon stocks between two consecutive years. The carbon density values for above- and below-ground biomass used by both models are based on recent history (ca. 1980s) measurements. This means that the C densities used have an implicit transient effect of increasing CO urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0062 and climate, leading to somewhat higher CO urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0063 emissions than S4 by DGVMs.

Even though based on similar principles, the two models differ in their mathematical formulation, spatial implementation, assumptions made about LUC transitions, processes included, forcing data used and, consequently, in the sources of uncertainty.

The HN2017 model covers the period of 1700–2015 and is based on regional statistics from the Food and Agricultural Organization (FAO) on changes in the areas of croplands and pastures since 1961 and changes in the areas of forests and other land since 1990 (FAO, 2015; FAOSTAT, 2015), while BLUE used the same data set as the DGVMs, that is, LUH2v2.1h. In the HN2017 model, calculations are performed at the country scale, and in the recent version, the model now incorporates country specific C densities. In this version, the model does not include shifting cultivation. From 1997 onward, peat fires (Giglio et al., 2016) and emissions from peat drainage (Hooijer et al., 2010) are added in Southeast Asia. The fluxes resulting from fire suppression are considered, but only in the United States. The HN2017 model allocates pasture preferentially to grasslands. This may result in lower CO urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0064 emissions by reducing deforestation rates (Reick et al., 2010).

The BLUE model uses a spatially explicit modeling scheme to calculate urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0065 on a pixel basis between 850 and 2017. It relies on biome-specific C densities and exponential response curves to track the changes in soil and biomass carbon pools following land use conversion or due to management. New cropland and pasture are taken proportionally from natural vegetation types and rangelands clear the natural vegetation for forest areas and degrade other natural land (Hansis et al., 2015; Le Quéré, Andrew, Friedlingstein, Sitch, Hauck, et al., 2018b). BLUE further includes shifting cultivation. In the GCB2018 version, BLUE used the same data set as several of the DGVMs, that is, the LUH2v2.1h, to calculate urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0066 at 0.25°  urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0067 0.25° latitude/longitude resolution. Because BLUE uses a spatially explicit framework and the same LUC forcing as DGVMs, we use it as a primary basis for comparison with process-based model estimates of urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0068. We then compare BLUE with HN2017 to evaluate the potential contribution of the forcing used to estimate urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0069 by DGVMs and BLUE to the mismatch between DGVMs and inversions.

2.4 Additional Data

In order to evaluate possible sources of uncertainty, we considered additional data sets to explain the differences between DGVM and inversion fluxes (see section 2.5): the Oceanic Niño Index (ONI) from NOAA's Climate Prediction Center (https://origin.cpc.ncep.noaa.gov/products/analysis_monitoring/ensostuff/ONI_v4.shtml) from December to March; annual average total water storage (TWS) over each of the 18 regions from the Gravity Recovery and Climate Experiment (GRACE) reconstruction (Humphrey et al., 2018); satellite-based land cover maps from the European Space Agency's Climate Change Initiative Land-Cover (LC-CCI) (ESA, 2017) from 1992–2015 aggregated from 250 m  urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0070 250 m to 1°  urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0071 1° latitude/longitude resolution; carbon emissions data from the Global Fire Emissions Database version 4.1s (GFED4.1s, https://doi.org/10.3334/ORNLDAAC/1293) available from 1997–2017, described in Randerson et al. (2017); and burned area from ESA-CCI Fire-CCI v5.1 (Fire-CCI; Chuvieco et al., 2018), covering the period 2001–2017.

2.5 Methods

The goal here is to quantify the mismatch between NBP estimated by inversions and DGVMs. As discussed in section 2.1, inversions differ in several aspects between each other. Likewise, DGVMs show variable skill in simulating fundamental processes of the terrestrial C cycle (see Figure B2 in Le Quéré, Andrew, Friedlingstein, Sitch, Hauck, et al., 2018b) and do not all simulate the same processes (Table 1). Some of the differences across data sets may arise from complex interactions between processes within models, and not be traceable to a particular process.

We define urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0072 as the difference in year urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0073 ( urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0074) between NBP estimates of the urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0075th inversion ( urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0076, with urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0077, depending on the year urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0078) available over that year and the urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0079th DGVM ( urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0080).
urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0081(1)

For a given year, the NBP values estimated by the inversions ( urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0082) can be then considered as a sample from a distribution of surface fluxes compatible with urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0083, with the size of this sample changing over the study period (1979–2017) as more inversions become available. For each year we can generate a set of urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0084 values of urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0085 for the globe ( urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0086), as well as for each of the 18 regions ( urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0087).

For each (inversion, DGVM) pair, the urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0088 can be decomposed into the 18 region components:
urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0089(2)
We can, therefore, quantify the contribution of each region to the variance ( urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0090) of urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0091 as
urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0092(3)

The contribution of each region urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0093 to the variance of urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0094 is therefore the quotient between the sum of the regional variance and covariance terms in urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0095 and the variance of urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0096, for each (inversion, DGVM) pair.

To identify possible drivers of urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0097, we test several statistical models using likely predictors for the mismatch between inversions and DGVMs. Linear mixed effects (LME) models allow modeling urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0098 as a function of fixed effects (e.g., El Niño/La Niña cycles), but also of random effects describing several sources of variability (DGVMs, years, inversions). We start by fitting to urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0099 the following simple LME model without any predictor region by region (the lme function from the R package lme4; Bates et al., 2014) described in equation 4 (LME urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0100):
urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0101(4)
where urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0102 is the average mismatch (i.e., urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0103 averaged over all models, years, and inversions) over the considered region. The random effects urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0104, urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0105 and urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0106 (all with zero mean and variance urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0107, urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0108, and urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0109, respectively) describe the variability of urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0110 across the DGVMs with index urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0111, the between-year variability of urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0112 with index urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0113, and the interaction between models and years on urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0114, respectively. The interaction term accounts for the possibility that some models may have larger urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0115 in some years, and smaller urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0116 in other years. The urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0117 term describes the variability across the five inversions and their interactions with year and DGVMs with index urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0118 and variance urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0119. For each region, a set of LME models with the possible combinations of one, two or all three random effects are fit to urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0120, the model that best explains urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0121 for each region is retained, by choosing the one with lowest Akaike Information Criterion and with significant fit ( urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0122 value  urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0123). We then expand the LME in equation 4 in order to explain part of the variance of urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0124 using different sets of predictors. We test four hypotheses (equation 5) that are supported by literature and for which data are available. We define a set of LME models (equation 5) (LME urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0125) with one or more predictors ( urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0126, urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0127, urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0128, urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0129) and associated coefficients ( urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0130) corresponding to the following hypotheses:
  1. urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0131 can be explained by a linear time trend, for example, if models under- or overestimate the sensitivity to a process with a strong trend component such as CO urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0132 increase (Graven et al., 2013; Thomas et al., 2016);
  2. Since urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0133 corresponds to the difference between top-down and bottom-up estimates of NBP, and NBP includes urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0134, which is highly uncertain (Le Quéré, Andrew, Friedlingstein, Sitch, Hauck, et al., 2018b; Piao et al., 2018), urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0135 might be explained by errors in urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0136;
  3. Differences can be due to too strong response of DGVMs to ENSO (Bastos et al., 2018);
  4. In fire-dominated regions, urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0137 may be explained by the fact that some DGVMs do not simulate fires (Table 1), or because those that do still show limited ability to correctly represent spatial and interannual variability in burned area and fire emissions (Li et al., 2014; Poulter et al., 2015; Yue et al., 2014, 2015).
The full model includes the four predictors and is expressed as
urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0138(5)

Because the predictors have different units, and variability across regions may also vary significantly for urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0139 and urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0140, we centered and standardized these variables (i.e., the mean was subtracted and the result divided by the standard deviation). As done for the random effects, we define multiple models with one, two, three, or four predictors as fixed effects, and choose the best model fit (lowest Akaike information criterion and significant fit). We fit the LME urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0141 separately for 1997–2017 when including fire emissions as a predictor.

When any of these variables is found to provide the best fit, we explore how the predictors may contribute to explain the differences. In the case of urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0142 we compare the estimates from DGVMs with those of BLUE and HN2017 and evaluate the role of the forcing LUC to explain urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0143 in that region. As ENSO effects on NBP are mainly related to changes in temperature and water availability in the tropics (warm/dry during El Niño and cool/wet during La Niña) we evaluate the sensitivity of fluxes to soil water availability and temperature by performing a multiple linear regression of spatially aggregated fluxes with regionally averaged annual temperature from CRU-JRA and water availability. Water availability was estimated by annual TWS from GRACE for inversions and simulated soil moisture for DGVMs. Where urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0144 is found to contribute to urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0145, we compare simulated emissions from fire as well as burned area with the reference data sets.

We further group DGVMs by “flavors,” corresponding to whether they represent (or not) the processes highlighted in Table 1: fire, N deposition, shifting cultivation and wood harvest. We evaluated whether the different groups of DGVMs have significantly lower biases ( urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0146) than the set of 16 DGVMs at global and regional scale (supporting information Table S1), and additionally add the process representation as a predictor in the LME urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0147 fit (Figure S7).

3 Results

3.1 Global Differences and the “Budget Imbalance”

The values of urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0148 reconstructed from the inversions fluxes on average match those from GCB2018 (Figure S1), with differences of urn:x-wiley:gbc:media:gbc20956:gbc20956-math-01490.04 urn:x-wiley:gbc:media:gbc20956:gbc20956-math-01500.41, 0.11 urn:x-wiley:gbc:media:gbc20956:gbc20956-math-01510.36, 0.14 urn:x-wiley:gbc:media:gbc20956:gbc20956-math-01520.36, 0.29 urn:x-wiley:gbc:media:gbc20956:gbc20956-math-01530.42, and 0.00 urn:x-wiley:gbc:media:gbc20956:gbc20956-math-01540.48 Pg C yr urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0155, for CAMS, CarboScope s76, CarboScope s85, MIROC, and CarbonTracker Europe, respectively, over the period covered by each inversion. For comparison, the budget imbalance term in GCB2018 for 2008–2017 was 0.5 Pg C yr urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0156.

The ensemble of urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0157 calculated following equation 1 for all (inversion, DGVM) pairs is shown in Figure 1. Over the 1979–2017 period, urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0158 averaged across all DGVMs for each inversion (bold lines) show strong multiannual fluctuations. From 1983–1989 (except 1982) and over the late 1990s, urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0159 was strongly positive, which indicates stronger NBP from DGVMs compared to inversions; following the year 2000, urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0160 showed a decreasing trend. These multiannual tendencies are punctuated by years of strong peaks, for example, 1982, 1991/1992, 2009/2010, and 2015 (negative urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0161) and 1984, 1988/1989, 2000, and 2011 (positive). The trough in 1991/1992 suggests that DGVMs miss the large abnormal sink deduced from urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0162, which is a known feature from all previous budgets (Le Queré et al., 2009; Le Quéré, Andrew, Friedlingstein, Sitch, Pongratz, et al., 2018; Le Quéré, Andrew, Friedlingstein, Sitch, Hauck, et al., 2018b) and likely linked to the underestimation by DGVMs of the land sink enhancement in response to the eruption of Mt. Pinatubo (Lucht et al., 2002; Mercado et al., 2009). Other years are associated with positive and negative phases of El Niño (Bastos et al., 2018; Bowman et al., 2017), suggesting a possible contribution to urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0163 of mismatches in the response of DGVMs and inversions to ENSO.

Details are in the caption following the image
Mismatch between DGVMs and inversions for the global land sink estimates, calculated following equation 1. The thin colored lines correspond to urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0164 for each inversion and each model, with the colors indicating the respective inversion (CAMS in dark blue; the two CarboScope inversions s76 and s85 in dark and light cyan, respectively; MIROC in pink; and CarbonTracker Europe [CTE] in purple). The bold lines show the multimodel ensemble mean of urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0165 for each inversion. The budget imbalance calculated by Le Quéré, Andrew, Friedlingstein, Sitch, Hauck, et al. (2018b) is shown in black (inverted sign to match the sign of equation 1) for comparison.

The “budget imbalance” time series from GCB2018 (black line, Figure 1) shows very similar variability to the group of ensemble means, supporting the hypothesis that errors in the land sink representation explain a large fraction of the GCB2018 imbalance term. The values of urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0166 show an offset compared to this term though, which is in part explained by the adjustment of lateral fluxes to Resplandy et al. (2018) values. Even though over the 39-year period no clear trend can be distinguished in the whole ensemble, seven out of the 80 pairs inversion-DGVM show significant trends, evaluated by the Mann-Kendall test.

The variability across (inversion, DGVM) pairs is much higher than the variability from year-to-year of the ensemble means. The 1 urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0167 of urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0168 is 0.9 Pg C yr urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0169 (Figure 1). This value is comparable to the uncertainty reported for the GCB2018 imbalance term ( urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0170 Pg C yr urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0171 ).

3.2 Regional Contributions to the Global Differences

We quantify the contribution of each of the 18 regions to the variance of the spatially averaged differences over the globe ( urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0172, Figure 2) for the urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0173 DGVM pairs. All (inversion, DGVM) pairs single out Brazil (BRA) as the region contributing the most to urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0174 at the global scale (16–27%, averaged accross DGVMs), followed by Oceania (OCE, 7–17%). Inversions also agree on a moderate contributions of northern South America (NSA, 4–7%), southern South America (SSA, 3–6%) and China (CHN, 3–11%). Four out of five inversions estimate a strong contribution of Southeast Asia (SEAS) to urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0175 ( urn:x-wiley:gbc:media:gbc20956:gbc20956-math-01761% for CAMS, 6–11% for the other four). CAMS assigns a strong negative contribution of EU ( urn:x-wiley:gbc:media:gbc20956:gbc20956-math-01776%) versus a strong positive contribution of RUS (21%) to urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0178, while the other inversions indicate comparable contributions of EU and RUS (0–7%). This suggests that inversions have dipoles in their allocation of the Eurasian CO urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0179 sink with inversions having more uptake in Russia and less in Europe. This is likely due to the relatively sparse observation network in RUS that does not allow to separate easily the flux of nearby regions.

Details are in the caption following the image
The 18 study regions: United States (USA), Canada (Canada), Europe (EU), Northern Africa (NAF), central Asia (CAS), Russia (RUS), Korea and Japan (KAJ), China (CHN), Southeast Asia (SEAS), Oceania (OCE), south Asia (SAS), Middle East (MIDE), southern Africa (SAF), equatorial Africa (EQAF), southern South America (SSA), Brazil (BRA), northern South America (NSA), and central America (CAM) are shown in the top panel. In the lower panel, the contribution of each region to the variance of urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0180 is shown for each inversion, calculated for the multi-DGVM mean for each of the five inversions. The bar order corresponds to CAMS, CarboScope s76 and s85, MIROC, and CarbonTracker Europe.

If the contribution of a given region to the variance of urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0181 changed over time, differences between the three “long” and the two short inversions (Table 1) can be explained partly by their different validity periods. However, large differences are found even between inversions with comparable validity periods, so they are most likely due to differences in the inversion systems. Moreover, we find a large spread of results between (inversion, DGVMs pairs) for several regions, indicating a relevant contribution of the uncertainty in spatial patterns of NBP from DGVMs to urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0182.

As for the globe, the spread of regional urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0183 ( urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0184) for the 5  urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0185 16 members ensemble in certain regions can be very high (Figure S2). There is not necessarily a correspondence between the regions with larger urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0186 spread and those regions identified above for their contribution to urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0187. The regions showing larger average spread over the 39-year period are, in decreasing order, RUS (1.3 Pg C yr urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0188), USA, and EU (1.0 Pg C yr urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0189), and BRA and CAN (0.9 Pg C yr urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0190). In some years the spread in urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0191 can reach 2.6 and 1.8 Pg C yr urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0192 for RUS and BRA, respectively, and 1.6 Pg C yr urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0193 in USA, EU, and CHN, which is close to the magnitude of the global sink (Le Quéré, Andrew, Friedlingstein, Sitch, Hauck, et al., 2018b). In most regions, it is hard to distinguish between individual inversions, showing that the variability across DGVMs also contributes considerably to the range of urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0194. Moreover, some regions show consistent interannual or long-term variability in urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0195 and its spread. For example, urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0196 is predominantly positive in the 1980s and 1990s, and becomes mostly negative following 2010. In OCE, a very large spread across data sets is observed for some years (e.g., 1999 and 2011) but not during most of the study period.

3.3 Decomposition of urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0206

The intercept ( urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0207) of the LMERE fit corresponds to the average stationary value of urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0208 for the period 1979–2017 (Figure 3 top panel). For the globe, urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0209 is positive, indicating that DGVMs estimate a stronger global CO urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0210 sink than inversions. Regionally, negative urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0211 values are found over most of the Northern Hemisphere's extratropics (except CAN and RUS) and positive values in the tropics and southern extratropics. Negative (positive) values indicate that DGVMs report a weaker (stronger) sink in the northern (tropical and southern) regions. The largest urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0212 terms are found in South America ( urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0213 0.2 Pg C yr urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0214), followed by EUR, CHN and USA ( urn:x-wiley:gbc:media:gbc20956:gbc20956-math-02150.15 Pg C yr urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0216). The tropics and southern extratropics have average urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0217 of 1.0 Pg C yr urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0218 and the northern regions of urn:x-wiley:gbc:media:gbc20956:gbc20956-math-02190.5 Pg C yr urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0220.

Details are in the caption following the image
Coefficients of the linear mixed effects model fit only with random effects only (equation 4) to the time series of urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0197 over each region. The intercept urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0198 corresponds to the error urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0199 averaged over all models and all years over each region (top panel). A positive (negative) value of urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0200 indicates an overestimate (underestimate) of NBP by the DGVMs compared to inversions over the study period (1979–2017). Regions where urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0201 values are not significantly different from zero are masked out in white. The coefficients for the random effects for each region and for the globe are shown in the bottom panel. A high value indicates a strong contribution of random effects from differences between DGVMs ( urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0202), between-year variability ( urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0203), model year variability ( urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0204), and variability between inversions ( urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0205).

The variance of urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0221 and urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0222 attributed to each random term of LMERE is shown in Figure 3 (bottom panel). Interannual variability ( urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0223) is the term contributing the most to urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0224, followed by the between-DGVM variability ( urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0225) and model year combined ( urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0226).  Variability between inversions ( urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0227) contributes the least to urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0228, but at the regional scale it is the most important factor in half of the regions, and is especially high in USA, CAN, EU, RUS, and CHN.

Interannual variability is the second most relevant term regionally, especially in BRA, OCE, NSA, and SSA where the interannual term dominates over the difference between inversions, although for the latter two the variance is rather small. Between-DGVM differences do not contribute much to the regional differences, except in RUS.

If random effects could explain all of the variance of urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0229, the residuals of the model fit should be randomly distributed around 0 and not change over time. However residuals of the LMERE fit show clear trends or multiannual patterns (Figure S3). This misfit of LMERE suggests that deterministic processes that explain interannual variability in urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0230 are missing or have a common bias in either inversions or DGVMs (probably being attributed to urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0231). The most evident cases are the decadal variations in the residuals for the globe, the sharp decreasing trend in the misfit in BRA since circa 2005, and long-term trends in SSA, EU, and RUS.

3.4 Effects of Predictors

For each region and for the globe, we test a set of key predictors (fixed effects model, LMEFE, equation 5) that might help explain urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0232. The predictors tested are the (i) a simple linear trend (“year” as predictor); (ii) the Oceanic Nino Index (ONI); (iii) urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0233 from the BLUE bookkeping model between 1979 and 2017, and (iv) the same predictors plus urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0234 for 1997–2017 (the period covered by GFED4.1s). For each region, we tested additional predictors such as TWS, burned area and changes in forest and cropland areas. These additional predictors are correlated with the previous ones (e.g., soil water with ENSO, or urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0235 from cropland/forest area changes) and were not found to provide additional information. The regression coefficients of the LMEFE fit to urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0236 and urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0237 are shown in Figure 4 for the two periods.

Details are in the caption following the image
Coefficients of the LME model fit with fixed effects (equation 5). urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0238, ONI, and urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0239 correspond to emissions from land use change, the Oceanic Niño Index, and emissions from fire, respectively. The regions abbreviations are described in Figure 2 . The symbols indicate statistically significant results: crosses for urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0240 value  urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0241 and asterisks for urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0242 value  urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0243. The top panel shows values for the period 1979–2017, and the bottom panel for the period 1997–2017.

The coefficients indicate the effect of a unit change in the predictor (e.g., urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0244) to a corresponding unit change in urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0245. We find that urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0246 is dominated by ONI and urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0247, with a small trend component, during 1979–2017, and only by the trend during 1997–2017, as other effects cancel out regionally. All coefficients for the globe have negative sign, indicating that positive anomalies of each predictor contribute to explain an underestimate of the CO urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0248 sink by DGVMs relative to inversions. This means, for example, that a positive anomaly in land use change emissions (above average) would coincide with lower NBP in DGVMs compared to inversions.

For the 39-year period, a strong negative relationship is found between urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0249 and ONI, implying that DGVMs tend to estimate a weaker sink or stronger source than inversions in response to El Niño, and the opposite during La Niña events. The global relationship of urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0250 and ONI is mainly driven by NSA, BRA, SAF, and SEAS, which also show strong negative effects of ONI, and partly offset by an opposing effect in CAS, MIDE, and SAS.

The LME urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0251 indicates urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0252 is explained by emissions from land use change in those regions where land use changes are more intense. The minus sign indicates that the higher urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0253 (positive for emissions), the more DGVMs underestimate the net sink globally compared to inversions. Such a negative contribution from LUC emissions is found in BRA, EU, EQAF, and SEAS. We tested LMEFE using urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0254 simulated by DGVMs (Experiments S2 and S3), which yielded very similar results. This suggests that too strong emissions from LUC in DGVMs might result in a weaker sink compared to inversions (which implicitly include urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0255).

We find a weak negative trend contribution to urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0256, revealing a significant, though small, divergence of inversions and DGVMs over time by urn:x-wiley:gbc:media:gbc20956:gbc20956-math-025726 Tg C decade urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0258. In other words, DGVMs underestimate the rate of increase of the global terrestrial sink compared to inversions. Significant trend components of urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0259 are found in 8 regions (negative in NSA, BRA, SSA, RUS, and CAS and positive in EU, KAJ, and SEAS), but their values are rather small (from 1–10 Tg C yr urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0260 in absolute magnitude).

Significant effects of urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0261 are found in NSA, CAS, and SEAS (Figure 4), the first two negative (DGVMs estimating lower sink/stronger source associated with fires than inversions) and positive in SEAS (DGVMs estimating stronger sink/weaker source associated with fires than inversions). Including urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0262 reduces the variance of the residuals of the LME fit to urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0263 during 1997–2017 by 80%, compared to a reduction of 58% for the fit without fire in that period (Figure S3). For one DGVM simulating fire using a state-of-the-art fire module, Yue et al. (2015) have shown a tendency of the model to overestimate fire emissions and burned area in regions roughly corresponding to NSA, BRA, and CAS, and to underestimate urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0264 and burned area in SEAS. This is discussed in section 3.5.3.

3.5 Sources of Uncertainty

3.5.1 ENSO, Temperature, and Soil Moisture Variability

The sensitivity of soil moisture from DGVMs to ONI in the regions with significant effects (NSA, BRA, SAF, and SEAS, negative and CAS, MIDE, and SAS, positive) is generally consistent with that of TWS from the GRACE reconstruction (Humphrey et al., 2018), although slightly underestimated for NSA and BRA (Figure S4). In SAF and SEAS the sensitivity of SM from DGVMs to ONI is close to that of TWS, but in the former DGVMs show a large range. The differences between DGVMs and inversions in these regions can be further related with the sensitivity of modeled NBP to the hot/cooler (e.g., by too strong respiration response to warming) or dry/wet anomalies linked with El Niño/La Niña events (e.g., by too strong water stress controls on productivity). Therefore, we evaluate the sensitivity of NBP to temperature and simulated soil moisture.

In NSA and BRA, inversions tend to estimate a positive sensitivity of NBP to temperature (higher uptake for warmer conditions, Figure 5), indicating a positive effect of temperature on productivity, which is mainly limited by radiation rather than water (Nemani et al., 2003). On the contrary, DGVMs tend to estimate negative sensitivity of NBP to temperature (lower NBP with warming). On the other hand, DGVMs estimate higher sensitivity of NBP to water availability than inversions (Figure 5). Higher sensitivity to water availability is also observed in SEAS, but inversions and DGVMs show similar sensitivity to temperature in this region.

Details are in the caption following the image
Sensitivity of NBP to annual temperature (CRU-JRA, top panel) and to soil moisture for inversions (GRACE reconstruction from (Humphrey et al., 2018)) and DGVMs (with simulated soil moisture) (bottom panel). The bars indicate the range of sensitivities from each set of data, and the crosses show the ensemble mean.

3.5.2 Land Use Change

The results of LME urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0265 (Figure 4) point to a negative effect of urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0266 in urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0267, indicating that DGVMs might overestimate emissions from LUC or underestimate LUC-related recovery sinks. However, the urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0268 data used relies on the same LUC forcing as DGVMs (BLUE) as a predictor. Therefore, the agreement between BLUE and DGVMs' estimates of urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0269 is partly explained by their use of a common forcing. In this sense, BLUE cannot be seen as a fully independent estimate of urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0270. We find a remarkable disagreement between regional estimates of urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0271 from DGVMs and BLUE with those from HN2017, especially in NSA, BRA, and SEAS. This points to a strong influence of the underlying LUC data used to force models, more than to structural differences between bookkeeping models or differences between bookkeeping and process-based models.

We compare urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0272 from DGVMs with the two bookkeeping models (BLUE and HN2017, urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0273 and urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0274, Figure 6). Simulation S3 (climate, CO urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0275 and LUC), although more suitable to compare with NBP from inversions, includes in their urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0276 the LASC term ( urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0277). The difference between S3 and S2 (climate and CO urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0278 only) is thus not directly comparable to bookkeeping model estimates. Therefore, we further include results from Simulation S4 (LUC only under preindustrial climate and CO urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0279) that does not include the LASC to calculate urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0280.

Details are in the caption following the image
Comparison of urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0281 from different data sets for the (top) globe, (middle) BRA, and (bottom) SEAS (where urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0282 is a significant predictor of urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0283).

Globally, the multimodel ensemble mean (MMEM) of urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0284 is above the estimates by the two bookkeeping models during most of the 39-year period (Figure 6), but urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0285 values are mostly between the two bookkeeping models, when they would be expected to be slightly lower. The two bookkeeping models differ on average by 0.5 Pg C yr urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0286, with BLUE systematically above HN2017, possibly because it includes more processes, particularly shifting cultivation (Arneth et al., 2017), or because HN2017 use grasslands preferentially over forest for pasture expansion (Hansis et al., 2015).

In BRA, the LASC has a relatively small effect (Figure 6). The MMEM for urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0287 is generally close to the estimates of BLUE and shows the same dynamics: a decreasing trend from the 1980s until circa 2009, and then a sudden increase in urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0288 of about 0.5 Pg C yr urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0289, followed by a period with high emissions and strong interannual variability. This 2009–2017 period coincides with a strong decrease in urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0290 in BRA (Figure S2), that is, DGVMs simulating a much weaker sink than inversions. HN2017, on the contrary, indicates lower emissions in the 1980s, a slow increasing trend until circa 2005 and a decreasing trend afterward.

In SEAS, peat burning and drainage are a major component of LUC-related fluxes (Moore et al., 2013). Since DGVMs do not simulate peat, the comparison of urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0291 with bookkeeping models should be adjusted for peat burning and drainage emissions (also note that part of these fluxes in models could be included in urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0292, which are discussed in the next section). When these emissions are removed from urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0293 estimated by bookkeeping models (Figure 6): urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0294 is close to the higher value estimated by BLUE, and urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0295 is between the two bookkeeping models, but only following the year 2000. Before that, urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0296 is well below both bookkeeping models, which is consistent with the lower equilibrium C densities in this simulation, as compared with bookkeeping values. The year 2000 also marks a period with higher urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0297 in BLUE, indicating a possible effect of the LUC forcing.

3.5.3 Fire

Fire emissions from GFED4.1s are found to be a significant predictor for urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0298 in NSA, CAS and SEAS. In the first two regions, urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0299 effects on urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0300 are negative, indicating that higher fire emissions in GFED4.1s are associated with too weak CO urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0301 uptake by DGVMs compared with inversions. In SEAS, the effect is the opposite, which suggests an underestimation by DGVMs of urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0302.

In Figure 7 (left panel) we compare urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0303 from those DGVMs that simulate fire (7 out of 16) with GFED4.1s fire emissions (Giglio et al., 2016). In NSA and CAS the ensemble of DGVMs estimates fire emissions about twice as high on average as those reported by GFED4.1s, but also much more variable (Figure S6). On the contrary, in SEAS, DGVMs estimate fire emissions that are, on average lower than the total emissions reported by GFED4.1s (about 60%). These results are consistent with overestimation of fire emissions in NSA and CAS leading to an underestimation of the CO urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0304 sink compared with inversions, and the opposite in SEAS.

Details are in the caption following the image
Comparison for the three regions where fire has a significant effect on urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0305 of urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0306 (cross markers, left panel) and burned area (cross markers, right panel) simulated by DGVMs (green bars indicate the DGVM range) and urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0307 reported by GFED4.1s and burned area from Fire-CCI Burned Area v5.1.

Biases in emissions from fire can be due to biases in simulated burned area or in the fuel properties and fire characteristics (e.g., human influence on ignitions). The comparison of simulated burned area with Fire-CCI burned area product (Chuvieco et al., 2016) indicates an overestimation by DGVMs of burned area in CAS and underestimation in SEAS, consistent with urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0308 biases. However, this is not the case in NSA, where DGVMs estimate too high fire emissions in spite of underestimating burned area.

4 Discussion

Tropical regions explained most of the variance in urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0309, especially Brazil and northern South America, Southeast Asia and Oceania. These are, at the same time, the regions less constrained by observations. However, the regions with higher biases (Figure 3) are generally those with better observational coverage (USA, CAN, EU). These regions are strong urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0310 emitters, suggesting that spatial and temporal differences between prescribed urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0311 might still affect surface fluxes estimated by inversions, which were only adjusted for latitudinal bands.

At regional scale, negative values of urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0312 (DGVMs underestimating the long-term sink) dominate in the Northern Hemisphere (especially Europe, China and USA), and positive values in the south, mostly in South America. We find that regional differences can be mostly attributed to variability across inversions, followed by interannual variability in the fluxes.

We found a strong contribution of interannual variability in NBP to urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0313, associated with El Niño–Southern Oscillation (ENSO) and fluxes from land use change ( urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0314), especially in Brazil and Southeast Asia, the two regions also contributing the most to urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0315 variance.

Using a similar set of DGVMs and some of the inversions used here, Bastos et al. (2018) have shown that DGVMs estimated a positive source anomaly in response to El Niño in 2015/2016 that was 0.3–0.6 Pg C yr urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0316 stronger than inversions. The coefficient of ONI estimated here ( urn:x-wiley:gbc:media:gbc20956:gbc20956-math-03170.3 Pg C yr urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0318/s.d.u., s.d.u. being the ONI values in standard deviation units) would imply a source anomaly difference between the inversions and DGVMs of about 0.5 Pg C yr urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0319 for these years ( urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0320.), consistent with their estimate.

Several studies have shown that variability in tropical CO urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0321 fluxes (mainly driven by ENSO) is better explained by variations in soil water storage than by temperature alone (Bastos et al., 2013; Humphrey et al., 2018; Poulter et al., 2014). The regions with stronger negative coefficients for ONI (NSA, BRA, SAF, and SEAS, Figure 4) are those where El Niño events impose strong warm and dry conditions, and those with positive coefficients (CAS, MIDE, and SAS) are associated with cool and wet conditions during El Niño (Bastos et al., 2013; Mason & Goddard, 2001). The combination of these two results hints at differences in the sensitivity of NBP from inversions and DGVMs to changes in water availability or temperature related with ENSO (section 3.5.1), or possibly also to changes in atmospheric circulation/mixing patterns linked to ENSO. The opposing sign in the sensitivity to temperature between inversions and DGVMs (Figure 5) may be due to an overestimate of the sensitivity of respiration to temperature (Bastos et al., 2018; van Schaik et al., 2018), or indirectly linked to the higher sensitivity of NBP to water availability in DGVMs compared to inversions. Models differ in prescribed soil depth and root water access for transpiration, but in general do not have deep rooting and ground water access of plants (Fan et al., 2017), which may explain why the sensitivity of regional CO urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0322 fluxes to water availability in DGVMs is higher than that of inversions with TWS. Still, inversions also show high uncertainty in these regions, which hampers the attribution of errors to one or other data source.

Excluding the years corresponding to the Mt. Pinatubo eruption, the strong peaks in urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0323 are associated with moderate or strong El Niño (1982, 2009/2010, and 2015) or La Niña events (1984, 1988/1989, 2000, and 2011). In most of these years,the GCB2018 budget imbalance term shows similar changes as urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0324. This suggests that improvements in DGVMs modeling of tropical vegetation sensitivity to water availability could help reducing this gap in future GCBs.

The higher values of urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0325 are consistent with the effect of CO urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0326 fertilization on natural ecosystems resulting in higher LUC-related emissions if a pristine forest is converted to managed land. However, urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0327 values (Figure 6) remain mostly between the two bookkeeping models, when they would be expected to be slightly lower, given that they rely on preindustrial C densities. Since most DGVMs do not include shifting cultivation, which is included in the BLUE model, estimates of urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0328 from DGVMs are likely too high. The negative effect of urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0329 to urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0330 in BRA and SEAS, and consequently in urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0331, suggests therefore a possible overestimation of LUC-related emissions due to the forcing used, for example, higher deforestation rates in LUH2 than in FAO/FRA.

The strong peak in emissions from LUC following 2009 in DGVMs and in BLUE is mainly explained by sharp increases in cropland area during 2009–2017 coinciding with increased forest loss in the LUH2v2.1h land cover data set (Figure S5). On the contrary, in the FAOSTAT (2015) data used by HN2017, most of the cropland area increase occurs before 2009, and forest loss slows down only after 2005. The LC-CCI also reports a slow down of forest area loss and stabilization of cropland in Brazil after 2005. A slowdown in deforestation rates in Brazil over this period has further been reported by Hansen et al. (2013) using high-resolution satellite imagery. This indicates that the overestimation of DGVMs of urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0332 inferred from the MMEM appears to be related with biases in deforestation rates from LUH2v2.1 in this region.

In SEAS, the two forcings show opposing changes in deforestation rates starting from circa 2000 onward (Figure S5), coinciding with higher urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0333 in DGVMs and BLUE (Figure 6) LUH2v2.1h indicates a faster decrease of forest area, while (FAO, 2015) shows a tendency for deceleration in deforestation. LUH2v2.1h also reports higher cropland expansion values, with much faster rates following 2005. The study by Hansen et al. (2013) indicated that gross forest loss rates increased in Indonesia between 2000–2012. Li et al. (2018) showed that net forest area changes in Southeast Asia and Indonesia from Houghton et al. (2012) and from LUH2v2h between 1992–2012 are consistent with the changes reported by Hansen et al. (2013), but much stronger than reported in LC-CCI data. LUH2v2h also reports higher increase in cropland areas when compared to LC-CCI (Figure S5). Li et al. (2018) pointed that these differences could be from the increase in plantations such as oil palm, which is categorized as cropland in FAOSTAT but detected as forest from satellites. It is thus not fully clear if the higher emissions by DGVMs in SEAS compared to HN2017 are indeed because of the LUC forcing.

We found that biases in burned area explain well biases in urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0334 and its contribution to urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0335 and urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0336, but not in NSA. A possible explanation for this disagreement might be related with simulated soil moisture and fuel dryness in this region (Van Leeuwen et al., 2013). For example, some models show very high urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0337 in 1997/1998 and 2015 in NSA compared to GFED4.1s. These very strong peaks are possibly a consequence of how vegetation response to drought during El Niño and consequent fire emissions (Patra et al., 2005) are simulated by DGVMs, perhaps leading to an overestimation of fire intensity and combustion completeness (since burned area is still lower than that of Fire-CCI in those years). In order to understand the errors in simulated fire emissions by DGVMs, a much more detailed analysis of the different factors controlling urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0338, from burned area to fuel characteristics and fire dynamics, is needed, which is beyond the scope of this study.

DGVMs tend to simulate a slower increase in CO urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0339 uptake than reported by inversions in several regions and globally. The weak negative trend in urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0340 is consistent with studies showing that DGVMs may underestimate the response of vegetation to the effect of CO urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0341 fertilization (Fernndez-Martnez et al., 2019; Thomas et al., 2016), or the effect of N deposition in Asia (Liu et al., 2013), or forest regrowth (Pugh et al., 2019). The positive values for the trend in EU, KAJ, and SEAS indicate that DGVMs show a stronger long-term increase in the land sink compared to inversions. This might be because DGVMs underestimate the negative effects of warming and decrease in water availability concurrent with increasing CO urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0342 in some regions (Buermann et al., 2018). However, inversions might also indirectly overestimate the land sink by underestimating the ocean sink intensification in recent years (Landschützer et al., 2015).

We found a significant contribution of uncertainty in urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0343 and of urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0344, as well as a small trend component, to regional and global urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0345. It is thus worth evaluating whether those DGVMs including representation of key LUC processes such as shifting cultivation and wood harvest (Arneth et al., 2017), N deposition or fires perform better compared to the ensemble of 16 DGVMs.

By fitting the LME to each subset of DGVMs, we find that simulating fire, N deposition or wood harvest does not lead to major reductions in uncertainty (Table S1). The DGVMs including shifting cultivation, though, show significant reduction of the bias globally, as well as in SEAS and EQAF, regions where these practices are relevant (Heinimann et al., 2017). When adding the process representation as a predictor in the LME urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0346 (Figure S7), including some of these processes as predictors are found to improve the LME fit, but the coefficients are generally nonsignificant, excepting N deposition globally and wood harvest in some regions. These results indicate that including gross transitions in urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0347 estimates is a key improvement to DGVMs, in order to reduce uncertainties in global and regional budgets.

The fact that models with fire do not perform much better than those without fire may be because models do not realistically simulate burned area and urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0348, as discussed above, but also because those DGVMs not simulating fire might compensate by having higher sensitivity of decomposition to temperature (as found for tropical regions in Figure 5). Simulating wood harvest also does not seem to necessarily reduce the bias in urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0349, and is only a weak predictor in NSA, NAF and KAJ. This is possibly because the wood harvest fluxes are generally small, compared to other terms, for example, shifting cultivation (Wilkenskjeld et al., 2014), or because trade of wood products is not accounted for, and therefore the location of sinks and sources related with wood production and consumption is not well captured by DGVMs. Finally, N deposition if found to significantly contribute to urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0350 (the negative sign indicating that models simulating N deposition tend to have lower urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0351), but when fitting the LME urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0352 model to that subset of DGVMs, no reduction in urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0353 is found. This suggests that uncertainty from other terms is probably higher.

5 Conclusions

In this study we attempted to identify the sources of uncertainty explaining the budget imbalance term of the latest GCB (Le Quéré, Andrew, Friedlingstein, Sitch, Hauck, et al., 2018b). We compared DGVM outputs of the NBP between 1979 and 2017 with results from an ensemble of atmospheric inversions, whose fluxes are consistent with the growth rate of atmospheric CO urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0354. We showed that the difference between NBP estimates by DGVMs and inversions matches the main variation patterns of the GCB2018 budget imbalance term. At the global scale, interannual variability and between-DGVM variability contribute the most to urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0355, and variability across inversions contributes the least. Based on our results, we could identify key regions and processes that deserve special attention from the Earth System and C cycle communities:
  • (i) the reduction of uncertainty in atmospheric inversions, for example, through more in situ observations in the tropics, or the use of remote-sensing;
  • (ii) improvements in urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0356 estimates, by including shifting cultivation DGVMs and also by evaluating thoroughly land use change estimates at country level, especially for the period overlapping high-resolution satellite records;
  • (iii) the improvement of tropical hydroecological processes and fire representation within DGVMs;
  • (iv) accurate estimates of lateral carbon fluxes and their variability, to guarantee a fair comparison between top-down and bottom-up approaches.

There are several ongoing efforts to tackle each of the processes highlighted above. Satellite-based inversions might help to reduce uncertainty in the flux partitioning between land, ocean, and different continents, and regional inversions might provide better constraints of regional CO urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0357 fluxes. Above-ground biomass products at annual time steps and moderate resolution are currently available at least for the tropics (Brandt et al., 2018), providing a truly independent estimate of losses and gains of carbon in ecosystems due to both natural and anthropogenic processes. Considerable work is being put into improving tropical phenology, vegetation-water interactions, mortality, and nutrient limitations by several modeling teams.

The predictors tested in this work reduced the variance of residuals of urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0358 only by 25–51%, and residuals still show large variability at the global and regional scale. This points to other processes not discussed in this study either on the DGVM side (e.g., response of forests to climate change, forest regrowth or forest aging, or their response to nitrogen deposition to nutrient limitations) or on the inversion side (e.g., errors in transport modeling, inversion set up, atmospheric monitoring network) that might deserve more attention in the future. Such efforts will hopefully allow reducing uncertainty in global and regional CO urn:x-wiley:gbc:media:gbc20956:gbc20956-math-0359 budgets in the near future. This, in turn, is fundamental for a science-based evaluation of the effectiveness of mitigation policies implemented in light of the Paris Agreement.

Acknowledgements

This work was partly supported by the European Space Agency Climate Change Initiative ESA-CCI RECCAP2 project (ESRIN/ 4000123002/18/I-NB). P. K. P. is partly supported by the Environment Research and Technology Development Fund (2-1701) of the Ministry of the Environment, Japan. A. J.'s effort was supported in part by the Department of Energy (DESC0016323) and NSF (NSF AGS 12-43071). P. C. acknowledges support from the European Research Council Synergy Project SyG-2013-610028 IMBALANCE-P, and P. C. and D. M. the ANR CLAND Convergence Institute. W. P. received funding from the European Research Council (ERC) for the Airborne Stable Isotopes of Carbon from the Amazon (ASICA) project, Contract 649087. N. E. S. received funding from the Netherlands Organisation for Scientific Research (NWO) for the Ruisdael Observatory. I. T. L. received funding from NWO under Contract 016.Veni.171.095. CT Europe simulations were performed using a grant for computing time (SH-312, 16666) from NWO. The CESM project is supported primarily by the National Science Foundation (NSF). This material is based upon work supported by the National Center for Atmospheric Research, which is a major facility sponsored by the NSF under Cooperative Agreement 1852977. Computing and data storage resources, including the Cheyenne supercomputer (doi:10.5065/D6RX99HX), were provided by the Computational and Information Systems Laboratory (CISL) at NCAR. Oak Ridge National Laboratory is operated by UT-Battelle, LLC, under Contract DE-AC05-00OR22725 with the U.S. Department of Energy. The authors would like to thank Corinne Le Quéré and Johannes Winckler for constructive comments in the preparation of this study and the ESA Soil Moisture CCI and the Fire-CCI teams for producing and maintaining the data sets used. The GRACE reconstructed data set was kindly provided by V. Humphrey.