The Southern Ocean Carbon Cycle 1985–2018: Mean, Seasonal Cycle, Trends, and Storage
Abstract
We assess the Southern Ocean CO2 uptake (1985–2018) using data sets gathered in the REgional Carbon Cycle Assessment and Processes Project Phase 2. The Southern Ocean acted as a sink for CO2 with close agreement between simulation results from global ocean biogeochemistry models (GOBMs, 0.75 ± 0.28 PgC yr−1) and pCO2-observation-based products (0.73 ± 0.07 PgC yr−1). This sink is only half that reported by RECCAP1 for the same region and timeframe. The present-day net uptake is to first order a response to rising atmospheric CO2, driving large amounts of anthropogenic CO2 (Cant) into the ocean, thereby overcompensating the loss of natural CO2 to the atmosphere. An apparent knowledge gap is the increase of the sink since 2000, with pCO2-products suggesting a growth that is more than twice as strong and uncertain as that of GOBMs (0.26 ± 0.06 and 0.11 ± 0.03 Pg C yr−1 decade−1, respectively). This is despite nearly identical pCO2 trends in GOBMs and pCO2-products when both products are compared only at the locations where pCO2 was measured. Seasonal analyses revealed agreement in driving processes in winter with uncertainty in the magnitude of outgassing, whereas discrepancies are more fundamental in summer, when GOBMs exhibit difficulties in simulating the effects of the non-thermal processes of biology and mixing/circulation. Ocean interior accumulation of Cant points to an underestimate of Cant uptake and storage in GOBMs. Future work needs to link surface fluxes and interior ocean transport, build long overdue systematic observation networks and push toward better process understanding of drivers of the carbon cycle.
Key Points
-
Ocean models and machine learning estimates agree on the mean Southern Ocean CO2 sink, but the trend since 2000 differs by a factor of two
-
REgional Carbon Cycle Assessment and Processes Project Phase 2 estimates a 50% smaller Southern Ocean CO2 sink for the same region and timeframe as RECCAP1
-
Large model spread in summer and winter indicates that sustained efforts are required to understand driving processes in all seasons
Plain Language Summary
The ocean takes up CO2 from the atmosphere and thus slows climate change. The Southern Ocean has long known to be an important region for ocean CO2 uptake. Here, we bring together all available data sets that estimate the Southern Ocean CO2 uptake, from models that simulate ocean circulation and physical and biological processes that affect the ocean carbon cycle, from surface ocean observation-based estimates, from atmospheric transport models that ingest atmospheric CO2 observations, and from interior ocean biogeochemical observations. With these data sets, we find good agreement on the mean Southern Ocean CO2 uptake 1985–2018, which is 50% smaller than previous estimates when recalculated for the time period and spatial extent used in the previous estimate. However, the estimates of the temporal change of the Southern Ocean CO2 uptake differ by a factor of two and thus are not in agreement. We further highlight that knowledge gaps exist not only in winter when observations are typically rare, but equally in summer when biology plays a larger role, which is typically represented too simplistically in the dynamic models.
1 Introduction
The Southern Ocean (Figure 1) is the primary conduit between the surface and the deep ocean (Morrison et al., 2022; Talley, 2013) making it a key region for the global carbon cycle and the climate system across time-scales from paleo to present day and into the future (Canadell et al., 2021). First, water mass formation of Antarctic surface water occurs during large-scale upwelling of deep, old and carbon-rich water masses due to strong westerly winds (Marshall & Speer, 2012; Russell et al., 2006). Part of this water moves northwards by Ekman transport and contributes to the formation of Southern mode and intermediate waters (Ito et al., 2010; Morrison et al., 2022; Sallée et al., 2012) together with subtropical water masses (Iudicone et al., 2016). Another part moves southward and circulates in the large gyres of the Weddell and Ross Seas (Klatt et al., 2005). A fraction of these Antarctic surface waters densify on the Antarctic shelves through cooling and brine rejection during sea-ice formation on the Antarctic shelves to then flow down the Antarctic slope and form Antarctic Bottom Water (Jacobs, 2004; Orsi et al., 1999).

Study region. The Southern Ocean covers three biomes: The subtropical seasonally stratified (STSS), the subpolar seasonally stratified (SPSS), and the ice (ICE) biome. The biomes are defined following Fay and McKinley (2014). We further consider the Atlantic, Pacific, and Indian Ocean sectors separately in parts of the analysis. The dashed lines show the REgional Carbon Cycle Assessment and Processes Project Phase 2 Southern Ocean northernmost extent (35°S), the RECCAP1 Southern Ocean northernmost extent (44°S), and RECCAP1's boundary for the circumpolar region (58°S).
In pre-industrial times, the Southern Ocean was a net source of CO2 to the atmosphere due to upwelling of carbon-rich deep waters (Mikaloff Fletcher et al., 2007). Importantly, the large-scale upwelling that drove the natural outgassing fluxes in the polar and subpolar Southern Ocean still occurs today. However, since industrialization, increasing atmospheric levels of CO2 have shifted the equilibrium of CO2 partial pressure between the ocean and the atmosphere in the favor of the latter, thus overcompensating the natural outgassing (e.g., Hoppema, 2004). The contemporary net flux in the Southern Ocean can thus be understood as the sum of the outgassing of natural CO2 and uptake of anthropogenic CO2 (Gruber et al., 2009; Gruber, Landschützer, & Lovenduski, 2019). Importantly, the Southern Ocean has acted as the primary region of uptake for anthropogenic CO2 (Caldeira & Duffy, 2000; Frölicher et al., 2015; Khatiwala et al., 2009; Mikaloff Fletcher et al., 2006; Orr et al., 2001; Sarmiento et al., 1992), which is attributed to upwelling of old water masses (with low anthropogenic carbon) in a region of high wind speeds, as well as subsequent transport of excess carbon from the surface into the ocean interior through the formation of Subantarctic Mode and Antarctic Intermediate Water (Bopp et al., 2015; Langlais et al., 2017; Mikaloff Fletcher et al., 2006; Sallée et al., 2012; Waugh et al., 2006). In the absence of evidence of substantial changes in the biological carbon pump over the past decades, the role of biology for anthropogenic carbon uptake is thought to be small (Holzer & DeVries, 2022; Murnane et al., 1999). However, the biological carbon pump can have a strong imprint on the net fluxes during the summer when primary production draws down natural CO2 at the surface (e.g., Bakker et al., 1997; Hoppema et al., 1995, 1999; E. Jones et al., 2012, 2015) despite it being generally considered inefficient due to widespread limitation by iron (De Baar et al., 1995; Ryan-Keogh et al., 2023; Smetacek et al., 2012) and possibly other trace metals (Browning et al., 2021).
While the general importance of the Southern Ocean for the ocean carbon sink is recognized, it is also the region with the largest uncertainty in the mean and trend of the sink (Friedlingstein et al., 2022; Hauck et al., 2020). This is partly because the observation-based estimates and model-based estimates measure different components of the ocean carbon sink, and assumptions on fluxes associated with river discharge need to made, which carry high uncertainty themselves (Aumont et al., 2001; Lacroix et al., 2020). Further, the decadal variability of the Southern Ocean and the underlying mechanisms thereof are a key contributor to the uncertainty and are a topic of continued discussion (Canadell et al., 2021; Gruber, Landschützer, & Lovenduski, 2019; Hauck et al., 2020; Landschützer et al., 2015; Le Quéré et al., 2007; McKinley et al., 2020). A stagnation in the growth of the Southern Ocean carbon sink in the 1990s is commonly attributed to a strengthening of the westerly winds and associated intensified upwelling of carbon- and nutrient-rich deep water (Hauck et al., 2013; Le Quéré et al., 2007; Lovenduski et al., 2007). Indeed, evidence for this stronger upwelling is indirectly observed by enhanced surface nutrient concentrations in all Southern Ocean basins (Ayers & Strutton, 2013; Hoppema et al., 2015; T. Iida et al., 2013; Panassa et al., 2018; Pardo et al., 2017). The early 2000's marked the start of the so-called reinvigoration of the Southern Ocean carbon sink (Landschützer et al., 2015). The strength of the reinvigoration is uncertain due to the observation-based products potentially overestimating the trends owing to data sparsity (Gloege et al., 2021; Hauck et al., 2023; Landschützer et al., 2015), while further analysis on the trends in the models is needed. Furthermore, the drivers of the reinvigoration are less well understood than for the stagnation, but may be linked to changes in the atmospheric forcing (Gruber, Landschützer, & Lovenduski, 2019) and/or changes in the overturning circulation (DeVries et al., 2017). There is also evidence that both the stagnation and the reinvigoration are part of a global response to variations in atmospheric CO2 growth rate, ocean temperature and circulation induced by the 1992 eruption of Mount Pinatubo (Eddebbar et al., 2019; McKinley et al., 2020).
The Southern Ocean carbon sink is projected to continue to play an important role in the future carbon cycle as shown by Earth System Model simulations (Canadell et al., 2021; Hauck et al., 2015; Kessler & Tjiputra, 2016; Terhaar et al., 2021). However, there are indications that system changes may occur, such as a shift to a larger proportion of the CO2 uptake occurring in the polar Southern Ocean (Hauck et al., 2015), and a strong sensitivity of Southern Ocean carbon storage to physical ventilation and warming (Bourgeois et al., 2022; Katavouta & Williams, 2021; Terhaar et al., 2021).
In this study, we aim to synthesize and assess information on the Southern Ocean carbon sink over the period 1985 to 2018 in the framework of the REgional Carbon Cycle Assessment and Processes project, Phase 2 (RECCAP2). This work builds on a previous assessment, RECCAP Phase 1 (referred to as RECCAP1 for clarity), for the period 1990 to 2009 (Lenton et al., 2013). In RECCAP1, the Southern Ocean was defined as the ocean south of 44°S (building on earlier classification in the atmospheric inversion community), which, however, cut through the major anthropogenic CO2 uptake region at the northern edge of the Southern Ocean. The assessment was based on five global ocean biogeochemical models, 11 atmospheric inversions, 10 ocean inversions and a single pCO2 observation-based data set, the climatology of Takahashi et al. (2009). RECCAP1 resulted in a best estimate of the net Southern Ocean CO2 uptake (1990–2009) of 0.42 ± 0.07 PgC yr−1 based on all models (including inversions), with a surface pCO2-based climatology (Takahashi et al., 2009) suggesting a lower number of 0.27 ± 0.13 PgC yr−1 Lenton et al. (2013). The interannual variability was estimated to be ±25% around this mean value. The largest proportion of the mean flux occurred in the region 44–58°S which spans large parts of the Subantarctic Zone and of the Polar Frontal Zone with similar contributions from the Atlantic, Pacific and Indian Ocean sectors. In the Antarctic Zone (south of 58°S), individual estimates did not agree on the sign of the net CO2 flux.
A major advance since RECCAP1 is the release and ongoing updates of the Surface Ocean CO2 Atlas (SOCAT, Bakker et al., 2016), which currently provides 33.7 million quality-controlled and curated surface ocean pCO2 measurements with an accuracy of <5 μatm in the 2022 release (Bakker et al., 2022). The release of SOCAT allowed for the development of the surface ocean pCO2 observation-based products (pCO2-products) that interpolate and extrapolate sparse ship-based observations from SOCAT to global coverage. Based on these maps of surface pCO2, the air-sea CO2 flux is then calculated using gas-exchange parameterizations and input data fields such as sea surface temperature (SST) and wind fields (Wanninkhof, 2014). Since RECCAP1, a diverse set of statistical and machine-learning approaches have been developed (e.g., Chau et al., 2022; Gregor et al., 2019; Landschützer et al., 2014; Rödenbeck et al., 2014). The pCO2-products allowed for observation-based investigation of interannual and decadal variability. They confirmed the reported stagnation of the Southern Ocean carbon sink in the 1990s (Le Quéré et al., 2007), and identified the aforementioned reinvigoration in the 2000s (Landschützer et al., 2015; Ritter et al., 2017). However, these pCO2-products have made the Southern Ocean's long-standing issue of sparse observations even more evident. Observing system simulation experiments (OSSEs) have shown that these methods are prone to regional and temporal biases (Denvil-Sommer et al., 2021) and some pCO2-products may overestimate the decadal variability by 30% (Gloege et al., 2021). In fact, a recent study showed that the SOM-FFN pCO2-product used in the reinvigoration study of Landschützer et al. (2015) overestimates the model-based decadal trend 2000–2018 by 130% in an ocean model subsampling experiment (Hauck et al., 2023). However, these OSSEs have also shown that augmenting ship-based observations with well-placed, high accuracy pCO2 observations from autonomous platforms can reduce these biases (Denvil-Sommer et al., 2021; Djeutchouang et al., 2022; Gloege et al., 2021; Hauck et al., 2023).
The gap in ship-based pCO2 observations is slowly being addressed by a second major advance, that is autonomous measurement devices. Among these are pH-equipped biogeochemical Argo floats (BGC-floats) (Johnson et al., 2017; Williams et al., 2016). With this approach, float pH measurements are combined with multi-linear regression-derived alkalinity (Carter et al., 2016, 2018, 2021; Williams et al., 2016), to estimate pCO2. Although uncertainties of the BGC-float based estimates of pCO2 are, to date, higher (theoretical uncertainty of 11 μatm, Williams et al., 2017) than for direct pCO2 measurements (2 μatm, Bakker et al., 2016), some of these indirect pCO2 estimates fill critical gaps in the sparsely sampled winter months. These novel data, either on their own (Gray et al., 2018) or as additional input for pCO2-products (Bushinsky et al., 2019), reported a strong winter outgassing of CO2 in the subpolar Southern Ocean for the years 2015 through 2017 that also led to a substantially smaller estimate of the annual Southern Ocean CO2 uptake for these years. However, these larger-than-expected winter outgassing estimates were challenged by airborne flux estimates and direct pCO2 measurements from a circumpolar navigation by an uncrewed sailing drone (Long et al., 2021; Sutton et al., 2021). The sailing drone observations were in agreement with ship-based pCO2-product estimates throughout all seasons (Sutton et al., 2021). The authors attributed the discrepancy between BGC-floats and other estimates to either a bias of the float measurement devices or interannual variability. In support of the latter argument, the BGC-Argo-based air-sea CO2 flux in the years 2017–2019 also did not reveal the strong winter outgassing signal of the years 2015 and 2016 (Sutton et al., 2021).
Another advance since RECCAP1 is that more global ocean biogeochemical models (GOBMs) have become available with improvements in resolution and physical and biogeochemical process representation (Friedlingstein et al., 2022; Wanninkhof et al., 2013). While the ability of the GOBMs to capture interannual variability of air-sea CO2 fluxes (FCO2) was questioned by the larger variability of pCO2-product estimates (Le Quéré et al., 2018), the lower interannual variability of GOBMs now falls within the range of the larger ensemble of pCO2-products (Hauck et al., 2020; McKinley et al., 2020). For the decadal variability of FCO2, there is a moderate agreement between GOBMs and pCO2-products on a stagnation of the sink in the 1990s and an increase of the sink in 2002–2011 but with a larger amplitude of the multi-year/decadal variability in the pCO2-products (Gruber et al., 2023; Hauck et al., 2020; McKinley et al., 2020). Although the GOBMs compare reasonably well to global and Southern Ocean observations of surface ocean pCO2 (Hauck et al., 2020), their estimates of the global ocean carbon sink remain below those of interior ocean anthropogenic carbon accumulation estimates from 1994 to 2007 (Gruber, Clement, et al., 2019), atmospheric inversions, observed O2/N2 ratios (Friedlingstein et al., 2022; Tohjima et al., 2019), and a similar underestimation was found in Earth System Models (Terhaar et al., 2022).
The final major advance in the last decade are regional and global data-assimilating global ocean biogeochemistry models (Carroll et al., 2020; Verdy & Mazloff, 2017). These models bring together the process-based knowledge from GOBMs, but use data assimilation schemes to minimize mismatches between simulated fields, and physical and biogeochemical observations.
Despite these recent advances in observations and models, the Southern Ocean is still the region with the largest discrepancy in mean CO2 flux (although within the uncertainty of the fluxes associated with river discharge, which are implicitly included in the observation-based estimates, but not in the models, see Sections 2.2.1 and 2.3.1) and variability, as well as largest model spread (Canadell et al., 2021; Fay & McKinley, 2021; Friedlingstein et al., 2022). In this study, we aim to quantify the Southern Ocean (following the RECCAP2 biome shown in Figure 1) surface CO2 fluxes and interior storage of anthropogenic carbon over the period 1985–2018 from different classes of models and observations, and to identify knowledge gaps and ways forward.
This study is organized in the following way. In our methods, we describe the region (Section 2.1), the data sets that we use throughout this synthesis (Section 2.2), and how the data were processed (Section 2.3). Our results contain first the estimates of the mean fluxes 1985–2018 and their decomposition into anthropogenic and natural fluxes, and atmospheric CO2 versus climate effects (Section 3.1). This is followed by an analysis of summer and winter fluxes and the full seasonal cycle, where we also decompose pCO2 into seasonal thermal and non-thermal contributions (Section 3.2). In an annually resolved perspective, we revisit mean fluxes accounting for data sets with short temporal coverage (Section 3.3) before we then analyze the regionally averaged temporal trends of CO2 flux (Section 3.4). The latter also encompasses a pCO2 comparison with in situ pCO2 observations, as well as atmospheric CO2 and climate effects as drivers of the trends. In the final part of the results, the study then evaluates the GOBM simulation results with observation-based estimates of ocean interior storage of anthropogenic carbon in the Southern Ocean (Section 3.5). The discussion first summarizes the results with a comparison of the RECCAP1 and RECCAP2 results (Section 4.1). We also discuss the drivers of the seasonal cycle (Section 4.2), the interannual and decadal variability (Section 4.3), and the zonal asymmetry of the fluxes in the Southern Ocean (Section 4.4). Lastly, we discuss how our study links with and can inform observational programs (Section 4.5), before presenting a conceptual characterization of the Southern Ocean carbon cycle in the conclusions (Figure 14 in Section 5).
2 Methods
2.1 Regions
We use the RECCAP2 regions (De Vries et al., 2023) to define the Southern Ocean and its northern boundary (Figure 1). This definition of the Southern Ocean covers the subtropical seasonally stratified biome (STSS), the subpolar seasonally stratified biome (SPSS), and the ice biome (ICE) and is based on the global open ocean biome classification of Fay and McKinley (2014). This covers a larger area than the definition used in RECCAP1 (44–58°S, 58–75°S, Lenton et al., 2013) and has the advantage that it does not cut through the subtropical region with its large CO2 flux into the ocean. The northernmost extent of the Southern Ocean in this definition is 35°S. For parts of our analysis, we further separate the Atlantic, Indian, and Pacific Ocean sectors along longitudes of 20°E, 147°E, and 290°E (Figure 1).
2.2 Data Sets
Here, we introduce data sets across four different data classes that are used for the assessment of the Southern Ocean CO2 fluxes and storage, namely: ocean biogeochemistry models (14), surface pCO2-based data-products (11), data assimilated and ocean inverse models (3), and atmospheric inversion models (6).
2.2.1 Ocean Biogeochemistry Models
We used 13 global ocean biogeochemistry models (GOBMs) and 1 regional ocean biogeochemistry model (Table 1). These models simulate ocean circulation and biogeochemical fluxes caused by physics (advection, mixing, gas-exchange) and by biological processes. They are forced with atmospheric fields from reanalysis products, for example, by either heat and freshwater fluxes directly or by air temperature, wind speed, precipitation and humidity, which are converted to heat and freshwater fluxes using bulk formulae (see references in Table 1; Large et al., 1994). From these 14 models, 11 models are global ocean models with roughly 1° × 1° resolution, and two global models (FESOM_REcoM_HR and ORCA025-GEOMAR) and the regional model (ROMS-SouthernOcean-ETHZ) are available in ca. 0.25° × 0.25° resolution. Details of global model set-ups are given in DeVries et al. (2023). The ROMS-based regional Southern Ocean model has a northern boundary at 24°S.
Data set | Time period | Specific information | Reference |
---|---|---|---|
Global ocean biogeochemistry models | Simulations | ||
CCSM-WHOI | 1985–2017 | A, B, C, D | Doney et al. (2009) |
CESM-ETHZ | 1985–2018 | A, B, C, D | Lindsay et al. (2014) and S. Yang and Gruber (2016) |
CNRM-ESM2-1 | 1985–2018 | A, B, C, D | Séférian et al. (2019), Berthet et al. (2019), and Séférian et al. (2020) |
EC-Earth3 | 1985–2018 | A, B, C, D | Döscher et al. (2022) |
FESOM-REcoM-HR | 1985–2018 | A, B | Hauck et al. (2013) and Schourup-Kristensen et al. (2014, 2018) |
FESOM-REcoM-LR | 1985–2018 | A, B, C, D | Hauck et al. (2013), Schourup-Kristensen et al. (2014), and Hauck et al. (2020) |
MOM6-Princeton | 1985–2018 | A, B | Liao et al. (2020) and Stock et al. (2020) |
MPIOM-HAMOCC | 1985–2018 | A, B, C, D | Ilyina et al. (2013), Paulsen et al. (2017), and Mauritsen et al. (2019) |
MRI-ESM2-1 | 1985–2018 | A, B, C, D | Urakawa et al. (2020) |
NorESM-OC1.2 | 1985–2018 | A, B, C, D | Schwinger et al. (2016) |
ORCA025-GEOMAR | 1985–2018 | A, B, C, D | Madec and the NEMO team (2016), Kriest and Oschlies (2015), and Chien et al. (2022) |
ORCA1-LIM3-PISCES (IPSL-NEMO-PISCES) | 1985–2018 | A, B, C, D | Aumont et al. (2015) |
NEMO-PlankTOM12.1 | 1985–2018 | A, B, C, D | Le Quéré et al. (2016), Buitenhuis et al. (2019), and Wright et al. (2021) |
Regional ocean biogeochemical models | Simulations | ||
ROMS-SouthernOcean-ETHZ | 1985–2018 | A, B, D | A. Haumann (2016) and Nissen et al. (2018) |
Data-assimilated models | |||
B-SOSE | 2013–2018 | Verdy and Mazloff (2017) | |
ECCO-Darwin | 1992–2017 | Carroll et al. (2020, 2022) | |
OCIMv2021 | 1780–2018 | A, B, C | DeVries (2022) |
- Note. Sorted by data class, here: Global Ocean Biogeochemistry Models (GOBMs), Regional Ocean Biogeochemistry Model, and data assimilated models.
For the ocean-models listed above, up to four different simulations were provided (see also Table S1 in Supporting Information S1 and DeVries et al., 2023). These differ in whether atmospheric CO2 and all other atmospheric forcing variables vary on interannual time scales, are repeated for a single year, or follow a multi-year climatology. In simulation A, the historical run, both atmospheric CO2 and all other physical forcing variables vary on interannual time scales. In simulation B, the preindustrial control run, a repeated year or climatological physical atmospheric forcing is used, and the atmospheric CO2 levels are held constant at pre-industrial levels. In simulation C, the atmospheric CO2 varies interannually and only the physical atmospheric forcing is climatological. In simulation D, the atmospheric CO2 levels are held constant at pre-industrial levels, whereas the physical atmospheric forcing varies interannually. These simulations allow for the separation of the effects of the increase in atmospheric CO2 and climate change and variability on air-sea CO2 fluxes: the steady-state and non-steady state components of both natural and anthropogenic carbon. Here anthropogenic refers to the direct effect of increasing atmospheric CO2 and non-steady state encompasses the effects of climate change and variability. For a detailed explanation, please see DeVries et al. (2023) and further explanation in Le Quéré et al. (2010), McNeil and Matear (2013), Hauck et al. (2020), Crisp et al. (2022), and Gruber et al. (2023). Simulation A includes all components of the carbon fluxes. In the control simulation B, only the steady-state component of natural carbon is considered. In simulation C, only the steady-state components of both natural and anthropogenic carbon are accounted for. Finally, in simulation D, only the steady state and non-steady state components of natural carbon are represented.
The majority of models do not account for the river-induced outgassing of carbon (DeVries et al., 2023; Terhaar et al., 2023), hence the air-sea CO2 flux in simulation A corresponds to the SOCEAN definition used in the Global Carbon Budget (Friedlingstein et al., 2022), which differs from pCO2-product estimates by the river-induced term. Note that the river-induced term will be discussed in greater detail in Section 4.1. In addition, simulation A may include a model bias (mean offset) and drift (temporally changing offset). We assess the model drift of the air-sea CO2 flux by calculating the linear trend of the integrated CO2 flux time series for the period 1985–2018 in simulation B for each model and each biome. The time series plots and the linear trends reported in Figure 8 are drift corrected by subtracting the trend from simulation B. We note that this drift-correction only marginally impacts the reported trends in the result section, as the trends in simulation B are small compared to the mean fluxes for all models (see Text S1 and Figure S1 in Supporting Information S1). In contrast to a global bias (any deviation of the global mean CO2 flux from 0 in simulation B, see Hauck et al. (2020)), the regional bias in the simulated flux cannot be assessed by the set of simulations as it cannot be separated from the natural steady-state air-sea CO2 flux (Terhaar et al., 2023), which is non zero on a regional level.
We use the full suite of models in all analyses, with two exceptions. First, we excluded the MPIOM-HAMOCC model in all seasonal analyses (Figures 4-7) because its amplitude of the seasonal cycle is a factor 3–6 larger than in the other models in the three main Southern Ocean biomes (Figure S2 in Supporting Information S1), and including this outlier would skew the ensemble mean disproportionately. The exaggerated seasonal cycle in the MPIOM-HAMOCC model was found in earlier studies and is attributed to excessive net primary production in the Southern Ocean (Mongwe et al., 2018). Second, the decomposition into natural and anthropogenic CO2 fluxes was not possible with GOBMs that only provided simulations A and B (MOM6-Princeton and FESOM-REcoM-HR). See Section 2.3.5 for further restrictions on GOBM use and interpretation for the interior ocean anthropogenic carbon accumulation.
2.2.2 Surface pCO2-Based Data-Products
As a second data class, we use surface ocean pCO2 observation-based data products (pCO2-products) (Table 2, for more details see DeVries et al. (2023)). These pCO2-products extrapolate or interpolate sparse ship-based measurements of pCO2 using statistical modeling approaches. All pCO2-based data-products use SOCAT as the target data set. The majority of pCO2-products use similar gridded prediction data sets to fill the gaps, including SST, sea surface salinity (SSS), mixed-layer depth, and chlorophyll-a estimates for the open ocean. We use eight such pCO2-products that all cover the full time-series 1985–2018 for the ensemble mean of pCO2-products. AOML_EXTRAT covers a shorter period, and is thus not included in the ensemble mean 1985–2018, but is included in the ensemble mean 2015–2018. The largest methodological difference between the pCO2-products stems from the algorithm choice. The majority of the methods use regression approaches (a.k.a. machine learning) such as artificial neural networks (e.g., MPI-SOM-FFN) and gradient boosted decision trees (e.g., CSIR-ML6) to capture the relationship between the ship-based measurements and the predictor variables. The Jena-CarboScope product includes a mechanistic understanding of mixing, entrainment, and fluxes of CO2 into and out of the mixed layer (Rödenbeck et al., 2014). The HPD-LDEO method adjusts global ocean biogeochemistry model estimates of pCO2 to be closer to observed ship-based measurements and is thus an observation-based posterior correction to the GOBM estimates (Gloege et al., 2022).
Data set | Time period | Specific information | References |
---|---|---|---|
pCO2-products | |||
AOML-EXTRAT | 1998–2018 | Wanninkhof (2023) | |
CMEMS-LSCE-FFNN | 1985–2018 | Chau et al. (2022) | |
CSIR-ML6 | 1985–2018 | Gregor et al. (2019) | |
Jena-CarboScope (Mixed layer scheme) | 1985–2018 | Rödenbeck et al. (2013, 2022) | |
JMA-MLR | 1985–2018 | Y. Iida et al. (2021) | |
LDEO-HPD | 1985–2018 | Gloege et al. (2022) | |
NIES-ML3 | 1985–2018 | Zeng et al. (2022) | |
OceanSODA-ETHZ | 1985–2018 | Gregor and Gruber (2021) | |
MPI-SOM-FFN | 1985–2018 | Landschützer et al. (2016, 2020) | |
Jena-CarboScope (SOCCOM) | 2015–2018 | Bushinsky et al. (2019) updated | |
MPI-SOM-FFN (SOCCOM) | 2015–2018 | Bushinsky et al. (2019) updated | |
Watson2020 | 1988–2018 | Watson et al. (2020) | |
LDEO-climatology (Takahashi legacy) | Climatology | Takahashi et al. (2009) | |
Atmospheric inversions | Ocean prior | ||
Jena CarboScope | 1957–2020 (1990–2020) | CarboScope pCO2-product | Rödenbeck et al. (2018) |
CAMS | 1979–2020 (1990–2020) | CMEMS-LSCE-FFNN pCO2-product | Chevallier et al. (2005) |
NISMON-CO2 | 1990–2020 | JMA-MLR pCO2-product | Niwa et al. (2017) |
CarbonTrackerEurope (CTE) | 2001–2020 | CarboScope pCO2-product | van der Laan-Luijkx et al. (2017) |
UoE | 2001–2020 | Takahashi climatology | Feng et al. (2016) |
CMS-Flux | 2010–2020 | MOM6 GOBM | Liu et al. (2021) |
- Note. Sorted by data class, here: pCO2-products and atmospheric inversions. The atmospheric inversions were provided only since 1990.
2.2.3 Data-Assimilated Models
We use three data-assimilating models (Table 1). The Biogeochemical Southern Ocean State Estimate (B-SOSE, Verdy & Mazloff, 2017) is an eddy-permitting 1/6-degree resolution data-assimilating model, which assimilates the data from Southern Ocean Carbon and Climate Observations and Modeling (SOCCOM) BGC-Argo floats as well as shipborne and other autonomous observations (i.e., GLODAP and SOCAT) over the period 2013–2018. In situ and satellite observations of the physical state are also assimilated. B-SOSE is based on the MIT general circulation model (MITgcm, Campin et al., 2011) and uses software developed by the consortium for Estimating the Circulation and Climate of the Ocean (ECCO, Stammer et al., 2002; Wunsch & Heimbach, 2013) to build on the SOSE physical model framework by adding the Nitrogen version of the Biogeochemistry with Light, Iron, Nutrients, and Gases (N-BLING; evolved from Galbraith et al. (2010)) biogeochemical model. Consistency with the data is achieved by systematically adjusting the model initial conditions and the atmospheric state through the 4D-Var assimilation methodology. This B-SOSE assimilation methodology does not break the model biogeochemical or physical budgets. The budgets are closed, which allows one to understand signal attribution, though limits the control over the solution. For this reason B-SOSE is only consistent with the data on the timescales longer than approximately 90 days; the mesoscale eddies are reproduced statistically and not deterministically. Even with this assimilation methodology some seasonal biases still exist, and B-SOSE is still a work in progress.
The ECCO-Darwin data-assimilation model (Carroll et al., 2020) is based on a global ocean and sea ice configuration (about 1/3°) of the MIT general circulation model and is available from January 1992 to December 2017. Besides being global and covering a longer duration than B-SOSE, this product also uses a different biogeochemical model and assimilation technique. The ECCO circulation estimates used in this version are coupled online with the Darwin ecosystem model (Dutkiewicz et al., 2009), which represents the planktonic ecosystem dynamics coupled with biogeochemical cycles in the ocean. The Wanninkhof (1992) parameterization of gas transfer velocity is used and is the National Oceanic and Atmospheric Administration Marine Boundary Layer Reference product (Dlugokencky et al., 2021). The biogeochemical observations used to evaluate and adjust ECCO-Darwin include (a) surface ocean fugacity (fCO2) from the monthly gridded Surface Ocean CO2 Atlas (SOCATv5, Bakker et al., 2016), (b) GLODAPv2 ship-based profiles of NO3, PO4, SiO2, O2, dissolved inorganic carbon (DIC), and alkalinity (Olsen et al., 2016), and (c) BGC-Argo float profiles of NO3 and O2 (Drucker & Riser, 2016; Riser et al., 2018). To adjust the model's fit to the global biogeochemical observations, the Green's function approach is used to adjust biogeochemical initial conditions and model parameters.
OCIMv2021 is an inverse model that assimilates observations of temperature, salinity, CFCs, and radiocarbon to achieve an estimate of the climatological mean ocean circulation (DeVries, 2022). This steady-state circulation model is used together with an abiotic carbon cycle model and atmospheric CO2 forcing to simulate anthropogenic carbon uptake and its redistribution within the ocean. It uses a monthly time-step and simulates the period 1780 to 2018. No assimilation takes place during this period.
2.2.4 Atmospheric Inversions
Six atmospheric inversions are available for our analysis (Table 2). Atmospheric inversions make use of the worldwide network of atmospheric CO2 observations. They ingest a data set of fossil fuel emissions, which are assumed to be well known, into an atmospheric transport model and then solve for the spatio-temporal distribution of land and ocean CO2 fluxes while minimizing the mismatch with atmospheric CO2 observations (Friedlingstein et al., 2022). Thus, the resulting land and ocean carbon fluxes are bound to the atmospheric CO2 growth rate, but the estimated regional fluxes depend on the number of stations in the observational network. The inversions also start from prior estimates of land and ocean fluxes. For four inversion data sets that we use here, the ocean prior is taken from pCO2-products that are used in this analysis as well (Table 2). One inversion (UoE) uses the Takahashi climatology as a prior and one (CMS-Flux) an ocean biogeochemical model. The atmospheric inversions are thus not independent from the other data classes (Friedlingstein et al., 2022, their Table A4). The atmospheric inversion data were submitted for RECCAP in the same version as in the Global Carbon Budget 2021 (Friedlingstein et al., 2022), but only since 1990. The three inversions starting later (2001 or 2010) are only included in averages reported for 2015–2018 (Figures 4 and 5), and as individual lines in the time-series figure (Figure 8).
2.3 Processing
Throughout this study, we report the air-sea CO2 exchange as the net flux (FCO2), which is the sum of natural, anthropogenic, and river-induced air-sea CO2 flux (see e.g., Crisp et al., 2022; DeVries et al., 2023; Hauck et al., 2020). As the GOBMs vary widely in their choices on river carbon and nutrient input into the ocean and burial at the seafloor (see DeVries et al., 2023; Terhaar et al., 2023), an adjustment is applied to make all data classes comparable.
2.3.1 River Flux Adjustment
Globally, the majority of GOBMs produce a small imbalance of riverine carbon inflow and burial globally (<0.14 PgC yr−1), which is smaller than the current best estimate of river-induced CO2 ocean outgassing of 0.65 PgC yr−1 (Regnier et al., 2022). The imbalances are due to manifold choices and illustrate the lack of a closed land-ocean carbon loop in the GOBMs. As the GOBMs do not adequately account for the river discharge and its fate within the ocean, and thus for river-derived ocean CO2 outgassing (Terhaar et al., 2023), we account for this outgassing by using the spatial patterns of river-induced air-sea CO2 fluxes from Lacroix et al. (2020) that are scaled to the global value of 0.65 PgC yr−1 (Regnier et al., 2022). Southern Ocean outgassing from rivers amounts to 0.04 PgC yr−1, that is, around 6% of the global river flux. It is distributed over the Southern Ocean biomes as follows (positive outgassing): 0.00036 PgC yr−1 in the ICE biome, 0.053 PgC yr−1 (SPSS biome), −0.014 (STSS biome). The estimated riverine CO2 fluxes were added to biome-integrated fluxes in simulation A for all GOBMs, so that these are comparable to the pCO2-products. They are not added to spatial maps of CO2 fluxes due to large uncertainties in the regional attribution by Lacroix et al. (2020). The riverine fluxes are one (ICE) to multiple (SPSS, STSS) orders of magnitude smaller than the mean fluxes quantified in this study. The uncertainty associated with the river flux adjustment is discussed in Section 4.1.
2.3.2 Treatment of Different Area Coverage
Air-sea CO2 fluxes in all data classes were integrated over the area available for each GOBM, pCO2-product etc., that is, fluxes were not scaled to the same ocean area here. Relative to the ocean area in the RECCAP mask, the covered ocean areas in the GOBMs and data-assimilating models corresponds to 96.2%–100% (minimum for CCSM-WHOI) and to 95.6%–100% in the pCO2-products (minimum for JMA-MLR). These differences mainly stem from the ICE biome. We assume that the discrepancy arising from differences in covered area are smaller than the uncertainty arising from any extrapolation to the same area.
2.3.3 Trend Calculation
The reported linear trends between 1985 and 2018, 1985 and 2000 or 2001 and 2018 were obtained from a linear regression of the annual mean time series (using Python's scikit-learn). The slope of the respective fit was multiplied by a factor of 10 to obtain decadal CO2 flux trends. The start and end year of the trend calculation were chosen based on the RECCAP2 protocol and are supported by the reported stagnation of the Southern Ocean carbon sink in the 1990s and the reinvigoration since the early 2000s (Hauck et al., 2020; Landschützer et al., 2015; Le Quéré et al., 2007).
2.3.4 pCO2 Decomposition
2.3.5 Anthropogenic Carbon Inventories
Anthropogenic CO2 (Cant) is defined as the change in ocean DIC since preindustrial times due to the direct effect of increasing CO2 concentration in the atmosphere. It is computed as the DIC difference between experiments A and D. The accumulation of Cant can be separated into a steady-state component (, DIC difference between experiments C and B), that is influenced only by the increased atmospheric CO2, and a nonsteady state component (), which considers the effect of climate variability and change on Cant (and which is maximally 10%–20% of Cant, Text S2 and Figures S3 and S4 in Supporting Information S1). Here we focus mainly on the change in Cant that has occurred over the period 1994–2007 (hereafter ΔCant), to correspond to the years covered by the eMLR(C*) observation-based estimate (Gruber, Clement, et al., 2019). The eMLR(C*) method (Clement & Gruber, 2018) uses ocean measurements of DIC from GLODAP2 (Olsen et al., 2016) over more than 30 years as the foundation to determine ΔCant between nominal years 1994 and 2007. The method has been shown to be accurate at global and basin scales, but is more uncertain at sub-basin scales and should not be used below 3,000 m depth. The (2 sigma) uncertainty of the eMLR(C*) product is estimated to be around 19% for the Southern Hemisphere (Gruber, Clement, et al., 2019). The eMLR(C*) method differs fundamentally from past indirect or model-based methods used to estimate Cant accumulated since pre-industrial times (DeVries, 2014; Gruber et al., 1996; Sabine et al., 2004; Waugh et al., 2006). Of these, we used the 1800–1994 cumulative Cant estimate based on Sabine et al. (2004), which is characterized by an uncertainty of about 20% globally (Matsumoto & Gruber, 2005; Sabine et al., 2004). In terms of GOBMs, we used all those listed in Table 1, with the exception of FESOM-REcoM-HR and MOM6-Princeton who provided only experiments A and B. For most GOBMs, we analyze , to allow for a more accurate comparison with the observation-based data set (eMLR(C*)). However, for MPIOM-HAMOCC and CNRM-ESM2-1 it was only possible to compute , because of physical forcing inconsistencies between experiments A and D. We believe that the advantage of including all GOBMs in the analysis outweighs the disadvantages of having an incoherent definition of Cant among GOBMs. It should be noted that the spin-up procedure of ROMS-SouthernOcean-ETHZ, which uses atmospheric CO2 from 1969 to 1978 (for a 10 year spin-up of the biogeochemical component), makes it suitable only for the analysis of ΔCant between 1994 and 2007, and not of cumulative Cant until 1994 nor of air-sea Cant fluxes in specific years. As explained in the RECCAP2 model evaluation chapter (Terhaar et al., 2023), all GOBMs are forced with a very similar atmospheric CO2 mixing ratio (xCO2) over the historical period. However, the atmospheric xCO2 in the pre-industrial control simulations across the GOBM ensemble varies between 278 and 287.4 ppm, leading to an underestimate of the Cant storage for those models with a late starting date (Terhaar et al., 2023).
3 Results
3.1 Mean Air-Sea CO2 Fluxes 1985–2018
We start with a comparison of the average air-sea CO2 flux in the two data classes (GOBMs, pCO2-products) that cover the full period 1985–2018. We exclude data classes with fewer products for the sake of robustness, and show the comparison between all data classes in Sections 3.2 and 3.4. The mean net Southern Ocean air-sea CO2 flux 1985–2018 by the GOBM ensemble is −0.75 ± 0.28 and −0.73 ± 0.07 PgC yr−1 (flux into the ocean) for the pCO2-product ensemble mean (Figure 2a). While both ensemble means result in an almost identical ocean uptake of CO2, the GOBM ensemble spread is four times larger.

Temporal average of the Southern Ocean CO2 net flux (FCO2). A positive flux denotes outgassing from ocean to atmosphere. The temporal average is calculated over the period 1985 to 2018 for the global ocean biogeochemistry models (GOBMs) and pCO2-products (Tables 1 and 2). (a) The green and blue bar plots show the ensemble mean of the GOBMs and pCO2-based data-products, and open circles indicate the individual GOBMs and pCO2-products. The ensemble standard deviation (1σ) is shown by the error bars. The river flux adjustment added to the GOBMs is small (0.04 PgC yr−1), its distribution over the biomes is described in Section 2.3.1. (b) Zonal mean flux density of the different data sets. Thick green and blue lines show the ensemble means, and thin green and blue lines show the individual GOBMs and pCO2-products. Approximate boundaries for biomes are marked with black points on the x-axis. (c, d) Maps of spatial distribution of net CO2 flux for ensemble means of GOBMs, and pCO2-products.
All Southern Ocean regions are sinks of CO2 based on the ensemble averages of the GOBMs and pCO2-products (Figure 2). The subtropical seasonally stratified biome (STSS), which is a subduction area with deep winter mixed layer depth and intermediate chlorophyll concentration (Fay & McKinley, 2014), is the largest sink according to all data sets (GOBMs: −0.53 ± −0.17 PgC yr−1, pCO2-based products:-0.62 ± 0.05 PgC yr−1, Figure 2a). Second is the subpolar seasonally stratified biome (SPSS) (GOBMs: −0.13 ± 0.14 PgC yr−1, pCO2-products: −0.06 ± 0.02 PgC yr−1), which is characterized by upwelling of old water, rich in natural carbon but with low anthropogenic carbon content. The upwelled water is also rich in nutrients, and thus a region with important biological activity. Note that three GOBMs simulate the SPSS to be a source of CO2 to the atmosphere. The marginal sea ice (ICE) biome is the weakest CO2 sink (GOBMs: −0.09 ± 0.13 PgC yr−1; pCO2-products: −0.04 ± 0.02 PgC yr−1) due to sea ice acting as a lid that prevents carbon outgassing in winter, and is the smallest of all three biomes covering an area of about 60% the size of STSS or SPSS (Fay & McKinley, 2014). Four individual models suggest that the ICE biome is a weak outgassing region, but no other data set supports this. Though small, the differences between GOBM and pCO2-product FCO2 at the biome-level compensate each other leading to the near-identical estimates for the whole Southern Ocean (Figures 2a and 2b).
In a zonal mean view (Figure 2b), the smallest uptake occurs between 62 and 55°S and the largest uptake around 40°S. However, the amplitude differs between data classes, with the pCO2-products having a larger difference between minima and maxima (1.96 mol C m−2 yr−1), than the GOBM ensemble mean (1.19 mol C m−2 yr−1). Some of the individual GOBMs deviate from this pattern (see Figure S5a in Supporting Information S1 for zonal means of individual models).
Regionally, significant differences emerge between the Atlantic, Indian and Pacific sectors of the Southern Ocean (Figures 2c and 2d). Within the STSS, large CO2 fluxes into the ocean occur in the Atlantic and Indian sector across all data classes (Figures 2b and 2c, mean flux density: −1.93 and −2.05 mol C m−2 yr−1 for GOBMs and pCO2-products, respectively, in the Atlantic sector, −1.44 and −1.89 mol C m−2 yr−1 in the Indian sector, and −1.22 and −1.54 mol C m−2 yr−1 in the Pacific sector). CO2 outgassing locations differ across the data classes. In the GOBM ensemble mean, the outgassing is mainly confined to the Indian sector of the SPSS, whereas it is more widely spread in the pCO2-product ensemble mean covering the Pacific and Indian Ocean sectors of the SPSS and the Indian sector in the ICE biome. The smooth appearance of the outgassing signal in the GOBM and pCO2-product ensemble means may be partly attributable to averaging over multiple data sets and months and years.
3.1.1 Decomposition Into Anthropogenic and Natural Carbon Fluxes and Climate Versus Atmospheric CO2 Effects on the Mean CO2 Flux
With the aid of the additional model simulations, we can decompose the net Southern Ocean air-sea CO2 flux into natural and anthropogenic components, and separate the indirect effects of physical climate change and the direct geochemical effect of increasing atmospheric CO2 mixing ratios. The GOBM ensemble mean indicates that the natural Southern Ocean carbon cycle without anthropogenic perturbation would be a small CO2 source to the atmosphere of 0.05 PgC yr−1, although with a large model spread as indicated by the standard deviation of 0.25 PgC yr−1 (Figure 3 green bars). In fact, six GOBMs simulate negative natural CO2 fluxes, that is, into the ocean, and six GOBMs simulate positive natural fluxes, that is, out of the ocean. This also illustrates that the GOBM spread of net fluxes (gray bars in Figure 3, standard deviation: 0.28 PgC yr−1) is, to the first order, dominated by the model differences of natural fluxes (standard deviation: 0.25 PgC yr−1), which may contain artifacts from model biases and drift (Terhaar et al., 2023). The spread of anthropogenic fluxes is smaller (0.13 PgC yr−1, dark red in Figure 3). The small natural outgassing signal in the ensemble mean is a balance of natural CO2 uptake in the STSS (−0.26 ± 0.14 PgC yr−1) and outgassing in the SPSS (0.21 ± 0.11 PgC yr−1) and ICE (0.10 ± 0.12 PgC yr−1) biomes. This is in qualitative agreement with the patterns of natural CO2 fluxes by Mikaloff Fletcher et al. (2007) who report outgassing of natural CO2 in the Southern Ocean with the largest contribution from the subpolar region.

Decomposition of the modeled net air-sea CO2 flux 1985–2018 into natural and anthropogenic CO2 fluxes; as well as into CO2 and climate effects. See method Section 2.2.1 for explanation on this decomposition. The separation into natural and anthropogenic CO2 fluxes is not possible for FESOM-REcoM-HR and MOM6-Princeton models as only simulations A and B are available. These models are only shown as crosses for net FCO2 but not used for averaging. Hence, separation within this figure is coherent, but the net FCO2 is slightly different from the net FCO2 in Figure 2.
The anthropogenic perturbation (−0.79 ± 0.13 PgC yr−1, dark red Figure 3) has turned the SPSS and ICE biomes, and possibly the entire Southern Ocean, from source to sink. The large anthropogenic flux contribution in the SPSS (−0.38 ± 0.08 PgC yr−1) suppresses the natural CO2 outgassing flux. The STSS is a sink for both natural and anthropogenic flux components. The direct effect of increasing atmospheric CO2 enhances the Southern Ocean sink by −0.74 ± 0.11 PgC yr−1 (light red Figure 3) and is the largest signal in the anthropogenic perturbation. A smaller component stems from the climate change effect on this steady state CO2-induced flux (Figure S6 in Supporting Information S1). The direct CO2 effect is largest in the SPSS (−0.34 ± 0.06 PgC yr−1) where old water masses reach the surface that are undersaturated in anthropogenic carbon, followed by the STSS and ICE biomes (−0.23 ± 0.03 and −0.17 ± 0.03 PgC yr−1). In the upwelling regions, the primary effect of rising atmospheric CO2 is thus to suppress the outgassing of natural carbon.
The effect of physical climate change and variability, that is, warming and changes in wind speed patterns and strength that provoke changes in circulation (Hauck et al., 2013; Le Quéré et al., 2007; Lovenduski et al., 2007), reduces the CO2 flux into the ocean (+0.04 ± 0.07 PgC yr−1, yellow in Figure 3), but is overall small in comparison to the direct CO2 effect. This climate change induced outgassing stems nearly entirely from the SPSS (+0.04 ± 0.04 PgC yr−1), with the largest contribution from the Indian sector followed by the Pacific (Figure S7 in Supporting Information S1). Thus, the climate change effect amplifies the natural CO2 outgassing, which is also the largest in the Indian and Pacific sectors of the SPSS. The climate effect is a combination of climate effects on natural and anthropogenic CO2 fluxes, which partly oppose each other (Figure S6 in Supporting Information S1).
3.2 The Seasonal Cycle of Air-Sea CO2 Fluxes in the Southern Ocean 2015–2018
We now shift our focus to seasonal fluxes by separating fluxes into separate winter (Figure 4) and summer (Figure 5) mean CO2 fluxes. For this, we examine the period 2015–2018, for which all data sets are available (see Figure S8 in Supporting Information S1 for an annual mean figure for 2015–2018 and Table S4 in Supporting Information S1 for an overview of numbers and comparison to annual means 2015–2018 and 1985–2018). The characteristics of the seasonal cycle and differences between data classes described below are robust to choice of period (2015–2018, 1985–2018).

Average winter (June–August) air-sea CO2 fluxes (FCO2) in the period 2015–2018, (a) averaged over biomes, (b) zonal mean flux density, and (c–f) maps of flux density. Same as Figure 2, but including also data sets with shorter coverage, and a map of the CO2 flux from the BGC-float pCO2-products (panel (e)), and Biogeochemical Southern Ocean State Estimate (f), and hence focusing on the period 2015–2018 for all data sets for comparability. Note that the MPI model is excluded here. The zonal mean of individual models are presented in Figure S5c in Supporting Information S1.

3.2.1 Winter
In winter, all but two data sets (one GOBM and BGC-float pCO2-products) agree that the Southern Ocean is a sink of CO2 (GOBMs: −0.83 ± 0.40 PgC yr−1, pCO2 products: −0.51 ± 0.17 PgC yr−1; Figure 4a, Table S4 in Supporting Information S1). The general pattern of strong uptake toward the north and a reduction toward the south is common to all data classes, though exceptions for individual GOBMs do exist (Figures 4a and 4b). Expounding on this, the strong uptake in the STSS is shown by all data sets, but further south the coherence disintegrates. Within the SPSS, there is considerable variation in position and magnitude of maximum outgassing with some GOBMs being a sink along the entire zonal mean (Figures 4a and 4b). Toward the southern reaches of the ICE biome, fluxes are more coherent as they are constrained by sea-ice cover in winter (Figure 4b). For the zonal means of individual GOBMs, see Figure S5 in Supporting Information S1.
The divergence between data class average flux estimates for the Southern Ocean are explained nearly entirely by differences in the SPSS (GOBMs: −0.15 ± 0.32 PgC yr−1 and pCO2 products: 0.13 ± 0.13 PgC yr−1, in Figure 4a). Note also that the spread of the individual GOBMs is the largest in the SPSS (0.32 PgC yr−1), although it is also substantial in the other biomes (STSS: 0.29 PgC yr−1, ICE: 0.13 PgC yr−1) (Figure 5a). The SPSS is also where we see the largest impact of the inclusion of floats in the BGC-float pCO2-products (Figures 4d and 4e), with the mean outgassing flux more than doubling that of the regular pCO2-product ensemble. Interestingly, differences between B-SOSE and BGC-float pCO2-products are large (∼1 PgC yr−1, whole Southern Ocean) and FCO2 from these data sets even have a different sign for whole Southern Ocean and SPSS biome (Figure 4) although they both include BGC float data.
The zonal differences and features of fluxes between data classes are also most distinct in the SPSS (Figures 4c–4f). In short, the Atlantic sector of the SPSS has the lowest flux (weak source or even sink), while the Indian and Pacific sectors dominate the outgassing. The data-assimilated model B-SOSE has stronger localized outgassing compared with the other data classes but bear in mind that B-SOSE is only one data sets (Figure 4f), while the other data classes (Figures 4c–4e) represent up to 13, thus potentially averaging out local signals. The outgassing hotspot at the boundary between the Atlantic and Indian sectors of the SPSS can also be recognized in the pCO2-products (Figure 4d). The second hotspot in the western Pacific SPSS is not distinguishable in the other data sets.
3.2.2 Summer
In summer, GOBMs, pCO2-products and inversions largely show CO2 uptake within the three Southern Ocean biomes, and outgassing north of the STSS (Figures 5a and 5b, Table S4 in Supporting Information S1). In contrast to winter, the GOBM ensemble mean for summer 2015–2018 (−1.04 ± 0.77 PgC yr−1) underestimates the CO2 uptake relative to the pCO2-product ensemble mean (−1.45 ± 0.17 PgC yr−1, Figure 5a). This also holds true for the data-assimilated models, where B-SOSE even simulates outgassing in the SPSS (Figures 5a, 5b, and 5f). Otherwise, the data-assimilated models, B-SOSE and ECCO-Darwin, deviate substantially from the other data classes. The differences between pCO2-products with and without BGC-float data are hardly apparent in summer (Figure 5a, compared to Figure 4a). This could be due to a smaller discrepancy between float and ship-data in summer, and/or a dominance of SOCAT data in summer for the ship + float estimate. For context, for the period 2015 through 2018, BGC-float data account for up to 70% of winter pCO2 monthly by 1° × 1° measurements in the Southern Ocean (SOCAT + floats), while in summer the floats represent only 20% (Bakker et al., 2016; Bushinsky et al., 2019).
While the STSS was a region of coherence between data classes in winter (Figure 4), it is the main source of the discrepancy between the GOBM and pCO2-product ensemble means in summer (GOBMs: −0.40 ± 0.28 PgC yr−1, pCO2-products: −0.73 ± 0.07 PgC yr−1). The discrepancy is comparatively smaller in the SPSS (GOBMs: −0.33 ± 0.34 PgC yr−1, pCO2-products: −0.41 ± 0.06 PgC yr−1). We note that CO2 fluxes for both GOBMs and pCO2-products show less variation from ICE to STSS in summer compared to winter (Figure 4b vs. Figure 5b, respectively). There is, nevertheless, an offset with lower GOBM CO2 uptake than in pCO2-products north of 55°S, and vice versa to the south. Also, the GOBM spread in the represented magnitude of the fluxes is large. In absolute terms, the GOBM ensemble spread of fluxes in summer (from −2.03 to +0.28 PgC yr−1) is larger than in winter (from −1.36 to 0.12 PgC yr−1) or than the spread in the annual mean (from −1.30 to −0.38 PgC yr−1; see Figure S5b in Supporting Information S1 for zonal means of individual GOBMs). This mirrors the difficulty in representing the balance between physical and biological processes in summer, which is further assessed in the next two Sections 3.2.3 and 3.2.4.
3.2.3 The Full Seasonal Cycle
We diagnose distinctly different seasonal cycles in the three biomes. The ICE biome has a rather clear maximum uptake in summer in the GOBM and pCO2-product ensemble means, as well as most individual data sets (Figure 6a). In the STSS, the pCO2-products suggest a weak seasonal cycle with a maximum uptake in autumn (Figure 6c), while the majority of GOBMs simulate a maximum CO2 uptake in winter and a substantially smaller flux in summer.

The seasonal cycle of air-sea CO2 flux in the Southern Ocean separated by biomes for all data sets as indicated in the legend, (a) subtropical seasonally stratified biome, (b) subpolar seasonally stratified biome, and (c) ice biome. Thin green and blue lines depict individual global ocean biogeochemistry models (GOBMs) and pCO2-products, and thick lines indicate their ensemble means. Note that the MPI model is excluded here. The ensemble standard deviation (1σ) is shown by the bars for each month. Panels (d–u) present the season of maximum CO2 uptake per grid cell in the individual GOBMs, data-assimilated models and the ensemble mean of the pCO2-products over the period indicated in the panels (varies by data set). See Figure S9 in Supporting Information S1 for the individual pCO2-products (panel (d–u) equivalents) and Figure S10 in Supporting Information S1 for the seasonal cycle in all nine subregions (equivalent to panels (a–c) but further split into Atlantic, Pacific, and Indian Ocean sectors).
The largest disagreement occurs in the SPSS, where the seasonal cycle transitions from winter outgassing in the ICE biome to summer outgassing in the STSS biomes. Here, in the SPSS, atmospheric inversions and pCO2-products (including the BGC-float pCO2-products), suggest the maximum CO2 uptake to be in summer. In winter, the BGC-float pCO2-products more than double the estimates of outgassing relative to the other pCO2-products (Figure 6b). The GOBM ensemble average roughly agrees with this seasonal pattern, but simulates a reduced seasonal cycle amplitude (Figure 6b). The reduced amplitude in the SPSS is also due to different seasonal cycle phasing within the GOBM ensemble, with 8 out of 13 GOBMs simulate the maximum uptake between November and January (Figures 6b and 6d–6r), illustrating the complexity of representing the transition between the different seasonal cycle regimes. As already seen in Figures 4 and 5, the seasonal cycle differs substantially between B-SOSE and BGC-float pCO2-products, particularly in the SPSS despite the new information provided by the BGC-floats. Further, the two data-assimilating models differ by a similar amount as other GOBMs differ from each other, suggesting that the model dynamics dominate over assimilated data.
In summary, most GOBMs and pCO2-products agree on a summer peak in the ICE biome (but exceptions exist, Figures 6d–6r), and a winter peak at the northern boundary of the Southern Ocean. The largest discrepancy between data sets is where and how swift this transition occurs. While the use of static biomes adds to the discrepancies seen in the averaged seasonal cycles (Figures 6a–6c), the disagreement between the phasing of individual GOBMs is likely a much larger contributor to these discrepancies (Figures 6d–6p). We now turn to an investigation of the thermal and non-thermal effects on the seasonal cycle, which may help explain these discrepancies.
3.2.4 Thermal Versus Non-Thermal Effects on the Seasonal Cycle
The seasonal cycle of CO2 fluxes in the Southern Ocean is a balancing act between competing thermal and non-thermal drivers (Mongwe et al., 2016, 2018; Prend et al., 2022). DIC draw-down by biological production leads to a summer minimum in pCO2 and maximum in CO2 uptake (Figures 7a–7c, Metzl et al., 2006; Mongwe et al., 2018). Whereas in autumn and winter, upwelling and entrainment of DIC-rich water into the mixed layer leads to a maximum in pCO2 and a minimum in CO2 uptake or even outgassing. Seasonal variations in mixed layer temperature further affect the solubility of CO2, with lower (higher) temperatures increasing (decreasing) solubility and thus promoting CO2 uptake (outgassing) (Takahashi et al., 2002).

(a–c): The climatological seasonal cycle of pCO2 on a monthly scale. (d–f): Seasonal cycle of the rate of change of the thermal (, dashed lines) and non-thermal (, solid lines) components of ocean surface pCO2 on monthly time scales given in μatm month−1 (Equation 2). The bars on the bottom show standard deviations of the non-thermal component. Models have been grouped into DIC-dominant/weak, where the DIC-weak models have a thermal contribution >0 for the mean of the subtropical seasonally stratified and subpolar seasonally stratified (shown in panel (g–i); see Figure S11 in Supporting Information S1 for individual global and regional ocean biogeochemistry models, and Table S2 in Supporting Information S1 for the DIC-dominant/weak model groups). (g–i): λpCO2, the difference of the thermal and non-thermal (dissolved inorganic carbon [DIC]) components of ocean surface pCO2 as in Mongwe et al. (2018). When λpCO2 > 0 (red) indicates temperature dominance, and λpCO2 < 0 (blue) indicates that the non-thermal component (i.e., DIC) is dominant. The MPI model is excluded in this analysis.
The thermal and non-thermal components of pCO2 can be decomposed to determine the dominant driver on monthly timescales (Figure 7; Mongwe et al., 2018). Here, we do this by estimating the absolute difference of the rate of change of the thermal () and non-thermal () components (Figure 7; Equation 3). The contribution of salinity and total alkalinity to seasonal pCO2 changes are small in the Southern Ocean and compensate for each other on a seasonal scale (e.g., Lauderdale et al., 2016; Sarmiento & Gruber, 2006), thus we here consider to be predominantly DIC-driven.
In general, the seasonal cycle phasing of the thermal component of the GOBMs agrees well with those of the pCO2-products (Figures 7d–7f). This should not come as a surprise, as GOBMs are forced by atmospheric reanalyses which assimilate observed SST (Doney et al., 2007). As a result, of the pCO2 seasonal cycle in the GOBMs (forced by reanalyses) compare much better to derived from the pCO2-products than fully coupled Earth System Models (Mongwe et al., 2016, 2018). The non-thermal contribution is thus the primary reason for the spread between GOBMs, and for the differences between GOBMs and pCO2-products (Figures 7d–7f). Thus, we group GOBMs based on whether they are predominantly DIC or thermally driven across all three biomes (Figures 7g–7i, Table S2 in Supporting Information S1), which we term DIC-dominant or DIC-weak respectively.
In DIC-weak GOBMs, the strong underestimation of causes these models to be too strongly temperature driven across the year (Figure 7). This reverses the seasonal cycle of pCO2 (Figure 7c) and shifts the timing of uptake toward the colder months (when CO2 solubility is largest). While in spring and summer, the role of biologically driven uptake is suppressed in favor of warming driven outgassing. This effect is largely confined to the STSS and to a lesser extent also the SPSS, and can account for the mismatch in the seasonal cycle of pCO2 and CO2 flux seen in some GOBMs (Figures 7b and 7c and Figures 6b and 6c). For example, in the STSS, nearly all GOBMs and specifically all DIC-weak GOBMs have a shifted season of maximum uptake from summer to winter/spring, that is, toward the colder months (Figures 7c and 7f and Table S2 in Supporting Information S1). In terms of the underlying mechanisms driving the underestimated , we hypothesize that a lack of deep vertical mixing in winter leads to too little entrainment of DIC-rich deep waters, while simultaneously allowing for too early primary production (which may then shift the growing season earlier and reduce biologically driven summer uptake). Notably, the bias in pCO2 is largest in summer (DJF), followed by autumn (MAM), and is about twice as large in the DIC-weak GOBMs than in the DIC-dominant GOBMs (Figure S13 in Supporting Information S1). This further supports the lesser importance of thermal processes in the STSS and SPSS regions evident in the pCO2-products.
In the ICE biome, fluxes of the GOBMs and pCO2-products tend to agree much more closely in terms of their representation of the seasonal cycle (Figure 6a). This is likely related to the strong role the seasonal advance and retreat of sea ice plays in air-sea CO2 fluxes, primarily through its effect as a physical barrier. The large difference in the seasonal cycle of pCO2 (Figure 7a) between the GOBMs and pCO2-products thus has little impact on the fluxes (Figure 6a). Further, sea ice can also have an effect on vertical mixing and light availability, thus impacting both physical and biological pathways of DIC into and out of the mixed layer (Bakker et al., 2008; Shadwick et al., 2021; M. Yang et al., 2021).
3.3 Mean Air-Sea CO2 Flux of All Data Classes
We next inspect the mean fluxes for data sets that are not available for the full time-period in an annually resolved perspective (Figure 8) in this section and the temporal evolution of the air-sea CO2 fluxes from 1985 to 2018 in the next Section 3.4.

Temporal evolution of the Southern Ocean air-sea CO2 flux for (a) the entire Southern Ocean, the (b) subtropical seasonally stratified, (c) subpolar seasonally stratified, and (d) ice biomes. The ensemble standard deviation (1σ) averaged over the whole time series, is shown by the bars. Panels (e–h) are the same as panels (a–d) for the global ocean biogeochemistry model (GOBM) ensemble average and pCO2-product ensemble average only, with linear trends between 1985–2000 and 2001–2018 as the dashed and dotted lines, respectively. The uncertainty range of the trend is calculated as one standard deviation of the trends across all GOBMs and pCO2-products, respectively. Note the different y-axis scales. The separation into Atlantic, Pacific, and Indian Ocean sectors is shown in Figure S12 in Supporting Information S1.
Acknowledging some agreement between GOBMs and pCO2-based product ensemble means despite large spread across GOBMs (Figure 8 bars), substantial deviations among individual data sets appear. B-SOSE (2015–2018) suggests a 0.25 PgC yr−1 smaller uptake than the GOBM and pCO2-product ensemble means for the entire Southern Ocean (Figure 8a). ECCO-Darwin has the largest flux estimate into the ocean in the SPSS and the entire Southern Ocean (1.30 PgC yr−1, 1985–2018). Notably, the two data-assimilated models B-SOSE and ECCO-Darwin differ by a factor of 2 for the Southern Ocean wide estimate. In agreement with previous reports (Bushinsky et al., 2019), BGC-float pCO2-products suggest Southern Ocean uptake to be 40% (0.4 PgC yr−1) smaller than the pCO2-products without BGC-float data (2015–2018). This discrepancy originates largely in the SPSS, where the BGC-float pCO2-products estimate outgassing of 0.14 PgC yr−1, and the ensemble mean of the SOCAT-only-based pCO2-products estimate a CO2 uptake of −0.13 PgC yr−1.
Smaller contributions to the deviation stem from the STSS and ICE biomes where BGC-float pCO2-products report a smaller uptake by 0.14 PgC yr−1 when compared with the regular pCO2-products. The Watson2020-product is generally close to the other pCO2-products, with the exception of the SPSS where it suggests a flux of −0.18 PgC yr−1 (1985–2018), which is larger than any other pCO2-product. The origin of the large SPSS difference in Watson2020 could, in part, be attributed to subtle differences in method choices in addition to different flux parameterizations (Watson et al., 2020). The atmospheric inversions produce a somewhat lower sink (−0.64 PgC yr−1, average over three inversions 1985–2018), but are generally close to the pCO2-products, as they mostly use surface pCO2-products as a prior (Table 2 and Friedlingstein et al., 2022).
3.4 Temporal Variability and Trends in Air-Sea CO2 Flux 1985–2018
While the STSS was a net-sink region throughout the period (Figure 8), the SPSS and ICE have turned from neutral (around 0 PgC yr−1) to net sink regions since 1985, based on GOBM and pCO2-product ensemble mean estimates. In the case of the pCO2-products, the trends should be interpreted in the context of the sparse observations, particularly prior to the 2000s (Figures 9a and 10c). However, most individual GOBMs support the shift from neutral to uptake, with only two GOBMs (CCSM-WHOI and EC-Earth3) simulating either the ICE or SPSS biomes as regions of outgassing after 2015.

Comparison of surface mean pCO2 for the whole Southern Ocean between global ocean biogeochemistry models (GOBMs) and pCO2-products with in situ observations (gridded SOCAT v2022 data set Sabine et al., 2013). (a) Time-series of annually averaged pCO2 from GOBMs (green), data-assimilated models (grays), and pCO2–products (blue). The darker shaded lines show the annual mean as calculated from the data sets subsampled to match the historic SOCAT sampling. The lighter shades show the annual mean calculated for the full coverage. The dark red line depicts the annual mean pCO2 from SOCAT observations without interpolation. The assimilation products (ECCO-Darwin and Biogeochemical Southern Ocean State Estimate) are kept separate as they have different time series lengths (shown by dashed and solid gray lines, respectively). The light red area plot (right y-axis) shows the number of monthly by 1° × 1° gridded SOCAT observations per year. (b) The bias of pCO2 for all data classes (subsampled to match SOCAT observations, dark lines in panel (a)) relative to SOCAT pCO2 observations (solid dark red line in panel (a)). (c) The root mean squared difference (RMSD) between SOCAT observations and estimates for all data classes. Bias and RMSD were calculated on a monthly by 1° × 1° resolution, and the bias and RMSD were averaged to annual means afterward. A plot of RMSE and bias for subpolar seasonally stratified and subtropical seasonally stratified biomes and different seasons is presented in Figure S13 in Supporting Information S1.

CO2 flux trend between 1985 and 2018. (a, b) Spatial maps of the net CO2 flux trend, for (a) the global ocean biogeochemistry models and (b) the pCO2-products. (c) Zonal mean CO2 flux trend 1985–2018 for the net CO2 flux (blue: pCO2-products and green: global ocean biogeochemistry models [GOBMs]) and the GOBM flux of Fnat,ss and Fant,ss, that is, the flux as expected from increasing atmospheric CO2 alone (green, dashed). (d) The sea surface temperature (SST) trend 1985–2018 in the GOBMs (green) and in the observational data set (black, NOAA Extended Reconstructed SST, ERSST, Version 5 (Huang et al., 2017)). Figures S14 and S15 in Supporting Information S1 split this analysis in the periods 1985–2000 (Figure S14 in Supporting Information S1) and 2001–2018 (Figure S15 in Supporting Information S1). Individual GOBM trends for Fnet, as well as Fnat,ss plus Fant,ss and SST are shown in Figure S16 in Supporting Information S1.
The temporal variability is quantified as the amplitude of “interannual variability” (IAV). This is calculated as the standard deviation of the detrended time-series, as defined in Rödenbeck et al. (2015) and Friedlingstein et al. (2022), which, in reality, captures both interannual and decadal variability components. Following this definition, the pCO2-products have a larger interannual variability for the Southern Ocean wide integrated flux (0.09 PgC yr−1, range 0.04–0.16 PgC yr−1) compared to the GOBMs (0.06 PgC yr−1, range 0.03–0.10 PgC yr−1, Table S3 in Supporting Information S1). Notably, the MPI-SOM-FFN pCO2-product, which formed the basis of previous reports on Southern Ocean decadal variability (Landschützer et al., 2015), has the largest IAV of 0.16 PgC yr−1, about 60% larger than the next largest pCO2-product IAV. This is in line with previous studies that found that the MPI-SOM-FFN approach may overestimate Southern Ocean variability by 30% (Gloege et al., 2021) and the decadal trend 2000–2018 by 130% (Hauck et al., 2023). The amplitude of interannual variability in the atmospheric inversion ensemble mean can only be calculated for the period since 1990 based on three ensemble members. The resulting IAV is higher than in any other data class (0.15 PgC yr−1, range 0.1–0.2 PgC yr−1), which appears to be variability on multi-year to decadal scale (Figure 8), but is probably less robust due to the small ensemble size and different period.
Within the Southern Ocean, the strongest IAV is found in the SPSS region (0.04 PgC yr−1 GOBMs, 0.05 PgC yr−1 pCO2-products, 0.08 PgC yr−1 atm. inversions), followed by the STSS (0.02 PgC yr−1 GOBMs, 0.03 PgC yr−1 pCO2-products, 0.06 PgC yr−1 atm. inversions) and ICE biome (0.02 PgC yr−1 for GOBMs and pCO2-products, 0.03 PgC yr−1 atm. inversions). Within the subpolar biome, the Indo-Pacific sector has a higher IAV (0.02 PgC yr−1 GOBMs and pCO2-products, 0.04–0.05 PgC yr−1 atm. inversions) than the Atlantic sector (0.01 PgC yr−1GOBMs, 0.02 PgC yr−1 pCO2-products and atm. inversions). The large contribution to interannual variability in the SPSS may well be linked to the largest amplitude of the seasonal cycle of CO2 flux (see Section 3.2.3).
To assess the decadal-scale trends, we fit linear trends to the periods 1985–2000 and 2001–2018 (Section 2.3.3, Figures 8e–8h). This choice is motivated by the fact that the year 2000 marks roughly the mid of the considered time period and the inflection point in global and Southern Ocean CO2 uptake (Gruber et al., 2023; Landschützer et al., 2015, 2016; Le Quéré et al., 2007). We acknowledge that trend calculations are sensitive to choice of start and end years (Easterling & Wehner, 2009; Fay & McKinley, 2013), but the general statements on the two periods hold also when start and end years are varied (Table S5 in Supporting Information S1). The pCO2-products suggest a stagnation of the flux in the STSS, and even a flux decrease in the SPSS prior to 2000. In contrast, GOBMs suggest a continued increase in the sink in the STSS and SPSS in the same period. In the ICE biome, GOBMs and pCO2-products result in an increasing trend (Figure 8h). After 2000, pCO2-products and GOBMs agree on a trend toward more CO2 uptake, which is significantly different from zero in all biomes except for pCO2-products in the ICE biome (see numbers in Figures 8e–8h). However, they differ substantially in magnitude between GOBM and pCO2-product ensemble means, particularly in the STSS (Figure 8f). The discrepancies in the magnitude of the trend act to decrease the gap between GOBM and pCO2-product ensemble means in the SPSS and ICE biomes, but lead to the divergence in the flux estimate in the STSS.
On a sub-biome level (i.e., Atlantic, Indian, and Pacific sectors), all three sectors in the STSS were CO2 sinks throughout the period and had weaker trends (less negative) before 2000 compared to the period after 2000 (Figure S12 in Supporting Information S1). In the SPSS, the Indian and Pacific sectors are characterized by intermittent outgassing and uptake patterns, in line with observations from BGC-floats (Prend et al., 2022). In the SPSS, only the Atlantic sector has a net uptake throughout the period, and the Indian Ocean sector shows the largest model spread of all three sectors (as in the STSS). In the ICE biome, a consistent quasi-linear evolution is apparent in all sectors. We further analyze divergence and drivers of trends in Section 3.4.2.
3.4.1 Comparison With In Situ pCO2
Here, we evaluate the accuracy of pCO2 across data classes since pCO2 is the dominant driver of air-sea CO2 flux variability at a monthly scale (Landschützer et al., 2016). All data sets are compared with observations (monthly gridded SOCAT v2022 data set Bakker et al., 2016, 2022; Sabine et al., 2013). The RECCAP2 data sets are subsampled to match the SOCAT observations in time and space, meaning that we do not assess sampling biases, but rather the mismatch between the observed and estimated pCO2.
The comparison of the RECCAP2 GOBMs and pCO2-products with gridded in situ pCO2 from SOCAT v2022 shows relatively good agreement (Figure 9a). The SOCAT pCO2 data shows large interannual variability due to spatially and temporally varying sampling efforts from year to year, particularly prior to 2000 when samples are fewer and thus carry more weight (Figure 9a). For example, in 1997, SOCAT pCO2 is anomalously low due to high sampling density in the Ross Sea during summer when primary production drives intense CO2 drawdown (Arrigo & van Dijken, 2007). The pCO2 products have a lower bias and a narrower spread than the GOBMs prior to 2000 (1.7 ± 4.3 and 10.7 ± 8.0 μatm, respectively), with the bias and the spread decreasing after 2000 for both classes (−0.3 ± 2.6 and −0.9 ± 3.9 μatm, Figure 9b). This comparison of simulated to observed pCO2 at observation sites demonstrates that GOBMs are capable of reproducing SOCAT pCO2 and its temporal evolution on large spatial and annual time-scales. Thus, for the period after 2000, the differences in CO2 flux trend for the entire Southern Ocean between GOBMs and pCO2-products (Figure 8) cannot be attributed to differences in pCO2 in the regions where observations were taken. Instead, the differences arise primarily from areas where no pCO2 observations exist, as also concluded in Hauck et al. (2020). The pCO2 time-series calculated from the full coverage results in a lower pCO2 value in the pCO2-products than in the GOBMs (Figure 9a), which could explain the stronger CO2 flux trend in the pCO2-products (Figure 8). This discrepancy between pCO2-products and GOBMs is larger in the last 10 years (2009–2019: 5.8 μatm) than the previous decade (1999–2008: 2.8 μatm, Figure 9a). Nevertheless, the root mean squared difference calculated from monthly mean data is higher in GOBMs than in pCO2-products (Figure 9c). This is expected as the pCO2-products are trained to fit the observations and further illustrates the GOBMs' deficiencies in simulating seasonal and spatial variability of the CO2 uptake.
The assimilation model, ECCO-Darwin, has a negative bias after 2000 (−13.5 ± 3.0 μatm; Figure 9b), but this negative bias is not strongly reflected in the mean of the non-subsampled data, with the mean pCO2 still being larger than that of the pCO2-products, which do not underestimate the pCO2 relative to SOCAT. This further emphasizes that sampling distribution may play an important role in the magnitude of the biases calculated in any model. The pCO2 summer sampling bias in the Southern Ocean has long been recognised as a potential source of biases in pCO2 estimates, particularly for the pCO2-products that rely heavily on the in situ data (Djeutchouang et al., 2022; Gregor et al., 2017; Metzl et al., 2006; Ritter et al., 2017). The SOCCOM project increased the number of winter samples with pH-enabled profiling floats (from 2014), suggesting stronger outgassing during winter than previously shown (Gray et al., 2018). In RECCAP2, the B-SOSE assimilation model and the BGC-float pCO2-products both make use of this data (Bushinsky et al., 2019; Verdy & Mazloff, 2017). Both of these estimates overestimate pCO2 relative to SOCAT pCO2 highlighting the challenge in consolidating ship-based SOCAT and BGC-float data.
3.4.2 Climate Versus CO2 Effects on Trends in CO2 Flux
Our analysis so far has indicated that the GOBMs reproduce seasonal temperature effects on CO2 flux reasonably well (Figures 7d–7f), and a larger uncertainty is associated with imprints of circulation and biological activity. Next, we inspect the zonal mean and spatial patterns of the CO2 flux trend between 1985 and 2018 (Figure 10).
Perhaps the most striking feature of the latitudinally averaged trends (Figure 10c) is that the uncertainty of the pCO2-products (blue) is much larger than that of the GOBMs (green). This is contrary to the seasonal cycle (Figures 7d–7f), where the pCO2-products are in much closer agreement than the GOBMs. This supports past studies which found that the sparse data in the Southern Ocean has a much larger influence on the accuracy of the decadal variability than on the seasonal variability (Gloege et al., 2021).
The pCO2-products place the largest trend of CO2 uptake in the entire ICE biome; however, note the large variability between pCO2-products (blue shading in Figure 10c). The pCO2-products show a smaller (negative) peak in the trends in the STSS between about 40 and 45°S. The GOBMs, in contrast, exhibit less zonal variability with a large meridional gradient in the ICE biome with the strongest (negative) growth rate of the trend between ∼60 and 65°S. The secondary peak in the STSS (observed in the pCO2-products) is not seen in the GOBM ensemble mean (green line).
Decadal variability, particularly for the pCO2-products, leads to weaker trends over the full period due to opposing trends over shorter time scales (see Figure 8, Figures S14 and S15 in Supporting Information S1). For example, the pCO2-products exhibit weakening CO2 uptake in the Pacific and eastern Indian sector of the SPSS for the full period (Figure 10b). However, the same location experiences a growth in the sink for the period 2001–2018 (Figure S14b in Supporting Information S1). Differences between GOBMs and pCO2-products in the ICE biome are large, but the smaller area of the higher latitude region decreases the influence of the discrepancy (Figure 8). Although the difference in flux density between GOBMs and pCO2-products is larger in the ICE biome, the discrepancy in the STSS contributes more to the total flux trend discrepancy due to the larger area of the STSS biome (Figure 8).
We can attribute the growth in the strength of the CO2 sink almost entirely to increasing atmospheric CO2 levels, as shown by simulation C in the GOBMs (Fss as black line in Figure 10c). The trends are only slightly modulated by climate change and variability, driving a weakening in the SPSS and strengthening in the northern part of the ICE biome (green Fnet vs. black Fss in Figure 10c). Thus, the effect of climate change and variability is substantially smaller than the uncertainty in the pCO2-products. But then what could be the cause of the discrepancies between the CO2 flux trends of the GOBMs and pCO2-products?
Past work has found that fully coupled Earth System Models in CMIP6 do not capture circulation trends accurately, which is also observed as a deviation in SST trends, with no cooling simulated in the polar Southern Ocean (Beadling et al., 2020). However, we demonstrate that the GOBMs simulate SST trends from 1985 to 2018 rather well, including the cooling in the southern parts (Figure 10d), as also shown for the thermally driven component of the pCO2 seasonal cycle (Figure 8). We can therefore not ascribe the discrepancies between GOBMs and pCO2-products in the trend of the CO2 flux to temperature biases in the SST trends of the GOBMs.
The well-represented SST trends in the GOBMs is related to the choice of forcing the GOBMs with reanalysis data that itself depends on SST observations (Doney et al., 2007). In contrast to fully coupled Earth System models in CMIP6 (Beadling et al., 2020), the suite of models used here capture the decadal trend pattern of warming along the northern flank of the Antarctic Circumpolar Current (ACC), and cooling in the south (Figure 10, Armour et al., 2016; F. Haumann et al., 2020). The lack of warming south of 50°S was previously related to the wind-driven upwelling of deep water that had not yet been exposed to anthropogenic warming and by northward heat transport (Armour et al., 2016). More recently, the cooling was suggested to be caused by increased freshwater export from the ice region, which increases stratification and thus reduces the upward heat flux from below by warm water masses (F. Haumann et al., 2020). While the GOBM ensemble mean captures the latitudinal structure of the SST trend well, it underestimates the magnitude of peak cooling at around 60°S as well as peak warming north of 40°S.
We thus ascribe data sparsity as a reason for potential biases in the trend in the pCO2-products, with the large uncertainty in pCO2-products underlining that their estimate is currently not robust. Trends simulated in GOBMs are consistent within the ensemble, suggesting that they are caused by rising CO2 acting on commonly reproduced characteristics of the Southern Ocean biomes. Nevertheless, biases in ocean circulation, sea ice and biology remain as possible reasons for biases in GOBMs.
3.5 Interior Ocean Storage of Anthropogenic Carbon 1994–2007
The focus of this section is the anthropogenic perturbation of DIC in a subset of the GOBMs (see Section 2.2.1), and in particular its accumulation rate over the period 1994 to 2007 (ΔCant), in comparison with the eMLR(C*) observational estimate (Gruber, Clement, et al., 2019) and the ocean inverse model OCIMv2021 (DeVries, 2022). The eMLR(C*) product uses a multiple linear regression approach to estimate ΔCant and captures both the influence of CO2-driven and climate-driven change in sea-air CO2 fluxes and transports, whereas OCIMv2021 captures only the CO2-driven changes.
All data classes agree in having the largest ΔCant inventories within and to the north of the STSS biome (Figure 11), whose southern boundary approximately corresponds to the northern edge of the ACC. This pattern is related to the mechanisms by which Cant is taken up at the surface and exported to depth (Bopp et al., 2015; Mikaloff Fletcher et al., 2006; Morrison et al., 2022). Subpolar upwelling exposes old Cant-poor waters to elevated atmospheric CO2 concentrations and this, combined with strong winds, drives a large influx of Cant in the SPSS biome (Figures 12a–12c). A small fraction of the Cant moves southward and is exported within Antarctic Bottom Waters, while the largest fraction is transported northward within the upper cell of the meridional overturning circulation. Cant air-sea fluxes remain elevated throughout the northward path, and are reinforced by the deep mixed layers in the regions where mode and intermediate waters are formed, which results in a secondary peak at around 40°S in some GOBMs, diluted by the ensemble mean (Figure 12c).

ΔCant yearly accumulation rate over the period 1994–2007 integrated until 3,000 m depth in the observationally constrained estimates (a) eMLR(C*) (Gruber, Clement et al., 2019) and (b) OCIM-v2021, in (c) “GOBMs high” (CESM-ETHZ, MRI-ESM2-1, NorESM-OC1.2, and NEMO-PlankTOM12.1) and in (d) “GOBMs low” (CCSM-WHOI, CNRM-ESM2-1, EC-Earth3, FESOM-REcoM-LR, ORCA025-GEOMAR, ORCA1-LIM3-PISCES, and MPIOM-HAMOCC). Individual global ocean biogeochemistry models are shown in Figure S4 in Supporting Information S1. The robustness of the patterns has been tested as explained in Text S4 in Supporting Information S1. Contours show the boundaries of the ice, subpolar seasonally stratified, and subtropical seasonally stratified biomes. Values below 3,000 m are not shown because of the low signal-to-uncertainty ratio in eMLR(C*).

Zonal integrals of ΔCant yearly accumulation rate from 1994 to 2007 and of air-sea Cant fluxes (positive downwards) averaged between 1994 and 2007 for (a, d) eMLR(C*), (b, e) OCIM-v2021, and (c, f) global ocean biogeochemistry models (GOBMs). (a–c) (black line) ΔCant column inventory (0–3,000 m) and (gray line) air-sea Cant fluxes; for the GOBMs, the distinction is made between “GOBMs high” (full lines) and “GOBMs low” (dashed lines). See figure caption 11 or Figure 13 for a list of GOBMs high and GOBMs low. (g–i) Anomalies of ΔCant accumulation rates in (g) OCIM-v2021 with respect to eMLR(C*), (h) GOBMs with respect to eMLR(C*), and (i) GOBMs with respect to OCIM-v2021. In all sections, contours show mean potential density (with a 0.03 kg m−3 spacing) referenced to the surface in World Ocean Atlas 2018 (Boyer et al., 2018), where thick lines indicate the 1,026.9 and 1,027.5 kg m−3 isopycnals. Anomalies of individual GOBMs shown in Figure S18 in Supporting Information S1 (with respect to eMLR(C*) and Figure S19 in Supporting Information S1 (with respect to OCIMv2021).
The effective transport of Cant into the ocean interior relies on a number of physical processes, the dominant of which is the northward transport by the overturning circulation of the Cant ventilated in the ocean interior by deep winter mixing (Frölicher et al., 2015; Morrison et al., 2022). The absorbed Cant spreads northward along density surfaces within mode and intermediate waters (Figures 12d–12f) and is circulated within and out of the Southern Ocean by the subtropical gyres (Frölicher et al., 2015; D. C. Jones et al., 2016; Waugh et al., 2019). As a result, the largest Cant inventories are displaced to the north with respect to the maximum air-sea Cant influx (Figures 12b and 12c). Another pathway by which the Cant inventory can build up without a corresponding surface influx is by southward advection and subsequent subduction of high-Cant Subtropical Waters (Iudicone et al., 2016; Morrison et al., 2022).
The observation-based product eMLR(C*) and the ocean inverse model OCIM-v2021 have similar ΔCant accumulation rates when integrated over the Southern Ocean for the period 1994 through 2007 (0.52 and 0.47 PgC yr−1, respectively, Figure 13), but differ in their horizontal (Figure 11) and vertical (Figure 12) patterns. The eMLR(C*) exhibits particularly low ΔCant values at subpolar and high latitudes (Figure 12g), especially in the Pacific sector (Figure 11). The GOBMs multi-model-mean of ΔCant accumulation rates over the same 1994 through 2007 period and integrated within the Southern Ocean (Figure 13) is 0.46 ± 0.11 PgC yr−1, that is, 7% lower than the mean of the two observational estimates considered here. 6 out of the 12 GOBMs fall within the 19% range of the observational eMLR(C*) uncertainty. Two thirds of all GOBMs (hereafter “GOBMs low”) have lower than observed ΔCant accumulation rates (0.39 ± 0.11 PgC yr−1, about 20% lower than the observational estimates). The remaining GOBMs (hereafter “GOBMs high”) have higher than observed ΔCant accumulation rates (0.58 ± 0.07 PgC yr−1, about 17% higher than the observational estimates). “GOBMs high” have a higher ΔCant storage than “GOBMs low” throughout the Southern Ocean (Figures 11c, 11d, and 12c), higher Cant air-sea fluxes (Figure 12c), higher SSS in the SPSS and STSS biomes and mixed layer depths especially in the SPSS biome (Texts S3 and S4 and Figure S17 in Supporting Information S1). Along the zonal mean section, all GOBMs show a southward shift in ΔCant with respect to eMLR(C*) shown by a north-south dipole in the upper 1 km (Figure 12h), as similarly found in the comparison between OCIM-v2021 and eMLR(C*) (Figure 12g). With respect to OCIM-v2021, GOBMs show higher ΔCant above 1000 m depth and lower ΔCant beneath (Figure 12i). This could point to insufficient ventilation of Cant in “GOBMs low” models (Figure S19 in Supporting Information S1), which represent the majority of the GOBMs. The amount of spread and the overall underestimate of ΔCant in the GOBMs is consistent with Earth System Models analyzed by Frölicher et al. (2015) and Terhaar et al. (2021), supporting the argument that biased ocean model dynamics and water mass properties rather than biases in the atmospheric forcing cause the Cant underestimate (Bourgeois et al., 2022; Terhaar et al., 2021).

Scatter plots showing relationships between ΔCant accumulation rates between 1994 and 2007 (integrated to 3,000 m) and different quantities namely (a) the cumulative Cant in 1994 integrated over the Southern Ocean, (b) air-sea Cant fluxes averaged between 1994 and 2007 and integrated over the Southern Ocean, (c) sea surface salinity (SSS) horizontally averaged over the subpolar seasonally stratified and subtropical seasonally stratified biomes (which show consistent SSS anomaly patterns, Figure S17 in Supporting Information S1). Shown are a subset of the global ocean biogeochemistry models (GOBMs) (see Section 2.3), the OCIM-v2021 data-assimilated model, the observation-based cumulative Cant until 1994 (C* method, Sabine et al., 2004) and the 1994–2007 ΔCant from (eMLR(C*) method, Gruber, Clement, et al., 2019), and SSS from EN4.2.1 (Good et al., 2013). Thin black lines show the linear fit of the data for the GOBMs only, with the explained variance (R2) and the p-value indicated for each regression. The gray shading in panel (a) indicates the 19% uncertainty levels around the mean of eMLR(C*) (Southern Hemisphere uncertainty estimate, based on Table 1, Gruber, Clement, et al., 2019) and the green shading the 20% uncertainty levels around the C*-based estimate of cumulative Cant until 1994 (global uncertainty estimate Sabine et al., 2004; Matsumoto & Gruber, 2005). Models that have a ΔCant storage higher than the average of the two observationally constrained data sets (“GOBMs high”) are shown in red, whereas the models in which it is lower (“GOBMs low”) are shown in blue. Because of its different spin-up procedure, ROMS-SouthernOcean-ETHZ is shown in the plots but has been excluded from the regression analysis. For OCIM-v2021, CNRM-ESM2-1, and MPIOM-HAMOCC the is shown, whereas in others the sum of steady state and nonsteady state is shown. As discussed in Text S2 in Supporting Information S1, accumulation rates are about 10%–20% of the total ΔCant.
Integrated over the Southern Ocean, we find that the model spread in ΔCant accumulation rates from 1994 to 2007 can be largely explained (81% variance explained) by the spread in accumulated Cant until 1994 (Figure 13), suggesting a coherent scaling between long-term and recent Cant accumulation rates. The model spread in ΔCant accumulation rates is also related with the spread in Cant air-sea fluxes averaged over 1994–2007 (78% variance explained). These results show that past long-term ΔCant accumulation rates are a better predictor for present ΔCant accumulation rate than present Cant air-sea fluxes. The reason for this is that Cant air-sea fluxes are linked to changes in Cant storage through ocean transport, which may differ substantially between models (Bourgeois et al., 2022; Frölicher et al., 2015; Terhaar et al., 2021). This becomes obvious when considering the myriad of processes involved, including the strength of the overturning circulation, the strength of the subtropical gyres, the isopycnal stirring by mesoscale eddies, and localized subduction dynamics (Morrison et al., 2022; Sallée et al., 2012). The different way in which the GOBMs simulate these transport processes is possibly linked to the large model spread in ΔCant accumulation rates among GOBMs. Past studies have found that SSS affects the surface ocean density in the formation regions of mode and intermediate waters and could be used as a constraint of the Cant air-sea fluxes, and thus of the Cant storage within the recently ventilated water masses (Terhaar et al., 2021). In this study and in Terhaar et al. (2023), we find that SSS explains a lower variance in the ΔCant accumulation rates (R2 = 0.61; Figure 13) and in the Cant air-sea fluxes (R2 = 0.57, Terhaar et al., 2023) with respect to the ESMs (R2 = 0.74) analyzed by Terhaar et al. (2021). The relationship may be weaker due to the different suite of models used in the ESM and GOBM ensembles and remaining biases associated with incomplete spin-up (Terhaar et al., 2023).
4 Discussion
4.1 Summary and Progress Since RECCAP1
We provide an updated estimate of the Southern Ocean carbon sink (see Figure 1 for regional extent). The numbers we present (Table 3) are not directly comparable with the RECCAP1 estimate (Lenton et al., 2013) due to different region definitions (Figure 1) and periods (1990–2009 vs. 1985–2018). The RECCAP1 regional definition of the Southern Ocean (44–75°S) cut across and missed a large part of the strong CO2 uptake north of the Subantarctic Front. Recalculating the RECCAP2 numbers for the RECCAP1 region and period each reduces the sink estimate by ∼50% resulting in only ∼25% of its original value. A key result thus is that RECCAP2 estimates a weaker Southern Ocean carbon sink by 50% compared to RECCAP1. In RECCAP2 we find that the SPSS and ICE biomes are a weaker sink than in RECCAP1, and with the absence of the STSS, a region of strong Cant uptake, the other regions gain importance.
Estimate | GOBMs (Pg C yr−1) | Observation-based (Pg C yr−1) |
---|---|---|
RECCAP2 1985–2018 | −0.75 ± 0.28 | −0.73 ± 0.07 |
RECCAP2 1985–2018 (44°–75°S) | −0.39 ± 0.24 | −0.30 ± 0.04 |
RECCAP2 1990–2009 (44°–75°S) | −0.22 ± 0.25 | −0.14 ± 0.09 |
RECCAP1 1990–2009 (44°–75°S) | −0.43 ± 0.38 | −0.27 ± 0.13 |
- Note. GOBMs: Global Ocean Biogeochemistry Models. Reported numbers are means ± one standard deviation. For RECCAP1, the median of all models is reported.
The observational and modeling communities have made substantial progress on quantifying and characterizing the Southern Ocean carbon sink since RECCAP1 (Lenton et al., 2013). The creation of the Surface Ocean CO2 Atlas and its annual updates have marked a step-change by facilitating the development of statistical models (a.k.a. pCO2-products). The large and diverse ensemble of pCO2-products help to identify the robust features of the Southern Ocean carbon sink. The pCO2-products have a relatively small spread compared to the global ocean biogeochemistry models in terms of mean and seasonal cycle, indicating that the uncertainty from differences in mapping methods is small. However, the spread in the trend estimates is in fact larger in the products than in the GOBMs (Figure 10). Further, the narrow spread in mean and seasonal cycle does not include the uncertainties due to sparse pCO2 observations in the Southern Ocean, particularly in winter and before the 2000's (Ritter et al., 2017). In addition, pCO2-products share the uncertainties with GOBMs associated with the bulk formulation of air-sea CO2 exchange (Fay et al., 2021; Wanninkhof et al., 2009). While they do have their shortcomings, the pCO2 products are an advance for constraining the Southern Ocean carbon sink compared to the atmospheric inversions that were used in RECCAP1 (Lenton et al., 2013). This is because the surface ocean pCO2 observations provide a more direct constraint on the air-sea CO2 flux than the relatively small atmospheric CO2 signals over the ocean that form the basis of the atmospheric inversions.
4.1.1 Model Deficiencies
The larger GOBM ensemble provides a more representative process-based estimate and the spread in GOBMs has been reduced since RECCAP1 (see Table 3, Lenton et al., 2013). The remaining spread is nevertheless large and points toward critical need for model development, where the largest sources of uncertainty stem from biological process description and circulation, which vary in importance depending on flux component (natural, anthropogenic, etc.), and spatio-temporal scale of interest. In terms of the anthropogenic component, the 12 GOBMs analyzed here have a 24% spread (standard deviation around the mean) in the Cant accumulation rates, which is marginally larger than the ∼20% uncertainty associated with the observational estimates of ΔCant and Cant (even though caution is warranted when directly comparing the uncertainty estimates, which are computed formally different across data classes; Gruber, Clement, et al., 2019; Sabine et al., 2004). Overall, the GOBM ensemble mean underestimates the observation-based estimates of the Cant accumulation up to 1994 by 19% and the change between 1994 and 2007 by 7%. Admittedly, the GOBM ensemble analyzed here is relatively small, and the underestimation of Cant and ΔCant is in the range of the uncertainty ranges of the observational estimates.
Despite the caveats of small ensemble size and overlapping uncertainty ranges, we can nonetheless speculate that the detected underestimation is likely related to a combination of physical, chemical, and methodological factors. First, our results point to too little or too shallow ventilation of mode and intermediate waters (Figure 12i), the causes of which can be related to insufficient vertical mixing or too sluggish northward export of the subducted waters (Morrison et al., 2022). However, while sea-surface salinity (SSS) was singled out as a strong predictor of Cant air-sea fluxes in an ESM ensemble analyzed by Terhaar et al. (2021), in our study and in Terhaar et al. (2023), SSS was not found to be a clear constraint of the anthropogenic CO2 uptake and its interior storage in the GOBMs. Rather, Terhaar et al. (2023) find that biases in the normalized surface Revelle factor could explain the underestimation of Cant uptake. Finally, the relatively high pre-industrial CO2 mixing ratios related to late starting dates in several GOBMs are likely causing an underestimation of the cumulative Cant storage, which is especially large in the Southern Ocean (Terhaar et al., 2023).
For the natural carbon fluxes, the difficulty in capturing the delicate balance between physical and biological processes is clearly manifested by the large model spread (Figure 3). In addition, the different spin-up procedures could play a role. Terhaar et al. (2023) indicate that the natural CO2 flux component may be biased toward uptake that is too strong, possibly related to GOBMs not being in steady-state (Terhaar et al., 2023), which is particularly relevant in the Southern Ocean where old water masses resurface. While long preindustrial spin-ups would bring the GOBMs closer to steady-state and thus reduce drift, they may come at the cost of less realistic surface conditions and their response to climate change and variability (Séférian et al., 2016). Interestingly, the two data-assimilated GOBMs differ to a large extent, illustrating that dynamical processes in these models may still override information gained from assimilated observations.
4.1.2 GOBM and pCO2-Product Discrepancies
The averages of the GOBM and pCO2-product ensembles agree for many key estimates, showing progress over the past 10 years: the mean and spatial distribution of the sink is in good agreement (Figure 2), although discrepancies of the magnitude and, particularly, the trends still persist (Figures 8 and 10; see also Canadell et al., 2021). The fact that these ensemble means agree so well in many respects provides some confidence in the Southern Ocean CO2 flux estimates because they are nearly independent. However, the agreement of GOBMs and pCO2-products on the mean CO2 flux is partly a result of compensation of regional and seasonal discrepancies (Figures 4, 5, and 8).
The concerning discrepancies in the trend cannot be explained by GOBM biases in warming trends as these are well reproduced (Figure 10). Similarly, the GOBM ensemble is not systematically biased toward formation of mode and intermediate waters that is too weak, in contrast to the ESMs, and an effect on the trend of the ocean carbon sink could not be evidenced (Terhaar et al., 2023). Further potential candidates for GOBM biases, which were not explored here, are stratification (Bourgeois et al., 2022), mixing, and mixed layer dynamics, which could also lead to excess carbon accumulation in the surface layer and thus may be the driver for the overestimation of the surface Revelle factor. In the pCO2-products, the trend is subject to biases by data sparsity (Gloege et al., 2021; Hauck et al., 2023).
The agreement in the mean air-sea CO2 flux between GOBMs and pCO2-products is also highly susceptible to the choice of river flux adjustment that either locates most outgassing of river-derived carbon in the Southern Ocean (Aumont et al., 2001) or in the tropical Atlantic (Lacroix et al., 2020). Reasons for the discrepancy between Aumont et al. (2001) and Lacroix et al. (2020) may be because of specific choices in nutrient and carbon input, lability of organic matter, resulting ocean model transport (see also the discussion in Terhaar et al., 2023). We here chose to use the river flux adjustment of Lacroix et al. (2020), scaled up to a global value of 0.65 PgC yr−1, resulting in a small adjustment for the Southern Ocean of 0.04 PgC yr−1. In contrast, the Southern Ocean (south of 20°S) adjustment based on Aumont et al. (2001) that is so far used in the Global Carbon Budget is higher by one order of magnitude (0.32 PgC yr−1) and can explain the large mismatch in the mean flux (but not its trend) between GOBMs and pCO2 products in the Southern Ocean in the Global Carbon Budget (Friedlingstein et al., 2022).
4.2 Seasonal Cycle and Thermal Versus Non-Thermal Drivers
As a community, we have a good understanding of the mechanisms that drive pCO2 seasonality in the Southern Ocean (Lenton et al., 2013), but we do not fully understand their magnitudes, opposing or synergistic, in different seasons and regions (Mongwe et al., 2018). Part of this lack of understanding is due to a lack of observations throughout all seasons, though particularly acute during winter (Bushinsky et al., 2019; Gray et al., 2018; Sutton et al., 2021). Further, complex biological processes affecting pCO2 in summer are more difficult to accurately describe in GOBMs (Mongwe et al., 2018).
The poor sampling distribution of pCO2 in the Southern Ocean, particularly the bias toward summer sampling, could result in a misrepresentation of the seasonal cycle in pCO2-products (Ritter et al., 2017). This may be shown by the narrow ensemble spread of the pCO2-products during winter (Figures 7g–7i). That being said, an observing system simulation experiment (OSSE) showed that the seasonal cycle in most of the Southern Ocean is in fact well captured by one pCO2 product (Gloege et al., 2021). The narrower GOBM spread of the non-thermal pCO2 component during winter may also suggest that winter-time processes (circulation) are less complex than summer (circulation and biology, Figures 7g–7i).
The introduction of biogeochemical Argo floats since the mid 2010's has increased the number of winter observations (relative to the available ship-based observations), albeit inferred from pH and estimated total alkalinity and thus associated with a higher uncertainty (Williams et al., 2017). The machine learning approaches that include float-based observations result in stronger winter outgassing (Figure 4, Bushinsky et al., 2019). Direct pCO2 measurements showed that the years used to train the machine learning model (2015–2018) may have had anomalously high pCO2 (Sutton et al., 2021). However, if this is in fact the case, and not related to sampling locations, this may indicate much larger interannual variability in the Southern Ocean than the majority of the pCO2-products currently estimate (Figure 8). Incorporating these data is thus potentially an important goal for pCO2-products, but it has proven difficult to incorporate the float-based pCO2 estimates further back in time than 2015, the start of the BGC-float record and account for their higher uncertainty (Bushinsky et al., 2019; Williams et al., 2017).
GOBMs also have a lower pCO2 ensemble spread during winter compared with summer and agree on the spatial location of the winter flux minimum (Figure 4). Nevertheless, the range in magnitude is still more than twice as large as those of the pCO2-products (Figures 7g–7i). Since the thermal component is well captured in GOBMs (Figures 7d–7f), the non-thermal physical drivers (i.e., circulation) determines the uncertainty observed in winter. In summer, GOBMs have difficulty capturing the delicate balance between biological and physical processes that leads to a large spread in model pCO2 and fluxes (Mongwe et al., 2018). GOBMs may thus benefit from more process-based studies that further our understanding of pCO2 drivers during summer, that is, biological productivity, respiration, remineralization, and sinking of organic carbon as part of the biological carbon pump.
4.3 Temporal Variability of CO2 Fluxes
Our analysis reduces the previously reported discrepancy in variability of Southern Ocean air-sea CO2 fluxes between data classes (GOBMs and pCO2-product ensemble means, Gruber, Landschützer, & Lovenduski, 2019). We relate the growing agreement to the larger ensemble of pCO2-products in our study, with the newer additions having a substantially lower variability than the two pCO2-products (Jena-CarboScope and SOM-FFN) used by Gruber, Landschützer, and Lovenduski (2019). A recent study using the same RECCAP data base also concluded that there is agreement on the magnitude of interannual variability between GOBMs and pCO2-products (Mayot et al., 2023).
The interannual to decadal variability of Southern Ocean air-sea CO2 fluxes was discussed extensively in the literature, and was often related to variations in the Southern Annual Mode (SAM) (Hauck et al., 2013; Lenton & Matear, 2007; Le Quéré et al., 2007; Lovenduski et al., 2007; Mayot et al., 2023; Nicholson et al., 2022). Also, regional wind variability linked to the zonal wavenumber 3 was suggested as a driver of interannual CO2 flux variability driving both the weakening in the 1990's and the strengthening in the 2000's (Keppler & Landschützer, 2019; Landschützer et al., 2015). The arguments of SAM or wave number 3 as dominant drivers of CO2 flux interannual variability might not be fully independent from each other, as previously a wave number 3 like pattern was reported to describe MLD anomalies during positive SAM events (Sallée et al., 2010).
The fact that the maximum IAV of GOBMs is found in the SPSS Indo-Pacific sector (Section 3.4, Figure S12 in Supporting Information S1) supports the argument of the above mentioned references that upwelling of carbon-rich deep water and related outgassing of natural carbon in response to a positive SAM and strengthening of westerly winds may be the dominant driver of interannual variability (DeVries et al., 2017). This is further supported by studies of atmospheric potential oxygen (APO), which can be used as a tracer of ocean-only processes from measurements of CO2 and O2 at atmospheric stations (Stephens et al., 1998). Nevison et al. (2020) showed that the interannual variations of APO seasonal minimum from stations in the Southern Hemisphere were strongly correlated with the SAM during years of positive phase. Further, they showed that GOBMs (as analyzed in this study) can capture the variability of CO2 and APO fluxes driven by the SAM variations during the austral winter months. However, the study of Nevison et al. (2020) also illustrated that the SAM index variability cannot fully explain the changes in APO seasonal winter minima suggesting that other factors or modes of variability such as ENSO could impact the CO2 and O2 air-sea fluxes of the Southern Ocean as also previously suggested in an ocean modeling study (Verdy et al., 2007).
On top of the interannual variability, on which pCO2 products and GOBMs seem to reach reasonable agreement, discrepancies in the CO2 flux trend since 2000 have emerged (Figure 8, Friedlingstein et al., 2022). These discrepancies highlight a major knowledge gap and urgently need to be resolved by critical analysis of potential biases in pCO2-products as well as GOBMs (see Section 4.1). While there is no evidence so far that adjustments of CO2 fluxes based on model evaluation of interfrontal salinity and Revelle factor affect the trend (Terhaar et al., 2023), data sparsity tends to lead to an overestimation of decadal variability and trend in at least two of the pCO2-products (Gloege et al., 2021; Hauck et al., 2023). Hence, both data classes need to be inspected for deficiencies.
4.4 Zonal Asymmetry of the Fluxes
While the primary spatial mode of variability in the Southern Ocean is from north to south, zonal variability in the dynamics, biogeochemistry, and carbon fluxes have been reported in the literature (Landschützer et al., 2015; Prend et al., 2022; Rintoul, 2018; Tamsitt et al., 2016). Similarly, we find substantial zonal asymmetry in both the mean states, and seasonal and interannual variability of the Southern Ocean CO2 fluxes (Figures S10 and S12 in Supporting Information S1); yet many of our results have been presented in a zonally averaged perspective for the sake of brevity.
In this work, we find that the largest zonal asymmetries in the Southern Ocean mean air-sea CO2 flux occur in the SPSS biome (Figures 4b–4e and Figure S12 in Supporting Information S1). Here, the Pacific and Indian sectors are larger sources (or weaker sinks) of CO2 to the atmosphere than the Atlantic sector. This is consistent with the pCO2-based products (Figures S12d–S12f in Supporting Information S1). The float-based pCO2-products amplify this winter outgassing dramatically. However, the GOBMs and the assimilative model ensemble averages do not show a coherent and convincing increase in outgassing in the Indian and Pacific relative to the Atlantic. The zonal asymmetry reported in the observation-based products is consistent with a recent BGC-float-based study that reported stronger outgassing in the Indian and Pacific sectors of the Southern Ocean (Prend et al., 2022). The authors attributed this dominance to stronger winter-time entrainment of deep waters to the surface in these regions. The zonal asymmetry is also apparent in the air-sea CO2 fluxes decomposed into natural and anthropogenic contributions (Figure S7 in Supporting Information S1). Here, too, the SPSS is the region with the greatest asymmetry. In the Indian sector, the large natural outgassing fluxes of the ensemble mean are nearly perfectly opposed by the anthropogenic uptake.
4.5 Link Large-Scale Synthesis to Observational Programs
The analysis presented here provides a synthesis of large-scale data sets with a focus on budgets, spatial, and temporal patterns of fluxes and carbon accumulation, and a first-order assessment of large-scale processes (e.g., thermal vs. non-thermal, anthropogenic vs. natural carbon fluxes). In particular, it highlights spatio-temporal windows for which discrepancies between data classes are largest (e.g., magnitude of winter outgassing, delicate balance of physical vs. biological processes in summer, magnitude of decadal trend of the Southern Ocean carbon sink). Importantly, this synthesis builds on contributions from many individual groups contributing repeat observations of surface and interior ocean biogeochemical properties from research vessels and ships of opportunity (e.g., Hoppema et al., 1998; Metzl et al., 1999; Pardo et al., 2017; Talley et al., 2016; van Heuven et al., 2014). The ship-based observations form the cornerstone for many of the data classes in this study: observation-based ocean interior estimates of CO2 storage assess changes in deep ocean measurements of CO2, the surface pCO2 estimates use observations from ships of opportunity, and the GOBMs are evaluated against ocean interior observations. And while sampling biases and gaps in the ship-based measurements may be filled by autonomous platforms with lower accuracy (e.g., BGC-floats), they will always require crossover validation measurements from the high-accuracy ship-board measurements. This emphasizes that the ship-based observations need to continue into the future to characterize the evolution of the Southern Ocean carbon cycle. This will only become more important, once stabilization of atmospheric CO2 will lead to a larger weight on ocean processes for control of air-sea fluxes relative to the current quasi-exponential growth rate of atmospheric CO2.
Further, detailed regional process studies employing a wide range of methodologies across disciplines are also important to further our holistic understanding of the Southern Ocean carbon cycle and to improve the description of biogeochemistry and ecosystem dynamics in GOBMs, particularly in summer. One example for such an interdisciplinary field program is along the continental shelf west of the Antarctic Peninsula where shipboard observations indicate a strong, near-shore summer undersaturation of surface pCO2 (Eveleth et al., 2017) and seasonal reduction in surface DIC (Hauri et al., 2015). The seasonal trends in the ocean CO2 system on the shelf reflect a combination of biological net community production (Ducklow et al., 2018) and meltwater input diluting surface DIC and alkalinity (Hauri et al., 2015). Regional ocean biogeochemical models simulate similar onshore-offshore gradients in surface chlorophyll, biological productivity, DIC, and pCO2 as well as the observed large interannual biophysical variability associated with year-to-year variations in seasonal sea-ice advance and retreat phenology (Schultz et al., 2021). Observed decadal trends for the region from the early 1990s to late 2010s indicate that reduced sea-ice extent associated with climate change drives an increase in upper ocean stability, phytoplankton biomass and biological DIC drawdown, resulting in a growing net downward air-sea CO2 flux during summer (Brown et al., 2019). Recent year-round, autonomous mooring observations of pCO2 and pH suggest a gradual increase in surface ocean pCO2 and DIC over the fall and winter, with CO2 outgassing during winter when pCO2 is supersaturated largely blocked by sea-ice cover (Shadwick et al., 2021; M. Yang et al., 2021). Similar large-scale programs are needed in other parts of the Southern Ocean given its size and importance in the global carbon cycle. On-going research activities, also as part of the Southern Ocean Observing System, for example, in the Ross (Smith et al., 2021) and Weddell Seas (Arndt et al., 2022) have the potential of being extended.
5 Conclusions
Here, we present a schematic overview that summarizes the main characteristics of the Southern Ocean carbon cycle 1985–2018, as derived in this analysis and its Supporting Information S1 (Figure 14). In general, the sink strength for atmospheric CO2 (net CO2 flux, FCO2) increases from South to North, but with important zonal asymmetry. The Atlantic and Indian Ocean sectors of the Subtropical Seasonally Stratified biome (STSS) are the regions that act as strongest sinks. In the Subpolar Seasonally Stratified biome (SPSS), the Atlantic sector stands out as the only sector acting as an annual mean CO2 sink. Also the seasonal cycle shows a distinct north-south gradient. In the ice-covered biome (ICE) the peak uptake occurs in summer and is driven by the seasonal cycle of DIC, that is, by physical DIC transport and biological processes. In contrast, the dominant driver of the seasonal cycle of CO2 uptake in the STSS is temperature, and thus the season of peak uptake occurs in winter. Trends in net CO2 uptake derived from Global Ocean Biogeochemistry Models (GOBMs) and surface ocean pCO2 observation based products (pCO2-products) disagree in all biomes, but the discrepancy is strongest in the Pacific sector of the STSS. In terms of anthropogenic CO2 (Cant), the strongest uptake occurs in the SPSS. This is not visible in the map of net CO2 flux, because the anthropogenic uptake manifests itself as a suppression of natural CO2 outgassing. The largest anthropogenic carbon storage occurs in the STSS and northward.

Main characteristics of the Southern Ocean carbon cycle 1985–2018. The surface ocean color shading depicts the net air-sea CO2 flux (FCO2) as the average of the ensemble means from pCO2-products and Global Ocean Biogeochemistry Models (GOBMs). Blue color denotes a CO2 flux into the ocean, and red color a flux out of the ocean. The section on the right shows the zonally integrated anthropogenic carbon (Cant) accumulation rate in the ocean interior from GOBMs. The section on the left shows the zonally averaged concentration of total (or “net”) carbon (Cnet). ICE: ice-covered biome, SPSS: Subpolar Seasonally Stratified Biome, and STSS; Subtopical Seasonally Stratified Biome.
Our analysis confirms the important role of the Southern Ocean in the global carbon cycle although RECCAP2 estimates a 50% smaller Southern Ocean CO2 sink for the same region and timeframe as RECCAP1. We have highlighted key knowledge gaps that need to be closed through extended observation systems and augmented process descriptions in the dynamic models in order to further reduce uncertainties.
Acknowledgments
We thank the editor for handling the manuscript and Galen McKinley and one anonymous reviewer for their constructive feedback. Funding to J. Hauck was provided by the Initiative and Networking Fund of the Helmholtz Association (Helmholtz Young Investigator Group Marine Carbon and Ecosystem Feedbacks in the Earth System [MarESys], Grant VH-NG-1301), by the ERC-2022-STG OceanPeak (Grant 101077209) and by the European Union's Horizon Europe research and innovation program under Grant 101083922 (OceanICU Improving Carbon Understanding). J.H., L.G., M.H., N.G., and P.M. acknowledge funding from the European Union's Horizon 2020 research and innovation programme under Grant 820989 (project COMFORT). L. Patara acknowledges funding from the project PA 3075/2-1 by the German Research Foundation and the North German Supercomputing Alliance (HLRN) for providing computing power for the experiments. L. Gregor, M. Hague, and N. Gruber further acknowledge funding from the European Union's Horizon 2020 research and innovation programme under Grant 821001 (SO-CHIC project), 821003 (project 4C). S. Bushinsky acknowledges funding from NASA Carbon Cycle Science (80NSSC22K0156), NOAA Climate Program Office's Climate Observations and Monitoring, Climate Variability and Predictability, and Global Ocean Monitoring and Observation programs (NA21OAR4310260), and NSF (OCE-2049631). S. Doney acknowledges support from the Palmer LTER program National Science Foundation Grant OPP-2026045. C. Le Quéré acknowledges support from the Natural Environment Research Council SONATA Grant NE/P021417/1 and the UK Royal Society Grant RP∖R1∖191063. J. Terhaar acknowledges funding from the Woods Hole Oceanographic Institution Postdoctoral Scholar Program, and the Swiss National Science Foundation under Grant 200020-20051. The work reflects only the authors' view; the European Commission and their executive agency are not responsible for any use that may be made of the information the work contains. Open Access funding enabled and organized by Projekt DEAL.
Open Research
Data Availability Statement
All RECCAP2 data is available at Zenodo: Müller (2023).
The processed data for this study is available at Hauck et al. (2023a).
Code for each figure is in the GitHub repository: https://github.com/RECCAP2-ocean/Southern-Ocean, with a frozen version deposited at Hauck et al. (2023b).