Toward Understanding of Long-Term Nitrogen Transport and Retention Dynamics Across German Catchments
Fanny J. Sarrazin and Pia Ebeling contributed equally to this work.
Abstract
Long-term nitrogen (N) transport and retention dynamics across catchments are not well understood. Using a process-based model for 89 German catchments, results across study catchments reveal that most N surplus (during 1950–2014) was removed by denitrification (mean ± standard deviation: 58 ± 15%) while the remaining fraction was mostly stored in the soil (14% ± 11%). The mean groundwater transit times in these catchments varied from 3.2 to 20.3 years. These results indicate that past N inputs could continue to affect surface and groundwater quality in the coming years. We identified four catchment groups with distinct archetypal N transport and retention dynamics, which are linked to the catchments' climate, topographic, and geological conditions. Overall, our results shed light on long-term N dynamics in German catchments and how they are linked to catchment characteristics, emphasizing the role of long-term N accumulation and transport for water quality management and evaluation programs.
Key Points
-
We provided insights into the long-term (1950–2014) nitrogen (N) transport and retention across various German catchments
-
Large-sample assessment shows that 57% of N surplus was removed by denitrification and 15% of N surplus was accumulated in the soil zone
-
Four catchment clusters with distinct nitrogen transport and retention dynamics were linked to climatic, topographic, and geological factors
Plain Language Summary
High nitrate concentrations in German water bodies are quite common. It is unclear to what degree current nitrogen levels in water bodies are due to current or past nitrogen (N) application on agricultural fields. What happened to excess N (N was not taken up by plants) in catchments has not been fully understood. In this study, we modeled the fate of excess N in 89 German catchments during the period 1950–2014. Our results suggest that most of the excess N was removed from the catchment in gaseous form (denitrification), and a substantial portion of the remaining excess N was stored in the soil zone. This soil N can leach into the deeper zone (groundwater) where it may travel for 3–20 years to reach the catchment outlet. We also identified four distinct groups of catchments with different behavior in terms of N transport and storage, which further exhibited different climatic, topographic, and subsurface conditions, suggesting that these factors could play a role in catchment N transport and retention. Overall, our results show that there could be a substantial delay between implemented management practices and resulting changes in surface or groundwater quality, which should be considered in water quality management.
1 Introduction
Human activities, especially agriculture, have drastically changed the Earth's landscape and disturbed the global nitrogen (N) cycle (Foley, 2017; Vitousek et al., 1997). N surplus (excess of N inputs) from global croplands increased more than fivefold from 16 Tg N yr−1 in 1961 to 86 Tg N yr−1 in 2010 (Zhang et al., 2021) and it is likely to continue increasing until at least 2050 (Bouwman et al., 2013). In many areas, excess use of N fertilizers on agricultural lands was identified as one of the main causes of surface water and groundwater deterioration, resulting in negative impacts on human health and aquatic ecosystems (EEA, 2021; Evans et al., 2019). Regulations at the national and international levels, for example, the Clean Water Act in the United States (EPA, 1972), the Nitrates Directive in Europe (EEC, 1991), and the Action Plan for the Zero Increase of Fertilizer Use in China (Ju et al., 2016), have been introduced to reduce excess N inputs to agricultural lands and to protect water quality. However, the implementation of such mitigation regulations does not always lead to immediate or clear responses of surface water and groundwater quality (Brown & Froemke, 2012; EEA, 2021; Smith et al., 1987). This requires a sound understanding of long-term N transport and retention.
The lag times from changes in N management practices and changes in groundwater or surface water quality vary from years to decades (Chen et al., 2014, 2018; Meals et al., 2010). The reason for these lag times was discussed to be the accumulation of N in the soil (mainly soil organic nitrogen—SON) as biogeochemical legacy and in the subsurface (unsaturated and groundwater, mainly dissolved inorganic nitrogen—DIN) as hydrological legacy (e.g., Basu et al., 2022; Chen et al., 2018; Van Meter et al., 2016, 2017). SON and groundwater DIN accumulations are controlled by mineralization and immobilization rates in the soil and groundwater transit times, respectively. Several studies suggest that most of the N surplus in the catchment is stored as SON while groundwater DIN is comparatively small (Ascott et al., 2017; Chen et al., 2018; Galloway et al., 2003; Liu et al., 2021; Van Meter et al., 2016). Nevertheless, groundwater DIN storage could affect stream water quality status over decades due to long transit times (Chen et al., 2018). There have been several studies explored the biogeochemical and hydrological lag times, for example, in the Mississippi River basin (Van Meter et al., 2016, 2017), the Susquehanna River basin (Van Meter et al., 2017), the Weser River basin (Sarrazin et al., 2022), and in other basins (Chen et al., 2018). The aforementioned studies, however, were conducted in individual or only a few catchments. Understanding and predicting long-term N transport and retention across a variety of landscape characteristics, hydroclimatic drivers, and anthropogenic impacts rather requires studies with a large sample of catchments.
In recent years, some studies have linked long-term N transport and retention with catchment attributes using a large sample of catchments to discuss underlying processes controlling the build-up of N legacies. However, only a few studies have explicitly separated the soil (biogeochemical legacy) and groundwater (hydrological legacy) N dynamics using different modeling approaches and for different study areas. For example, McDowell et al. (2021) found that lag times between soil N leaching and riverine N export in 34 catchments in New Zealand varied from 1 to 12 years with higher lag times in catchments with higher altitudes, less steep slopes, higher stream order, and higher evapotranspiration. In 14 nested catchments located in the Grand River Watershed, Liu et al. (2021) reported that about 82%–96% of the catchment N was stored in the soil and the remaining was stored in groundwater. The mean transit times in groundwater in these catchments ranged from 5 to 34 years with longer transit times found in catchments with higher tile drainage density.
Some recent studies have directly linked N surplus to riverine N export without an explicit separation between the soil zone and groundwater (e.g., Dupas et al., 2020; Ehrhardt et al., 2021). In these studies, “missing N” is often used to refer to the amount of N that can be either stored in the catchment or be permanently removed via denitrification. For example, lag times between N surplus and the peak riverine N export (mode of the N transport time distribution) in 16 catchments located in Western France were found to vary from 2 to 14 years, depending on catchment lithology (Dupas et al., 2020). In these catchments, about 45%–88% of N surplus was missing N. At a larger scale spanning over 238 catchments in Western Europe, the mode of N transport times were reported to be around 5 years, on average with a higher mode of N transport times in catchments with higher potential evapotranspiration and lower precipitation seasonality (Ehrhardt et al., 2021). They also found that catchments with thicker unconsolidated aquifers have a larger amount of missing N while a higher fraction of consolidated and porous aquifers show a smaller amount of missing N. While these studies provided empirical (data-based) evidence on the fate of missing N, there is generally a lack of understanding of the different components of the missing N (e.g., soil N storage, groundwater N storage, soil, and groundwater N denitrification) and their relation to catchment characteristics. This knowledge gap is important for a more mechanistic understanding of long-term N characteristics in catchments and allows better-targeted management strategies for abating N pollution.
The aims of this study are (a) to provide quantitative estimations of different components of the “missing N” across German catchments and (b) to discuss the linkages between long-term N transport and retention, and catchment characteristics. To this end, we investigated long-term N transport and retention in different terrestrial components (soil and groundwater) across 89 catchments in Germany with diverse settings. We used a parsimonious, process-based model that allows for an explicit characterization of biogeochemical and hydrological legacies. Moreover, we discussed how our findings could be used for management purposes and provide potential implications for other catchments.
2 Materials and Methods
2.1 Study Area and Data
The study uses data from 89 catchments (out of which 70 are non-nested catchments) located in Germany (Figure 1). Among 19 nested catchments, larger catchments cover smaller catchments and areas beyond smaller catchments. In total, the study area has a non-overlapping area of 120,596 km2, which is about one-third of the German territory. The catchment area varies between 19 and 49,760 km2 with a median area of 742 km2, covering both German lowlands and mountainous areas. Agriculture is the dominant land use in most of the study catchments, accounting for (median value) 56% of the catchment area. Unconsolidated rocks are dominant in the Northern German lowlands and South-Eastern Germany, while consolidated rocks dominate in Central and South-Western Germany (Figure S9 in Supporting Information S1). The distribution of precipitation, air temperature, topographic gradient (slope), aquifer depth, and other catchment characteristics indicate that the selected catchments have diverse settings (Figure S8 in Supporting Information S1).
The catchment-scale annual N-surplus from 1950 to 2014 was calculated from the fractional contribution from agricultural and non-agricultural land uses (forest, buildup, and other vegetated and non-vegetated lands), based on their relative areas (Figure S10 in Supporting Information S1). Land uses were constructed by combining the Corine Land Cover data set (EEA, 2019), the History Database of the Global Environment data set (HYDE data set, Goldewijk et al., 2017), and statistical agricultural area data of Germany (Statistisches Bundesamt, 2021) similar to Sarrazin et al. (2022). The N surplus for agricultural areas is available at the county level (NUTS 3—Nomenclature of Territorial Units for Statistics 3) for the period 1995–2014 (Häußermann et al., 2020) and at the state level (NUTS 1) for the period 1950–1998 (Behrendt et al., 2003). The two data sets were harmonized to create consistent time series of N surplus for the period 1950–2014 following Ehrhardt et al. (2021) and Ebeling et al. (2022). The N surplus for non-agricultural areas was estimated as the sum of atmospheric N deposition (Lamarque et al., 2012; Tilmes et al., 2016) and biological N fixation. Biological N fixation rates of 16 kg ha−1 yr−1 for the forest and 2.7 kg ha−1 yr−1 for the other vegetated land were taken based on the mean rates reported in Cleveland et al. (1999) for temperate forest and natural grassland, respectively (Sarrazin et al., 2022). The catchment-scale annual N point sources for the period 1950–2014 were constructed using the methodology of Morée et al. (2013) and information on population counts (HYDE data set), protein supply (FAO, 1951; 2021a; 2021b), and population connection to sewer and wastewater treatment plants (WWTPs; Eurostat, 2016, 2021; Seeger, 1999) (for further details see Sarrazin et al., 2022). The reconstructed N loading from WWTPs was constrained to follow the N loading reported by the authority for the period 2012–2016 (Büttner, 2020; Yang et al., 2019), following Sarrazin et al. (2022).
Daily instream nitrate concentrations were reconstructed from irregularly observed instream NO3-N data using Weighted Regression on Time, Discharge and Season (WRTDS, Hirsch et al., 2010) and were aggregated (discharge-weighted mean) to yearly estimates. Simulated daily discharges from the mesoscale Hydrologic Model (mHM, Kumar et al., 2013; Samaniego et al., 2010) were bias-corrected using piece-wise linear regression and used for gap filling if observed discharges were not available for WRTDS (Ehrhardt et al., 2021). Further details on instream nitrate (NO3-N) concentrations and discharge data at outlets of selected catchments can be obtained from Ebeling et al. (2022). For all of the selected gauging stations, the minimum time series length of instream NO3-N concentrations was 20 years (Figure S7 in Supporting Information S1) and the median number of observations was 426 (min = 154, max = 1,294). In general, the performance of the WRTDS is acceptable with a median RMSE of 0.7 (min = 0.16, max = 3.0), R2 of 0.63 (0.2, 0.8), and BIAS of 0.02 (−0.24, 0.03) (Figure S7 in Supporting Information S1).
2.2 Representation of N Transport in the Catchment
In this study, we used a parsimonious representation of soil N dynamics and a mechanistic representation of N transport in groundwater using the concept of StorAge Selection (SAS) function (Botter et al., 2011; Nguyen et al., 2021, 2022; Van der Velde et al., 2010). The model, called the StorAge Selection function for Nitrate (SAS-N, Figure S1 in Supporting Information S1), consists of two dominant N storages representing the soil zone and groundwater (e.g., Nguyen et al., 2021; Van Meter et al., 2017). The SAS-N model (a) can be considered as an improved version of the catchment-scale lumped transfer function approach (Ehrhardt et al., 2021) with an explicit representation of the soil and groundwater compartments, and (b) has a more realistic representation of groundwater transport with dynamics groundwater transit times compared to other models (e.g., Van Meter et al., 2017). The SAS-N model operates at a yearly time step and is driven by N surplus and effective precipitation (the difference between precipitation and actual evapotranspiration). N surplus can be accumulated in the soil zone as SON, denitrified, or leached to the groundwater as DIN (nitrate). Leached N to the groundwater can be further denitrified and exported to the stream using the SAS approach (Benettin et al., 2013; Nguyen et al., 2022). N point sources (e.g., from WWTPs) are added to the riverine N export and routed to the catchment outlet taking into account instream removal (Sarrazin et al., 2022). A detailed description of the SAS-N model is given in the Text S1 in Supporting Information S1.
The SAS-N model contains six calibration parameters (Text S1 and Table S1 in Supporting Information S1). These parameters were identified by running the model for each catchment with 50,000 parameter sets generated by uniform Latin Hypercube Sampling (LHS) within their pre-defined ranges (Table S1 in Supporting Information S1), assuming that there is no correlation among initial parameter values. The model was run from 1800 to 2014 with 1800–1949 taken as the warm-up period. The model performance was evaluated against (a) instream nitrate concentrations at the corresponding catchment outlet with the root mean square error and (b) observed groundwater nitrate concentrations (see Text S2 and Text S3 in Supporting Information S1 for more detail on the model performance). The mean results of the 30 best simulations from each catchment were used for all of the following analyses. As the number of initial parameter sets is large and behavioral parameter sets (corresponding to the 30 best simulations) could be a combination of any parameter values, parameter correlation could exist among some behavioral parameters (Figure S5 in Supporting Information S1).
2.3 Cluster Analysis
The objectives of the cluster analysis were to find distinct archetypes of long-term N transport and retention and to characterize their relationships with catchment attributes. In water quality studies, the k-means clustering algorithm (Hartigan & Wong, 1979) has been used, for example, to understand patterns and controls of catchment-scale nitrate storage (Ascott et al., 2017), groundwater geochemistry (Frapporti et al., 1993), and aquifer vulnerability (Javadi et al., 2017). As an unsupervised machine learning approach, k-means clustering does not require prior knowledge about the underlying patterns of the data sets. The modeled long-term (1950–2014) mean behavioral N fluxes and stores (in terms of % N surplus) characterizing transport and retention processes, including the transit times, from the 30 best model simulations (behavioral simulations) for each catchment were used for the clustering (Text S4 in Supporting Information S1). Then, statistical properties of various catchment attributes (Figure S8 in Supporting Information S1) within each cluster were calculated to identify differences in the catchment attributes among clusters. The tuning parameter of the k-means is the number of clusters that we optimized using a combination of the silhouette (Rousseeuw, 1987), elbow (Kodinariya & Makwana, 2013), and gap statistic (Tibshirani et al., 2001) methods to have a robust estimation (Figure S11 in Supporting Information S1).
3 Results and Discussion
3.1 Long-Term N Transport and Retention
The simulated mean long-term (1950–2014) N fluxes and stores across all catchments (Figure 3a) show that only 27 (mean of mean behavioral simulations) ± (standard deviation of mean behavioral simulations) 13% of N surplus was exported to the stream, in other words, the “missing N” accounts for 73% ± 13% of N surplus (equivalent to 35 ± 6 kg ha−1 year−1). These estimated values are well within the range reported by Ehrhardt et al. (2021) for Western European catchments. Results from our study suggest that the majority of N surplus was removed by denitrification in the soil zone (30% ± 15%) and groundwater (27% ± 11%). This is in line with the findings from Sarrazin et al. (2022), who showed that more than half of the N surplus in the Weser catchment in Germany was removed via denitrification. Seitzinger et al. (2006) also found that denitrification in the soil was generally higher than in groundwater at a global scale. About 14% ± 11% of N surplus that entered the catchments between 1950 and 2014 was accumulated in the soil zone while that was only 1% ± 0.9% in the groundwater. A dominance of soil N accumulation over groundwater N accumulation in catchments has been confirmed in earlier studies across western France (Dupas et al., 2020), the Danube (Malagó et al., 2017), the Weser (Sarrazin et al., 2022), and the Mississippi (Van Meter et al., 2016) river basins. An independent estimation based on groundwater N-stocks and maximum increase in groundwater nitrate concentration also showed that only around 1% of N surplus was accumulated in the European groundwater zone (Howarth et al., 1996). Although groundwater N accumulation was low compared to soil N accumulation, groundwater nitrate concentrations in 26% of groundwater observation wells in Germany still exceeded the threshold of 50 mg/L of for drinking water (EEA, 2018). In catchments with relatively high soil N accumulation and long transit times in groundwater, stream water quality status could be affected by past N inputs applied several years ago. Here we found that the mean transit times of discharge (and dissolved N), the time elapsed since a water parcel enters the groundwater to the time it leaves the catchment via discharge, varied between 3.2 and 20.3 years with a median value of 7.1 years (Figure 2b). It should be noted that there is also variability in the simulated long-term N fluxes and stores among behavioral simulations within a catchment. In general, higher simulated fluxes or storages have higher standard deviations, except the instream N export because it is the calibrated variable (Figure S12 in Supporting Information S1).
The time series of N fluxes and stores among different catchments show a wide range of variations in levels but also similarities in patterns (Figures 2c–2f). The N surplus, mean behavioral soil N and groundwater N storages from all catchments show a significant increasing trend (Mann-Kendall trend test (MK, Mann, 1945; Kendall, 1975) with p-value < 0.001) during the 1950–1988 period (Figures 2c–2e). After 1988, N surplus declined significantly (MK, p-value < 0.05, mean slope = −0.93) in 74 catchments, out of which 13 and 3 catchments nevertheless showed an increasing trend in soil N and groundwater N storages (MK, p-value < 0.05, mean slope < −0.77), respectively. While the median N surplus across all 89 catchments in 2014 was reduced by 57% compared to that of 1988, the median of the mean behavioral soil, groundwater N storages, instream N concentrations and loadings decreased only by 15% and 16%, 23%, and 49%, respectively (Figures 2c–2f, blue lines). The small reduction of groundwater N storage since 1988 found in this study is also in line with a slight decline in observed groundwater nitrate concentrations in recent decades across Germany (Van Grinsven et al., 2012).
3.2 Linking N Characteristics to Landscape Attributes
Results from the k-means analysis indicate that the study catchments can be grouped into four clusters based on their underlying N transport and retention dynamics (Figures 3a and 3b and Figure S5 in Supporting Information S1). In general, catchments in the same cluster are located closer to each other (Figure 3a). This is expected as spatial similarity in neighborhood exists for many hydrological and water quality processes (Detenbeck et al., 1996; Western et al., 2004). The number of catchments within each cluster varies from 12 to 32 and each catchment cluster shows distinct long-term N dynamics (Figures 3a and 3b). The salient features of the four clusters can be summarized as: catchment cluster one has high soil N leaching and instream N export, and short groundwater transit times, catchment cluster two is characterized by high soil and groundwater N accumulations and long groundwater transit times, catchment cluster three shows high soil denitrification, and catchment cluster four has high groundwater denitrification.
Regarding the catchment attributes, catchments in cluster one are characterized by high altitude, high precipitation, high topographic slopes, and low topographic wetness index (Figure 3c). We argue that these conditions lead to a dominance of fast shallow flow paths with short transit times in both soil and groundwater, resulting in low soil and groundwater N storage, high soil N leaching, low denitrification, and high instream N export relatively to N surplus (Figure 3a). In addition, shallow aquifers and a high fraction of consolidated rocks in catchment cluster one are also factors that may lead to low N storage and short transit times in groundwater (Figures 3b and 3c). The catchments in cluster one are also minimally disturbed mountainous forested catchments with low N surplus (Figure 3c). In contrast, catchments in cluster two can be interpreted as managed lowland catchments (low altitudes and slopes) with agriculture-dominated landscapes and high N surplus (Figure 3c). Lower precipitation and higher aridity in these catchments could cause lower soil moisture that restricts soil denitrification and flushing (leaching) of soil N, leading to higher soil N storage. Lower topographic slopes and deeper aquifers observed in these catchments facilitate deeper flow paths with longer transit times. Long transit times in combination with low aquifer denitrification rate could be an explanation for the relatively high fraction of groundwater N accumulation compared to the other clusters. High N accumulation in the catchments leads to low instream N export.
Catchment cluster three is located in a comparable range of altitudes to catchment cluster one but with lower slopes and higher fractions of agriculture, lower precipitation, higher precipitation seasonality, and lower mean temperature. Soil denitrification in catchment cluster three was found to be the highest among the four catchment clusters. The precipitation seasonality with higher summer precipitation causes higher soil moisture during the warm and biologically active season and could thus enhance soil denitrification. Additionally, high soil pH might cause high soil denitrification in cluster three, as shown for southern Germany (Müller et al., 2022). Groundwater denitrification in catchment cluster three is relatively low compared to the others due to low soil N leaching. Catchment cluster four is located in the lowland areas as is catchment cluster two, but with slightly higher precipitation, causing higher soil N leaching and lower soil N storage (Figures 3a and 3c). The mean fraction of sedimentary aquifers in catchment cluster four is the highest among the four catchment clusters with deeper aquifers. This could indicate long transit times, high anoxic conditions and abundance of electron donors (Ebeling et al., 2021; Knoll et al., 2019), resulting in high groundwater denitrification in the catchment cluster four.
4 Limitations and Outlook
In this study, riparian denitrification was implicitly captured in either groundwater or instream denitrification. This could lead to higher estimated instream or groundwater denitrification rates or lower estimated groundwater transit times. In addition, including the 19 nested catchments could cause bias in the general results due to duplication of the results for the same areas, however, larger catchments also cover areas beyond the smaller subcatchments. The number of clusters could change when more or fewer catchments are considered due to increasing available data or adding new constraints for catchment selection. Furthermore, we did not consider different forms of N surplus and the carbon-to-nitrogen ratio, which can affect N dynamics in soils (Robertson & Groffman, 2015). Nevertheless, our findings are in line with results from existing studies within the study area or elsewhere (Section 3.1) and the cluster analysis gave plausible results regarding existing process understanding (Section 3.2). Here, we suggest using more data, especially data regarding long-term soil and groundwater nitrogen dynamics, to better constrain the model results (Lutz et al., 2022).
5 Summary and Implication
Our results suggest that overall there is a large amount of accumulated N in the soil zone as biogeochemical legacy, while the magnitude of N accumulation in groundwater (in form of dissolved inorganic N) is comparably low. Both biogeochemical and hydrological N legacies could have a significant impact on instream water quality for the next few decades as shown by the mean transit time of discharge could be up to 20.3 years. The k-means clustering revealed that N transport and retention characteristics can be explained by climatic (precipitation, aridity index), topographic (altitude, topographic slope), and aquifer (aquifer depth) characteristics.
We propose that results from the cluster analysis can be used for a qualitative assessment of long-term N characteristics in other catchments within and beyond the physical boundaries of our study area. In particular, our results have shown that catchments located in close spatial proximity tend to behave more similarly than catchments located more distant from each other. Therefore, long-term N characteristics in ungauged catchments can possibly be inferred from their neighboring catchments. On the other hand, knowing the catchment attributes could help to identify the catchment archetype for N transport (cluster) as demonstrated in this study (Section 3). The linkage between catchment characteristics and dominant N transport, storage and removal processes could inform the development of robust parameter regionalization techniques in future modeling studies (e.g., Kumar et al., 2013; Samaniego et al., 2010).
This study highlights the importance of considering N legacy effects in water quality modeling, management, evaluation programs, and having catchment-specific N management approaches as catchment responses to N surplus are highly heterogeneous. Neglecting N legacies in catchment water quality modeling could provide the right results for the wrong reasons, leading to false implications for management practices (Basu et al., 2022). In catchments with a high accumulation of N in the soil zone (e.g., low-land catchments), a long-term effort is needed to achieve good chemical status for the groundwater bodies as N in the soil zone will continue to leach to the groundwater, potentially causing elevated groundwater N concentrations over a long period. In such catchments, management practices should also focus on managing soil N storage. Depending on the forms of N inputs, N can be further accumulated in the soil, therefore, soil N management could be the choice of N forms in the applied N inputs. In addition to soil N accumulation, long transit times in groundwater, could delay the effects of current management practices on changes in surface water quality, which should be taken into account for evaluation programs (EEA, 2021).
Acknowledgments
We thank Stefanie R. Lutz for her comments on the initial version of the manuscript. We thank the Editor and the two anonymous reviewers for their thoughtful comments. Pia Ebeling was funded by the Deutsche Forschungsgemeinschaft (DFG).
Open Research
Data Availability Statement
The model code and model results are available at https://doi.org/10.5281/zenodo.6788552. All catchment attributes can be obtained from https://doi.org/10.4211/hs.0ec5f43e43c349ff818a8d57699c0fe1.