An examination of convective moistening of the lower stratosphere using satellite data
Abstract
In this paper, we use satellite data to test the hypothesis that deep convection moistens the lower stratosphere. Water vapor measurements from Earth Observing System-Microwave Limb Sounder and Atmospheric Chemistry Experiment-Fourier Transform Spectrometer over North America are binned according to the International Satellite Cloud Climatology Project deep convection indices. The results show that in the North American region (50–112°W, 10–50°N) the convection-impacted samples are significantly moister than the nonimpact samples in the lowermost stratospheric layer right above the tropopause, and a drier tendency is also noticed right above this moistened layer. Trajectory modeling is used to aid the identification of deep convection-impacted water vapor samples. However, we find that a substantial fraction of high-concentration (>8 ppmv) samples at 100 hPa cannot be attributed to nearby deep convections.
Key Points
- Convective moistening of lower stratosphere is evidenced by satellite data
- Dry anomaly is found immediately above the moistened layer
- Deep convection only explains a fraction of moist samples in lower stratosphere
1 Introduction
Despite its scarcity (~3–5 ppmv), stratospheric water vapor (SWV) is a climatically important atmospheric constituent due to its significant radiative and chemical impacts [Forster and Shine, 1999; Solomon et al., 2010; Anderson et al., 2012]. Recent studies have particularly addressed whether and how SWV variation may be coupled with tropospheric and surface temperatures and constitute a radiative feedback that affects climate sensitivity. Huang [2013] shows that the overall stratospheric radiative effect in coupled general circulation models may amount to 0.3 W m−2 K−1; Dessler et al. [2013] postulates that SWV alone may result in a feedback of this magnitude and especially highlight the effect of water vapor in the lower stratosphere.
Assessment of the climatic impact of SWV is impeded by the lack of understanding of processes that control its distribution and variation. An important process, transport by the overturning Brewer-Dobson Circulation, has been long recognized [Brewer, 1949]. However, it is uncertain how factors such as temperature in the tropical tropopause layer, strength of circulation, and vertical and horizontal mixing are weighted and interactively determine SWV distribution and variation [Fueglistaler et al., 2014]. Some long-term trends in the site record (e.g., the balloon measurements taken in Boulder, Colorado) cannot be fully explained by the known factors, and the abrupt decrease in global mean SWV after the year 2000 remains a mystery [Hartmann et al., 2013]. On a relevant note, it is important to bear in mind that accurately measuring water vapor at low concentrations and monitoring its global, climatic variations remain a challenge. The intercalibration issue between different in situ instruments has been a long-standing issue [e.g., Weinstock et al., 2009]. There are also considerable systematic biases between different satellite data sets [Hegglin et al., 2013]. Moreover, regional near-surface measurements and satellite data differ in terms of spatial representativeness, which potentially lead to opposite estimates of water vapor trend [Hegglin et al., 2014].
Besides methane oxidation that mostly affects the upper stratospheric water vapor budget, another process that affects SWV is overshooting deep convection, which penetrates the tropopause and is capable of directly injecting water vapor into the lower stratosphere. This moistening effect is evident from field measurements, e.g., those of Anderson et al. [2012]. It has been postulated that this may form an effective troposphere-stratosphere water vapor transport mechanism [Fu et al., 2006]. Hanisco et al. [2007] show isotopic evidence that summertime extratropical convective storms make a considerable contribution to stratospheric water vapor over the North American continent. Although global SWV climatology from satellite observations, such as Atmospheric Chemistry Experiment-Fourier Transform Spectrometer (ACE-FTS) [Randel et al., 2012] and Aura Microwave Limb Sounder (MLS) [Schwartz et al., 2013], show prominent seasonal SWV anomalies in the Asian and North American monsoon regions that seem to corroborate the account of convective impact, SWV and deep convection maxima are not spatially overlapped in both monsoon regions and the HDO signatures that are indicative of deep convection intriguingly differ between the two monsoon regions [Randel et al., 2012]. There is also noticeable discrepancy between the in situ SWV measurements and the collocated MLS samples during moistening events [Schwartz et al., 2013]. In short, it is unclear to what extent convection may affect SWV at regional or global scales. Satellite analysis of such effect is particularly lacking.
In this paper, we examine the relationship between SWV and deep convection (DC) by diagnosing satellite data sets, with a focus on the North American region (50–112°W, 10–50°N). Specifically, we (1) investigate the covariations of SWV and DC, (2) examine whether SWV concentrations differ with respect to DC occurrence, and (3) examine whether the high-concentration SWV samples can be explained by DC. The data sets and model used in our analysis will be described in the following section. Then the SWV-DC relationship diagnosed from the different perspectives will be presented. Discussion of the results and some conclusions will be given at the end.
2 Data and Methodology
2.1 Water Vapor Data
We mainly use data from Earth Observing System-Microwave Limb Sounder (MLS) aboard the NASA Aura satellite for describing SWV distribution and variation. Aura was launched in July 2004. It is a Sun-synchronous polar-orbiting satellite with a 1:45 P.M. equator crossing time and a period of about 100 min (i.e., circling the Earth about 14.5 times each day). The MLS retrieves temperature, water vapor, and other trace gases by measuring the thermal emission of the atmosphere at multiple microwave bands (the 190 GHz band is used for water vapor retrieval). Due to the 25 s measurement rate, the along-track spacing of samples is about 1.5° or about 165 km. Water vapor retrieval is given at fixed pressure levels, from 316 hPa upward. The vertical resolution of water vapor data is about 3 km in the upper troposphere and lower stratosphere (UTLS) region that we are most interested in. Details of retrieval algorithm are given by Livesey et al. [2006]. The uncertainty range of MLS SWV data is about 20% between 316 hPa and 147 hPa and 10% at 100 hPa and stratosphere [Read et al., 2007; Jiang et al., 2010]. The level 2 retrieval data (version 3.3) taken during the period from January 2005 to June 2008 are used in this study. As we are especially interested in the high-concentration samples, we have conducted a strict quality control process recommended by the MLS science team [Livesey et al., 2011]. Specifically, we have used these flags to screen the data: (1) only values located within 316–0.002 hPa are used, (2) data points for which L2gpPrecision is set negative are removed, (3) profiles for which “status” is an odd number are removed, (4) profiles whose “quality” field is less than 1.3 or “convergence” field is greater than 2.0 are removed, (5) profiles having high or low cloud status flag bits set are removed, and (6) profiles of which concentration is lower than the minimum measurable water vapor concentration at a specific level are removed.
In addition, we analyze version 3.0 SWV data from ACE-FTS, which is aboard a Canadian satellite, SCISAT, launched in August 2003 [Bernath et al., 2005]. ACE-FTS uses solar occultation in the spectral range of 750–4400 cm−1 to retrieve water vapor concentration. ACE-FTS has similar sampling footprint to that of MLS: about 300 km in the horizontal and 3–4 km in the vertical. One advantage of this instrument is that the isotopic composition of some species such as HDO can be made available. However, due to limitations of the solar occultation technique, ACE-FTS water vapor data are limited in spatial and temporal coverage and are much less abundant than MLS data. Hence, this data set is used here mainly for verification of some analysis results. We will focus on the MLS results in the following sections.
2.2 Deep Convection Data
Here we use deep convective tracking data from the International Satellite Cloud Climatology Project (ISCCP-CT) [Machado and Rossow, 1993] for the period (January 2005 to June 2008) that overlaps with MLS and ACE-FTS data sets. ISCCP data come from five geostationary satellites. We will use GOES-EAST data for the domain 50–112°W, 10–50°N because we are most interested in the North American region.
The ISCCP-CT data set was developed to identify deep convection by analyzing image pixels of the ISCCP pixel-level data (DX) data set [Rossow et al., 1996]. Pixels that have brightness temperatures less than 245 K are defined as convective system (CS), and those that have brightness temperatures less than 220 K are referred to as convective clusters (CC). Deep convective clouds range from 90 km to 350 km in size. Information including time, center position (latitude and longitude), radius, and family number of each CS are provided at 3 h time intervals and 30 km spatial intervals. Consecutive images are used to observe movement of each CS; the position and size of the CS are recorded. The same convective system detected at different time is given the same family number [Machado et al., 1998].
Here we identify DC by comparing the lowest brightness temperature (LBT) recorded in the ISCCP data set to a threshold temperature. Different threshold temperatures have been used in previous works for such purpose [e.g., Zipser et al., 2006; Kubar et al., 2007; Yuan and Li, 2009; Bedka et al., 2010; Takahashi and Luo, 2014]. We have tested different values ranging from 195 K to 245 K at 5 K interval. The results presented below are based on a threshold value of 195 K, to ensure that the DC cases selected are overshooting cases of interest. We will discuss possible impacts of this choice wherever necessary in the rest of the paper.
Based on the information provided by ISCCP-CT data, we divide MLS data into three groups: storm, nonstorm, and uncertain. Two latitude-longitude boxes are drawn here for categorizing the data. Both boxes are centered at the average center position (longitude and latitude) of each CS family within which a DC is identified. The size of the smaller box is given by the average radius of the family during its lifetime. The size of the larger box is the area covered by all CSs belonging to the family. MLS data samples falling into the smaller box are identified as “storm” samples, data outside the larger box as “nonstorm” samples, and the ones in-between as “uncertain” samples. The time window used for the storm search is set as the life period of the CS family, and the time is increased by 3 h on both ends for nonstorm search. Figure 1 illustrates an example in October 2007 of the data categorization.

2.3 Trajectory Model
The Hybrid Single-Particle Lagrangian Integrated Trajectory (HYSPLIT), version 4, is used here. This model was developed by NOAA Air Resources Laboratory [Draxler and Hess, 1998] and has been widely used to study aerosol transport and water vapor distribution [e.g., Strong et al., 2007]. We use this model to analyze the back trajectory (historical locations) of high SWV concentration samples.
Samples are tracked on isentropic surfaces [Newman et al., 2001]. A parcel at each position is traced back up to 10 days, and deep convections near the parcel locations are searched according to the same criterion used for finding storm samples above. Once a DC is encountered along the back trajectory or the trajectory extends outside the study domain (50–112°W), we will stop the tracking. To assess the uncertainty brought by the wind data, two reanalyses, National Centers for Environmental Prediction/National Center for Atmospheric Research (NCEP/NCAR) reanalysis 1 and Eta Data Assimilation System, are used here. The NCEP/NCAR reanalysis 1 provide 4 times daily data at 2.5° × 2.5° horizontal resolution since 1948. The data are presented at 17 fixed pressure levels. Details can be found in Kalnay et al. [2006]. The Eta Data Assimilation System (EDAS) data set, which covers the U.S. region, is on a 185 × 129 Lambert Conformal grid with 40 km horizontal resolution and is available at 26 fixed pressure levels ranging from 1000 hPa to 50 hPa at 3 h intervals. More information is provided at https://ready.arl.noaa.gov/edas40.php. This data set covers 60–140°W North American region, so the longitude coverage in the tropics is lower. We only use the EDAS data set to calculate trajectories for samples located in the 30–50°N band because of less overlap with the study domain in the low latitudes.
2.4 Auxiliary Data
In addition to the above data sets, ERA-Interim data set [Dee et al., 2011] is used to provide temperature profiles for calculating the environmental tropopause position defined by the World Meteorological Organization [1957] criterion.
3 Results
3.1 Covariations of SWV and DC
There are a total of 169,426 MLS profiles falling into the study domain (50–112°W, 10–50°N) during the period of January 2005 to June 2008, which are used in this paper. Figure 2a illustrates the zonal distribution of the MLS data at various levels within GOES-EAST view (50–112°W) during the same period. Only samples with water vapor concentration greater than 8 ppmv are shown here. Data are presented on the latitude-potential temperature cross section. The potential temperature (θ) is calculated as
, P0 = 1000 hPa, and T and P are temperature and pressure observed at the same level as the SWV data. It is clear from the figure that a number of moist samples lie above the climatological tropopause (380 K in the tropics and 330 K/380 K (winter/summer) in the midlatitudes). These are the cases potentially related to DC. To have an overview of the distribution of high water vapor concentrations, geographic locations of samples at 360–370 K and 380–390 K are shown in Figure 2. At 360–370 K in the North American region most high-concentration samples are detected in the midlatitudes (30–50°N) (Figure 2c). High water vapor concentration samples are much scarcer at 380–390 K. Samples with concentration greater than 12 ppmv are mostly located in the North American midlatitudes (Figure 2b). The fact that few high-concentration SWV samples lie in the tropics agrees with the understanding that vertical transport of water vapor is subject to dehydration due to cold tropopause temperature within the tropics [Holton and Gettelman, 2001]. Separating the samples into different seasons (not shown), we find that most of the high-concentration samples (>8 ppmv) that are located above the 370 K isentropic surface and poleward to 30° latitude occur in summer. This well corresponds to the seasonality of DC activity to be discussed below.

We divide the study domain into 10° × 10° grid boxes. The frequency of DC documented in the ISCCP-CT data set of each grid box is shown in Figure 3 for both summer and winter. Most deep convections in the study domain happen in summer, while the occurrence rate in winter is much lower (zero in many midlatitude boxes). This is consistent with what is found in previous studies [e.g., Laing and Fritsch, 2007].

To examine how SWV changes with DC, the correlation coefficient is calculated in each 10° × 10° box between water vapor concentration and DC frequency. The correlation result at 100 hPa is shown in Figure 4. At this level, SWV is positively correlated with DC, suggesting that deep convection explains the seasonal variation of lower stratospheric water vapor. It is also noticed that water vapor concentrations at different vertical levels vary in an unsynchronized manner. At higher vertical levels water vapor concentration tends to be anticorrelated with DC. For example, in the 10–20° N band, the water vapor anomaly at 56 hPa is opposite to DC fluctuation. In the 30–40°N band, a phase lag between the 100 hPa SWV and DC can also be seen.

We note that the seasonality of DC and thus the correlation between DC and SWV are sensitive to the LBT threshold for DC identification. We find that if including shallower DC using a threshold temperature higher than or equal to 210 K, the number of DC in midlatitudes becomes higher in winter because winter cyclones frequently occur in this region. Considering that the high-concentration samples in the lower stratosphere occur predominantly in summer in the region (see Figure 2 and discussions above), this means it is necessary to use a low threshold value (195 K is used here) to screen out the shallow convections irrelevant to the problem investigated here.
3.2 Storm Versus Nonstorm Profiles
We examine the influence of DC on water vapor distribution by comparing composite vertical profiles of storm and nonstorm groups categorized based on the procedure explained in section 2. Since the tropopause position at different latitudes is different, we divide the North American region (10–50°N, 50–112°W) into four 10° latitude bands. We calculate seasonal tropopause position based on both MLS and ERA-Interim temperature profiles. Both results suggest that in the low latitudes (10–30°N) tropopause does not have much seasonal variation and it is around 100 hPa. However, seasonal tropopause variation cannot be neglected in the midlatitudes (30–50°N). The lapse rate tropopause calculated based on MLS temperature profile is at 147 hPa in boreal summer and at 215 hPa in boreal winter in the 30–40° N band and is at 178 hPa and 215 hPa, respectively, in the 40–50°N band. The results calculated from ERA data are similar. So we examine vertical water vapor profiles in the 30–40° N and 40–50°N bands in summer (June–August) and winter (December–February), respectively. Results are shown in Figure 5.

The impact of deep convection is very noticeable. For example, in the 10–20°N band, MLS data show significant increase in water vapor concentration in the UTLS region in the storm samples as compared to the nonstorm samples (Figures 5a and 5b). This moistening signal is evident from the comparisons of most latitude bands.
Another noticeable feature is a dehydration signal at higher levels in the stratosphere, i.e., statistically significant decrease of water vapor concentration, e.g., around 40–60 hPa in the 10–20°N band. This signal is also observed in the other latitude bands. Using averaging kernels [Livesey et al., 2011] of MLS water vapor products, we find that moistening in adjacent moistened lower layers cannot fully explain the drying signal. This suggests that there may be physical reasons that account for the dehydration observed.
We also calculate the probability distribution function (PDF) of water vapor concentration at identified wetter (or drier) levels in different latitude bands. Figure 6 shows noticeable differences between the storm and nonstorm cases. The difference is especially noticeable in the low-latitude band, where concentration in the storm cases leans toward higher values than the nonstorm cases (Figures 6a and 6b). This is consistent with the results above that DC is strongly correlated with enhanced moisture in the UTLS.

The results are verified using ACE-FTS water vapor data (not shown). The moistening-drying pattern seen from ACE-FTS data agrees well with the MLS results in most cases except that the position of dry signal derived using ACE data is a bit higher than that derived using MLS data in the midlatitudes. This is likely due to the different vertical coordinates used in retrieval for the two satellites [Hegglin et al., 2013].
3.3 Trajectory Analysis
As shown by Schwartz et al. [2013], most high-concentration lower stratospheric water vapor samples are concentrated in the North American and Asian monsoon regions. If these high-concentration samples are due to convective injection, it is expected that the samples are located within a reasonable time-space window around deep convection. Table 1 shows how the samples binned by their concentration values are divided into the storm and nonstorm groups. Interestingly, we find that a substantial fraction of the moist samples cannot be explained by collocated deep convection. For example, at 100 hPa half of the >8 ppmv samples (137 out of 166 samples) belong to the nonstorm group, almost double that in the storm group (13 out of 166). A possible explanation is long-range transport of high water vapor concentration air parcels. Here this effect is analyzed using the HYSPLIT trajectory model.
Latitude Band | Group | <2 | 2–3 | 3–4 | 4–5 | 5–6 | 6–7 | 7–8 | >8 |
---|---|---|---|---|---|---|---|---|---|
10–20°N | Storm | 0 | 32 | 174 | 362 | 386 | 193 | 34 | 8 |
Nonstorm | 133 | 5,291 | 14,074 | 8,449 | 5,170 | 1,962 | 258 | 20 | |
20–30°N | Storm | 0 | 7 | 49 | 126 | 144 | 73 | 16 | 3 |
Nonstorm | 110 | 4,662 | 15,023 | 10,565 | 6,924 | 2,465 | 376 | 53 | |
30–40°N | Storm | 0 | 1 | 23 | 35 | 28 | 15 | 2 | 2 |
Nonstorm | 27 | 1,557 | 12,973 | 14,818 | 7,715 | 1,859 | 293 | 47 | |
40–50°N | Storm | 0 | 0 | 20 | 46 | 16 | 3 | 1 | 0 |
Nonstorm | 0 | 272 | 8,386 | 22,475 | 7,674 | 758 | 76 | 17 |
All samples with concentration greater than 8 ppmv in the North American region (10–50°N, 50–112°W) at 100 hPa are selected and set as the initial point to calculate back trajectories. With 20, 53, 47, and 17 samples in each latitude band, a total of 137 samples are examined in the trajectory analysis. Most of these samples are over the continent.
Since parcel release position influences trajectory result and there is uncertainty in the position of MLS samples (mainly due to its low vertical resolution), we calculate trajectories at multiple vertical levels (16, 16.5, 17, 17.5, and 18 km) around the level (100 hPa) where each sample is reported. As discussed in section 2, parcels are traced back at 48 h intervals up to 10 days. CSs that happened during the period are searched to determine whether each sample encountered DC. Here we define the influence region of one CS by the radius of each CS documented in the ISCCP data set. To simplify the procedure, 100 km is approximated as 1°. The search domain is enlarged by 1° in addition to account for the uncertainty in the CS influence region. If the trajectory crosses any CS influenced region, the high-concentration sample is tagged as “explained” by DC. For example, on 7 August 2005 8:00:00 UTC a high-concentration sample is detected at (17.8°N, 97°W). We trace back from this position and analyze back trajectories in five 48 h segments. As shown in Figure 7, the hourly positions of the parcel are recorded and the CSs that happened during the time are marked. In this case, two CS locations at 19°N that happened 45 h ago and two CS locations at 17°N that happened 20 h ago are found. This high-concentration sample will be regarded as being explained by DC within 20 h in history.

Following this procedure, each sample in the four latitude bands is traced back to search for DC encounters. Results derived using NCEP/NCAR reanalysis data are summarized in Figure 8. In the 10–20° N band, the tracking process is stopped at 8 days when there are no samples whose trajectory remains inside the domain with no DC encountered. Trajectory results explain 12 of the 20 samples in this band (Figure 8a). In comparison, 18/53 in the 20–30°N band, 26/47 in the 30–40°N band, and 11/17 in the 40–50°N band are explained. In summary, only a fraction of these moist samples can be explained by DC; the unexplained samples amounts to 70, more than 40% of all the moist samples (a total of 166). Results derived using the EDAS data are similar.

Note that we have adopted a strict criterion (195 K threshold) for selecting DC and thus may have underestimated the likelihood of DC encountering. To examine this uncertainty, we relax the criterion to 245 K and re-search for DC along the trajectory. As a result, now more than 80% samples can be explained in the 10–20°N and 40–50°N bands. However, the ratio has little change in the 20–30°N and 30–40°N bands; there are still about 30% samples that cannot be explained. Furthermore, each sample is traced back at five vertical levels, and if any of these five trajectories encounters DC, the sample is categorized as explained. But despite all the reasons that may have led to overestimation of DC encountering likelihood, a number of anomalously high concentration samples cannot be explained by either local vertical transport or long-range transport (up to 10 days and within the study domain).
4 Discussion and Conclusions
Using ISCCP convective tracking data to identify deep convection, we investigate the influence of deep convection on UTLS water vapor. We first compare MLS and ACE-FTS water vapor samples collocated with deep convection to those not collocated with deep convection. A typical pattern of a moistened UTLS layer overlaid by a drier upper layer is observed in both tropics and midlatitudes. We have also calculated the PDF of water vapor concentration at different UTLS levels, which corroborate the above results.
The drying signal above the moistened layer is interesting and was also identified by other studies [e.g., Ray and Rosenlof, 2007]. Our analysis based on the MLS weighting function suggests that this is not a spurious signal due to the vertical resolution of the retrieval data. The physical cause of this signal still eludes us and warrants further study.
It is found that many high-concentration water vapor samples cannot be explained by collocated deep convection. Using a back trajectory model, we investigate whether these samples can be caused by historical encounters with deep convection and long-range transport. Back trajectories are calculated by HYSPLIT model, using NCEP/NCAR and EDAS wind data. The attribution results derived from the two data sets are in good agreement. When tracked backward longer in time, more samples can be attributed to deep convection that happened along the trajectories. However, even when a relaxed criterion is used for determining the convective impact on the samples, a substantial fraction of samples, especially in the midlatitude bands, cannot be traced to any deep convection within the tracking domain (the North American region within the GOES-EAST field of view) and the time window (10 days). This may be due to small-scaled convection (radius less than 90 km) that is not observed by ISCCP data. Another plausible explanation is that these moist samples are due to longer-range (and time) transport. Pinning down the cause of these samples is beyond the scope of this paper but is an interesting topic for future work.
As discussed in the previous sections, DC identification (storm sample selection) is subject to the threshold LBT used in this study. A range of threshold values is tested. To minimize the impact of shallow (nonovershooting) convection on the storm composite, a rather low value of 195 K has been used for most of the results presented in this paper. On the other hand, the nonstorm composite is little impacted by this criterion owing to the much larger sample size. In general, the storm-nonstorm difference remains a moister UTLS layer overlaid by a drier layer, although the moistening in the lower stratosphere (at the MLS retrieval levels above the tropopause) becomes indiscernible in midlatitudes (30–40°N) when LBT threshold higher than 210 K is applied. In addition, we notice that the level that separates the drier upper layer from the moister lower layer goes upward when colder LBT threshold is used, which suggests that stronger overshooting deep convection “pushes” both moistened and dehydrated layers upward. Modeling results showed that high-capped DCs are more likely to moisten high stratosphere levels [Dessler et al., 2007]. Our analysis provides observational evidence for this argument.
Acknowledgments
We thank Bill Randel, Andrew Gettleman, Jonathan Jiang, Jonathan Wright, Longtao Wu, and Johnny Luo who made useful comments on the paper. We also thank two anonymous reviewers for their very helpful comments. This work is supported by a FRQNT funding 2015-NC-181248. Y.S. is also supported by a Stephen and Anastasia Mysak Graduate Fellowship. The NCEP Reanalysis data are provided by the NOAA/OAR/ESRL PSD, Boulder, Colorado, USA, from their Web site at http://www.esrl.noaa.gov/psd/.