Can Regional Climate Modeling Capture the Observed Changes in Spatial Organization of Extreme Storms at Higher Temperatures?
Abstract
The spatial extent and organization of extreme storm events has important practical implications for flood forecasting. Recently, conflicting evidence has been found on the observed changes of storm spatial extent with increasing temperatures. To further investigate this question, a regional climate model assessment is presented for the Greater Sydney region, in Australia. Two regional climate models were considered: the first a convection-resolving simulation at 2-km resolution, the second a resolution of 10 km with three different convection parameterizations. Both the 2- and the 10-km resolutions that used the Betts-Miller-Janjic convective scheme simulate decreasing storm spatial extent with increasing temperatures for 1-hr duration precipitation events, consistent with the observation-based study in Australia. However, other observed relationships of extreme rainfall with increasing temperature were not well represented by the models. Improved methods for considering storm organization are required to better understand potential future changes.
Key Points
- The spatial dependence of extreme rainfall to temperature simulated by two regional climate models is compared to observations
- The convection-resolving model failed to simulate the positive scaling of peak precipitation for 3-hr storms
- The model that adopts the Betts-Miller-Janjic convective scheme appears to best capture the observed precipitation-temperature scaling
1 Introduction
Intensifying short-duration rainfall extremes in a warming climate have been confirmed in numerous studies based on both observations and climate model projections (Beck et al., 2015; Demaria et al., 2017; Leahy & Kiely, 2011). This is of great concern to society as it can lead to an increased risk of flash flooding (Westra et al., 2014). These changes are generally attributed to the increased moisture holding capacity of the atmosphere at higher temperatures as governed by the Clausius-Clapeyron (CC) relationship. While the saturation water vapor pressure increases exponentially with temperature at the rate of approximately 7% per degree Celsius (see section 3.7.3; Wallace & Hobbs, 2006), higher rates of increase (super-CC scaling) may result due to the changes of storm dynamics at warmer temperatures (Emori & Brown, 2005; Loriaux et al., 2013; O'Gorman, 2012). Based on observational records, super-CC scaling was found in the hourly rainfall extremes for temperatures between 10 °C and 22 °C in Netherlands, Belgium, Switzerland, Australia, and Hong Kong (Lenderink et al., 2011; Lenderink & van Meijgaard, 2008, 2010; Wasko et al., 2015). The super-CC scaling is attributed to positive feedbacks from the excess latent heat released in extreme rainfall events (Lenderink & van Meijgaard, 2010). The precipitation event type also effects the scaling of subdaily extreme storms with temperature with different rates of scaling found for convective and stratiform events (Berg & Haerter, 2013; Molnar et al., 2015).
While the scaling of extreme rainfall with temperature may provide some guidance on potential changes to flood producing rains due to anthropogenic climate change, it is not the entire picture. Any changes in the spatial distribution of rainfall events could act either to intensify or offset the effects of increasing storm rainfall intensities. Observations in Australia indicate that storms tend to become more concentrated around the storm center with increasing temperatures, with smaller effective radii and more spatial variability (Wasko, Sharma, & Westra, 2016). This is consistent with model studies using the weather forecasting and research model with the Grell-Devenyi convective parameterization (Wang & Kotamarthi, 2015) where decreases in storm size regardless of season were predicted (Chang et al., 2016) and decreases in storm cell size use a downscaled global climate model (GCM) and a stochastic weather generator (Bordoy & Burlando, 2014). Additionally, extratropical cyclones under climate warming have been predicted to become more concentrated and intense (Pfahl et al., 2015; Shi & Durran, 2016). However, there is also evidence that storm areas may be larger at higher temperatures (Lenderink et al., 2017; Lochbihler et al., 2017), highlighting the urgent need for more research in this area to resolve these conflicting findings.
One way of providing further insight into this issue is through climate modeling studies. GCMs typically run at a coarse resolution of 100–300 km and are unable to resolve the physical processes related to short-duration extreme rainfall. Regional climate models (RCMs) provide a better representation of localized processes including some skill in the representing the temperature dependence of extreme rainfall (Singleton & Toumi, 2013). However, there are still concerns with the ability of RCMs to represent the timing of extreme storms caused by deep convection (Cortés-Hernández et al., 2016). A further problem is the suppressed variability in temperature scaling of rainfall compared to observations. The convection parameterization schemes in RCMs lead to these problems, because they misrepresent the local scale physical processes that drive deep convection and contribute to short-duration extreme rainfall.
These problems have been addressed using convection-resolving (i.e., horizontal resolution less than 4 km) models (CRMs; Kendon et al., 2017). Such models have shown added value in producing subdaily rainfall characteristics with improved representation of diurnal cycle of local scale convection (Argüeso et al., 2016) and occurrence of hourly heavy rainfall events (Ban et al., 2014). Model and observed scaling rates of precipitation extremes with temperature have also been found to be similar (Bao et al., 2017). However, the temperature scaling of the spatial distribution of extreme rainfall events from these models has not yet been investigated. We investigate the temperature and spatial rainfall scaling relationships from a 2-km CRM (CRM2) and compare these to the relationships from observations and a 10-km convection-parameterization model (CPM10) over the Greater Sydney region.
2 Data and Method
2.1 Observations
This study used the same subdaily observational records as in Wasko, Sharma, and Westra (2016) restricted to the Greater Sydney region to match the domain of the RCMs. Six-minute precipitation and 3-hourly 1.2-m dry-bulb temperatures were provided by the Australian Bureau of Meteorology for 53 gauges (Figure 1a). The temporal coverage of the observations varies with gauges. To maximize the data length used for analysis, the full record at each gauge was used. The 6-min observational rainfall was aggregated to hourly time scale to match the temporal resolution of the model simulations.

2.2 Regional Climate Model Simulations
The CPM10 and the CRM2 chosen for this study are part of the New South Wales/Australian Capital Territory Regional Climate Modeling project (Evans et al., 2014; Evans & Argüeso, 2015) and used the Weather Research and Forecasting (WRF) model version 3.3 (Skamarock et al., 2008). The CPM10 has output at hourly resolution using three different physical schemes (Table 1) from 1950 to 2010. The CPM10 model is nested inside the 50-km simulation that is driven by the 6-hourly boundary conditions extracted from the National Centers for Environmental Prediction-National Center for Atmospheric Research reanalysis project (NNRP). The detailed model design can be found in Evans et al. (2014), and extensive evaluation of the simulations has found a good representation of precipitation and temperature fields (Olson et al., 2016) including extremes (Cortés-Hernández et al., 2016; Evans et al., 2017) and climate mode teleconnections (Fita et al., 2017). Apart from the aforementioned 3 NNRP-driven CPM10 ensembles, the three 10-km RCMs have another 12 simulations with boundary conditions derived from 4 different GCMs. As the reanalysis data set is considered to better represent the observations than GCM simulations, the reanalysis-driven RCM will provide a better assessment of model skill compared to observations than the GCM-driven RCM. As such, this study only used the NNRP-driven CPM10 simulations. The CRM2 is double nested within the 50- and 10-km WRF simulations and covers the period from 1990 to 2009. The CPM10 simulations cover southeast Australia, whereas the CRM2 is only available over the study domain of the Greater Sydney region (Figure 1). The CRM2 simulation has also been evaluated against observations demonstrating good performance for temperature (Argüeso et al., 2014) and spatially varying biases for precipitation (Evans & Argüeso, 2015). The CRM2 simulation has previously been used to investigate the climate change impact on areal reduction factors (Li et al., 2015) and design rainfall (Li et al., 2017). Hourly precipitation and surface air temperature simulated by the CRM2 and the three ensemble members of the CPM10 were used. To ensure that the results from gridded simulations and point observations are comparable, only the nearest grid cells corresponding to the 53 subdaily gauges were used in analysis. The time period used for the spatial field analysis is 1990–2009 for both RCMs. Table S1 in the supporting information summarizes the statistics of temperature and precipitation at Sydney Observatory Hill (151.2050°E, −33.8607°S) for the observations and model simulations. All four model runs simulated the temperature statistics and the mean precipitation well. The models underestimate the 90th percentile rainfall with the CRM2 model providing the best match to the observations. The maximum rainfall simulated by the models is quite variable compared to observations. Given that the resolution of the CPM10 models is 10 km, some moderation of the extremes is expected, which is consistent with typical areal reduction factors. The overestimation of the maximum rainfall by CRM2 is consistent with the findings of Evans and Argüeso (2015).
CPM10 ensemble run | Planetary boundary layer scheme | Cumulus convection scheme | Cloud microphysics scheme | Shortwave/longwave radiation physics scheme |
---|---|---|---|---|
R1 | Mellor-Yamada-Janjic/Eta similarity | Kain-Fritsch | WRF double moment 5-class | Dudhia/rapid radiative transfer model |
R2 | Mellor-Yamada-Janjic/Eta similarity | Betts-Miller-Janjic | WRF double moment 5-class | Dudhia/rapid radiative transfer model |
R3 | Yonsei University/MM5 similarity | Kain-Fritsch | WRF double moment 5-class | National Center for Atmospheric Research community atmosphere |
2.3 Method
The aim of the analyses is to investigate the spatial organization of moisture within a storm following the method of Wasko, Sharma, and Westra (2016). The data used in analyses for both the observations and RCM simulations are from the 53 subdaily gauges locations. The method used for the observations is also applicable to the RCM simulations. The process starts by identifying independent 1 and 3-hr storms in the hourly precipitation time series using a minimum interevent dry period of 5 and 15 hr, respectively (Wasko & Sharma, 2014). The maximum 1 and 3-hr precipitation burst within each storm was paired with the 24-hr temperature average centered on the time of the maximum precipitation. The 24-hr temperature average was chosen to remove the effects of the diurnal circle (Rasuly, 1996). The precipitation-temperature pairs were then extended to form spatial fields. Potential spatial fields consist of all neighboring grid points with precipitation at the same time within a radius of 50 km. The spatial fields are then screened to ensure that the maximum precipitation occurs at the center and that they have at least four neighboring stations with rainfall. Spatial fields that did not meet these criteria were not used in the analyses. Therefore, by definition, the maximum precipitation occurs at the center of the spatial field.
Extreme events were then identified through exponential quantile regression. Only the spatial fields with maximum precipitation greater than the 90th percentile were used to represent the spatial distribution of moisture. Quantile regression was used to extract the extreme events instead of binning because it is unbiased with sample size and it does not make any assumption on bin size or subjectively exclude data that fall into the bins with small sample size (Morrison & Spencer, 2014; Wasko & Sharma, 2014). The 90th percentile was chosen here to extract the extreme events so that the results are comparable with those presented in Wasko, Sharma, and Westra (2016). We also attempted to use the 95th percentile as the threshold to separate the extreme and nonextreme events. However, there are not enough points above the 95th percentile line to fit a robust exponential curve at some locations.
- Peak precipitation (PP), which is the maximum precipitation at the original central gauge location of the spatial field.
- Total precipitation (PT), estimated as the two-dimensional integration of the precipitation plotted against the distance from the storm center.
- Effective radius (RE), defined as the centroid of the storm where pij is the ith precipitation observation within the spatial field j at a distance of dij from the center.

- Coefficient of variation (CV), defined as the ratio of the standard deviation of the precipitation in the spatial field to the mean.
- Fraction of zero precipitation observations (PZ) within the spatial field.
Consistent with Wasko, Sharma, and Westra (2016) an exponential regression was fitted for the first four statistics against temperature and a linear regression was used to examine the relationship between the PZ and temperature. As the statistics described above were estimated for the spatial field rather than a single point, the relationship between those statistics and temperature was also calculated for the spatial field.
3 Results
To examine the quality of the scaling estimation, the fitted exponential regression of the selected statistic scaling for 1-hr storms using the precipitation simulated by the CRM2 is shown in Figure S1 in the supporting information for a randomly selected location. Increases in the PP, the PT, and the CV were found, whereas decreases were found in the RE. The scaling relationships are reasonably consistent across the range of temperatures modeled during the study period.
Figure 2 presents the scaling of PP with temperature for 1-hr storms. The three ensemble members of CPM10 have similar positive scaling of PP to the observations. However, CRM2 underestimated the positive scaling and even simulated negative scaling along the coastal fringe, although these estimates are not statistically significant at the 5% level. Positive scaling of PP was also found for 3-hr storms with significant negative scaling estimated using the CRM2 simulations (Figure S2). Despite the improved skill in resolving the mechanisms that lead to localized short duration extreme rainfall, the biases in the extreme precipitation simulated by the model (Jakob et al., 2015) mean that the scaling relationships are not as well represented as in the convection-parameterization model. Similar results were found for 1-hr PT (Figure S3) such that CRM2 showed negative scaling while positive scaling was found in the observations. While all three ensemble members of CPM10 captured the positive scaling of the PT, CPM10-R1 and CPM10-R3 slightly overestimated the magnitude of the scaling at a few sites. A better match between CRM2 and observations of PT scaling was found for the 3-hr events, including for both negative and positive scaling locations, while for this duration none of the CPM10 simulations indicated negative scaling. The divergence in the scaling of PP between observation and the CRM2 is likely to be caused by the unrealistic simulation of 1-hr maximum precipitation burst exceeding the 90th percentile quantile regression curve as shown in Figure S5. The oversimulation of maximum precipitation has been noted before for this model (Evans & Argüeso, 2015). Unlike other statistics examined here, the PP scaling is more sensitive to the values at a single point location. Therefore, it is not surprising to see some divergence in the PP scaling between observations and the CRM2. As the CRM2 was double nested inside the 50- and 10-km WRF simulations, this finding suggests that it is more difficult to maintain the consistency between temperature and extreme precipitation at a finer scale.

Moving to the spatial distribution of extreme rainfall events (Figure 3) both the CRM2 and CPM10-R2 confirm the negative scaling of storm RE found by Wasko, Sharma, and Westra (2016). The magnitude of the scaling in the models is slightly smaller than in the observations for both the 1- and 3-hr storms (Figure S6). The two CPM10 models using the Kain-Fritsch scheme had markedly different RE scaling compared to the observations and the other two models. It is known that the Kain-Fritsch scheme tends to leave unrealistically deep saturated layers in postconvective sounding, thus activating the microphysics scheme to simulate too much stratiform precipitation (Fiori et al., 2014) and possibly resulting in increases in the storm size with higher temperatures.

The positive scaling of peak and PT and the negative scaling of RE estimated from observations suggest that extreme storms are likely to become more concentrated at the storm centers at warmer temperatures. This behavior was also reflected in the CPM10-R2 and CRM2 simulations for 1-hr duration storms, consistent with the conclusions of Wasko, Sharma, and Westra (2016). Two other statistics of storm spatial scaling were calculated, namely, the CV and the PZ. Observations show more variability within a storm (higher CV) with increasing temperatures (Figure S7) and more zero precipitation locations (Figure 4).

Increasing variability in the storm rainfall totals was simulated by all models (Figure S7), although CRM2, CPM10-R1, and CPM10-R3 simulated nonsignificant negative scaling for the few sites along the east coast. The observed positive scaling of the PZ was not seen in any of the model simulations (Figure 4), which is not surprising due to the widely reported drizzling effect in RCM simulations (Argüeso et al., 2013; Olsson et al., 2015; Teutschbein & Seibert, 2013). The results for 3-hr storms (Figures S8 and S9) are similar to those estimated for 1-hr storms.
Figure 5 summarizes the scaling for all statistics estimated from the observations, the CRM2, and the three ensemble members of the CPM10 for 1-hr storms. For all statistics, the model estimates of scaling are damped compared to the observations, with this effect much stronger in the convection resolving RCM. Figure 5 also highlights that the direction of scaling estimated by the models is generally consistent with the observations, with the exception of the RE. Different cumulus convection schemes lead to quite different conclusions on the relationship of storm area with temperature. Based on the method for defining storm size used here the Betts-Miller-Janjic cumulus convection scheme is able to simulate the observed spatial redistribution of moisture within a storm at higher temperatures as can the CRM2. Considering all the five sets of statistics, the CPM10-R2 best captures the observed changes in spatial organization of extreme storms as temperature increases.

Due to the scarcity of subdaily precipitation observations, the longest record available at each of the 53 rain gauges was used. This leads to varying temporal coverage for observational analysis at different locations. In contrast, a fixed time period (i.e., 1990–2009) was chosen for the analysis using model simulations. A sensitivity test was performed using the same analysis periods for a small subset of the stations, and the conclusions were consistent with those presented above.
4 Discussion and Conclusions
This study was motivated by conflicting findings on the influence of temperature on storm sizes (Lochbihler et al., 2017; Wasko, Sharma, & Westra, 2016). RCM simulations can provide further lines of evidence on the relationship between storm events and increasing temperatures. Two of the RCM simulations confirm the results of Wasko, Sharma, and Westra (2016) with the RE of storms decreasing with increasing temperature. The RCM simulations using the Kain-Fritsch scheme lead to the opposite conclusion; however, the known weaknesses in this scheme with respect to the proportion of stratiform rainfall lead to less confidence in this finding.
The studies of changing spatial storm characteristics have used very different methods for characterizing the size, shape, and properties of storm events as well as being implemented in parts of the world with quite different climates. For example, Wasko, Sharma, and Westra (2016) used gauge data for Australia, Lochbihler et al. (2017) used radar data for Netherlands, while other studies used models of varying resolution for North America (Chang et al., 2016; Guinard et al., 2015). For consistency with Wasko, Sharma, and Westra (2016), in this study, precipitation and temperature data were extracted from the RCM simulations only at the locations where rain gauges are available. The analysis also assumes a Eulerian frame of reference and does not account for storm advection. This is necessary due to the limited number of pluviograph stations within the study area. Using a Lagrangian frame of reference which tracks the storm cell may or may not produce different scaling results (Chang et al., 2016; Lochbihler et al., 2017). Using a storm tracking algorithm may enable a more extensive use of the RCM simulations across the full model domain and enable relaxing of the implicit assumption of circular storms, but then a direct evaluation of the RCM with observations would not have been possible. Any rainfall that occurred concurrently within a 50-km radius is considered as part of the same storm event. This distance was chosen to ensure that a sufficient number of neighboring gauge stations are included in the analysis. The choice of radius is a trade-off between missing some parts of large storms and enclosing several different local storms that occurred at the same time. However, because the analyses were restricted to locations with a large number of spatial fields (i.e., a minimum of 100 spatial fields) the results are considered representative of the overall characteristics of a majority of extreme storms.
In addition to the choice of radius, the underlying distribution of the rain gauge network also affects the estimation of extreme spatial field statistics to some extent. As shown in Figure 1, the spatial distribution of the rain gauges is not uniform with low station densities in the north west of the study area due to the topography, vegetation, and low population in this area. As this study used a subset of RCM outputs at the rain gauge locations, the spatial field membership is the same between the RCM and the station-based analysis, and the comparison of results is fair. However, future work is recommended to use the full RCM grid to allow the spatial fields to be defined consistently across the study domain and not be affected by the rain gauge density.
The use of climate model simulations to assess scaling introduces two other issues that may affect the conclusions. The rainfall estimates from each model grid are assumed to be representative of the point extreme value measured at a rain gauge of the same location. It is known that area-averaged extreme precipitation is often lower than the point value since it is unlikely that the extreme precipitation occurs concurrently at multiple points within a defined area (Li et al., 2015). This issue is more important for the CPM10 simulations which have grid cell areas of approximately 100 km2. Li et al. (2015) show for the same 2-km resolution CRM that the grid resolution is small enough to approximate the point rainfall. The scale mismatch between grid simulations and point observations may lead to biased estimates of precipitation-temperature relation. However, similar analysis performed on remotely sensed rainfall demonstrates that the scaling relations do hold on gridded products (Wasko, Parinussa, & Sharma, 2016), providing confidence in using climate model simulations to further explore scaling.
The second issue with the use of climate models is the bias in the simulation of precipitation. Here the simulations have not been bias corrected, assuming that the biases in the model are constant across all temperatures. Li et al. (2016) assessed biases in different storm types in these RCMs for daily precipitation. No significant changes in the bias and proportion of storms in the future were found; however, further work is required to extend this analysis to the subdaily simulations considered here. This is particularly important because the mechanisms explaining departures from the thermodynamic scaling rates (CC) include changes to storm dynamics (Lenderink & van Meijgaard, 2010) and mixing of storm event type (Berg & Haerter, 2013; Molnar et al., 2015). Therefore, if there are biases in different storm types in the climate model simulations and the storm types themselves vary with temperature, the climate model biases may influence the estimated scaling rates. However, the reasonable matches between the precipitation intensity scaling with observations found here suggest that this may not be a significant issue in these models.
Another concern is the effect of urban land use on the relationships between temperature and extreme precipitation. While the models adopted the land use data derived from satellite based global data sets, representing the early 1990s, the actual urban area has increased from 1990 to 2009. However, since all the stations used in the analysis have had limited changes in their surrounding urbanization, the urban expansion is expected to have negligible effect on the analysis used here.
The results here suggest that as temperatures increase, short duration storm events will become more intense over smaller areas, which has important implications for hydrologic engineering design and flood management. Because this study used two different convective parameterizations and an RCM run at a resolution that allows for convection to be fully resolved, it highlights better modeling methodologies to represent observed extreme precipitation scaling relationships. Modeling studies have indicated that future precipitation scaling rates may exceed those found in the historical record (Bao et al., 2017). Future work should consider whether this also applies to the scaling of storm spatial properties found here. The results show that both the convection resolving model and convection parameterized model are capable of simulating the intensification of PP and PT for 1-hr storms as temperature increases, which indicates that the thermodynamic mechanism of increasing moisture capacity causing increased precipitation is properly modeled by both RCMs at hourly time scale. The positive scaling of peak and PT and the negative scaling of RE reflect that at higher temperatures moisture is likely to redistribute from the storm boundaries to the storm center. This pattern was best captured by the convection-parameterized model that uses the Betts-Miller-Janjic cumulus convection scheme and the CRM for 1-hr storms, although the latter slightly underestimated the increased heterogeneity of the precipitation at higher temperatures. In addition, the CRM simulated weakened PP intensity for 3-hr storms at higher temperatures, which is contradictory to the observations. In this case, the CRM did not provide a better simulation of the relationship between temperature and extreme precipitation, which suggest that a better representation of convection does not always translate to a better simulation of precipitation. Precipitation simulation is also affected by other model parameterizations such as the cloud microphysics and planetary boundary layer. Sensitivities related to the cloud microphysics and the cloud-radiative feedback are considerably larger than those related to mesh refinement at kilometer scales (Prein et al., 2015). The scaling results presented here provide one way of evaluating the performance of RCMs, and we find that somewhat counterintuitively the CRM used here does not always outperform its convection-parameterized counterpart for this statistic.
Acknowledgments
Regional climate data have been provided by the New South Wales and Australian Capital Territory Regional Climate Model (NARCliM) project funded by the New South Wales Governmental Office of Environment and Heritage, University of New South Wales Climate Change Research Centre (CCRC), Australian Capital Territory Government Environment and Sustainable Development Directorate, and other project partners. The RCM data are available at http://climatechange.environment.nsw.gov.au/Climate-projections-for-NSW/Download-datasets. The rainfall data can be obtained from http://www.bom.gov.au/climate/data/stations/. This work was made possible by funding from the Australian Research Council as part of DP120100338 and FT110100576. This work was supported by an award under the Merit Allocation Scheme on the NCI National Facility at the ANU. F. Johnson and J. Li were supported through ARC Discovery Project DP150100411. C. Wasko acknowledges the support of a McKenzie Postdoctoral Fellowship provided by the University of Melbourne. This research/project was undertaken with the assistance of resources and services from the National Computational Infrastructure (NCI), which is supported by the Australian Government.