Gaussian Process Regression-Based Bayesian Optimisation (G-BO) of Model Parameters—A WRF Model Case Study of Southeast Australia Heat Extremes
Abstract
In Numerical Weather Prediction (NWP) models, such as the Weather Research and Forecasting (WRF) model, parameter uncertainty in physics parameterization schemes significantly impacts model output. Our study adopts a Bayesian probabilistic approach, building on prior research that identified temperature (T) and relative humidity (Rh) as sensitive to three key WRF parameters during southeast Australia's extreme heat events. Using Gaussian process regression-based Bayesian Optimisation (G-BO), we accurately estimated the optimal distributions for these parameters. Results show that the default values are outside their corresponding optimal distribution bounds for two of the three parameters, suggesting the need to reconsider these default values. Additionally, the robustness of the optimal parameter distributions is validated by their application to an independent extreme heat event, not included in the optimisation process. In this test, the optimized parameters substantially improved the simulation of T and Rh, highlighting their effectiveness in enhancing simulation accuracy during extreme heat conditions.
Key Points
-
Our study optimizes WRF model parameters for Southeast Australia heat extremes, enhancing the accuracy of the model simulation
-
G-BO method finds optimal parameter ranges, substantially improving the simulation of temperature and humidity
-
Results suggest updating WRF model's default settings for better extreme heat event simulations
Plain Language Summary
This study aims to enhance the accuracy of a numerical weather model called the Weather Research and Forecasting (WRF) model, especially for simulating extreme heat events in Southeast Australia. Typically, the accuracy of such models depends on specific settings, which are often set to default values. Our research used a method known as Gaussian process regression-based Bayesian Optimisation (G-BO) to determine the best range of values for these settings. We found that the default settings were not optimal. By applying G-BO, we identified more effective values that substantially improved the model's simulations of temperature and humidity during heat extremes. This improvement was consistent even when tested on an independent extreme heat event. These advancements are vital for more accurate weather forecasting, which is essential for emergency services, electricity management, and agriculture planning during extreme heat conditions.
1 Introduction
Recent studies highlight a significant increase in extreme weather globally, including intensified heatwaves that impact human and natural systems, especially in Southeast Australia (Masson-Delmotte et al., 2021; Perkins-Kirkpatrick & Lewis, 2020; Reddy, Perkins-Kirkpatrick, & Sharples, 2021). Accurate heatwave simulations using Numerical Weather Prediction (NWP) models are essential in this context. The success of these models depends on their initial conditions and the representation of atmospheric processes, despite computational limitations (Bjerknes, 1910). Parameterization in NWP models is a technique used to represent complex atmospheric processes that are too small-scale or intricate to directly resolve by the model. It involves simplifying these processes into manageable mathematical forms, often employing empirical or theoretical relationships. For example, processes like cloud formation and convection are represented through parameterization schemes, which use a set of tuneable parameters. These parameters, often constants or exponents in model equations, are critical for the accuracy of simulations (Di et al., 2015; Yang et al., 2012). The Weather Research and Forecasting (WRF) model, noted for its adaptability and high-resolution capabilities (Evans et al., 2014; Skamarock et al., 2021), is widely used in Southeast Australia for forecasting and simulating extreme events. While the sensitivity of various physics parameterization schemes in these simulations has been explored (Evans et al., 2012; Ji et al., 2022), the specific influence of parameter values within these schemes is an area of active research, with the potential to further refine and improve model simulations.
Parameter optimisation is a process in which the model parameters are tuned to match the simulated output variables with respective observations. One of the primary challenges with optimisation is the exponential increase in complexity with an increase in the number of tuneable parameters, resulting in a “curse of dimensionality” (Duan et al., 2006, 2017). Another challenge is the number of output variables considered in the optimisation's objective function. These complexities make the optimisation process computationally demanding by making observational constraints inconsistent, by causing the parameters to be correlated and making the parameters poorly constrained (Matear, 1995). Therefore, several studies (Baki et al., 2022a; Chinta et al., 2021; Di et al., 2015, 2017; Ji et al., 2018; Quan et al., 2016; Yang et al., 2012) first performed a sensitivity analysis to identify the sensitive parameters that influence the output variables of interest. This helps reduce the number of parameters optimize, thereby reducing the computational costs.
Several studies (Baki et al., 2022b; Chinta & Balaji, 2020; Di et al., 2018) performed optimisation of WRF model parameters either for a single variable (single objective) using adaptive surrogate model-based optimisation (ASMO) (Wang et al., 2014) or for multiple variables using Multi-Objective ASMO (Gong et al., 2016) and knee point-based multi-objective optimisation (KMO) (Wang et al., 2023) algorithms. The main goal of these studies was to identify a single optimum value for each parameter that minimizes the simulation error with respect to observations. However, this approach often overlooks the natural predictive uncertainties and erroneously presumes that a unique, ideal set of parameter values is always applicable for all scenarios (Hoversten et al., 2006). It's important to recognize that a single optimal parameter value might not always be attainable but even when it is, the uncertainties involved could be substantial. Moreover, while approaches like Pareto front analysis in multi-objective optimisation reveal multiple near-optimal solutions, they too have limitations. Specifically, Pareto optimality focuses on finding a balance among competing objectives, which might not effectively capture the underlying uncertainties or the complexity of the parameter space (Gupta et al., 1998; Van Straten & Keesman, 1991). In this context, Bayesian optimisation offers a significant advantage. It provides a probabilistic framework that accounts for uncertainties and explores the parameter space more comprehensively, offering a range of solutions with quantified uncertainties (Beven & Binley, 1992). This approach not only acknowledges the complexity inherent in such models but also adapts more fluidly to varying scenarios, making it a more robust and flexible method for parameter optimisation.
Bayesian optimisation employs probabilistic methods to account for parameter uncertainties in models (Chinta et al., 2023; Gong & Duan, 2017; Issan et al., 2023; Reiker et al., 2021; Xu et al., 2022). This approach represents input parameters as probability distributions from which multiple samples are drawn. These samples facilitate ensemble simulations, allowing the model to generate a range of predictions. The model's outputs are then compared with actual observations using an objective function, refining the parameter distributions into more accurate posterior distributions. Subsequently, simulations based on these refined distributions align more closely with observed data. However, this method demands significant computational resources, as it involves numerous simulations of the WRF model. To address this, machine learning (ML) strategies, particularly ML-based surrogate models, are increasingly vital (Chinta et al., 2024; Reddy, Chinta, Matear, et al., 2024; Wang et al., 2020). Once trained on a subset of existing simulations to understand the complex relationships between input parameters and outputs, surrogate models efficiently help explore the parameter space for Bayesian optimisation.
This study aims to optimize the WRF model parameters that influence different output variables corresponding to heat extremes using Bayesian optimisation. We do this by focusing on Southeast Australia during two extreme heat events. This study is organized as follows: Section 2 describes the data, events selected, WRF model configuration, introduces how surrogate models were developed, and presents the methodology of Bayesian optimisation. Section 3 presents the results and a detailed discussion of the optimized parameters. Section 4 summarizes the conclusions from this study.
2 Methods
2.1 WRF Model Configuration and Selected Extreme Heat Events
In the present study, the WRF model v4.4 (Skamarock et al., 2021) is adopted for the numerical simulations. The simulation domain is configured with a single domain (d01) across southeast Australia, as shown in Figure S1 in Supporting Information S1. The domain consists of 206 × 181 grid points in the horizontal direction, with a horizontal resolution of 12 km and 40 terrain-following σ vertical levels reaching up to the 50 hPa atmospheric level. The simulations are integrated with a time step of 72 s. For initial and lateral boundary conditions, the European Center for Medium-Range Weather Forecast Reanalysis fifth generation data set (ERA5) (Hersbach et al., 2020) at a horizontal resolution of 0.25° and a six-hourly interval is employed. The WRF model output variables, namely temperature at 2 m height (T) and relative humidity at 2 m height (Rh), are obtained at hourly intervals. This work extends our previous study (Reddy, Chinta, Matear, et al., 2024), where only three model parameters were identified to influence meteorological variables significantly during extreme heat events over southeast Australia. Consistent with our previous work, we adopt the same physics schemes as described in Table 1 of (Reddy, Chinta, Matear, et al., 2024). The description of three sensitive parameters and the respective physics schemes is presented in the supplementary Table S1 in Supporting Information S1.
The present study selected two southeast Australian extreme heat events like the previous study (Reddy, Chinta, Matear, et al., 2024) for the parameter optimisation. The first event spans 13 days, encompassing the heatwave period from January 26th, 12 UTC to February 8th, 12 UTC of 2009. The second extreme heat event simulation covers 15 days from December 16th, 12 UTC to December 31st, 12 UTC of 2019. Further, to assess the robustness of the optimized parameters, we consider an additional extreme heat event of 2013 covering the heatwave from 01st Jan 12 UTC to 18th Jan 12 UTC over southeast Australia. More details about the selection of events are presented in the supplementary information. For all the selected events, a 36 hr model spin-up is considered. The simulation results are compared against hourly data from the Bureau of Meteorology Atmospheric high-resolution Regional Reanalysis for Australia (BARRA2; (Su et al., 2022)) at 12 km horizontal resolution.
2.2 Gaussian Process Regression-Based Bayesian Optimisation (G-BO) Using Markov Chain Monte Carlo Sampling
We then train a surrogate model based on the generated parameter sample WRF simulations. Following the previous studies, we considered the Gaussian Process Regression (GPR; (Williams & Rasmussen, 1995, 2006)) as a surrogate model training with parameter samples as input and nMAE as target. We built the GPR model with a composite kernel that consists of three components: a constant kernel, a Radial Basis Function (RBF) kernel with a unit length scale, and a white noise kernel with a small noise value of 1−10, which refers to prediction with noisy responses (regression). The GPR model provides the mean predictions and the variance; in this study, we only considered the mean predicted values. The accuracy of the trained GPR model is evaluated through K-fold cross-validation (here, K = 8), and the dependence on sample size is illustrated in Figure S2 in Supporting Information S1. The results indicate that the 128 samples are adequate for GPR training in achieving good accuracy with a goodness of fit (R2) value of 0.99. Subsequently, the trained GPR is used to estimate the objective function (nMAE) in performing the optimisation of model parameters.
MCMC sampling systematically draws a representative set of samples from the target posterior distribution by constructing the Markov Chain. Here, the drawn sample from the probability distribution depends on the previously drawn sample. As the number of samples increases, the chain converges to the desired target posterior distribution (Roberts & Rosenthal, 2004). There are many MCMC algorithms, each considering different ways of constructing the Markov Chain while sampling, such as Gibbs sampling, Metropolis-Hastings algorithm (Hastings, 1970; Metropolis et al., 1953), and Affine invariant ensemble sampling. Out of these sampling algorithms, previous studies recommended the Affine invariant ensemble sampling because it reaches faster convergence by considering the ensemble of chains in parallel, invariant to the affine transformations of parameters, enabling easy sampling from anisotropic probability distributions and has only two hyperparameters (one is number of walkers (i.e., ensemble of chains) and the other is stretch move (updates the next step of a given walker)) (Goodman & Weare, 2010; Issan et al., 2023).
In this study, we implemented the Affine invariant ensemble sampling using the “emcee” Python package by choosing 50 walkers and stretch move as two (for more detailed description, refer to Mackey et al. (2013)). The MCMC sampling is sensitive to the initial point, where a low probable initial condition could be considered, which might not be representative of the target posterior distribution. Hence, the few initial samples were disregarded until the chain reached the stationary distribution, which is referred to as burn-in. In this study, an initial 1,000 steps were chosen as burn-in, after which the chains converge (Figure S3 in Supporting Information S1). Following the Mackey et al. (2013), we run the chains to 3,000 steps (i.e., around 50 times the integrated autocorrelation time (which is around 50)) to ensure the convergence of chains to the target distribution (Figure S3 in Supporting Information S1). More information about the autocorrelation times can be found in Goodman and Weare (2010) and Mackey et al. (2013).
3 Results and Discussion
3.1 Gaussian Process Regression Based-Bayesian Optimisation (G-BO) of Parameters for Improving Temperature and Relative Humidity
G-BO results of the three sensitive parameters in calibrating the hourly temperature (T) and relative humidity (Rh) individually and the T and Rh combined are presented in Figure 1. The most probable optimal values of the scattering tuning parameter (P14) are toward the lower end of the parameter range for all the three cases of optimisation (only T (Figure 1a), only Rh (Figure 1d), and T and Rh combined (i.e., T and Rh are given equal weights in constructing a combined single objective function for optimisation) (Figure 1g)). Here, the default value of the P14 is outside the 2σ interval of the optimal posterior, which clearly suggests that the default value is not the best for providing accurate information of T and Rh during extreme heat events over southeast Australia. Multiplier for the saturated soil water content parameter (P17) posterior distribution resembles a Gaussian for all three optimisation cases (T (Figure 1b), Rh (Figure 1e), and both T and Rh (Figure 1h)), with a mean value around 0.42, 0.37, and 0.40 (normalized values) when optimized for only T, only Rh, and both T and Rh, respectively. Here, the default P17 value is outside the 2σ of posterior for T; however, it is within the 2σ interval when optimized for only Rh and on the lower end of the posterior with less probability when optimized T and Rh combined. This suggests that the default value of P17 is less likely to accurately simulate the T and Rh during heat extremes in southeast Australia. Similar to P14, profile shape exponent for calculating the momentum diffusivity coefficient parameter (P22) posterior has high densities toward the lower bound of the parameter range when optimized for T (Figure 1c), Rh (Figure 1f) individually, and T and Rh combined (Figure 1i). The default value of P22 is not in the 2σ interval of optimal posterior, suggesting the default value should be reconsidered for this parameter.
The mean (and 2σ confidence interval) nMAE of G-BO posterior distribution of optimized parameter combinations for T and Rh combined case is 0.867 (0.863, 0.874) for T and 0.928 (0.924, 0.934) for Rh. Further, we compare the MAE spatial patterns of T and Rh between the default and optimized parameter distribution combinations, as shown in Figure 2. The MAE of default or optimized parameter combination is calculated with respect to BARRA2 data. We randomly sample 10 parameter sets from the G-BO posterior distribution (T and Rh combined) as the representative sample of optimal parameter values (see Table S2 in Supporting Information S1). These 10 parameter samples from the optimal distributions improved the MAE of the simulated T and Rh during all days of both events (2009 and 2019) by 11.2%–12% and 5.4%–6.8%, respectively. Further, MAE of the ensemble mean of 10 runs computed with respect to BARRA2 data is shown for the spatial comparison. Figures 2a and 2d show the average T and Rh, respectively, during all days of both events (2009 and 2019) using the BARRA2 data. The optimized ensemble mean reduced the MAE of T mostly across the domain compared to the default simulation (compare Figure 2b vs. Figure 2c). Particularly, the substantial reductions were seen in the regions of high average temperatures (greater than 33°C, see in Figure 2a) and in the northeast parts of the domain (Figure 2c). Similar to T, optimized ensemble mean improved the simulation of Rh compared to the default over the regions of the low average Rh that is, central parts of the domain and across the regions of average high Rh that is, northeast coast of the domain (compare Figure 2e vs. Figure 2f). Previous studies have also shown that the default parameter set has a substantial (cold) temperature and (wet) precipitation bias over southeast Australia, broadly consistent with the current results (Di Virgilio et al., 2019; Ji et al., 2022; Kala et al., 2015). The optimized ensemble mean improves the prediction of T and Rh mostly across the domain, particularly over the east coast and the northeast parts of the domain where the substantial biases observed in the default simulation. Further, for a more thorough evaluation of optimisation results, following Wang et al. (2023), we analyzed the frequency distribution of bias improvements between the default and optimal simulations of T and Rh. Results show that 59.1%–59.8% of the total land grids show a reduction in T error, and 57.2%–58.8% of the land grids also exhibit a decrease in Rh error.
Daily maximum temperature (Tmax) and daily minimum relative humidity (Rhmin) are the critical meteorological variables considered for identifying and quantifying the heat extremes, particularly the dry heat, over southeast Australia (Abram et al., 2021; Reddy, Sharples, et al., 2021). Hence, we compare the spatial patterns of the Tmax and Rhmin mean during all days of the two selected events (Figures 3a–3c and 3f-3h) and only on the extremely hot days of each event (2009 (Figure S4 in Supporting Information S1) and 2019 (Figure S5 in Supporting Information S1)). The optimized ensemble mean simulation improved the realism of the mean Tmax by around 2.5°C (compare Figure 3b vs. Figure 3c) and Rhmin (compare Figure 3g vs. Figure 3h) by approximately 0.1 over the domain compared to the default parameter values. Further, we compare the domain area-average time series of daily maximum temperature (Tmax) during all days of both the events (2009 and 2019) between BARRA2, default, and optimized ensembles (Figure 3d). Results show that the optimized ensemble clearly improved the accuracy of Tmax across all days of the two selected events compared to the default (Figure 3d). Further, the ensemble spread of domain average bias of Tmax and Rhmin across all the days of both events (2009 and 2019) is compared with the default domain average bias (Figures 3e and 3j). This shows that all the 10 optimized ensemble members clearly improved the cold Tmax and wet Rhmin bias compared to the default. Around 71.4%–74.5% and 65.4%–66.9% of total land grids observe an improvement in the Tmax and Rhmin, respectively, with the optimal parameter simulations compared to the default run. Furthermore, the optimized ensemble accurately simulated the hot region (region of Tmax greater than 44°C) on the extremely hot day of both the selected events (2009 (compare Figure S4b vs. Figure S4c in Supporting Information S1) and 2019 (compare Figure S5b vs. Figure S5c in Supporting Information S1) compared to the default. This suggests that the optimized ensemble simulations better capture the extremeness of the extreme heat events over southeast Australia compared to the default. The accurate information on extremeness of the extreme heat events is critical for planning emergency services, electricity demand management, and cattle safety and crop management (Asseng et al., 2011; Lindstrom et al., 2013; Loridan et al., 2016).
The G-BO optimized parameter distributions are further tested on an independent extreme heat event (2013 event) not used in the optimisation. Similar to the 2009 and 2019 events, the 2013 event optimized ensemble mean improves the simulation of T and Rh mostly across the domain, particularly over the east coast and the northeast regions of the domain, compared to the default parameters (Figures S6 and S7 in Supporting Information S1). Further, the Tmax and Rhmin results of the 2013 event show that the optimized ensemble improves the simulation compared to the default, which is consistent with the two optimized events (Figure S7 in Supporting Information S1). This further testing help demonstrates the robustness of the G-BO results. Additional experiments with respect to spatial resolution were performed at a 6 km horizontal grid. The optimal distribution range of the three key parameters improved the T and Rh of extreme heat events over southeast Australia on a 6 km grid, like the 12 km spatial resolution simulations (for more details, refer to supporting Text S2 in Supporting Information S1 and see Figures S8 and S9 in Supporting Information S1). This suggests the spatial resolution independence of the identified optimal parameter distributions. Overall, this study's G-BO methodology improved the simulation accuracy of T and Rh during heat extremes, specifically bettered the extremeness of the extreme heat information.
3.2 Physical Understanding of Optimal Distribution of Parameters
The scattering tuning parameter (P14) optimal posterior is toward the lower bound of its range with a maximum likelihood value around 0.5 × 10−5 to 0.6 × 10−5 m2 kg−1, which is lower than the default value (1 × 10−5 m2 kg−1). The lower the P14, the weaker the scattering, leading to more incoming solar radiation, which increases the surface heating and can amplify the daytime surface temperature (Dudhia, 1989; Montornès Torrecillas et al., 2015). This supports our result of lower P14 compared to the default as the optimal, which improves the temperature cold bias of default parameter simulation (Figure 3b vs. Figure 3c). This is more specifically seen in the Tmax because it is much affected by the P14 (Figure 3b vs. Figure 3c) (Reddy, Chinta, Matear, et al., 2024). Low P14 values favor low Rhmin; our results agree with this and show that the positive Rhmin bias in default simulation is improved in the optimized run with the P14 value lower than the default one (Figure 3g vs. Figure 3h).
The multiplier for saturated soil water content or soil porosity P17 in the land surface scheme is optimized to the posterior, with the maximum likelihood value ranging between 1.03 and 1.18, which is slightly higher than the default value (one). Our G-BO results show a higher Tmax with the calibrated parameter set compared to the default parameter simulation. Consistent with the results, previous studies suggest that low P17 favors a decrease in surface temperature, particularly during the daytime (Fonseca et al., 2019; Reddy, Chinta, Matear, et al., 2024; Temimi et al., 2020). Next is parameter P22, which is the profile shape exponent in the momentum diffusivity coefficient of the planetary boundary layer scheme. P22 optimized posterior maximum probability value is around 1.0 to 1.14, which is lower than the default value (two). Previous studies suggest that the low P22 weakens the turbulent mixing below the maximum height of momentum diffusivity, which may moderate the convective mass flux, leading to a lower Rh (Hong et al., 2006; Oke, 2002).
Our study has focused on the critical task of quantifying parameter uncertainty and optimizing parameter values relative to observations, which is fundamental for enhancing model reliability. It is important to recognize, however, that there are additional sources of uncertainty that also affect model accuracy. These include uncertainties in initial and boundary conditions, the accuracy of observational data, and inherent limitations within the model structure. While our results provide valuable insights for parameter optimisation, future studies could further improve model simulations by exploring these additional sources of uncertainty, thereby offering a more holistic approach to model accuracy and reliability. Our study has focused on improving the accuracy of T and Rh during extreme heat events over southeast Australia by optimizing the WRF model parameters. In particular, our results show significant improvements for extreme heat events, however, further research and optimisation would be necessary to determine the applicability and effectiveness of these parameters under different conditions.
4 Conclusions
We used the G-BO methodology to estimate the optimal distribution of the three WRF model parameters previously identified as the most important to simulate extreme heat conditions in Southeast Australia (Reddy, Chinta, Matear, et al., 2024). The parameters, scattering tuning parameter (P14), the multiplier for saturated soil water content (P17), and the profile shape exponent for calculating the momentum diffusivity coefficient (P22), have produced the greatest sensitivity to the simulated hourly temperature (T) and relative humidity (Rh) during the two considered southeast Australian extreme heat events (2009 and 2019). Unlike the previous studies, which focus on identifying single optimum parameter values, our methodology provides optimal parameter distributions, which allows us to quantify the parameter uncertainty and parameter correlations.
The key results from the parameter optimisation are: (a) for two of the three parameters optimized, the default WRF parameter values lie outside the optimal range suggesting the need to reconsidering the parameter values for simulating heat extremes in this region. (b) Randomly drawing 10 parameter samples from the optimal distributions improved the MAE of the simulated T and Rh during all days of both events (2009 and 2019) by 11.2%–12% and 5.4%–6.8%, respectively. (c) The mean spatial pattern of 10 optimal parameter simulations improves the default simulation negative bias of T and positive bias of Rh mostly across the domain. Most importantly optimal parameter sample substantially improved critical variables of the dry heat, such as daily maximum temperature (Tmax) and daily minimum relative humidity (Rhmin), compared to the default parameters over the domain. The changes in the optimized parameters from the default values are physically plausible and explainable from a physical parameterization perspective. An investigation of the optimal parameter distributions shows no correlations between the optimal parameters (Figure S10 in Supporting Information S1). Further, Reddy, Chinta, Matear, et al. (2024) showed the sensitivities of the three optimized parameters are mostly first order, which indicates the negligible interaction effects of parameters in influencing the T and Rh. A small spread in the ensemble from the optimized model indicates constrained parameter uncertainty (Figures 3d, 3e and 3i-3j). However, the discrepancies between the ensemble predictions and the observed data suggest that additional uncertainties from other sources are present within our model.
Further, to demonstrate the robustness of the optimal parameter distributions we use them to simulate a heat extreme event in 2013 in southeast Australia. The optimal parameter simulations improved the representation of the Tmax and Rhmin, over the default parameter values. Overall, G-BO methodology improved the simulation of T and Rh during heat extremes, specifically bettered the extremeness of the extreme heat information, which have significant implications for emergency services management and cattle and crop productivity. Apart from G-BO, there are other approaches that utilize GPR and BO, such as the Adaptive Surrogate Modeling-Based Sampling Strategy for Parameter Optimisation and Distribution Estimation (ASMO-PODE) (Gong & Duan, 2017). Unlike G-BO, ASMO-PODE starts with a small set of initial samples and uses a resampling strategy to iteratively perform the optimisation. Future studies could focus on comparing these methods and assess the computational cost to obtain an acceptable solution. The present study results may quite not be applicable to wet extremes, which needs to be further explored and is clearly outside the scope of this study. Future studies can apply the present study's methodology to other extreme events such as extreme rainfall and tropical cyclones, to name a few. Further, this study's approach can be applicable to other dynamical models in the atmospheric, oceanic, and biological sciences, to name a few.
Acknowledgments
The authors would like to thank National Computing Infrastructure (NCI) Australia for providing computational resources. We would like to thank Chun-Hsu Su for providing the BARRA2 reanalysis data. We acknowledge the funding support of the CSIRO and Australian Climate Service.
Open Research
Data Availability Statement
The ERA5 data is openly available at https://doi.org/10.24381/cds.adbb2d47 (Hersbach et al., 2020). The source code of WRF v4.4 is openly available at https://github.com/wrf-model/WRF/releases/tag/v4.4 (Skamarock et al., 2021). The BARRA2 dataset is available from the NCI THREDDS data server https://dapds00.nci.org.au/thredds/catalogs/ob53/catalog.html (Su et al., 2022). The code used to perform the optimisation analysis is available at https://doi.org/10.5281/zenodo.12511781 (Reddy, Chinta, Baki, et al., 2024). All the figures are generated with Python.