Volume 54, Issue 9
Research Article
Free Access

Unsurprising Surprises: The Frequency of Record‐breaking and Overthreshold Hydrological Extremes Under Spatial and Temporal Dependence

Francesco Serinaldi

Corresponding Author

E-mail address: francesco.serinaldi@ncl.ac.uk

School of Engineering, Newcastle University, Newcastle upon Tyne, UK

Willis Research Network, London, UK

Correspondence to: F. Serinaldi,

E-mail address: francesco.serinaldi@ncl.ac.uk

Search for more papers by this author
Chris G. Kilsby

School of Engineering, Newcastle University, Newcastle upon Tyne, UK

Willis Research Network, London, UK

Search for more papers by this author
First published: 29 June 2018
Citations: 17

Abstract

Record‐breaking (RB) events are the highest or lowest values assumed by a given variable, such as temperature and precipitation, since the beginning of the observation period. Research in hydroclimatic fluctuations and their link with this kind of extreme events recently renewed the interest in RB events. However, empirical analyses of RB events usually rely on statistical techniques based on too restrictive hypotheses such as independent and identically distributed (i/id) random variables or nongeneral numerical methods. In this study, we propose some exact distributions along with accurate approximations describing the occurrence probability of RB and peak‐over‐threshold (POT) events under general spatiotemporal dependence, which enable analyses based on more appropriate assumptions. We show that (i) the Poisson binomial distribution is the exact distribution of the number of RB events under i/id, (ii) equivalent binomial distributions are accurate approximations under i/id, (iii) beta‐binomial distributions provide the exact distribution of POT occurrences under spatiotemporal dependence, and (iv) equivalent beta‐binomial distributions provide accurate approximations for the distribution of RB occurrences under spatiotemporal dependence. To perform numerical validations, we also introduce a generator of spatially and temporally correlated binary processes, called BetaBitST. As examples of application, we study RB and POT occurrences for monthly precipitation and temperature over the conterminous United States and reanalyze Mauna Loa daily temperature data. Results show that accounting for spatiotemporal dependence yields strikingly different conclusions, making the observed frequencies of RB and POT events much less surprising than expected and calling into question previous results reported in the literature.

1 Introduction

Emphatic adjectives like unprecedented, surprising, or exceptional are often used to describe extreme hydroclimatic events, such as storms, floods, and droughts (Colucci et al., 2017; Coumou & Rahmstorf, 2012; Thompson et al., 2017). However, such adjectives are usually applied in their colloquial sense, while discussions concerning physical phenomena, engineering, and decision‐making should rely on words having a more precise meaning. Terms like surprise are intrinsically relative in the sense that they are related to the expectations of the observer and imply a comparison of the event under consideration with what is known based on the past experience (Itti & Baldi, 2009; Kjeldsen & Prosdocimi, 2018; Merz et al., 2015). The involvement of the observer's mindset in the definition and recognition of surprising events reveals the psychological nature of surprise. The notion of psychological surprise is not new in water science, where it was introduced by Fiering and Kindler (1984) and Matalas (2003), who built on the principles of the theory of investment decisions proposed by Shackle (1942). According to Shackle (1949), surprise is the state of mind following the occurrence of counterexpected or unexpected events, where the former occurs in a state of so‐called imperfect knowledge, whereas the latter in a state of incomplete knowledge. Imperfect knowledge and counterexpected events refer to the case in which all possible outcomes of a decision process are known but some of them are deliberately excluded from the so‐called inner group of credible (unsurprising) options (Shackle, 1942). On the other hand, incomplete knowledge and unexpected events concern the case in which all possible outcomes are not knowledgeable at the time a decision is made (Matalas, 2003).

Imperfect knowledge can be addressed by probability theory and (frequentist) statistical tools, whereas incomplete knowledge requires a more careful treatment since unexpected events fall outside the set of imagined options, and “Thus an important surprising event will require [an individual] more or less to create afresh his structure of expectations…We shall call this process the assimilation of the event into the structure of expectations” (Shackle, 1942). This assimilation process, resulting in updated mental models, is basically a Bayesian procedure where new data observations update the prior distribution of beliefs, thus yielding posterior belief distributions accounting for the new information. In this context, the index of surprise proposed by Itti and Baldi (2009) enables the quantification of surprise of unexpected events in terms of Kullback‐Leibler divergence (Kullback & Leibler, 1951), that is, the average of the log‐odd ratio of posterior and prior beliefs. Conversely, surprise of counterexpected events can be measured by indices such as Weaver's index (Weaver, 1948) relying on the ratio of the average amount of probability we can expect to realize per trial of the experiment in question and the probability associated to the realized event. The definition of Weaver's index is genuinely frequentist and implies that the set of outcomes is exhaustive (i.e., the knowledge is imperfect but not incomplete).

Dealing with hydroclimatic variables, we can use historical observations and/or other (empirical and theoretical) sources of information to assign a given probability to every event whose magnitude exceeds a given value. Therefore, also the largest observed value of a hydroclimatic variable (i.e., the so‐called record or record event) can be exceeded (broken) with a given probability. Moreover, under some assumptions discussed later, it can be shown that the probability of observing a record‐breaking (RB) event over a future time window, such as the design life of an infrastructure, is exceedingly high, thus making such event expected. Therefore, to what extent a future RB event exceeds the magnitude of the most recent record matters more than how frequently they occurred or may occur in the future (Matalas, 1997). In this respect, Kjeldsen and Prosdocimi (2018) studied the level of surprise of RB flood events in the United Kingdom by applying an index of surprise based on the magnitude of the top ranked events (Solow & Smith, 2005). It should be noted that Solow‐Smith's index is a frequentist metric dealing with counterexpected events (imperfect knowledge) rather than truly unexpected events (incomplete knowledge). However, despite the importance of event magnitude, several studies focused on the occurrence of RB events in hydroclimatology (Anderson & Kostinski, 2010; Bassett, 1992; Benestad, 2003, 2004; Finkel & Katz, 2017; Matalas, 1997; Meehl et al., 2009; Newman et al., 2010; Rahmstorf & Coumou, 2011; Vogel et al., 2001).

In light of the above remarks, both peak‐over‐threshold (POT) and RB events are counterexpected since their probability is low compared with that of other possible outcomes. However, as these extreme values are included in the set of possible (even if rare) outcomes, they should not be considered as unexpected (in Shackle's sense). Nonetheless, the analysis of the properties of POT and RB events (e.g., the cumulative number of RB values over a time window) is often used to infer the possible change of the frequency of occurrence of such events under climate change or other forcing factors. In other words, this type of analysis is often used to support a paradigm shift (e.g., from stationary to nonstationary representation) corresponding to an update of the prior belief, when the behavior of empirical values of RB and/or POT statistics computed under a specific model (e.g., i/id) shows substantial discrepancy with values expected under that model. This updating procedure implies implicitly or explicitly an attribution of the observed discrepancies to the factors (e.g., anthropogenic factors) embedded in the alternative model whose expectations are closer the empirical values computed under the new model. However, since a rigorous attribution requires the careful assessment of multiple lines of evidence leading to the identification, within a prespecified margin of error, of unique causes and exclusion of any other plausible alternative (Hasselmann, 1997; Mitchell et al., 2001; Serinaldi et al., 2018), performing POT and RB analysis under different reasonable prior models is paramount.

A possible alternative to i/id hypothesis is the assumption that the underlying process is stationary and correlated, that is, variables are nonindependent and identically distributed (ni/id). Correlation inflates the variability of the expected values and the width of confidence intervals (CIs). This behavior is related to information redundancy. Matalas and Langbein (1962) provided a first detailed discussion on the impact of both spatial and temporal correlations on the information content of the mean of a hydrological time series. Since then several studies focused on the effect of spatiotemporal correlation on different inference problems, such as analysis of variance (Jones, 1975), estimation of distribution quantiles (Koutsoyiannis, 2003), trend hypothesis testing (Bayazit & Önöz, 2007; Douglas et al., 2000; Hamed & Rao, 1998; Hamed, 2008, 2009a; Katz, 1988a; Kulkarni & von Storch, 1995; Serinaldi & Kilsby, 2016a; Serinaldi et al., 2018; Yue & Wang, 2002, 2004), and interaction between spatial and temporal correlations in various applications (Hamed, 2009b, 2011; Katz, 1988b; Katz & Brown, 1991), just to mention a few.

The ni/id hypothesis has also been considered as an alternative to i/id assumption in the context of RB analysis, in order to distinguish systematic changes and spurious fluctuations due to spatiotemporal dependence. Vogel et al. (2001) proposed analytical expressions (based on theoretical results and Monte Carlo simulations) of the moments of the number of RB events considering the average regional value of spatial correlation (under the assumption of multivariate Gaussian distribution). Benestad, (2003, 2004) explored the effect of temporal correlation on the RB statistics by Monte Carlo simulations, using a first‐order autoregressive (AR) model, while spatial correlation was either introduced by sampling a set of time series many times (Benestad, 2003) or removed by resampling the original data set (Benestad, 2004). Newman et al. (2010) further explored the effect of autocorrelation by Monte Carlo simulations considering fractional Gaussian noise (fGn) with a power law decaying autocorrelation function (ACF) and power spectral density. Meehl et al. (2009) applied block bootstrap to preserve short‐term (3‐year) autocorrelation and spatial correlation in order to assess the uncertainty of observed and modeled statistics of RB temperatures, averaged over the conterminous United States (CONUS).

From a theoretical point of view, results mainly refer to the statistics of the standard RB process corresponding to an infinite sequence of independent identically distributed (i/id) observations (Arnold et al., 2011, p. 7), while nonstandard variations corresponding to nonindependent or nonidentically distributed variables are less developed and often involve specific dependence structures (Arnold et al., 2011, pp. 208–215) or models whose applicability is generally limited to low‐dimensional cases (Nagaraja et al., 2002). Similar lack of general theoretical results also concerns the statistics of occurrence of events exceeding a given percentage threshold (hereinafter, POT events) under spatial and temporal dependence. In fact, while the number of POT events over a finite number of time steps and/or spatial locations is described by a binomial ( urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0001) distribution under i/id assumption, the lack of independence is usually treated by using Monte Carlo methods (e.g., Renard & Lang, 2007).

In this study, we show that the Poisson binomial ( urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0002) distribution describes the occurrence of RB events under i/id conditions of the underlying process, overcoming the computational burden of computing Stirling numbers of the first kind, while equivalent binomial ( urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0003) distributions provide accurate approximations under i/id hypothesis. Moreover, the beta‐binomial ( urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0004) distribution describes the number of POT occurrences under general spatiotemporal dependence, while equivalent beta‐binomial distributions ( urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0005) give accurate approximations of the distribution of RB occurrences under spatiotemporal dependence. We also show that the parameters of urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0006, urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0007, and urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0008 distributions only depend on the rate of occurrence and/or the spatiotemporal correlation function (STCF).

In order to check the accuracy of urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0009 and urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0010 distributions and their generality, we also introduce an extension of the so‐called BetaBit algorithm proposed by Serinaldi and Lombardo (2017) to generate binary processes with given temporal correlation structure. This approach, called BetaBitST, enables the simulation of binary random fields with desired STCF under minimal assumptions to be used as reference benchmarks in Monte Carlo analysis. Altogether, urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0011, urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0012, urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0013, urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0014, and BetaBitST represent a set of new practical tools to study and make inference on POT and RB occurrence processes.

This study is organized as follows. Section 2 introduces the heuristic reasoning that lead us to consider urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0015, urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0016 as suitable distributions for the number of POT and RB events, respectively, under spatiotemporal dependence. Section 3 discusses the role of urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0017 and the urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0018 approximation under i/id hypothesis. Sections 4 and 5 provide technical details concerning urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0019 and urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0020 distributions. In section 6, we introduce the BetaBitST generator and use it to validate the urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0021 and urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0022 models described in sections 4 and 5. We therefore analyze RB and POT occurrences in monthly temperature and precipitation across the CONUS and reanalyze Mauna Loa daily temperatures in section 7. Discussion and conclusions are reported in section 8.

2 Preliminary Remarks on the Occurrence of POT Events Under Independence and Nonstationarity

Since the main results discussed in the following sections build on previous findings reported by Obeysekera and Salas (2016), we first summarize their key results and use the models applied in their case study to support our discussion. Obeysekera and Salas (2016) studied the distribution of the number of events, Z, exceeding a given design (critical) value xd0 during the design (project) life spanning for instance n years, under the assumption that the discrete‐time stochastic process urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0023 (e.g., annual peak flow) is a sequence of independent nonidentically distributed (i/nid) random variables with univariate distributions urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0024. Under these conditions, the occurrence or absence of an extreme event exceeding xd0 in year j is described by a Bernoulli process urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0025 with state space urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0026, where j(=0,1,2,…) denotes discrete time, and Yj=1 if Xj>xd0; otherwise Yj=0. Under nonstationary conditions, the variables Yj are nonidentically distributed with time‐varying occurrence probability given by urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0027. With this notation, the number of events exceeding a given design value xd0 during n time steps is defined as urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0028, where urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0029. For i/nid Bernoulli trials, Z is distributed as a urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0030 distribution (Hong, 2013; Tejada & den Dekker, 2011; Wang, 1993; Zaigraev & Kaniovski, 2013) whose probability mass function (pmf) and cumulative distribution function (cdf) are, respectively,
urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0031(1)
urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0032(2)

Hong (2013) introduced an exact formula with a closed‐form expression to compute the urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0033 distribution function, developed an algorithm for efficient implementation, and studied the advantages and disadvantages of various approximation methods in order to overcome the computational burden associated with extensive enumerations involved in equations 1 and 2, especially for high n (i.e., n > 2,000).

In order to show the applicability of urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0034 distribution, Obeysekera and Salas (2016) used probabilities pj resulting from a generalized extreme value (GEV) distribution describing the annual peak flows of the Assunpink Creek watershed in Trenton, New Jersey, with parameters varying along 100‐year design life according to the following parametrization:
urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0035(3)
where μj = 44.587 + 0.306(j − 1968.027) is a time‐varying location parameter in year j and σ = 16.617 and ϵ = 0.136 are time‐invariant scale and shape parameters, respectively. To support our discussion, Figure 1 reproduces Figure 2 in the work by Obeysekera and Salas (2016). Figure 1a shows the time‐varying probabilities pj (for each year urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0036 of the design life) corresponding to the values urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0037 for j = 0, while Figure 1b displays the cdfs of the number of POT events computed by urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0038 distributions for the i/id case with urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0039 and by the urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0040 distribution for the i/nid case. It is worth noting that the GEV model in equation 3 is only used to highlight theoretical properties concerning the link between time‐varying probabilities and urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0041 distribution and their subsequent applicability to RB analysis. We refer the reader to the supplementary material of Serinaldi and Kilsby (2015) for a critical discussion of the use of the GEV distribution in equation 3 to describe annual peak flows of the Assunpink Creek.
wrcr23461-fig-0001
(a) The time‐varying probabilities pj, for urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0042, corresponding to the model in equation 3 discussed by Obeysekera and Salas (2016) and denoted as OS2016 case. Each curve refers to a given value urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0043 at time j = 0. (b) The cumulative distribution functions of the number of peak‐over‐threshold events computed by urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0044 distributions for the stationary case with urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0045 and by urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0046 distribution for nonstationary case. (c and d) Similar to panels (a) and (b), respectively, but compare OS2016's model and the stationary case with urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0047. (e–h) Similar to panels (a) and (b) but for randomly varying probabilites pj corresponding to GEV location and scale parameters randomly varying around the values of μ0 and σ reported in the text and OS2016. GEV = generalized extreme value.
Figure 1b highlights the positive shift in the number of occurrences introduced by the increasing values of GEV location parameter. Of course, an opposite shift (i.e., fewer POT events) would occur if the GEV location parameter decreased in time. It should be noted that the first two central moments of urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0048 (i.e., mean and variance) are given by the relationships (Hong, 2013)
urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0049(4)
urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0050(5)
Under i/id conditions, with pj=p0 for all js, equations 4 and 5 yield urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0051 and urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0052, corresponding to mean and variance of a urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0053 distribution describing the conditions at the beginning of the design life. The mean and variance of urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0054 distribution can also be written as (Edwards, 1960; Poisson, 1837; Wang, 1993)
urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0055(6)
urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0056(7)
where urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0057 and urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0058 are the mean and variance of the set of probabilities urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0059, respectively. Therefore, urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0060 increases as the set of probabilities urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0061 tends to be more and more homogeneous and attains its largest value when all pj values are identical (Tejada & den Dekker, 2011; Wang, 1993). From equations 6 and 7, it follows that one can approximate the urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0062 distribution (describing a nonstationary Bernoulli process) with a simpler equivalent binomial urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0063 distribution describing the number of failures/successes over n trials for an equivalent stationary Bernoulli process with average rate of occurrence urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0064. The urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0065 model preserves exactly the mean of urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0066 distribution, as urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0067, while the variance is at most overestimated by a quantity urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0068, thus giving a cautionary estimate of the variability of the number of successes/failures over n trials. Referring to the GEV example mentioned above, Figure 1c shows the nonstationary GEV probabilities pj (already shown in Figure 1a) and the average urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0069 corresponding to the equivalent stationary Bernoulli process, while Figure 1d highlights the remarkable agreement of urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0070 and urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0071 models. In this example, the sequence urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0072 with p0=0.5 has urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0073 and urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0074. Obviously, the ratio of the two components of variance in the right side of equation 7 depends on the absolute values of pj and how they vary with time. However, this example and the case discussed in the next section support the use of an equivalent stationary Bernoulli process and the corresponding urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0075 distribution in practical applications.

To further explore this point, we also considered GEV distributions with location and scale parameters randomly varying around the values assumed at time j = 0 (i.e., μ0=44.587 and σ = 16.617). In particular, we assumed additive error models for both location and scale parameters such that μϵ = μ0+ϵ, with urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0076, and σϵ = σ + ϵ, with urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0077, where urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0078 denotes a Gaussian distribution. Figure 1e shows the exceedance probabilities corresponding to the model with randomly varying location around μ0 along with the reference probabilities resulting from the GEV distribution with constant parameters. Figure 1f highlights that the random fluctuations do not influence the distribution of the number of events. The same conclusion holds for the GEV model with randomly varying scale parameter (Figures 1g and 1h). The insensitivity of the probability of failure to randomly fluctuating parameters is an important aspect since it highlights a critical characteristic of frequency analysis relying on models with parameters depending on covariates exhibiting random behavior, such as teleconnection indices. In these cases, the resulting models are not truly nonstationary distributions, as is often incorrectly referred to in the literature, but simply compound stationary distributions with negligible or no effect on the probability of failure in a given time window. Confusing compound distributions with nonstationary distributions (whose parameters depend on time via deterministic functions) can lead to misleading interpretations and conclusions. We refer to Serinaldi and Kilsby (2015) and Serinaldi et al. (2018) for further discussion of these aspects. These concepts and remarks will be used in the next sections in the context of the analysis of POT and RB events under different conditions of spatial and temporal dependence.

3 Updating the Analysis of RB Events: The Role of urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0079 and urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0080 Distributions

As mentioned in section 1, RB values are observations exceeding all previous observations of the same variable. The statistical theory of RB values was introduced by Chandler (1952), and the main theoretical results refer to an infinite sequence of i/id observations of a random variable X with cdf F, which is assumed to be nondegenerate in order to avoid the possibility of ties, that is, identical values in the sequence (Arnold et al., 2011; Glick, 1978). An observation xj is a RB high value if xj>xi for every i < j. An analogous definition holds for RB low values, whereby xj<xi for every i < j. With no loss of generality, we will refer to RB high values if not otherwise specified. When dealing with RB events, several statistics can be of interest, for example, RB time (i.e., the time of occurrence of an RB value), RB magnitude, RB increment process (i.e., the difference of magnitude of subsequent RB values), and inter‐RB time (i.e., the number of time steps between RB values). In this study, we focus on the number of RB events over a given time window.

Formally, given a sequence urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0081 of observations of the variable X, the occurrence of RB values in the sequence defines a binary process such that Yj=1 if urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0082; otherwise, Yj = 0 (Vogel et al., 2001). Therefore, the number of RB events in n trials (e.g., n years of annual data) is urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0083. We use the same notation applied in section 2 because the occurrence of RB events is a nonstationary Bernoulli process urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0084 with mean and variance (Glick, 1978)
urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0085(8)
urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0086(9)
while the mean and variance of the process Z are (Glick, 1978)
urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0087(10)
urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0088(11)
where urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0089 is the Euler constant. It is worth noting that Z, along with any other counting statistic, is not affected by the parent cdf F, as the counting process relies on ranks rather than on absolute values. Unlike the example in section 2, here the time‐varying probabilities pj do not come from a nonstationary parent distribution resulting from fitting to data (i.e., induction) but from conceptual considerations (i.e., deduction) according to the following reasoning (Glick, 1978): “The first observation necessarily must be a record…But, prior to observing any values, I know that the second of two numbers in random sequence has equal probability of being smaller or larger than the first. Hence the probability is exactly 50% that a second, independent observation will be a new record high surpassing the initial record, assuming that there cannot be an exact tie…From the same perspective, there is probability 1/3 that a third trial will be a new maximum, since the last of three repeated observations will be equally likely to be smallest, middle, or largest…Similarly, all 10 ranks are equally likely for the tenth observation; so maximum rank for the tenth observation has probability 1/10.” This deductive arguments support the use of tools developed for nonstationary processes in contrast to the application of nonstationary models relying on widespread but questionable data‐driven inductive inference (see Serinaldi et al., 2018).
While the first two moments of Z have a relatively simple expression, the exact pmf of Z was originally derived in terms of the Stirling numbers of the first kind urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0097 (David & Barton, 1962, pp. 178–183), where
urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0098(12)
with
urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0099(13)
Because the burden of combinatorial computations increases with n, Vogel et al. (2001) considered the asymptotic result for large sample size reported by Glick (1978)
urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0100(14)
which, however, does not provide a very accurate approximation (see Figure 2 in Vogel et al., 2001, and discussion below).

Recalling that the occurrence of RB events associated to an i/id parent process is a nonstationary Bernoulli process with pj = 1/j, it follows that the pmf and cdf of the number of occurrences over n trials are described by a urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0101 distribution, whose computation is readily available in statistical software (Hong, 2013). Moreover, according to remarks in section 2, we can also consider an equivalent Bernoulli process and the corresponding urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0102 distribution with urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0103. Figure 2 compares the urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0104 cdf and those obtained by equation 12 (referred to as Stirling distribution), equation 14 (denoted as Asy), and urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0105 distribution for urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0106. Figure 2a shows that the urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0107 and Stirling cdfs are identical for urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0108, while Stirling is not available for n = 1,000 because of computational infeasibility. Therefore, urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0109 provides the exact distribution of Z and it can also be used for (relatively) large n values, avoiding the computational problems of Stirling numbers. Figure 2b confirms that the asymptotic approximation gives biased results even for n = 1,000, thus exhibiting a slow rate of convergence. On the other hand, the urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0110 approximation is unbiased in terms of expectation E[Z] (Figure 2c) and only biased in terms of variance Var[Z], which is overestimated as per equation 7. Since the time‐varying probabilities of occurrence pj = 1/j are known a priori, we can assess in advance this bias as a function of n. Figure 2d shows that urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0111 overestimates urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0112 of the 23% at most for n = 5 (17% for n = 100, and 12% for n = 1,000). Even if convergence in terms of standard deviation is slow, Figure 2c highlights that the difference of urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0113 and urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0114 models is very small in terms of cdf shape and probabilities associated to a given value of Z, and the approximation is better than that of the asymptotic one, thus making urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0115 an additional tool for practical applications.

wrcr23461-fig-0002
(a) Comparison between urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0090 and Stirling distributions of the number of record‐breaking events, Z, over n experiments (time steps). For n = 1,000, Stirling distribution is not available as it involves prohibitive combinatorial calculations. (b) Comparison between urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0091 distribution and the asymptotic approximation reported by Glick (1978). (c) urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0092 distribution versus the urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0093 approximation based on urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0094. (d) The ratio of the standard deviations of urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0095 distribution and the corresponding urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0096 approximation as a function of the record length n. cdf = cumulative distribution function.

4 The Number of POT Events Under Spatiotemporal Dependence: Introducing the urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0116 Distribution

In the previous section, we discussed the case of i/nid binary processes, namely, processes comprising independent Bernoulli trials with (deterministic) time‐varying probability of occurrence pj. In this section, we introduce basic results concerning the distribution of the number of occurrences (failures or successes) over n trials, under the assumption of constant p and mutually dependent trials, that is, the ni/id case. In this context, the urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0117 distribution plays a key role. Denoting urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0118, the probability of success/failure in n trials, the urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0119 distribution is a compound distribution resulting from the ordinary urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0120 distribution urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0121, when ψ is assumed to be a random variable Ψ following a beta distribution urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0122 with mean E[Ψ] = p, where B denotes beta function and α and β are two positive shape parameters. The urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0123 pmf can be written as (Skellam, 1948)
urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0124(15)
while mean and variance are given by the formulas (Ahn & Chen, 1995)
urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0125(16)
and
urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0126(17)
where p = α/(α + β), and urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0127 is known as the intraclass or intracluster correlation. If the random variable Ψ has a degenerate distribution with probability 1 at a single point (or urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0128 and urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0129), then Var[Ψ] = 0 and Z becomes binomial with urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0130 (Ahn & Chen, 1995). Being positive by definition, urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0131 produces overdispersion as it inflates the variance np(1 − p) of the original urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0132 distribution with constant p. On the other hand, urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0133 does not affect the expected value, which is identical for urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0134 and urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0135. For correlated experiments, we have
urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0136(18)
where ρjl = Corr[Yj,Yl] denotes the pairwise correlation of experiment j and l. The indices j and l can refer to two different time steps in a temporal process evolving over n time steps or two locations in a spatial process over n locations. For a spatiotemporal process over n time steps and m locations, ρji,lk = Corr[Yji,Ylk] are the element of the q × q = nm × nm space‐time correlation matrix and equation 18 reads as
urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0137(19)
Note that the urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0138 distribution can arise in a number of ways (Hisakado et al., 2006; Moran, 1968) and is also known as the Polya or negative hypergeometric distribution (Griffiths, 1973). The above derivation (compounding urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0139 and beta distributions) is analogous to the derivation of the negative binomial distribution by compounding the Poisson with a gamma distribution (Moran, 1968, pp. 87–91). The Poisson model is the limiting form of the binomial distribution when n is large and p is small, while the negative binomial is the limiting form of the urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0140 distribution when n and α + β are large (Hughes & Madden, 1993). The urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0141 distribution has been used in several fields for various purposes such as the modeling of correlated failures and reliability of multiversion software (Nicola & Goyal, 1990), the description of plant disease incidence data (Hughes & Madden, 1993), the estimation of false discovery rates in multiple testing for significance with gene expression data from DNA microarray experiments (Tsai et al., 2003), or the estimation of the rejection rate in multiple trend testing for correlated stream flow data (Serinaldi et al., 2018).
For completeness, we also recall a simple and asymptotically efficient estimator of urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0142 parameters based on the ratio of the first two factorial moments (Tripathi et al., 1994):
urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0143(20)
where urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0144 and urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0145 denotes the ith sample factorial moment. Adopting the convention that urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0146, the estimators in equation 20 require only the computation of urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0147 and urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0148. Tripathi et al. (1994) showed that such estimators outperform other options, such as maximum likelihood, in terms of asymptotic relative efficiency.

In order to show the accuracy of the urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0149 distribution, we performed a Monte Carlo experiment by simulating binary time series with urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0150, urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0151, and two different ACFs, namely, the one‐parameter fGn and Markov dependence structures characterized by parameters H ∈ (0.5,1) and ρ1 ∈ (0,1), respectively (see the appendix for further details). The range of p values represents relatively rare events, while the intervals of H and ρ1 cover the admissible values for positively correlated stationary processes, where the lower limits correspond to independence (i.e., the i/id case). Since hydroclimatic variables generally show positive dependence, empirical estimates of H and ρ1 usually fall in those intervals. For the sake of space, hereinafter, some results are reported only for representative values of H and ρ1 (i.e., low, medium, and high positive correlation). Binary correlated samples are generated by BetaBit algorithm (Serinaldi & Lombardo, 2017). For each parameter configuration, we generated 5,000 replications, thus counting the number of occurrences z for each replication. This set of 5,000 z values was used to obtain the empirical cdf (ecdf) of Z. Hereinafter, results for fGn processes are reported in the text, while those for Markov models in the supporting information. Figures 3 and S1 compare ecdfs and fitted urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0152 and urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0153 cdfs for some representative parameter configurations, highlighting how the urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0154 distribution can strongly underestimate the probability of z values smaller/higher than the expected values np. On the other hand, the urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0155 distribution correctly describes the overdispersion introduced by the temporal dependence structures.

wrcr23461-fig-0003
Each panel shows the empirical distribution function of the number of peak‐over‐threshold events computed on binary time series with fractional Gaussian noise correlation structure and rate of occurrence p simulated by BetaBit, along with the corresponding urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0156 distribution and the urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0157 distribution corresponding to i/id conditions. Each panel refers to a specific combination of Hurst parameter H, rate of occurrence p, and sample size n. cdf = cumulative distribution function.

We also checked the accuracy of the estimators in equation 20 by comparing the parameters urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0158 and urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0159 estimated on the simulated sequences with the theoretical values p and urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0160, where urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0161 is computed from equation 18 by using the pairwise correlation terms ρjl resulting from the theoretical ACFs in equations A1 and A2. Figures 4 and S2 show the agreement of theoretical and estimated parameters, confirming (i) the suitability of the urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0162 distribution and its parametrization in equation 18 to describe the distribution of Z under ni/id hypothesis, and (ii) the performance of the estimators in equation 20. Moreover, these results further validate the performance of BetaBit in terms of a by‐product variable, Z, which is not used in the algorithm structure.

wrcr23461-fig-0004
Each panel shows the theoretical parameters of the urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0163 distribution of the number of peak‐over‐threshold events for binary processes with fractional Gaussian noise dependence structure and a specific combination of p and n as a function of H. Theoretical values are compared with estimates obtained by applying the factorial moment method (equation 20) on simulated samples.

5 The Number of RB Events Under Spatiotemporal Dependence

5.1 Introducing Nonindependent and Nonidentically Distributed (ni/nid) Binary Variables

In sections 3 and 4, we discussed respectively the cases i/nid (independent binary processes with time‐varying rate of occurrence pj) and ni/id (dependent binary processes with constant rate of occurrence p) focusing on temporal processes. However, in practical applications, we are more often interested in spatiotemporal processes, such as the number of POT or RB events occurring over multiple locations in a given time window under possible spatiotemporal dependence (e.g., Benestad, 2003, 2004; Renard & Lang, 2007). For POT events, the focus can be, for instance, on the number of flood events exceeding the at‐site value corresponding to a given annual exceedance probability p (or return period) in agreement with specified standard of protection (e.g., Interagency Committee on Water Data, 1982; Kjeldsen et al., 2008). The urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0164 model provides the exact distribution of Z even though, to the best of our knowledge, it was never considered before for these type of problems, where simulation approaches have generally been preferred (e.g., Renard & Lang, 2007).

On the other hand, the case of RB events over several locations under spatiotemporal dependence is more challenging as it corresponds to nonindependent and nonidentically distributed (ni/nid) hypothesis for Y. As mentioned above, our review of theoretical literature highlights that some attempts have been made in the past, considering extensions devised for specific problems and resulting in rather complex models that are also difficult to apply/adapt to different applications. Therefore, we propose a general purpose approximation of the distribution of Z under ni/nid that derives from conceptual reasoning by merging the results and arguments reported sections 3 and 4. Our deduction is not analytical but consists of a working hypothesis to be validated ex post by simulating binary processes with specified properties, namely, prescribed time‐varying pj and spatiotemporal dependence structure. This approach requires some conceptual elaboration and the availability of a generator of binary random fields with given spatiotemporal correlation structure, which we introduce in the following. We aim to provide a simple model approximating the distribution of Z under ni/nid reasonably well, thus avoiding extensive Monte Carlo simulations.

5.2 urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0165: A urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0166 Distribution for an Equivalent ni/id Process

If the rate of occurrence p is constant, the urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0167 distribution can account for persistence by the parameter urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0168 representing the average of the correlation matrix. On the other hand, the urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0169 model can account for time‐varying pj but not for dependence. However, an i/nid binary process can be approximated by an equivalent i/id process with parameter urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0170 equal to the average of pj. Therefore, our assumption is to approximate the distribution of Z under ni/nid by a urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0171 model with parameters urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0172 and urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0173, such that
urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0174(21)
and
urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0175(22)
where ϕ is a correction factor to be defined according to the law of variation of pj. For pj = 1/j, extensive simulations based on the binary generator described in section 6 yield ϕ = 0.75. The urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0176 model corresponds to an equivalent ni/id binary process with average rate of occurrence urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0177 and prescribed (spatiotemporal) correlation function.

For an RB process with pj = 1/j, we have j≥2 in equation 21 because the first observation of a time series is always an RB event by definition, and therefore, its occurrence is fully deterministic. Since the urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0178 model only describes the stochastic part of the process with pj<1, the final distribution of Z over n time steps and m locations is a shifted urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0179 distribution urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0180.

We stress that the probability pj is generally different from 1/j under dependence because this property implies that large (small) values follows large (small) values. In this respect, Benestad (2003) showed that the weak dependence resulting from a first‐order AR model, namely, AR(0.3), yields pj values no significantly different from 1/j, while Newman et al. (2010) showed that increasing dependence yields pj>1/j when the parent process X is fGn or Brownian walk parametrized by the Hurst coefficient H. However, such differences are very small for H≤0.75 and become significant only for strong dependence (H≥0.875). Therefore, assuming pj = 1/j under persistence is acceptable in a wide range of practical applications where X exhibits weak or moderate dependence. Moreover, we focus on the dependence structure of the binary process Y rather than on that of X. Our aim is to show the potential impact of dependence when its nature and magnitude is potentially concealed by small sample sizes that commonly characterize hydrological time series. In fact, in some cases, observations x can appear approximately independent while z exhibits some anomalous behavior contrasting with the reasonable assumption pj = 1/j supported by the apparent independence. The urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0181 distribution approximates the distribution of Z for a binary process that retains the assumption pj=1/j (and therefore E[Z]) but shows inflated variability and clustering of events due to dependence. This process is therefore an alternative to i/id option to be used as a benchmark to check the behavior Z without introducing demanding assumptions related, for instance, to the nonstationarity of the parent process X (Serinaldi et al., 2018).

Table 1 summarizes the relationships between the type of the parent process X (i.e., i/id and ni/id) and that of the corresponding POT and RB occurrence processes Y, along with the rates of occurrence, and distributions FZ of the number of occurrences Z over generic spatiotemporal windows. It should be noted that the results derived in this study strictly refer to POT and RB cases corresponding to i/id and ni/id parent processes X spanning n time steps over m locations for which the rate of occurrence is constant or 1/j (i.e., the cases in the first two columns of Table 1). When X are i/nid, ni/nid, general results are not available as the temporal evolution of the rate of occurrence is case specific and can be different for each location. We also stress that this type of POT and RB spatiotemporal analysis makes sense if we have simultaneous observations over all time steps and locations. Our results are not devised for i/nid, ni/nidX processes with arbitrarily evolving rates of occurrence. In this circumstances, we do not recommend the use of urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0182 and urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0183 models without the support of accurate and extensive preliminary Monte Carlo simulations starting from the process X. However, in these cases, analytical models lose their usefulness as the necessary Monte Carlo experiments already provide the required information about the (case‐specific) distributions of Z.

Table 1. Relationships Between the Nature of the Parent Process X (i.e., i/id and ni/id) and That of the Corresponding POT and RB Occurrence Processes Y, Along With the Rates of Occurrence, and Distributions FZ of the Number of Occurrences Z Over Generic Spatiotemporal Windows
Parent X i/id ni/id i/nid ni/nid
Y i/id ni/id i/nid ni/nid
POT Rate p p pj pj
FZ urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0184 urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0185 urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0186 or urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0187 urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0188
Y i/nid ni/nid i/nid ni/nid
RB Rate 1/j ≈1/j urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0189 urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0190
FZ urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0191 or urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0192 urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0193 urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0194 or urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0195 urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0196
  • Note. For i/nid and ni/nidX, pj and urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0197 denote generic time‐varying and generally different rates of occurrence of POT and RB events, respectively. Parametrization (correction factor) is case specific and should be assessed by preliminary simulation.

6 BetaBitST: An Extension of BetaBit Algorithm to Simulate Binary Processes With Given Spatial and Temporal Dependence

To check the performance of the urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0198 distribution and assess the correction factor ϕ, we need a generator of binary time series and random fields preserving the required pj and spatiotemporal dependence structure. To accomplish this task, we propose an extension of BetaBit (Serinaldi & Lombardo, 2017), which is an efficient and relatively simple generator of binary time series based on parent Gaussian bivariate distributions (see also Papalexiou, 2018, for a detailed discussion of this class of models). We briefly recall the rationale of BetaBit to introduce its spatiotemporal version denoted as BetaBitST.

BetaBit generates a correlated sequence of random numbers urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0199, for simplicity urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0200, taking values 1 and 0 with probability p and 1 − p, respectively, by generating a sequence of n random numbers urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0201 for an auxiliary process X with the desired ACF (e.g., exponential or power law) and standard Gaussian marginal cdf urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0202, and then transforming the marginal distributions into Bernoulli marginals by
urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0203(23)
where urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0204 denotes the inverse of urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0205, that is, the quantile function. Even though this dichotomization does not preserve linear correlation, the ACF terms of the process X,ρX(τ) = Corr[Xj,Xj + τ], are related to those of the process Y, ρY(τ) = Corr[Yj,Yj + τ], by an implicit analytical relationship, ρX = ζ(ρY), that can be approximated with negligible error by a beta cdf Fβ with parameters αp and βp depending on p (Serinaldi & Lombardo, 2017)
urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0206(24)
where urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0207 and
urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0208(25)
Therefore, BetaBit algorithm generates a correlated binary sequence urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0209 as follows:
  1. Compute αp and βp from equation 25 based on the desired value of p;
  2. Use equation 24 to inflate the terms of the ACF of the auxiliary process X, ρX(τ), in order to obtain the target process Y with the desired ACF, ρY(τ);
  3. Generate a standard Gaussian time series urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0210 with the inflated ACF by algorithms allowing the explicit use of the ACF in the simulation process, such as the method proposed by Davies and Harte (1987) and used in this study;
  4. Apply the dichotomization in equation 23.

The BetaBit structure allows a straightforward spatiotemporal extension by using cross‐correlated innovations to simulate standard Gaussian time series with given ACF. In particular, cross‐correlated (but temporally independent) innovations used in Davies‐Harte's algorithm are drawn from a multivariate Gaussian distribution with desired cross‐correlation function. This approach is quite standard (e.g., Podgórski & Wegener, 2012) and yields Gaussian processes with separable STCF ρX,st(u,τ) = ρX,s(uρX,t(τ), where ρX,s(u) is the cross‐correlation function and u is the distance between two generic locations. The hypothesis of STCF separability means that the STCF can be expressed as the product of the spatial and temporal correlation functions. Intuitively, this hypothesis corresponds to assume that the cross correlation (spatial correlation) can be studied independently of the autocorrelation (temporal correlation; see, e.g., Genton, 2007; Gneiting et al., 2006, and references therein for a technical discussion). Even though more general types of STCFs do exist (Genton & Kleiber, 2015; Gneiting et al., 2006), separable STCFs are the simplest way to account for spatiotemporal dependence especially in cases, such as the occurrence of rare events, where there are often not enough data to justify the use of more complex dependence structures.

Recalling that ρX = ζ(ρY), the STCF of the binary process Y resulting from BetaBitST reads as
urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0211(26)

Since ζ is a nonlinear transformation, the spatiotemporal binary process Y shows the desired ACF, as ρY,st(0,τ) = ζ−1(1·ζ(ρY,t(τ))) = ρY,t(τ), and lag‐0 cross correlation, as ρY,st(u,0) = ζ−1(ζ(ρY,s(u))·1) = ρY,s(u), while the lagged cross‐correlation terms ρY,st(u,τ) ≠ ρY,s(uρY,t(τ), thus meaning that the STCF of Y is not separable. Figure 5a shows an example of the separable STCF of X resulting from the product ρX,s(uρX,t(τ) of spatial and temporal Markovian correlation functions (equation A2) with parameters ρ1,s = ρ1,t=0.8, while Figure 5b illustrates the actual STCF (from equation 26) of a binary process Y (with p = 0.05) resulting from BetaBitST by using the same (marginal) spatial and temporal Markovian correlation functions. Even though both STCFs share the same (marginal) spatial and temporal correlation functions, the mixed terms (corresponding to u0 and τ ≠ 0) are affected by the nonlinear backward transformation ζ−1, which slightly strengthens the lagged cross‐correlation terms. However, this discrepancy is not a shortcoming for our purposes, as we are not interested in the separability of the STCF of Y but in knowing its exact form to compute the correlation matrices in equations 19 and 22. Note that the only condition required by BetaBitST is that the STCF is positive definite in the range of space and time lags m and n of interest.

wrcr23461-fig-0005
(a and b) An example of the theoretical spatiotemporal correlation functions (STCFs) of the parent Gaussian process X and the corresponding STCF of the binary process Y, respectively, assuming Markov dependence structures in space and time with parameters ρ1,s=ρ1,t=0.8 and rate of occurrence p = 0.05. (c) The empirical STCF of a binary random field simulated by BetaBitST using the theoretical STCF in panel (b). As the empirical STCF in panel (c) refers to a single simulated random field, discrepancies between empirical STCF in panel (c) and theoretical STCF in panel (b) are due to finite sample effects (i.e., sampling uncertainty). Note that the ensemble STCF matches the theoretical STCF in panel (b) as expected (figure not shown).

To confirm the correctness of equation 26, we simulated a binary sample spanning 10,000 time steps and 100 equally spaced locations and the corresponding empirical STCF (Figure 5c). Leaving aside fluctuations due to finite size effects, the empirical STCF matches the expected theoretical STCF in Figure 5b. Figure 6 provides some examples of binary random fields simulated by BetaBitST over a grid 20 × 30 and 20 time steps, with p = 0.1 and fGn spatial and temporal dependence with nine combinations of parameters Hs and Ht ranging in urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0214. For each value of Hs, the increasing value of Ht yields increasing persistence of the field structure across subsequent time steps. On the other hand, the scattering of the random fields decreases as Hs increases. For high values of both Hs and Ht, the random fields exhibit clustered spatial patterns (reflecting spatial persistence across adjacent grid boxes) preserving their structure in time due to temporal dependence.

wrcr23461-fig-0006
Sequences of 20 random fields simulated by BetaBitST with fractional Gaussian noise dependence structures, p = 0.1, and various combinations of spatial and temporal H parameters denoted as Hs and Ht, respectively. For a given Hs, subsequent snapshots are more and more similar as Ht increases, implying stronger temporal dependence, while spatial events tend to cluster more and more within each snapshot as Hs increases, denoting stronger spatial dependence.

BetaBitST enables the validation of urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0215 and urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0216 as suitable distributions of Z under ni/id and ni/nid conditions, respectively, as well as the evaluation of the correction factor ϕ in equation 22 for spatiotemporal processes spanning n time steps over m locations. Figure 7 (S3) compares ecdfs of Z and urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0217 cdfs corresponding to binary temporal processes with different sizes n and values of Ht (ρt), and pj = 1/j. Empirical and theoretical models show an excellent agreement for all combinations of parameters with ϕ = 0.75. This value of ϕ results from extensive Monte Carlo simulation. Specifically, we considered several combinations of values of Ht ∈ (0.5,1) (ρt ∈ (0,1)), Hs (ρs), n ∈ [10,100], and m ∈ [10,100]. For each combination, we simulated 5,000 random fields with pj = 1/j and computed the number of RB events for each sample. The resulting 5,000 values of Z are used to build the ecdf of Z and to estimate urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0218. After checking for substantial independence of urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0219 estimates, urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0220, from BetaBitST parameters, we computed the ϕ value minimizing the sum of squared relative errors urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0221. We stress that these results holds for pj = 1/j. Other cases, such as those listed in the last two columns of Table 1, must to be addressed by case‐specific Monte Carlo simulations. Figure 8 (S4) shows the perfect matching of ecdfs and urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0222 distributions using different combinations of Ht (ρ1,t), Hs (ρ1,t), m, and n for spatiotemporal binary processes describing POT events with different rates of occurrence p. Figure 9 (S5) shows urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0223 cdfs approximating the ecdfs of Z corresponding to ni/nid binary processes with different spatiotemporal dependence structures and time‐varying pj = 1/j (with ϕ = 0.75).

wrcr23461-fig-0007
Each panel shows the empirical distribution function of the number of record‐breaking values computed on binary time series with fractional Gaussian noise correlation structure and rate of occurrence pj = 1/j simulated by BetaBit, along with the corresponding urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0212 distribution and the urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0213 distribution corresponding to i/id conditions. Each panel refers to a specific combination of Hurst parameter H and sample size n. cdf = cumulative distribution function.
wrcr23461-fig-0008
Each panel shows the empirical distribution function of the number of peak‐over‐threshold events computed on sequences of binary random fields with fractional Gaussian noise spatial and temporal correlation structures and rate of occurrence p simulated by BetaBitST. The corresponding urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0224 distribution and the urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0225 distribution corresponding to i/id conditions are also shown. Each panel refers to a specific combination of Hurst parameters Hs and Ht for the spatial and temporal dependence structures, rate of occurrence p, sample size n, and number of locations m = 50. cdf = cumulative distribution function.
wrcr23461-fig-0009
Each panel shows the empirical distribution function of the number of record‐breaking values computed on sequences of binary random fields with fractional Gaussian noise spatial and temporal correlation structures and rate of occurrence pj = 1/j simulated by BetaBitST. The corresponding urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0226 distribution is also shown. Each panel refers to a specific combination of Hurst parameters Hs and Ht for the spatial and temporal dependence structures, sample size n, and number of locations m = 50. cdf = cumulative distribution function.

All cases highlight that the urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0227 and urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0228 distributions provide and exact or very accurate approximation of the distribution of Z under different assumptions concerning (in)dependence and (non)stationarity. These results also stress the variance‐inflating effect of spatiotemporal dependence on the distribution of Z. Compared with the i/id case, we have higher probabilities of observing a number of events smaller and greater than the theoretical average over n time steps and m locations. This inflated variability corresponds to greater uncertainty that should be taken into account when drawing conclusions about the surprising nature of POT and RB events or their clustering in space and time. These aspects are further discussed in the subsequent analysis of real world data.

7 Analysis of Temperature and Precipitation Data

7.1 Temperature and Precipitation Data Sets

As mentioned in section 1, previous studies attempted to study the rate of occurrence of RB events accounting for spatial and/or temporal correlation. However, they did not introduce a general framework allowing simulation and accurate description under fully specified spatiotemporal dependence structures, including short‐ and long‐range dependence. In this section, we show the impact of spatiotemporal dependence on the rate of RB and POT events by studying temperature and precipitation data across CONUS and reanalyzing the Mauna Loa temperature data previously studied by Newman et al. (2010).

The CONUS data set consists of monthly precipitation, and minimum and maximum temperature anomalies (R, Tmin, and Tmax, respectively), computed with respect to 1901–2000 average, from January 1895 to December 2017 (123 years) over the 344 divisions of the CONUS. This database was obtained from area‐weighted averages of grid point estimates resulting from station data gridded via climatologically aided interpolation (Karl & Koss, 1984; Vose et al., 2014). The divisional scale yields sufficiently refined spatially smoothed results that allow the recognition of physically coherent hydroclimatological patterns (e.g., McCabe & Wolock, 2002; Wolock & McCabe, 1999). Data are provided by the National Oceanic and Atmospheric Administration (NOAA) through the U.S. Climate Divisional Database (nClimDiv data set; http://www.ncdc.noaa.gov/cag/time‐series/us). Data were preliminarily deseasonalized by subtracting the calendar‐month average and dividing by the calendar‐month standard deviation. This procedure removes the effect of seasonal fluctuation of the first two marginal moments and allows for the study of the underlying homogeneous process, thus avoiding seasonal stratification and enabling better recognition of long‐range (low‐frequency) fluctuations.

The data set used by Newman et al. (2010) comprises 30 years of maximum and minimum daily temperatures at the NOAA Mauna Loa Observatory on the Big Island, Hawaii, recorded from 1 January 1977 to 31 December 2006. These time series are extracted from high‐quality hourly temperature data measured at 2 m above ground level (Malamud et al., 2011). Data are freely available from NOAA Earth System Research Laboratory Global Monitoring Division (https://www.esrl.noaa.gov/gmd/dv/data/). For the sake of comparison, data were preprocessed by removing 29 February of leap years and infilling missing values by the same methodology used by Newman et al. (2010): “if one to three successive days were missing, the values for the adjacent days were averaged. If more days were missing, the data for the adjacent years for that day were averaged.”

7.2 RB Analysis of CONUS Divisional Temperature and Precipitation

Our analysis focuses on the occurrence of RB events in monthly Tmax, Tmin, and R both globally at the CONUS scale and locally at divisional scale. The research question is: is the occurrence of observed RB events consistent with a spatially and temporally correlated random process? To answer this question, we studied the cumulative average number of records occurred across the 344 divisions of the CONUS, following the rationale of the analyses reported by Vogel et al. (2001) and Newman et al. (2010). The empirical values are compared with those expected from a stochastic process reproducing spatial and temporal correlations of precipitation and temperature measurements. In particular, we simulated 1,000 standard Gaussian random fields of size m × n equal 344 × (12·123) with separable spatiotemporal correlation in the same spirit of BetaBitST (see also Serinaldi & Kilsby, 2017, for more details on this modeling approach). The temporal dependence structure of divisional Tmax and Tmin was modeled by fGn processes whose parameter H was estimated by the least square based on variance method (Tyralis & Koutsoyiannis, 2011), while the ACF of divisional R by AR models of order up to 3, which were selected by an automatic selection procedure based on the Akaike information criterion (Akaike, 1974). The choice of using fGn for Tmax and Tmin and AR for R is based on preliminary exploratory analysis of empirical ACFs and diagnostic plots of variance versus aggregation scale. Since this analysis recognized a persistent behavior in temperature data and short‐range dependence in rainfall data, which is in agreement with the literature (e.g., Kantelhardt et al., 2006; Franzke, 2012), fGn and AR models provide reasonable and parsimonious descriptions of time series properties. To reproduce the spatial correlation, the divisional time series models were fed with temporally independent but spatially correlated innovations sampled for a multivariate Gaussian distribution with correlation matrix equal to the near positive definite version of the empirical correlation matrix (Higham, 2002) computed on the residuals of the divisional temporal models. It should be noted that we do not need to reproduce the exact marginal distributions of precipitation and temperature as the RB occurrences are rank‐based statistics not affected by the absolute values of the observations. Figures 10 and 11 show the cross‐correlation matrices for all variables and the spatial distribution of H parameter for Tmax and Tmin. It is worth noting that Tmax and Tmin exhibit a spatial correlation stronger than R (as expected), while the H values show coherent spatial patterns.

wrcr23461-fig-0010
Cross‐correlation matrices of the residuals of fractional Gaussian noise models fitted on the 344 divisional time series of Tmax (panel a), Tmin (panel b), and AR models fitted on precipitation R (panel c).
wrcr23461-fig-0011
Spatial patterns of Hurst parameter H for Tmax (panel a) and Tmin (panel b).

Using the simulated Gaussian fields, we computed the total (cumulative) number of records zji occurred at each division urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0229 until the time step urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0230, and then the average urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0231. We considered the RB high values for Tmax and Tmin, since warming is usually of major interest in hydroclimatic research, and both RB high and RB low values for R. Figure 12 shows the urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0232 values estimated on the observed temperature and precipitation, along with the expected average number of records under independence, and the 95% pointwise CIs corresponding to independence and spatiotemporal dependence. Even though there is substantial discrepancy between the estimated and theoretical values of urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0233 for Tmax and Tmin, the estimated urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0234 is still within the CIs when spatiotemporal dependence is accounted for. It should be noted the substantial difference between the width of CIs under independence and spatiotemporal dependence. Vogel et al. (2001) already discussed the effect of spatial correlation on the uncertainty of the rate of occurrences of annual maximum daily flow records; however, our results concerning temperature and precipitation emphasize how large can actually be the difference in cases where spatial and temporal dependence is not negligible, and it is therefore properly incorporated in the analysis. In this respect, we also note that focusing on extreme values, such as annual maximum daily flow, can lead to strong underestimation of the dependence characterizing the underlying process and therefore to too narrow CIs and different conclusions (see Serinaldi et al., 2018, for further discussion). For R, the estimated values of urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0235 are closer to the theoretical expectation but still outside the i/id CIs for RB high values. Even though R shows spatial and temporal correlations weaker than that of Tmax and Tmin, also in this case, CIs under spatiotemporal dependence are much wider than those corresponding to independence, making observed patterns compatible with a stationary random process.

wrcr23461-fig-0012
Each panel shows the empirical cumulative average number of RB events across the conterminous United States, where the cumulative sum is taken over time, and the average across the 344 division at each time step. Empirical estimates on the observed data are complemented with the theoretical expectations and 95% CIs under independence and 95% CIs under spatial and temporal dependence. The latter CIs are obtained by Monte Carlo simulation described in the text. Results refer to record‐breaking high Tmax and Tmin (panels a and b), and record‐breaking low and high precipitation (panels c and d). CI = confidence interval; RB = record‐breaking.

Figures 12a and 12b also show that the estimated curves tend to depart from the theoretical ones at the beginning of the time series, thus indicating clustering of records in the first years of observation. To further investigate this aspect, we identified the divisions where the estimated zji fall outside the divisional 95% CIs, and the year in which the zji curves cross the CI limits. Figure 13 shows that the crossing points tend to occur in the first years in most of cases with few exceptions, where crossing occurs after 1950. As expected, crossing points are more clustered in space for Tmax and Tmin than for R because of the stronger spatial correlation.

wrcr23461-fig-0013
Maps show the year in which the empirical cumulative number of RB events in each division cross and fall outside the limits of the 95% Monte Carlo confidence intervals under spatiotemporal dependence. Results refer to record‐breaking high Tmax and Tmin (panels a and b) and record‐breaking low and high precipitation (panels c and d). RB = record‐breaking.

Finally, we used the urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0236 to answer the following question: given the information available until 1980, what is the probability of the total number of records observed until 2017? In other words, we attempt to understand if the occurrence of the most recent records is consistent with a spatially and temporally correlated random process. To provide an answer, urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0237 is estimated on data from 1895 to 1979, and urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0238 is used to compute the probability of the number of records observed until 2017. Figure 14 shows that these probabilities are generally below the the upper limit of the 95% prediction interval. In agreement with Figure 12, probabilities are closer to the upper limit for Tmax and Tmin than for R. For Tmax and Tmin, limit exceedances tend to cluster in space because of spatial correlation, reflecting the common behavior of groups of neighboring divisions. Moreover, when we integrate spatial correlation out, that is, we look at the average behavior over the entire area, the overall number of records does not exceed the confidence limits (as shown in Figures 12a and 12b). Therefore, even though Tmax and Tmin exhibit a larger number of local exceedances compared with R, this is consistent with and can be described by spatial dependence. It is worth noting that we do not say that the number of temperature RB events are not higher than expected (as this is obvious from Figures 12a and 12b) but that the departures started at the beginning of the observation period (i.e., the first decades of the nineteenth century), and they are still consistent with stationary correlated random processes.

wrcr23461-fig-0014
Maps show the spatial pattern of the nonexceedance probability of the total number of RB events observed from 1895 to 2017 in each division computed by the urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0239 model with parameters estimated on data from 1895 to 1980. Divisions where the probability exceeds the value of 0.975, that is, the upper limit of the 95% prediction interval are highlighted by bold black boundaries. Results refer to record‐breaking high Tmax and Tmin (panels a and b), and record‐breaking low and high precipitation (panels c and d). RB = record‐breaking.

7.3 POT Analysis of CONUS Divisional Temperature and Precipitation

In this section, we complement the RB analysis of Tmax, Tmin, and R across CONUS divisions with a POT analysis focusing on the effect of spatial correlation on the occurrence of events with frequency exceeding a given percentage threshold. In more detail, we want to know how many simultaneous POT events we can expect to observe across the CONUS at each time step (month) and their distribution. Recalling that the data set comprises 123 years of (deseasonalized) monthly values of Tmax, Tmin, and R across 344 divisions, each value xji occurred at each division urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0240 until the time step urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0241 is transformed into yji, such that yji = 0 if xjixi,p and yji=1 otherwise, where xi,p is the empirical quantile of X with exceedance probability p for the division i. The resulting binary sequences are therefore summed up yielding urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0242, that is, the number of simultaneous POT events over the CONUS divisions at each time step. This sequence is used to build the ecdf of Z, which can be compared with urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0243 and urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0244 distributions. We recall that the urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0245 model describes the number of successes over 344 trials for an experiment with success rate equal to p under i/id assumption, while urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0246 model accounts for spatial correlation via the parameter urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0247. urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0248 is estimated by equation 18 where the correlation value ρY,ik of a pair of binary processes (Yi,Yk) is computed from the correlation ρX,ik of the parent processes (Xi,Xk) by the relationship (Emrich & Piedmonte, 1991)
urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0249(27)
where Φ2 denotes the bivariate Gaussian cdf of (Xi,Xk). It should be noted that a bivariate Gaussian cdf is a good approximation for monthly Tmax, Tmin, and R, since the marginal distributions of these variables are approximately Gaussian or bell shaped and weakly skewed. Figure 15 shows that the urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0250 distributions fit the ecdfs of all variables for the thresholds urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0251. Accounting for spatial correlation enables the modeling of discrepancies between ecdfs and the urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0252 distributions without introducing further demanding assumptions on nonstationarity, for instance. In this respect, is worth noting that the spatial correlation yields not only a larger number of simultaneous POT events over a given area but also a higher probability to observe no events over the same area. For example, this probability is close to 0 under i/id according to urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0253 model and is ≈40% (80%) for the observed Tmax and Tmin POT events with p = 0.05(0.01). This means that the alternation of periods with widespread POT events and no events should not be considered as exceptional but the rule under the ni/id assumption, which in turn is sufficient to properly describe this behavior.
wrcr23461-fig-0015
Each panel shows empirical cdfs of the number of simultaneous POT events recorded across the 344 divisions of the conterminous United States at each time step (month) for percentage thresholds urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0254. Theoretical urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0255 and urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0256 cdfs are also shown. cdf = cumulative distribution function; POT = peak‐over‐threshold.

7.4 RB Analysis of Mauna Loa Temperature

In this section, we replicate one of the analyses on RB events reported by Newman et al. (2010). Recalling that the data span 30 years from 1977 to 2006, this analysis considers the time series of 30 maximum and 30 minimum temperatures for each calendar day, resulting in 365 time series of size 30 (e.g., the first time series is the sequence of the 30 temperature values recorded on 1 January from 1977 to 2006). For each time series, we computed the cumulative numbers or RB temperatures, zmax,ji and zmin,ji, with urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0257 and urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0258. For each year j, the 365 values are averaged obtaining urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0259 and urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0260 as functions of year j from 1977 to 2006.

Since the sampling distribution of the number of RB values is approximately Gaussian with mean and variance, respectively, given by equations 8 and 9 under i/id hypothesis, Newman et al. (2010) used such a distribution to assess the agreement of the estimated urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0261 and urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0262 with theoretical results, accounting for sampling uncertainty. As urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0263 and urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0264 averages are taken over 365 values, their sampling standard deviation is expected to be urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0265 times smaller than that of zmin,ji, under i/id hypothesis. Under these assumptions, Newman et al. (2010) concluded that the observed values of urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0266 and urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0267 are not consistent with an i/id random process.

However, some questions arise: Are data really independent? Is the i/id hypothesis defensible/credible? If not, what is its effect on results? As far as temporal dependence is concerned, inference on time series of size 30 is generally speculative, as much longer time series are required to obtain reliable estimates. For example, clustering of extreme events in time is a typical characteristic of temporally correlated stochastic processes. This property can result in observed sequences of low and high summary statistics (e.g., block minima, block maxima, and POT values) that appear, however, approximately uncorrelated because these statistics do not provide enough information to assess the actual dependence of the underlying process. Lack of apparent dependence can lead to interpret low and high regimes as lack of stationarity, while they can be explained by the underlying dependence, which is however concealed by the sampling procedure (Serinaldi & Kilsby, 2016b; Serinaldi et al., 2018). In this respect, focusing on monthly, seasonal, and annual values, as done in the literature mentioned above, and using resampling methods that preserve only approximately a fraction of the actual correlation can invalidate preliminary analysis and modeling efforts. This further explains why we used deseasonalized time series instead of seasonally stratified data in the analysis reported in section 7.2. Based on our analysis of CONUS data and previous studies on temporal dependence in temperature records (e.g., Koutsoyiannis, 2003; Maraun et al., 2004; Percival et al., 2001; Stephenson et al., 2000; Vyushin & Kushner, 2009), we cannot exclude that Mauna Loa temperatures are temporally dependent. However, we conservatively assume that the i/id hypothesis holds true in time for the year‐to‐year calendar‐day values.

Nonetheless, we cannot overlook the cross correlation of the 365 time series used to compute urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0268 and urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0269. In fact, the cross‐correlation matrices in Figure 16 show that the pairwise correlation is not negligible, especially for neighboring calendar days (values around the secondary diagonal). In other words, the 30 observations for a given calendar day (e.g., 1 January) are correlated with the 30 values of neighboring calendar days (e.g., 31 December and 2 January).

wrcr23461-fig-0016
Cross‐correlation matrices of size 365 × 365 showing the pairwise correlation values of 30‐year temperature time series recorded in each pair of calendar days at Mauna Loa from 1977 to 2006.

To assess the impact of the cross correlation, we simulated samples of size 30 from a 365‐dimensional standard multivariate Gaussian distribution with correlation matrix equal to the near positive definite version of the empirical correlation matrices. Figure 17 confirms the conclusions of Newman et al. (2010) under i/id hypothesis, showing that the values of urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0270 and urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0271 fall outside the i/id CIs. However, conclusions dramatically change under cross correlation, which substantially inflates the variability of the expected values and the width of CIs. Accounting for dependence, the observed urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0272 and urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0273 are no longer incompatible with fluctuations of a stationary process. We do not claim that dependence caused the observed behavior of RB occurrences but only that such a behavior is consistent with a stationary process different from i/id. Therefore, the existence of an alternative plausible description of the observed patterns of urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0274 and urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0275 calls into question conclusions and possible attributions previously reported in the literature mentioned above and indicates that a more careful investigation and use of statistical analyses are required.

wrcr23461-fig-0017
Each panel shows the empirical cumulative average number of RB events for Tmax and Tmin time series recorded at Mauna Loa, where the cumulative sum is taken over the 30 calendar years from 1977 to 2006 and the average across the 365 calendar days. Empirical estimates on the observed data are complemented with the theoretical expectations and 95% CIs under independence and 95% CIs preserving dependence between the values recorded at different calendar days. The latter CIs are obtained by Monte Carlo simulation described in the text. Results refer to record‐breaking low Tmax (panel a) and high Tmin (panel b). RB = record‐breaking; CI = confidence interval.

8 Discussion and Conclusions

8.1 Discussion

In this study, we have studied analytical distributions of the number of POT and RB events over specified temporal windows starting from the beginning of the period of record. While this is useful for exploratory analysis and general modeling, for design purposes it can be of interest to know the distribution of Z over a future design life given the information collected in the period of record. This requires the development of conditional distributions. Since POT and RB events are characterized by several properties, such as magnitude, differences of magnitude between consecutive observations, interarrival times, and number of events, different conditional distributions can be defined. For instance, defining the distribution of Z conditioned on the magnitude of the last recorded event requires information on the distributions of the absolute values of the process X. To our knowledge, in these cases analytical solutions are not straightforward even for relatively simple cases of i/idX. Of course, the problem can be addressed by case‐specific Monte Carlo simulations. Examples of this approach concerning conditional interarrival times of POT events and conditional block maxima under ni/id assumption are given by Eichner et al. (2011, and references therein).

Even though this work describes results for POT and RB processes referring to the case ni/id and specific dependence structures (i.e., fGn and Markov), we do not endorse any particular model or assumption. As explicitly stated at the end of the case studies, our message is that the empirical observations can be consistent with ni/id as well as i/nid models reported in the previous literature, where the latter models are justified by inconsistency between observations and i/id or alternative assumptions that do not properly account for correlation. Since rejection of i/id cannot lead to acceptance of i/nid because other options are available, and we cannot discriminate among alternative descriptions, attribution procedures based on the logic of mutually exclusive frameworks (in the spirit of common interpretation of null hypothesis statistical testing) might be inconclusive and should be taken with great care, if not discarded (e.g., Serinaldi et al., 2018, and references therein). In other words, when we observe discrepancies between observed and theoretical patterns, we should bear in mind that the theoretical values refer to a specific (not necessarily well‐devised) reference hypothesis (e.g., i/id) and that statements such as “a causes b” can make sense only if we can exclude any other reasonable explanation. Until we can say “a can be compatible with b, c, d,...,” we can only choose one of the possible options, bearing in mind parsimony, and other (common sense) criteria.

For a correct interpretation of our empirical results, it is worth recalling the two main step of the statistical inference well summarized by von Storch and Zwiers (2003):
  1. “A statistical model is adopted that supposedly describes both the stochastic characteristics of the observed process and the properties of the method of observation. It is important to be aware of the models implicit in the chosen statistical method and the constraints those models necessarily impose on the extraction and interpretation of information.”
  2. “The observations are analyzed in the context of the adopted statistical model.”

According to these principles, it is clear, for instance, that our estimates of H or ρ1 are based on the assumption that the underlying processes are fGn or Markov. More generally, every inferential procedure relies on an assumed model (e.g., i/id, ni/id, linear, and nonlinear). It is also obvious that the inferential results are generally biased if the assumed theoretical process is misspecified. For example, we can obtain a full range of H and ρ1 values for time series drawn from correlated processes different from fGn and Markov. These estimates make sense only under the assumption that fGn and Markov provide a reasonable description of the observed process. In this context, our emphasis on the ni/id framework versus i/nid relies on the fact that the latter requires much stronger assumptions, involves additional model uncertainties, and makes many results of classical statistical inference questionable, or even meaningless/undefined (e.g., Serinaldi & Kilsby, 2015; Serinaldi et al., 2018). Therefore, the ni/id should always be considered as a parsimonious alternative to i/nid to describe possible discrepancies from i/id, and when observations agree with both ni/id and i/nid descriptions, we can only conclude that the evidence is not enough for a clear choice and unambiguous attribution.

The remarks above are related to the problem of recognizing and identifying dependence, nonstationarity, and other properties from data. For example, it is well known that very large sample sizes are required to reliably identify long‐range persistence and estimate H from observed time series (e.g., Koutsoyiannis & Montanari, 2007). Inference becomes even more difficult when dealing with binary processes Y describing POT and RB occurrences, as the dichotomization removes the information of the absolute values of the parent process X. To mitigate this effect, we did not apply monthly or seasonal stratification in the analysis of CONUS temperature and precipitation, and spatiotemporal dependence structures were estimated on the parent process X and then suitably deflated by equation 27 to obtain those of Y. Moreover, it is known that correlation can influence the statistics of extreme values yielding, for instance, spatiotemporal clustering of RB, POT, or block maxima (e.g., Bogachev & Bunde, 2012; Eichner et al., 2011; Serinaldi & Kilsby, 2016b) . On the other hand, it is generally difficult to retrieve the underlying correlation structures only from extreme events, which often appear to be approximately independent because of downsampling effects of data selection and consequent removal of nonextreme data providing information on correlation (e.g., Serinaldi et al., 2018). In these cases, ni/id can be confused with i/nid and vice versa, as the data are not enough to draw conclusions, and further information does need to be collected. As shown in the case studies, in these circumstances, endorsing a particular model or assumption and their consequences can be questionable, while a more balanced approach implies the use of multiple schemes, bearing in mind their properties, shortcomings, and suitability for the problem at hand.

8.2 Concluding Remarks

The occurrence of RB events and, more generally, extreme values of hydroclimatic variables such as temperature, precipitation, or stream flow attract attention from the media (King, 2017), especially in light of the possible connection with anthropogenic influence. This explains the recent rapid evolution of methods devised to assess possible systematic changes of rate of extreme events. Among these methods, we can mention the so‐called event attribution, which “compares the occurrence probability of an event in the present, factual climate with its probability in a hypothetical, counterfactual climate without human‐induced climate change” (Hauser et al., 2017). Detection and attribution heavily rely on climate model simulations and are therefore subject to their limitations (King, 2017). In particular, the choice of model, counterfactual climate, and boundary conditions can lead to contradicting conclusions (Hauser et al., 2017). In this context, uncertainty and the nature of statistical methods used to summarize information play a key role. Uncertainty can refer to multiple aspects, going to sample uncertainty (i.e., limited size of sequences of reliable systematic observations, which call the assessment of the so‐called natural variability into question), to model uncertainty (including model structure and underlying assumptions).

In this study we have therefore attempted to propose a set of tools enabling an analysis of POT and RB occurrences overcoming too restrictive i/id hypothesis and accounting more consistently for spatial and temporal dependence. In particular, we showed that the urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0276 and urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0277 distributions allow for readily assessing the probability of the number of POT and RB events, respectively, over n time steps and m locations, with minimal information and avoiding extensive simulation. On the other hand, the BetaBitST generator enables the simulation of binary random processes with general spatial and temporal correlation structures, resulting in binary sequences/fields characterized by spatial and temporal clustering, which is often attributed too superficially to external factors.

Our validation of urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0278 and urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0279 showed that spatiotemporal dependence has very limited or no impact on the expected number of POT and RB events but strongly affects the shape of their distribution, resulting in so‐called overdispersion, that is, a substantial increase of the probability to observe a number of events larger (smaller) than the mean compared with the i/id reference. Therefore, for a given location, we can expect, for instance, periods showing clusters of POT and RB events followed by periods with no events. Similarly, for a fixed date, POT and RB events will tend to occur in nearby locations forming hot spot and event‐free areas.

As shown in the case studies, the use of different benchmark assumptions involving spatiotemporal dependence lead to conclusions completely different from those resulting from analyses based on i/id hypothesis or underrepresented dependence, thus making the departure from theoretical averages less surprising than expected. Referring to 2017 Houston flood caused by Hurricane Harvey, Montz (2017) posed the question “While the flooding may have been unprecedented, was it unexpected?” As mentioned above, the answer to this question depends on many factors and “uncertainty abounds and the generally available information…does not address the uncertainty well, if at all” (Montz, 2017). However, a key aspect is how we use the available information and what statistical models and underlying assumptions we use to quantify uncertainty. Inefficient use of the data can prevent the retrieval of valuable information, while the choice of models based on inappropriate or too restrictive hypotheses (e.g., i/id) can lead to incorrect conclusions. Following the rationale of Serinaldi and Kilsby (2015) and Serinaldi et al. (2018), this study showed the huge impact of these assumptions and provides statistical tools to analyze POT and RB occurrences under reasonable and more challenging reference hypotheses.

Acknowledgments

The authors were supported by the Engineering and Physical Sciences Research Council (EPSRC) grant EP/K013513/1“Flood MEMORY: Multi‐Event Modelling Of Risk & recoverY”, and Willis Research Network. CONUS data are provided by the National Oceanic and Atmospheric Administration (NOAA) through the U.S. Climate Divisional Database (nClimDiv data set; http://www.ncdc.noaa.gov/cag/timeseries/us), while Mauna Loa data are freely available from NOAA Earth System Research Laboratory Global Monitoring Division (ESRL‐GMD; https://www.esrl.noaa.gov/gmd/dv/data/). The authors wish to thank the eponymous reviewer Simon‐Michael Papalexiou (University of California, Irvine) and two anonymous reviewers for their insightful remarks and constructive criticisms. In particular, we are grateful to one of the reviewers for introducing us to the work of George L.S. Shackle and some seminal papers of Nicholas C. Matalas. The analyses were performed in R (R Development Core Team, 2018).

    Appendix A: Dependence Structures

    The performance of the urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0280 distribution is tested by generating binary sequences with ACF corresponding to two widely used stationary processes, that is, the fGn and the Markov process. The former, also known as Hurst‐Kolmogorov process (e.g., Koutsoyiannis, 2010), is characterized by the following ACF:
    urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0281(A1)
    which exhibits a power law decay urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0282. For 0.5 < H < 1the process is positively correlated and exhibits long‐range dependence, while it reduces to white noise for H = 0.5. As a second example, we consider a process with short‐range Markovian dependence, which is characterized by exponentially decaying ACF of the form
    urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0283(A2)
    where 1/γ is the correlation radius and urn:x-wiley:wrcr:media:wrcr23461:wrcr23461-math-0284 is the lag‐one autocorrelation coefficient.

      Number of times cited according to CrossRef: 17

      • Probability Distribution of Waiting Time of the kth Extreme Event under Serial Dependence, Journal of Hydrologic Engineering, 10.1061/(ASCE)HE.1943-5584.0001923, 25, 6, (04020025), (2020).
      • Change point detection of flood events using a functional data framework, Advances in Water Resources, 10.1016/j.advwatres.2020.103522, (103522), (2020).
      • Projecting the future of rainfall extremes: better classic than trendy, Journal of Hydrology, 10.1016/j.jhydrol.2020.125005, (125005), (2020).
      • A stochastic rainfall model that can reproduce important rainfall properties across the timescales from several minutes to a decade, Journal of Hydrology, 10.1016/j.jhydrol.2020.125150, (125150), (2020).
      • All in order: distribution of serially correlated order statistics with applications to hydrological extremes, Advances in Water Resources, 10.1016/j.advwatres.2020.103686, (103686), (2020).
      • Detecting Flood‐Rich and Flood‐Poor Periods in Annual Peak Discharges Across Europe, Water Resources Research, 10.1029/2019WR026575, 56, 7, (2020).
      • Analysis and Modeling of Precipitation Intermittency by Compound Markov‐DARMA Models, Water Resources Research, 10.1029/2019WR025522, 56, 2, (2020).
      • Trends, change points and spatial variability in extreme precipitation events from 1961 to 2017 in China, Hydrology Research, 10.2166/nh.2020.095, (2020).
      • Simulation of Non-Gaussian Correlated Random Variables, Stochastic Processes and Random Fields: Introducing the anySim R-Package for Environmental Applications and Beyond, Water, 10.3390/w12061645, 12, 6, (1645), (2020).
      • On the Exact Distribution of Correlated Extremes in Hydrology, Water Resources Research, 10.1029/2019WR025547, 55, 12, (10405-10423), (2019).
      • Future floods using hydroclimatic simulations and peaks over threshold: An alternative to nonstationary analysis inferred from trend tests, Advances in Water Resources, 10.1016/j.advwatres.2019.103463, (103463), (2019).
      • Revealing Hidden Climate Indices from the Occurrence of Hydrologic Extremes, Water Resources Research, 10.1029/2019WR024951, 55, 9, (7662-7681), (2019).
      • A simplified MEV formulation to model extremes emerging from multiple nonstationary underlying processes, Advances in Water Resources, 10.1016/j.advwatres.2019.04.002, (2019).
      • ‘How can we learn to live with floods? Challenges for science and management’: guest editorial introduction, Scottish Geographical Journal, 10.1080/14702541.2019.1667625, (1-4), (2019).
      • The mode of the climacogram estimator for a Gaussian Hurst-Kolmogorov process, Journal of Hydroinformatics, 10.2166/hydro.2019.038, (2019).
      • Hydrologic Record Events, Statistical Analysis of Hydrologic Variables, 10.1061/9780784415177, (491-536), (2019).
      • Revealing hidden persistence in maximum rainfall records, Hydrological Sciences Journal, 10.1080/02626667.2019.1657578, (2019).