Você está na página 1de 16

Environmental Modelling & Software 97 (2017) 243e258

Contents lists available at ScienceDirect

Environmental Modelling & Software


journal homepage: www.elsevier.com/locate/envsoft

A rainfall analysis and forecasting tool


Javier Diez-Sierra, Manuel del Jesus*
gico de Cantabria, 39011, Santander,
Environmental Hydraulics Institute, Universidad de Cantabria - Avda. Isabel Torres, 15, Parque Científico y Tecnolo
Spain

a r t i c l e i n f o a b s t r a c t

Article history: MENSEI-L is a stand-alone software tool for the automatic analysis of pluviometric networks, that also
Received 24 February 2017 provides three-day rainfall forecasts based on weather types. The software tool, implemented in Python
Received in revised form and R, is able to fill missing values in original daily data series and to generate synthetic pluviometers in
19 July 2017
ungauged locations, by means of kriging techniques. MENSEI-L also characterizes punctual and spatial,
Accepted 23 August 2017
average and extreme distributions of precipitation for the complete pluviometric network. Tenerife
(Canary Islands, Spain) is used as study site to evaluate the capabilities of MENSEI-L and the implicit
rainfall analysis methodology that it implements. MENSEI-L proves to be a useful tool to extract infor-
Keywords:
Rainfall modeling
mation from dense observation networks where manual analysis is not practical.
Weather types © 2017 Elsevier Ltd. All rights reserved.
Spatial analysis
Kriging
Rainfall extremes

1. Introduction into one of the most dangerous natural hazards (Blo €schl et al.,
2008).
Rainfall is the main driver of the continental portion of the Rainfall is measured by pluviometers, which only capture rain-
water cycle below parallel 50+ . Rainfall characterization is fall data for a very small portion of the territory, and thus, tend to be
important for an accurate modeling of other hydrological fluxes deployed in networks to cover larger areas. On top of collecting
such as runoff and evapotranspiration, as rainfall controls their rainfall data for a large area, pluviometric networks provide useful
statistical behavior, both average and extreme distributions. Even information for the spatio-temporal characterization of rainfall and
more, in small arid and semiarid watersheds, where runoff and are thus important to inform engineering design and decision-
discharge measurements may not be practical, an accurate spatio- making processes. In addition, the availability of dense pluvio-
temporal rainfall characterization becomes an essential part of the metric networks offers the opportunity to build short-term sto-
assessment of hydrological fluxes. These fluxes may be required chastic rainfall forecasting systems making predictions that cover
for infrastructure design or risk assessments, effectively turning the entire area of interest. Rainfall data collection and treatment,
rainfall into an important part of the design and the decision- however, presents a series of complications. Measurements for
making processes (Xu et al., 2010; Paixao et al., 2011; Arnbjerg- specific events may be missing, due to equipment malfunctions or
Nielsen et al., 2013; Renschler et al., 1999; Sugiyama et al., 1995). other reasons, creating discontinuities in the time series. Equip-
Furthermore, the incorporation of quantitative precipitation fore- ment upgrades may lead to differences in time resolution for a
casting (QPF) may also be required for many hydrological appli- given gauge for different time periods. Newer gauges, with better
cations especially for real-time flood forecasting, as it is a critical quality data, provide shorter time series, limiting their usability.
component of public safety and quality of life. The potential risk Data collection from pluviometric networks leads to generating
associated to flash floods is related to two main characteristics: large databases from which manually extracting and interpreting
their rapid occurrence and to the spatial dispersion of the areas information is complicated and time consuming. Moreover, the
affected by these floods. Both characteristics limit the ability to implementation of statistical short-term forecasting systems often
issue timely flood warnings (Borga et al., 2011) turning flash floods leads to deal with sophisticated techniques and large atmospheric
databases. For these reasons, statistical tools, machine learning
techniques and software applications become indispensable to
* Corresponding author.
convert rainfall and atmospheric data into information, ready to
E-mail address: manuel.deljesus@unican.es (M. del Jesus). support professional practitioners and managers.

http://dx.doi.org/10.1016/j.envsoft.2017.08.011
1364-8152/© 2017 Elsevier Ltd. All rights reserved.
244 J. Diez-Sierra, M. del Jesus / Environmental Modelling & Software 97 (2017) 243e258

Nowadays, different software applications exist to assist tech- the implementation details of MENSEI-L, to finally close with future
nicians in the process of rainfall analysis. STORMS 2010 (Sabourin developments, limitations and the conclusions of the work.
and Associates Inc., 2010) is a commercial application able to
analyze local rainfall information to fit average and extreme dis- 2. Description of the study area and the information sources
tributions. It computes Intensity-Duration-Frequency (IDF) curves,
performs intensity trend analysis and is able to create design 2.1. Description of the study area
storms. CLIMATOL (Guijarro, 2016) is a time series homogenization
software distributed as an R (R Core Team, 2015) package under an Tenerife is one of the seven islands that form the Spanish ar-
open source license. It was designed to deal with general climato- chipelago of the Canary Islands. Tenerife (see Fig. 1) is located in the
logical time series, not only with rainfall. Similar software appli- Atlantic Ocean, 300 Km west of the African coast, between parallels
cations are MASH (Szentimrey, 2006), SPLIDHOM (Mestre et al., 28+ N and 29+ N, and between meridians 16+ W and 17+ W. Tenerife
2011) and EXTRAQC (Aguilar and Prohom, 2011), tools designed covers a surface area of 2034:38 Km2 , with a population of 891; 111
for homogenizing and checking the internal coherence of daily time inhabitants. It is the largest, and the most populated of the Canary
series, and AnCli (Stepanek, 2005), which also provides statistical Islands and of the whole Macaronesia region.
characterization tools. Tenerife displays a strong topographic gradient from the coast to
Software libraries and packages have also been developed for the central part of the island. The central region of the island is a
the treatment of rainfall data. One interesting example is reddPrec mountain range, where El Teide (3:718 m) is the highest peak, that
(Serrano-Notivoli et al., 2017), an R package that contains a set of separates the island in two well differentiated climatic areas: a
functions for controlling the quality of rainfall time series, filling northern region, relatively wet due to the effect of the Trade winds,
missing values and creating virtual series (and gridded datasets) in and a southern region, drier due to the blocking induced by the
ungauged locations. Other softwares such as RainSim (Burton et al., orography. Spatial heterogeneities of climate are also present
2008) or MRS (Mehrotra et al., 2015a) are able to simulate sto- within each of the regions, configuring a widely varied range of
chastic rainfall at multiple locations preserving the most important climates within a relatively small area.
statistics (Burton et al., 2013; Kretzschmar et al., 2014; Mehrotra The precipitation in Tenerife is very scarce, with an annual
et al., 2015b). average of 233 mm for the whole island. However, its temporal and
To the best of the author's knowledge, there are not too many spatial distribution is very heterogeneous since a very high per-
tools specifically developed to perform short-term rainfall fore- centage of precipitation falls in short periods of time and with a
casting (Neiman et al., 2009; Wright et al., 2017). In most cases great spatial inhomogeneity. The singular orographic and climatic
rainfall forecasting is dealt with by means of numerical weather characteristics of the island combined with the urbanization pro-
prediction models or the extrapolation of remote sensing obser- cesses carried out in the last years have transformed flash floods
vations (radar data and satellite images) to the current weather into a serious problem. If fact, in the twentieth century, almost one
conditions. Unfortunately, while the first do not seem able to pro- hundred floods were recorded in Tenerife causing important losses
vide accurate rainfall forecasts at the temporal and spatial resolu- both materials and humans.
tion required by many hydrologic applications, neither the outputs
from satellite or radar images allow a satisfactory assessment of 2.2. Description of data sources
rainfall intensity yet (Toth et al., 2000; Krzysztofowicz, 1995; Liu
et al., 2013; Moser et al., 2015). Information regarding synoptic patterns of atmospheric vari-
All the reviewed applications and techniques serve to charac- ables is obtained from two sources. On the one hand, historical
terize, homogenize and interpolate punctual rainfall time series, to information is obtained from the global reanalysis NCEP Climate
generate stochastic rainfall series or to forecast rainfall. However, Forecast System Reanalysis (CFSR) (Saha et al., 2010), together with
no software package provides an integrated methodology to its continuation, the Climate Forecast System Version 2 (CFSV2),
automatically analyze the spatio-temporal characteristics of rainfall developed by the National Oceanic and Atmospheric Administra-
for a pluviometric network and generate short-term rainfall fore- tion (NOAA). In this paper, the term CFSR will be used to refer both
casting. This paper presents such methodology, implemented into databases, CFSR and CFSV2. CFSR is a third-generation reanalysis
an automated rainfall analysis software tool called MENSEI-L, that database, generated with a global high-resolution land-ocean-at-
allows to generate point and spatial characterizations of average mosphere model coupled with an ice sheet model. CFSR provides 6-
and extreme distributions of rainfall, as well as to make short-term hourly data. The spatial resolution of the atmospheric model is
stochastic rainfall forecasts using an analogs method based on 38 Km horizontally, with 64 vertical levels. The spatial resolution of
weather typing. An application to Tenerife (Canary islands, Spain) is the oceanic model is approximately 0:25+ near the Equator, and
used as a calibration and validation step for the methodology 0:5+ beyond the Tropics, with 40 vertical levels. The land model
developed in the present work. counts with 4 soil levels, and the ice model with 3 levels. CFSR and
In order to make the exposition clearer and easier to follow, the CFSv2 together cover the period 1979e2015.
theoretical basis of each section will be presented first and the On the other hand, forecasting information is obtained from the
specific implementation for the Tenerife study case will follow. For Global Forecast System (GFS), also developed by NOAA. GFS pro-
this reason, the paper begins with a description of the study area vides 16 days-into-the-future forecasts with a model similar to the
and the databases used in the study. Then, a description of the base one used to generate CFSR.
methods used to carry out the rainfall analysis is presented. These Rainfall data is provided by Consejo Insular de Agua de Tenerife
methods are grouped in three categories: the generation of (CIATF), the water planning and managing agency for Tenerife Is-
weather types, the characterization of point statistics of rainfall, land. CIATF maintains a database of rainfall observations (Melia n
and the implementation of a spatio-temporal interpolation scheme et al., 2011) that includes the observation network of Agencia
based on geostatistics. The paper continues presenting the work Estatal de Meteorología (AEMET), the Spanish national meteorolog-
flow followed to carry out the rainfall analysis, explaining how the ical agency, and AgroCabildo, a local agency for agriculture devel-
basic methods are used to construct robust rainfall analysis tools. opment. The CIATF database contains information for 125 gauges,
Afterwards, the scheme of the stochastic rainfall forecasting system combining daily and sub-daily values, with an average coverage of
is described. The main body of the paper ends with a description of 15 years, although the longest series start in 1890.
J. Diez-Sierra, M. del Jesus / Environmental Modelling & Software 97 (2017) 243e258 245

Fig. 1. Tenerife island location.

CIATF also provided a digital elevation model (DEM) of 5 m of the atmosphere that are selected in such a way that they
spatial resolution for the whole island. constitute an optimal basis to express the state of the atmosphere
and its natural variability through a reduced ensemble of situations.
3. Methodology For the purposes of this study, the state of the atmosphere at a
specific time is represented by the combined fields of sea level
The rainfall analysis methodology and MENSEI-L's imple- pressure (SLP), elevation of the 500 hPa geopotential surface
mentation are articulated around three major procedures. The first (GH500) and elevation of the 850 hPa geopotential surface
one is the generation of weather types, that serves to classify the (GH850). SLP, GH500 and GH850 fields, that cover the area between
time series of climatic conditions into a limited number of repre- 45+ W and 5+ E, and 20+ N and 50+ N, are extracted from the CFSR
sentative synoptic patterns. These patterns capture most of the and CFSv2 databases (6-hourly information over a global 0:5+ x
observed variability and serve to cluster together similar climatic 0:5+ mesh) and aggregated to a daily temporal resolution. The se-
situations (Huth et al., 2008). Weather types are computed for at- lection of variables, as well as their area coverage, were chosen
mospheric variables and for rainfall. based on analysis found in literature (Herrera et al., 2001; Tullot,
The second procedure is the computation of point rainfall sta- 1959).
tistics, i.e. the analysis of isolated rainfall time series. Average and Weather types are computed through a two-step process. In the
extreme distributions are both considered. An automatic procedure first place, a principal components analysis (PCA) (Abdi and
is developed to select the best fitting parametric distribution in Williams, 2010) is applied to reduce the dimensionality of the
each case. data. Afterwards, a clustering or a selection algorithm is applied to
The last procedure is spatio-temporal interpolation, that makes the reduced-dimensional dataset to determine the optimal
use of geostatistical methods. Rainfall measurements are taken at weather types. Several algorithms can be used to determine the
specific locations, but some data may be missing, or may be optimal weather types: k means (KMA), maximum dissimilarity
required at locations different from where the measuring devices algorithm (MDA) and self-organized maps (SOM). The different
are deployed. Both situations require an interpolation procedure algorithms may provide different weather types, all of them
able to make use of all the information in the database to provide optimal according to the criteria implicitly or explicitly established
robust estimates. by each specific algorithm. Further detail about clustering algo-
It is impotant to notice that the methodologies applied in rithms and their application to climate characterization can be
MENSEI-L have been developed and validated at a daily temporal found in Camus et al. (2011). In the present study, the KMA algo-
resolution; due to the natural heterogeneity of precipitation at rithm is used (as implemented in the scikit-learn python library,
small aggregation scales, it is likely that the higher the temporal Pedregosa et al. (2011)), as it best serves the purpose of grouping all
resolution, the less accurate the results will be. the observed situations into homogeneous groups for statistical
The following sections describe in detail these three major purposes.
procedures that constitute the basis of the analysis. Fig. 2 shows the SLP component of the weather types deter-
mined for the rainfall analysis in Tenerife. The total number of
3.1. Generating weather types weather types, 81 in this case, is selected by means of a sensitivity
analysis of the storm type statistics to the number of weather types.
Atmospheric weather types. Atmospheric weather types, weather The number of types is selected to minimize the total variance of
types from now on, are representative synoptic patterns of the state the rainfall clustering, as well as the intragroup variance.
246 J. Diez-Sierra, M. del Jesus / Environmental Modelling & Software 97 (2017) 243e258

Fig. 2. SLP Weather types. Image created with MENSEI-L.

Storm types. In the present work, storm types are the weather parametric distributions are fitted for all the data.
types corresponding to representative rainfall patterns. Storm Average distributions. Average distributions aim to characterize
types could be determined in the same way atmospheric weather the behavior of the complete population of a variable. The precip-
types were: by a combination of a dimensionality reduction and a itation amount distribution tends to be strongly skewed at a daily
clustering algorithm. However, to be able to make stochastic rain- time scale, and is commonly assumed to be gamma distributed
fall forecasts, storm types are defined as the representative rainfall (Ben Alaya et al., 2017; Stephenson et al., 1999; Yang et al., 2005).
patterns corresponding to every weather type. A storm type is the Nevertheless, some authors suggest that the gamma distribution
average rainfall probability and the average rainfall depth pattern can be unsuitable for modeling precipitation extremes since it is
induced by all the atmospheric situations contained within a very restrictive and cannot account for features like heavy tails.
weather type. It is assumed that with a sufficiently high number of Therefore, to treat this issue, average distributions are fitted to a
weather types, every weather type induces a narrow collection of collection of parametric distributions that includes: the lognormal
rainfall patterns that can be properly captured and represented by distribution, the Pearson-III distribution, the Generalized Extreme
their average. When storm types are defined this way, their gen- Value (GEV) distribution, the Rayleigh distribution, and the Gamma
eration does not involve additional classifications, it derives from distribution (Serinaldi and Kilsby, 2014).
the initial classification done for the atmospheric conditions. It is important to remark that GEV is fitted to the complete
Fig. 3 shows the spatial distribution of rainfall probability for population as a general three parameter distribution, without any
every storm type. Each colored point corresponds to the rainfall specific treatment of extremes. Distributions are fitted by the
probability (total depth larger than 0.1 mm) of a pluviometer. It can maximum likelihood method. The best-fit distribution is selected
be seen that there are storm types (6, 7, 8, 9, 41 and 66) with greater with the Akaike Information Criterion (AIC) (Akaike, 1974; Aho
rainfall probability. These storm types correspond to specific et al., 2014).
weather type such as cold depressions or low-pressure systems Extreme distributions. Extreme distributions aim to characterize
from temperate zones. The 81 weather types and their corre- the largest values of the population of a variable. They are impor-
sponding storm types, allow to explore the whole spectrum of tant in infrastructure design and risk assessment because they
synoptic patterns that affect Tenerife island. The figure also shows generate the most demanding loads or impacts on the systems.
that storm types represent different spatial patterns of rainfall, Extreme distributions are assumed to follow a generalized extreme
indicating that the total number of types considered is able to value (GEV) distribution. The GEV encompasses, in a single func-
properly capture the overall variability. tional form, the Weibull, the Gumbel and the Fre chet distributions
of extremes. GEV fitting is done for annual maxima by means of the
maximum likelihood method.
3.2. Computing point statistics

Rainfall data captured by pluviometers are kept in time series; a 3.3. Interpolating rainfall
collection of paired values that register the time of occurrence
together with the depth or intensity of rainfall. Time series data Spatio-temporal interpolation is an important part of charac-
may be directly summarized through some statistics (average, terizing rainfall patterns over a territory. Observations in time se-
standard deviation, lag-one correlation, etc). However, a complete ries may be missing due to multiple causes: equipment
statistical description of the data is normally useful and thus malfunction, transcription errors, information transfer errors,
J. Diez-Sierra, M. del Jesus / Environmental Modelling & Software 97 (2017) 243e258 247

Fig. 3. Storm types of rainfall probability. Image created with MENSEI-L.

discontinued observation locations, etc. Data may not even exist at In this context, geostatistic kriging techniques were selected for
a location where they are needed because no pluviometer was ever the present study because (i) several authors (Tabios and Salas,
deployed there. Spatio-temporal interpolation serves to complete 1985; Phillips et al., 1992) have shown that kriging provides bet-
missing value in time series and to create virtual pluviometers, to ter estimates of rainfall than conventional methods when there is a
improve data coverage over the territory, allowing a better statis- high density of stations (as it is the case of Tenerife) and (ii) there
tical characterization of the spatial distribution of rainfall. are software libraries specifically designed to be implemented in a
There are many statistical techniques in the literature that can simple way.
be used to interpolate precipitation such as: Kriging, Inverse Dis- Geostatistics, discipline that is based on the theory of region-
tance Weighting (IDW), Spline functions and Multiple regression alized variables (Journel and Huijbregts, 1978; Goovaerts, 1997,
with residual correction, among others (Ninyerola et al., 2007; New 2000), is increasingly preferred to other statistical techniques
et al., 2000; Wagner et al., 2012). Interpolation techniques are because it allows to account for the spatial correlation among
methods that are commonly used to predict precipitation at a given neighboring gauges to make predictions at ungauged locations.
point using the available adjacent information. One of the major Rainfall interpolation follows the two-step process presented in
drawbacks of interpolation techniques is that the results are Herrera et al. (2012): first, the interpolation predicts the rainfall
smoothed and that they are not able to capture the spatial rainfall state of the location, wet or dry; then, in the case that a wet state
heterogeneity associated to local processes. To deal with those is- has been predicted, the total amount of rainfall is interpolated.
sues, some variants of kriging, such as Cokriging or Universal Different flavors of kriging were tested to be used as the spatio-
kriging, multiple regression with residual correlations and other temporal interpolation method and the best one was selected
techniques like PRISM (Parameter-elevation Relationships on In- through cross-validation. Ordinary kriging, universal kriging using
dependent Slopes Models) (Isotta et al., 2014; Daly et al., 2008) use elevations as the drift term, universal kriging using precipitation
external information (elevation, coastal proximity, topographic averages as drift and universal kriging using both elevation and
facet orientation, hydrometeorological variables, etc.) to improve precipitation as drifts, were tested. Cross-validation for each of the
the results. methods used the leave-one-out approach: all the gauges but one
Cokriging is used to take advantage of the covariance between were used to reconstruct the information on the missing one. This
two or more regionalized variables that are related to rainfall; process was repeated for all the gauges and an average recon-
Universal kriging is a variant of the Ordinary Kriging with a local struction error was computed. Universal kriging using elevation
trend; Multiple regression with residuals is a combination of sta- and average rainfall information was selected as it minimized the
tistical (multiple regression) and spatial interpolation (splines and average reconstruction error.
inverse distance weighting) tools; and PRISM calculates a regres- The error scores were computed for every station by comparing
sion between the climate variable and the elevation for each point, the observed values with the corresponding gridded one in the
assigning a weight to every station that is based primarily on the period 1950e2016. In particular we used bias, root-mean-square
physiographic similarity of the stations. error (RMSE) and Pearson correlation coefficient (R) as validation
248 J. Diez-Sierra, M. del Jesus / Environmental Modelling & Software 97 (2017) 243e258

measures; both RMSE and R scores were computed on the subset of that particular time are used to interpolate, via kriging, the value at
wet days (Precip > 0.1 mm) in order to reduce the effects of the the location of the gauge missing the observation. This kriging
varying occurrence frequencies. It provided and average Pearson procedure makes use of elevation and average rainfall information,
correlation coefficient R of 0.80 for all the gauges, an average and therefore includes local information, about the specific rainfall
standardized bias of 0.02 mm and a RMSE of 6.7 mm. Fig. 4 shows event, and also global information, the relative amount of precipi-
the spatial distribution of the above scores for the kriging method tation that the interpolation location receives. The creation of a
over Tenerife island. In general, a very good agreement between the virtual pluviometer is equivalent to filling the missing values of a
interpolated and recorded series, including the correlations, is gauge that contains no observations.
observed. Fig. 4 also shows that bigger errors are registered in re-
gions with complex topography or in areas where the density of 3.4. Forecasting rainfall
stations is lower.
In recent years, time-series analysis, such as Generalized linear
(a) Bias/Std models (GLM), Stochastic auto-regressive moving-average models,
(b) Root Mean Square Error (pp > 0.1 mm) Logistic regression or Artificial intelligence methods (LogitBoosting,
(c) Correlation (pp > 0.1 mm) Random Forest, Support Vector Machine, Artificial neural networks,
K-nearest-neighbors or Clustering regression methods), have
Kriging requires a specific parametric form of the variogram for become some of the most popular techniques for short-term
the interpolation process. The gstat (Pebesma, 2004) and automap rainfall forecasting. Although these techniques present a limited
(Hiemstra, 2013) libraries used to implement the interpolation prediction skill owing to the low persistence in time that usually
procedure, select the optimal variogram for every interpolation characterizes rainfall, the moderate data availability and reduced
among 20 different ones. Most of the time the circular and expo- computer time required to use them, make their application very
nential variograms are selected as optimal. attractive in the context of real-time flood forecasting (Toth et al.,
The selected interpolation method serves both, as a temporal 2000; Monira et al., 2010). In this context, the analogs method
and a spatial interpolation method. Indeed, temporal interpolation based on weather types was selected for the forecasts due to the
is achieved through spatial interpolation. When a guage is missing reduced computational cost of application and the interpretability
a value for any given time, all the other gauges with information for of the results, which simplifies its application by non experts.

Fig. 4. Cross validation results for the period 1950 to 2016. Scores are the standardized bias of the full series and the RMSE and Pearson correlation coefficient R of the wet days
(precip > 0.1 mm).
J. Diez-Sierra, M. del Jesus / Environmental Modelling & Software 97 (2017) 243e258 249

The short-term rainfall forecasting system implemented in


MENSEI-L feeds on the atmospheric forecasts provided by NOAA's
GFS model. SLP, GH-500 and GH-850 forecasts are automatically
collected from NOAA for the three following days. The represen-
tative weather type corresponding to the forecast atmospheric
state is determined with the weather type classification developed
above. Forecasts for the rainfall probability and the average rainfall
depth for every period are computed by selecting the storm type
corresponding to the forecasted weather type.
The quality of the forecast is evaluated by a k-fold cross-
validation approach (Markatou et al., 2005), that is based on
dividing the observation period into k subperiods, and recon-
structing every subperiod using the information of the other ones
and evaluating the quality of the reconstruction. The k-fold cross-
validation approach avoids spurious effects of any particular
partition. The reconstruction performance is evaluated comparing
observed and CFSR-generated forecast maps of average rainfall
probability and average rainfall depth (see Fig. 2). It is important to Fig. 6. F-score index for the 17 gauges selected.
note that, as probabilities are involved the reconstructed time se-
ries are not unique. Therefore, reconstruction is repeated 100 times,
and the reconstruction performance is taken to be the average of The figure increases up to 50% for three-day accumulated and up to
the 100 realizations. As an example, Fig. 5 shows the reconstructed 70% when the observed rainfall intensity is greater than 30 mm.
series (red line), using the WT technique, and the observed series Fig. 6 shows the f-score index (Powers, 2011) for the 17 selected
for the year 2010 (blue line) of the “Santiago del Teide” station, gauges. The f-score index is very useful to compare the skill of
located in the northwest part of the island. The reconstructed series model predictions when working with imbalanced datasets
corresponds to one of the 100 simulations. The top panel shows the (datasets in which one class is much more frequent than the
daily series while the bottom panel corresponds to the three-day others), as in the case of Tenerife where rainfall occurrence is rare
moving average. It is possible to observe that the correlation be- (below 15%) and can be placed up to 25% in some gauges located in
tween the reconstructed series and the observed one is greater the north of the island and down to 5% in some gauges located in
once the moving mean is applied, which is expected since the the south. Fig. 6 shows that the skill of the forecast is substantially
reconstruction was performed with the mean values of each better in the north of the Island than in the south. This is probably
weather type. because weather types capture the frontal precipitation, which
In order to evaluate the skill of the forecasts, the 17 most usually occurs in the north of the island, better than the convective
complete gauges covering the period 1979e2014 along Tenerife processes prevailing in the south. A model that hits on average 30%
have been selected (see Fig. 6). Using the average conditions of the of the rainy days and with an average f-score of 0.28 may not seem
storm type to reconstruct rainfall makes bias (of number or rainy very skillful. If rainfall was forecast by a random model that only
days and average annual precipitation) almost zero, and minimizes preserved the probability of rainfall at each station, it would only
RMSE (Precip > 0.1 mm) to a value of 15.6 mm for all gauges. The correctly predict 13% of the days, with an f-score close to 0.13. That
daily rainfall forecasting procedure shows an average accuracy over is, it would perform only half as good as our model. If it was forecast
all pluviometers of 30%, combining rainfall occurrence and depth. by a Markov Chain preserving the transition probabilities, it would

Fig. 5. Reconstructed serie (red line), using WT technique, and the observed one (blue line) of the station know as “Santiago del Teide” located in the northwest part of the island for
the year 2010. The reconstructed series corresponds to one of the 100 simulations. The top panel shows the daily serie while the bottom panel corresponds to the three-day moving
average. (For interpretation of the references to colour in this figure legend, the reader is referred to the web version of this article.)
250 J. Diez-Sierra, M. del Jesus / Environmental Modelling & Software 97 (2017) 243e258

correctly predict about 20% of the day, leaving an f-score of 0.23, Therefore, the forecasting system, although not accurately pre-
which is still one third worse than our model, and in this case the dicting rainfall occurrence, is still useful for early warning of
next day forecast is made based on the observed rainfall the pre- flooding events, which in Tenerife present flash flood
vious day, performance would decrease if computed for the three characteristics.
day forecast.
On the positive side, most flooding events coincided with a 4. Rainfall analysis work flow
reduced set of weather types. This is the case of the flash flood
events registered on the dates of 4-12-1991, 13-12-1995 and 31-03- The present section describes how the different tools presented
2002 which fell in the weather types 37, 27 and 12 (see Fig. 7) and in the previous section are combined to generate a work flow for
with 300, 220 and 230 mm/day of rainfall, respectively. All of them rainfall analysis. Fig. 8 depicts a schematic representation of the
corresponding to atmospheric systems known as cut-off low methodology. The description assumes the existence of a database
pressure system. This figure shows the 35 rainfall events with an of precipitation observations. The main steps of the work flow are:
intensity higher than 200 mm/day in the period between 1979 and (1) quality control of the rainfall series, (2) selection of the refer-
2014. Blue and orange bars correspond to the number of events ence series, (3) creation of a spatial map of average precipitation,
with an intensity greater than 200 mm/day and to the associated (4) reconstruction of time series, and (5) statistical analysis of
probability of occurrence of each weather type respectively. These rainfall.
35 events fell in just 15 weather types whose probability in all the Controlling the quality of time series. The spatio-temporal rainfall
cases, except two, is below the average 100=81 ¼ 1:23 (black hor- analysis begins with a quality control of all the rain gauges in the
izontal line). This means that special attention should be paid to the database. Length of record, number of missing values and number
weather types with a high number of extreme events and a low of outliers (defined as values more than five standard deviations
probability of occurrence (e.g. 18 and 27), since they will have great above the mean without concomitant occurrences on several
probability of generating torrential rains. gauges) were considered to evaluate the quality of each series. Only

Fig. 7. 35 rainfall events with an intensity higher than 200 mm/day in the period 1979e2014. Blue and orange bars correspond to the number of events with an intensity greater
than 200 mm/day and to the associated probability of occurrence of each weather type respectively. Black horizontal line correspond to the mean probability of occurrence for each
WT. (For interpretation of the references to colour in this figure legend, the reader is referred to the web version of this article.)
J. Diez-Sierra, M. del Jesus / Environmental Modelling & Software 97 (2017) 243e258 251

Fig. 8. Workflow of the methodology.

gauges with more than 10 years of information and less that 10 where missing values appear and new virtual gauge can be created
missing values per year were considered to compute the spatial wherever required. This process ensures complete rainfall time
map of average precipitations. series from the initial date of reconstruction of the database.
Creating the spatial map of average rainfall. The spatial map of It is important to note here that information from all gauges is
average precipitation is constructed using universal kriging with used to create the spatial maps of rainfall; even those not used to
elevation because this method minimized the average reconstruc- create the map of spatial rainfall averages. Therefore, the daily
tion error. Only the reference gauges, those with the best quality kriging reconstruction of rainfall incorporates all the information
data, are used for this reconstruction. available.
Reconstructing time series. After having constructed the spatial Analyzing rainfall statistics. With a filled rainfall database, point
map of rainfall averages, the database is inspected to detect missing rainfall characterization is a straightforward procedure using the
values. Observation time series may begin in different dates. Newer tools described in section 3.2. Spatial statistics are created by
time series would present missing values from the starting date of interpolation of the point analysis carried out for all the gauges in
the oldest time series until their starting date. The kriging inter- the domain.
polation scheme requires a minimum number of gauges for accu-
rate interpolation. Therefore, the database reconstruction 5. Software tool implementation
procedure starts from the earliest date for which a minimum
number of observations, homogeneously distributed in space, are The rainfall analysis described in the previous sections has been
consistently ensured. In the case of Tenerife island, the recon- implemented into a software tool called MENSEI-L (see Fig. 9) that
struction starts at 1950, the moment from which at least 25 ob- simplifies and automatizes the application of the methodology. It is
servations are always present. implemented in python (Python Software Foundation, 2016) and R
Database reconstruction is done through spatial interpolation as (R Core Team, 2015), making use of the wxWidgets library (Smart
described in section 3.3. For a given time, all the gauges with et al., 2005). The software is distributed in a way that Python
measurements are used to reconstruct the spatial field of rainfall for modules are frozen and embedded, so no python installation is
that particular day. The value from this rainfall field is introduced in needed. The R packages required for MENSEI-L are installed directly
every time series with a missing value for the day. Following this by the application once it is initialized. The software was developed
procedure, synthetic observations are provided at every location to be as easy to handle as possible.
252 J. Diez-Sierra, M. del Jesus / Environmental Modelling & Software 97 (2017) 243e258

Fig. 9. MENSEI-L main window.

Once the installation is complete, the main window of the Each of the modules is divided into a series of submodules through
interface is automatically showed by the application (see Fig. 9). On which all the analysis options are accessed. Fig. 11 shows a sche-
the left-hand panel of the main window the four main modules matic with the main options available in MENSEI-L; these options
explained in the methodology are found: weather types, point and will be described in-depth in the section number 6 Results.
space precipitation distributions, interpolation and forecasting. MENSEI-L needs two input CSV files to work, stations.csv and

Fig. 10. Punctual rainfall regime submodule.


J. Diez-Sierra, M. del Jesus / Environmental Modelling & Software 97 (2017) 243e258 253

Fig. 11. MENSEI-L structure. Modules and options available from MENSEI-L.

series.csv. The first one contains the identifier of each station (ID), its used in MENSEI-L. The results section is organized in such a way
name, its location (longitude and latitude) and its elevation; while that every figure summarizes the functionality of each of the main
the second corresponds to a CSV file in which each column contain modules: weather types, rainfall distributions module (punctual
the time series of each station. CSV files can be introduced by hand, and spatial), interpolation module and forecasting module.
however due to the technical specifications of the project, MENSEI- Weather types module. This module (see Fig. 11) makes use of the
L is designed to read the information directly from a SQL database. weather types classification to generate all the analysis. The clas-
Once the software is connected to the database, the two CSV files sification results are stored in the application reducing in this way,
are created locally by MENSEI-L. This way, it is possible to work both the space installation requirements and the computational
disconnected from the database, reducing the bandwidth required cost that would take perform this process every time. However, the
by the application. 81 storm types are calculated every time since the user might add
The first time MENSEI-L is used, it only allows the user to choose new additional information or generate virtual stations.
the Complete interpolation option (see Fig. 11). This option accesses The weather types module generates a visualization of the
the database, resamples the rainfall series to a daily temporal res- weather type classification for the atmospheric variables SLP,
olution and performs the interpolation day by day; filling the gaps GH500 and GH850 (see Fig. 2). Furthermore, it generates a visual-
and generating new series in those new stations created by the user ization of the storm types of average rainfall depth and average
in the stations.csv file. The rest of the modules use automatically the rainfall probability associated with each weather type (see Fig. 3).
interpolated information for their analysis. The Complete Interpo- In addition, the user has access to the continuous daily time series
lation option may take a couple of hours so once it has been of weather types which is very useful, for instance, when
completed the user will be able to use the Incremental interpolation comparing each pattern with the historical records of flooding,
(see Fig. 11) option, which directly identifies the new pluviometric droughts or infrastructures damages. The spatial fields of the 81
records added to the database, performs the interpolation for those weather types can be exported to PNG and GIS raster formats with
new records, generates the CSV files and saves the results in the the same spatial resolution of the CFSR database. Moreover the
database. In addition to the CSV files, the application contains the storm types of average rainfall depth and average rainfall proba-
results of the weather type classification, the DEM of the study area, bility can be exported to CSV format.
and a connection file for the database.
One of strengths of MENSEI-L, is that the software interface (a) Time series visualization for the gauge “Orotava Palmeras” in
adapts to the required analysis. For example, in the Punctual rainfall the north of the island. The blue line corresponds to the
regimen submodule (Punctual in Fig. 11), a canvas is shown where reconstructed time series while the dashed red line corre-
gauges can be selected graphically for analysis (see Fig. 10). The left- sponds to the observed one.
hand panel allows to access different analysis, such as time series (b) Medium value analysis of the “Orotava Palmeras” rain gauge.
plotting, average and extreme distribution fitting, and to select the The left-hand panel shows the empirical cumulative distri-
gauge by name. All analysis can be saved to disk for further pro- bution function (CDF), as well as the fitted parametric one
cessing outside MENSEI-L. The upper panel offers different options (Gumbel). On the right-hand panel, a Q-Q plot shows the
for visualization, for instance showing shape files to improve visual empirical data (red dots), as well as the parametric fit (black
aspects, and for image storage in different formats, PNG, CSV and line).
GIS raster. (c) Extreme value analysis of the “Orotava Palmeras” rain gauge.
The left-hand panel shows the empirical cumulative distri-
6. Results bution function (CDF), as well as the fitted parametric one
(GEV). On the right-hand panel, a return-level plot shows the
This section presents the main results and options that can be empirical data (red dots), as well as the parametric fit
254 J. Diez-Sierra, M. del Jesus / Environmental Modelling & Software 97 (2017) 243e258

(black line). It also shows the values of the fitted distribution (b) Return period rainfall map (T ¼ 100 years).
(GEV).
Interpolation module. As mention before, the Interpolation
Rainfall regime module. This module is divided in two sub- module has two options: Complete interpolation and Incremental
modules: punctual and spatial (see Fig. 11). The first one grants interpolation (see Fig. 11). Complete interpolation has no specific
access to the interface showed in Fig. 10, where any pluviometer display since it just interpolates missing data from time series from
can be selected to plot the complete daily time series, compute the 1950 to the current date. The first time the application is opened,
average distribution or carry an extreme value analysis, all of them the user can only select this option, since the rest of modules feed
for the complete daily series. Fig. 12 shows the resulting plot of on the results generated by the complete interpolation process. To
these principal functionalities of the punctual rainfall submodule. generate a synthetic gauge in a location where there is no real one,
Panel (a), located at the top of the figure, shows the time series of its location and name information must be included in the database
the “Orotava Palmeras” gauge, located in the north of the island. It (stations.csv file); then the software automatically takes it into ac-
can be seen that the original data of the gauge (dashed red line) just count in the interpolation process. In addition, in the case of new
counts with 5 years of records. The missing value estimation records being incorporated to the database, the Incremental
implemented in the interpolation module allows to generate a interpolation option can be carried out from the previous ending
continuous time series with more than 60 years of data (blue line), date in order save computational time. As discussed above the
significantly improving the statistical characterization of the series. Complete Interpolation submodule could take a couple of hours, but
The average distribution characterization is shown in panel (b). The it mainly depends on the number of stations and the interpolation
panel shows the average distribution for the “Orotava Palmeras” period.
rain gauge. On the left-hand side of the panel, the empirical cu- Forecasting module. In the Forecasting module (see Fig. 11) the
mulative distribution function (CDF), as well as the fitted para- user can visualize the spatial fields of SLP, GH500 and GH850,
metric one (Gumbel) are shown. On the right-hand panel, a Q-Q automatically downloaded from the GFS model for the next three
plot shows the empirical data (red dots) and the parametric fit days. From this information the most similar weather type to the
(black line). Some representative percentiles are shown as blue dots GFS data is found and their respective storm types of average
in the image, as well as the fitted parameters for the distribution. rainfall probability and average rainfall depth are shown. Fig. 14
Panel (c), at the bottom of the figure, shows an extreme value shows three-day GH-500 forecasts from the GFS model for the
analysis for the “Orotava Palmeras” rain gauge. On the left-hand dates 07-06-2017, 08-06-2017 and 09-06-2017 and their respective
side of the panel, the empirical cumulative distribution function storm types of average rainfall probability and average rainfall
(CDF), as well as the fitted parametric one (GEV) are shown. On the depth. When the Storm types submodule is selected two options are
right-hand side of the panel, a return-level plot shows the empirical available: (i) showing the rainfall probability and depth in the
data (red dots), as well as the parametric fit (black line). It also gauges like in Fig. 3, or (ii) constructing the spatial maps by using
shows the values of the fitted distribution (GEV). The results can be universal kriging with elevation and average rainfall information as
exported to PNG and CSV formats. On one side, the selected time drifts. Results of this module can be exported as follows: export the
series, the fixed distributional parameters of the average and images in PNG format, export the spatial precipitation forecast
extreme analysis and the empirical and parametric quantiles and fields in raster format and export the rainfall forecast for all the
return periods can be exported to CSV; on the other side the three stations in CSV format.
panels showed in Fig. 12 for the selected serie can be saved to PNG
format. (a) GH-500 (m)
The spatial rainfall submodule is designed to generate spatial (b) Average rainfall probability (%)
precipitation fields using universal kriging with elevation and (c) Average rainfall depth (mm/day)
mean precipitation as drifts. MENSEI-L contains a digital elevation
model, in this case of Tenerife island, which is directly stored in the
application. This submodule involves two different analysis: 7. Future developments, limitations and conclusions
computing the spatial field of average, cumulative or maximum
rainfall for a specific period of time since 1950, and computing the Universal kriging using elevation and average rainfall as drifts
spatial rainfall field for a return period between 2 and 500 years. In has proven to be a powerful technique to fill and reconstruct
this last option, the application uses the entire time series to rainfall in areas with a high density of gauges. Pluviometer filling
perform the extreme value analysis. These maps make use of the with information from neighboring gauges allows to extend the
information of all gauges, and not just the reference gauges used to length of time series, improving the stability of extreme value
generate the drift term for the kriging interpolation. Panel (a) of estimation. Regular and extreme analysis, both punctual and
Fig. 13 shows the cumulative rainfall field for the period 1986-01-01 spatial, are implemented in MENSEI-L in order to be applied to
to 1987-01-01. The map correctly captures the strong gradient of hydrological projects.
precipitation that exists between the north and the south of the One of the main limitations is its applicability to areas where
island. It also correctly captures the correlation that exists between there is a low number of gauges, or where they are far from each
precipitation and elevation; larger elevations receive larger other, since the quality of the reconstruction is strongly affected by
amounts of precipitation. Panel (b) shows the spatial rainfall map this lack of original information (Beguería et al., 2016). In such
for the 100 years return period. The spatial rainfall map for a spe- areas, interpolation techniques that integrate relationships be-
cific return period is generated by means of calculating the return tween the climatic variables and the topography must be applied in
period for every station individually and using universal kriging order to capture the underlying heterogeneity in rainfall (Isotta
with elevation and mean precipitation as drifts. Results of this et al., 2014; Daly et al., 2008).
submodule can be exported to PNG and GIS raster formats with the Rainfall forecasting requires more sophisticated models than
same spatial resolution of the DEM (5 m in the case of Tenerife). average predictions based on weather types. A 50% forecasting
accuracy on three-day periods is acceptable as a pre-warning sys-
(a) Accumulative rainfall map for the period 1986-01-01 to tem, but is still not good enough for measure implementation and
1987-01-01. mitigation means deployment. Different regression techniques
J. Diez-Sierra, M. del Jesus / Environmental Modelling & Software 97 (2017) 243e258 255

Fig. 12. Functionality of the Punctual distribution submodule. Images created with MENSEI-L.
256 J. Diez-Sierra, M. del Jesus / Environmental Modelling & Software 97 (2017) 243e258

Fig. 13. Functionality of the Spatial distribution submodule. Images created with MENSEI-L.

combined with weather type decomposition are already under potential as a flood warning system, since it identifies those
analysis to improve the forecasting procedure. It has been shown, weather types that have led to extreme rainfall and flooding events
however, that the rainfall forecasting system shows a great on Tenerife island in the past.

Fig. 14. GH-500 (m) three days forecast from GFS for the dates 2017-06-07, 2017-06-08 and 2017-06-09 ðaÞ average rainfall WT probability ðbÞ and average rainfall WT depth
selected for these days ðcÞ. The spatial maps are constructed using universal kriging with elevation and average rainfall information as drifts. The number placed in the lower left
corner corresponds to the storm type selected for each day. Images created with MENSEI-L.
J. Diez-Sierra, M. del Jesus / Environmental Modelling & Software 97 (2017) 243e258 257

Weather types constitute a good representation of optimal basis correlation for the United Kingdom. Environ. Model. Softw. 49, 22e33. http://
dx.doi.org/10.1016/j.envsoft.2013.06.001.
for hydroclimatic variables. Although storm types derived from ~ o, A.S., 2011. Analysis of clustering and
Camus, P., Mendez, F.J., Medina, R., Cofin
weather types show distinct rainfall patterns, further investigation selection algorithms for the study of multivariate wave climate. Coast. Eng. 58,
is needed to establish a strong relation between synoptic patterns 453e462.
of atmospheric conditions and rainfall patterns. In this study, a Daly, C., Halbleib, M., Smith, J., Gibson, W., Doggett, M., Taylor, G., Curtis, J.,
Pasteris, P., 2008. Physiographically sensitive mapping of climatological tem-
blind classification (without information about rainfall patterns) perature and precipitation across the conterminous United States. Int. J. Cli-
was carried out for the atmospheric conditions. However, for matol. 28, 2031e2064. http://dx.doi.org/10.1002/joc.1688.
optimal results, the classification should consider the final objec- Goovaerts, P., 1997. Geostatistics for Natural Resources Evaluation. Oxford Univer-
sity Press on Demand.
tive: to group atmospheric situations into homogeneous rainfall Goovaerts, P., 2000. Geostatistical approaches for incorporating elevation into the
patterns. spatial interpolation of rainfall. J. hydrology 228, 113e129.
MENSEI-L constitutes a powerful tool for the analysis of a plu- Guijarro, J.A., 2016. Climatol. URL. http://www.climatol.eu/.
Herrera, R.G., Puyol, D.G., Martín, E.H., Presa, L.G., Rodríguez, P.R., 2001. Influence of
viometric network. It simplifies the application of advanced the north atlantic oscillation on the canary islands precipitation. J. Clim. 14,
mathematical techniques (kriging interpolation) and automatizes 3889e3903.
the process, allowing non-specialized technical staff to have access Herrera, S., Gutie rrez, J.M., Ancell, R., Pons, M., Frías, M., Fern andez, J., 2012.
Development and analysis of a 50-year high-resolution daily gridded precipi-
to powerful rainfall analysis techniques. This advantage can be tation dataset over Spain (Spain02). Int. J. Climatol. 32, 74e85.
transferred into risk assessments and infrastructure design pro- Hiemstra, P., 2013. Automap. URL. https://cran.r-project.org/web/packages/
cesses, improving the overall value obtained by society. automap.
Huth, R., Beck, C., Philipp, A., Demuzere, M., Ustrnul, Z., Cahynova , M., Kyselỳ, J.,
Tveito, O.E., 2008. Classifications of atmospheric circulation patterns. Ann. N. Y.
Software and data availability Acad. Sci. 1146, 105e152.
Isotta, F., Frei, C., Weilguni, V., Percec Tadi gues, P., Rudolf, B., Pavan, V.,
c, M., Lasse
Cacciamani, C., Antolini, G., Ratto, S., Munari, M., Micheletti, S., Bonati, V.,
Software name: MENSEI-L.
Lussana, C., Ronchi, C., Panettieri, E., Marigo, G., Verta cnik, G., 2014. The climate
Developers: Environmental Hydraulics Institute, Universidad de of daily precipitation in the alps: development and analysis of a high-resolution
Cantabria. grid dataset from pan-alpine rain-gauge data. Int. J. Climatol. 34, 1657e1675.
http://dx.doi.org/10.1002/joc. 3794.
Contact information: manuel.deljesus@unican.es.
Journel, A.G., Huijbregts, C.J., 1978. Mining Geostatistics. Academic press.
Hardware required: 1 GB HD and 8 GB RAM. Kretzschmar, A., Tych, W., Chappell, N., 2014. Reversing hydrology: estimation of
Software required: 64-bit Windows. sub-hourly rainfall time-series from streamflow. Environ. Model. Softw. 60,
Program languages: python 2.7 and R. 290e301. http://dx.doi.org/10.1016/j.envsoft.2014.06.017.
Krzysztofowicz, R., 1995. Recent advances associated with flood forecast and
Availability: MENSEI-L rainfall analysis tools and graphical user warning systems. Rev. Geophys. 33, 1139e1147. http://dx.doi.org/10.1029/
interface (GUI) could be made available for research purposes upon 95RG00873.
request. Liu, J., Bray, M., Han, D., 2013. A study on wrf radar data assimilation for hydrological
rainfall prediction. Hydrology Earth Syst. Sci. 17, 3095e3110. http://dx.doi.org/
10.5194/hess-17-3095-2013.
Acknowledgments Markatou, M., Tian, H., Biswas, S., Hripcsak, G., 2005. Analysis of variance of cross-
validation estimators of the generalization error. J. Mach. Learn. Res. 6.
Mehrotra, R., Li, J., Westra, S., Sharma, A., 2015a. A programming tool to generate
The authors would like to thank Omar Castellanos for his work multi-site daily rainfall using a two-stage semi parametric model. Environ.
in software implementation and Pedro Delgado Melia n for his Model. Softw. 63, 230e239. http://dx.doi.org/10.1016/j.envsoft.2014.10.016.
comments to improve MENSEI-L. They also would like to thank Mehrotra, R., Li, J., Westra, S., Sharma, A., 2015b. A programming tool to generate
multi-site daily rainfall using a two-stage semi parametric model. Environ.
“Consejo Insular de Aguas de Tenerife (CIATF)” for granting Model. Softw. 63, 230e239. http://dx.doi.org/10.1016/j.envsoft.2014.10.016.
permission to use their rainfall database for this manuscript and for Melia n, P.D., Ruiz, J.J.B., Díez, A.M., Salete, E.G., 2011. “Mo dulo de gestio n de tor-
funding the implementation of the software in Tenerife. mentas” en la modelizacio n hidrolo gica de superficie de tenerife. In: II Jornadas
de Ingeniería del Agua. Fundacio  n para el Fomento de la Ingeniería del Agua,
Barcelona, Spain.
References Mestre, O., Gruber, C., Prieur, C., Caussinus, H., Jourdain, S., 2011. Splidhom: a
method for homogenization of daily temperature observations. J. Appl. Mete-
Abdi, H., Williams, L.J., 2010. Principal component analysis. Wiley Interdiscip. Rev. orology Climatol. 50, 2343e2358. http://dx.doi.org/10.1175/2011JAMC2641.1.
Comput. Stat. 2, 433e459. URL. http://dx.doi.org/10.1002/wics.101. Monira, S., Faisal, Z., Hirose, H., 2010. Comparison of Artificially Intelligent Methods
Aguilar, E., Prohom, M., 2011. Extraqc. Centre for Climate Change and Servei in Short Term Rainfall Forecast, pp. 39e44. http://dx.doi.org/10.1109/
Meteorolgic de Catalunya, Spain. URL. www.c3.urv.cat/data.html. ICCITECHN.2010.5723826.
Aho, K., Derryberry, D., Peterson, T., 2014. Model selection for ecologists: the Moser, B., Gallus, W.A.J., Mantilla, R., 2015. An initial assessment of radar data
worldviews of aic and bic. Ecology 95, 631e636. URL. http://dx.doi.org/10.1890/ assimilation on warm season rainfall forecasts for use in hydrologic models.
13-1452.1. Weather Forecast. 30, 1491e1520. http://dx.doi.org/10.1175/WAF-D-14-00125.1.
Akaike, H., 1974. A new look at the statistical model identification. IEEE Trans. Neiman, P., White, A., Ralph, F., Gottas, D., Gutman, S., 2009. A water vapour flux tool
automatic control 19, 716e723. for precipitation forecasting. Proceedings of the Institution of Civil Engineers.
Arnbjerg-Nielsen, K., Willems, P., Olsson, J., Beecham, S., Pathirana, A., Water Manag. 162, 83e94. http://dx.doi.org/10.1680/wama.2009.162.2.83.
Gregersen, I.B., Madsen, H., Nguyen, V.T.V., 2013. Impacts of climate change on New, M., Hulme, M., Jones, P., 2000. Representing twentieth-century space-time
rainfall extremes and urban drainage systems: a review. Water Sci. Technol. 68, climate variability. part ii: development of 1901-96 monthly grids of terrestrial
16e28. surface climate. J. Clim. 13, 2217e2238.
Beguería, S., Vicente-Serrano, S., Tom as-Burguera, M., Maneta, M., 2016. Bias in the Ninyerola, M., Pons, X., Roure, J., 2007. Monthly precipitation mapping of the iberian
variance of gridded data sets leads to misleading conclusions about changes in peninsula using spatial interpolation tools implemented in a geographic in-
climate variability. Int. J. Climatol. 36, 3413e3422. http://dx.doi.org/10.1002/ formation system. Theor. Appl. Climatol. 89, 195e209. http://dx.doi.org/
joc.4561. 10.1007/s00704-006-0264-2.
Ben Alaya, M., Ouarda, T., Chebana, F., 2017. Non-gaussian spatiotemporal simula- Paixao, E., Auld, H., Mirza, M.M.Q., Klaassen, J., Shephard, M.W., 2011. Regionaliza-
tion of multisite daily precipitation: downscaling framework. Clim. Dyn. 1e15. tion of heavy rainfall to improve climatic design values for infrastructure: case
http://dx.doi.org/10.1007/s00382-017-3578-0 (in press). study in southern ontario, Canada. Hydrological Sci. J. 56, 1067e1089.
€ schl, G., Reszler, C., Komma, J., 2008. A spatially distributed flash flood fore-
Blo Pebesma, E.J., 2004. Multivariable geostatistics in s: the gstat package. Comput.
casting model. Environ. Model. Softw. 23, 464e478. http://dx.doi.org/10.1016/ Geosciences 30, 683e691.
j.envsoft.2007.06.010. Pedregosa, F., Varoquaux, G., Gramfort, A., Michel, V., Thirion, B., Grisel, O.,
Borga, M., Anagnostou, E., Blo € schl, G., Creutin, J.D., 2011. Flash flood forecasting, Blondel, M., Prettenhofer, P., Weiss, R., Dubourg, V., Vanderplas, J., Passos, A.,
warning and risk management: the hydrate project. Environ. Sci. Policy 14, Cournapeau, D., Brucher, M., Perrot, M., Duchesnay, E., 2011. Scikit-learn: ma-
834e844. http://dx.doi.org/10.1016/j.envsci.2011.05.017. chine learning in Python. J. Mach. Learn. Res. 12, 2825e2830.
Burton, A., Kilsby, C., Fowler, H., Cowpertwait, P., O'Connell, P., 2008. Rainsim: a Phillips, D.L., Dolph, J., Marks, D., 1992. A comparison of geostatistical procedures for
spatial-temporal stochastic rainfall modelling system. Environ. Model. Softw. 23, spatial analysis of precipitation in mountainous terrain. Agric. For. Meteorology
1356e1369. http://dx.doi.org/10.1016/j.envsoft.2008.04.003. 58, 119e141.
Burton, A., Glenis, V., Jones, M., Kilsby, C., 2013. Models of daily rainfall cross- Powers, David M.W., 2011. Evaluation: from precision, recall and f-measure to roc,
258 J. Diez-Sierra, M. del Jesus / Environmental Modelling & Software 97 (2017) 243e258

informedness, markedness and correlation. J. Mach. Learn. Technol. 2, 37e63. Sugiyama, T., Okada, K., Muraishi, H., Noguchi, T., Samizo, M., 1995. Statistical
Python Software Foundation, 2016. Python language Reference. URL:, version 2.7. rainfall risk estimating method for a deep collapse of a cut slope. Soils Found.
http://www.python.org. 35, 37e48.
R Core Team, 2015. R: a language and environment for statistical computing. URL. Szentimrey, T., 2006. Development of mash homogenization procedure for daily
https://www.R-project.org. data. In: Proceedings of the Fifth Seminar for Homogenization and Quality
Renschler, C., Mannaerts, C., Diekkrüger, B., 1999. Evaluating spatial and temporal Control in Climatological Databases, Budapest, Hungary, pp. 123e130.
variability in soil erosion riskrainfall erosivity and soil loss ratios in andalusia, Tabios, G.Q., Salas, J.D., 1985. A comparative analysis of techniques for spatial
Spain. Catena 34, 209e225. interpolation of precipitation. JAWRA J. Am. Water Resour. Assoc. 21, 365e380.
Sabourin, J.F., Associates Inc, 2010. Storms 2010. URL. http://www.jfsa.com/rainfall- Toth, E., Brath, A., Montanari, A., 2000. Comparison of short-term rainfall prediction
analysis-storms.php. models for real-time flood forecasting. J. Hydrology 239, 132e147. http://
Saha, S., Moorthi, S., Pan, H.L., Wu, X., Wang, J., Nadiga, S., Tripp, P., Kistler, R., dx.doi.org/10.1016/S0022-1694(00)00344-9.
Woollen, J., Behringer, D., et al., 2010. The ncep climate forecast system rean- nticos 1, 57e103.
Tullot, I.F., 1959. El clima de las islas canarias. Anu. Estud. atla
alysis. Bull. Am. Meteorological Soc. 91, 1015e1057. Wagner, P., Fiener, P., Wilken, F., Kumar, S., Schneider, K., 2012. Comparison and
Serinaldi, F., Kilsby, C., 2014. Simulating daily rainfall fields over large areas for evaluation of spatial interpolation schemes for daily rainfall in data scarce re-
collective risk estimation. J. Hydrology 512, 285e302. http://dx.doi.org/10.1016/ gions. J. Hydrology 464e465, 388e400. http://dx.doi.org/10.1016/
j.jhydrol.2014.02.043. j.jhydrol.2012.07.026.
Serrano-Notivoli, R., de Luis, M., Beguería, S., 2017. An r package for daily precipi- Wright, D., Mantilla, R., Peters-Lidard, C., 2017. A remote sensing-based tool for
tation climate series reconstruction. Environ. Model. Softw. 89, 190e195. http:// assessing rainfall-driven hazards. Environ. Model. Softw. 90, 34e54. http://
dx.doi.org/10.1016/j.envsoft.2016.11.005. dx.doi.org/10.1016/j.envsoft.2016.12.006.
Smart, J., Csomor, S., et al., 2005. Cross-platform GUI programming with Xu, Y.P., Booij, M., Tong, Y.B., 2010. Uncertainty analysis in statistical modeling of
wxWidgets. Prentice Hall Professional. extreme hydrological events. Stoch. Environ. Res. Risk Assess. 24, 567e578.
Stepanek, P., 2005. Anclim-software for Time Series Analysis. Dept. of Geography, http://dx.doi.org/10.1007/s00477-009-0337-8.
Fac. of Sciences, MU, Brno. Yang, C., Chandler, R., Isham, V., Wheater, H., 2005. Spatial-temporal rainfall
Stephenson, D., Rupa Kumar, K., Doblas-Reyes, F., Royer, J.F., Chauvin, F., Pezzulli, S., simulation using generalized linear models. Water Resour. Res. 41, 1e13. http://
1999. Extreme daily rainfall events and their impact on ensemble forecasts of dx.doi.org/10.1029/2004WR003739.
the indian monsoon. Mon. Weather Rev. 127, 1954e1966.

Você também pode gostar