Escolar Documentos
Profissional Documentos
Cultura Documentos
www.elsevier.com/locate/advwatres
a,2
a
DISTART, Universita` di Bologna, Viale Risorgimento 2, 40136 Bologna, Italy
Enel S.p.A., Produzione Area Energ. Rinnovabili, Via Torino 105/E, 30172 Mestre (VE), Italy
Received 12 February 2004; received in revised form 16 August 2004; accepted 18 August 2004
Abstract
A ow-duration curve (FDC) illustrates the relationship between the frequency and magnitude of streamow. Applications of
FDC are of interest for many hydrological problems related to hydropower generation, river and reservoir sedimentation, water
quality assessment, water-use assessment, water allocation and habitat suitability. This study addresses the problem of FDC estimation for ungauged river basins, assessing the eectiveness and reliability of several regional approaches. The study refers to a wide
region of eastern central Italy and adopts a jack-knife cross-validation procedure to evaluate the uncertainty of regional FDCs,
comparing it with the uncertainty of empirical FDCs constructed from short samples of streamow data. The results (a) provide
an evaluation of the reliability of the regional FDCs for ungauged sites, (b) show that the reliability of the three best performing
regional models are similar to one another, and (c) demonstrate that empirical FDCs based on limited data samples generally provide a better t of the long-term FDCs than regional FDCs.
2004 Elsevier Ltd. All rights reserved.
Keywords: Streamow regime; Regional analysis; Jack-knife cross-validation; Eastern-central Italy
1. Introduction
A ow-duration curve (FDC) provides the percentage
of time (duration) a daily or monthly (or some other
time interval) streamow is exceeded over a historical
period for a particular river basin (see e.g., [28]). FDC
may also be viewed as the complement of the cumulative
distribution function of the considered streamows (e.g.,
[15,28]). The earliest use of FDCs is attributed to Clemens Herschel and dates back to 1880 [10]. Neverthe*
954
FDC is probably one of the most informative methods of displaying the complete range of river discharges,
from low ows to ood events [24]. Its traditional interpretation is reported in the scientic literature as periodof-record FDC (e.g., [24,28]), from now on simply
referred to as FDC, which consists of the complement
of the cumulative distribution function of the daily (or
some other time interval of) streamows over the whole
available period of record.
The construction of a FDC using the streamow
observations can be performed through non-parametric
procedures consisting of two main steps: (a) the observed streamows qi, i = 1, 2, . . . , N, are ranked to produce a set of ordered streamows q(i), i = 1, 2, . . . , N,
where N is the sample length, and q(1) and q(N) are the
largest and the smallest observations, respectively; (b)
each ordered observation q(i) is then plotted against its
corresponding duration Di, which is generally dimensionless and coincides with an estimate, pi, of the exceedance probability of q(i). If the Weibull plotting position
(WPP) is used, pi reads
pi P Q > qi
i
N 1
955
2b
Quimpo et al. [19] propose a regional parametric approach for estimating FDCs at ungauged potential
small hydropower sites in the Philippines. The authors
suggest the following two-parameter exponential equation to represent FDCs of daily streamows:
QD QA expcD
where the parameters a, b, c and d have to be nonnegative. In order to estimate the FDCs at ungauged
956
4. Study overview
Our study focuses on the estimation of daily FDCs.
Traditional period-of-record ow-duration curves
(FDCs) are commonly used in Italy, and unlike
AFDCs can be used directly for lling gaps and for
extending daily streamow series, or generating streamow series at ungauged river basins (see e.g., [8,14]). In
particular, hydropower engineering motivates our study,
and therefore we do not consider the top end of the
FDC. Instead we refer to the duration range
D 2 [0.30, 0.99]. The study analyses the statistical procedures by Fennessey and Vogel [9] and Singh et al. [23],
the parametric procedures by Quimpo et al. [19], Mimikou and Kaemaki [17] and Franchini and Suppo [11]
and the graphical procedure by Smakhtin et al. [25].
The analysis applies and compares the procedures as
reported in the cited studies, as well as several modications of the original procedures (see Section 6), aiming
at identifying the procedure that oers the best tradeo between an accurate representation of the physical
processes controlling the streamow regime and the
overall simplicity of the regional model (e.g., number
of parameters and amount of information required for
the application). The analysis of each regional procedure
consisted of three main steps: (a) implementation of the
regional model using the available daily streamow
observations; (b) analysis of the goodness-of-t of the
regional FDCs estimated with the model for a rst
assessment of the suitability of the procedure to the
study region; (c) cross-validation of the regional model
through a jack-knife resampling procedure (see e.g.,
[26,3]).
The second objective of the study is to compare the
regional FDCs with the empirical FDCs constructed
using short streamow records (i.e., record length equal
to 1, 2 and 5 years). This part of the analysis aims at
identifying the minimum record length for which empirical FDCs reproduce long-term (i.e., period-of-record)
FDCs more accurately than the best performing regional models. The results of this analysis might guide us
through the selection between empirical and regional
FDCs for gauged river basins with a limited amount
of hydrometric information.
^
qs;j qs;j
qs;j
wj es;j
N D j1
re;s
v
u
ND
X
u 1
2
t
wj es;j es
N D 1 j1
957
N
1 X
wi ei;j
N
i1
qs;j qs;j
j1 ^
Es 1 P
10
2
PN D
ND
j1 qs;j
j1 qs;j
where ND, qs,j and ^qs;j have the meaning illustrated before. The index dened in (10) is analogous to the
Nash-Sutclie [18] eciency criterion and varies between 1, perfect t, and 1. The values of Es, computed
for s = 1, 2, . . . , N, were used to calculate three further
descriptors of the overall quality of the jack-knifed
FDCs, P1, P2 and P3, dened as the percentages of
cases, over the N possibilities, for which Es > 0.75 (P1,
good to fair t), 0.75 P Es > 0.50 (P2, fair to poor t)
and Es 6 0.50 (P3, poor t).
5. Study region
N
j1 j
D ). q* represents a minimum streamow value (i.e., q* = 0.1 m3/s) below which the utilisation of the
water resource is unfeasible or impacts signicantly on
958
The signicant dierences existing between the minimum, average and maximum values of many geomorphoclimatic indexes reported in Table 1 (see e.g., A, P,
L, MANP) are representative of the high hydrological
complexity of the study region.
The record length at the gauges varies from a minimum of ve years to a maximum of 67 years with a mean
value of 24 years. Several geomorphological and climatic
characteristics were determined for the 51 basins, such
as the basin area A, an estimate of the permeable portion of the basin area P, maximum, Hmax, mean, Hmean,
and minimum, Hmin, elevations in metres above the sea
level (a.s.l.), DH = Hmean Hmin, and the main channel
length L.
An extensive collection of hydrological information
allowed us to characterise the 51 river basins from a climatic and litho-pedologic point of view. The climatic
information was retrieved from an evenly spaced network of 88 thermometric sensors and 337 raingauges.
The monthly series of areal temperature and rainfall
depth were evaluated for each river basin through the
Thiessen polygons method by referring to the thermopluviometric data collected in the same time span of
the streamow observations. These measures were then
utilised to derive the 51 values of the mean annual temperature, MAT, mean annual precipitation, MAP, mean
annual potential evapotranspiration, MAPET, and
mean annual net precipitation, MANP = MAP
MAPET.
Table 1
Minimum, average and maximum values of the geomorphological and climatic indexes for 51 unregulated basins in the study region
Index
A
(km2)
P
(%)
Hmax
(m a.s.l.)
Hmed
(m a.s.l.)
Hmin
(m a.s.l.)
DH
(m)
L
(km)
MAT
(C)
MAP
(mm)
MAPET
(mm)
MANP
(mm)
Minimum
Average
Maximum
31.6
357.7
3082.0
0.1
48.3
99.0
279.5
2077.9
2914.0
178.0
948.6
1950.0
3.0
361.6
1103.1
158.3
587.0
1543.2
9.6
36.8
159.9
8.3
11.6
15.3
824.2
1095.3
1505.4
581.7
692.1
826.0
13.9
403.0
923.7
959
and the estimates of Ai, with i = 0, 1, . . . , n, for both regional models were identied through multivariate stepwise regression analyses, implemented using an ordinary
least-squares (OLS) algorithm. The stepwise procedure
adopted in our study begins by considering a simple
model that estimates the dependent variable, e.g. ^h of
(11), as a constant value. At each step the procedure
adds a further explanatory variable on to the model,
choosing the variable that minimises Mallows Cp statistics (see e.g., [30]). If we suppose that the addition leads
to a model with p explanatory variables, the procedure
then tests the performance of all the p 1 models
including p 1 explanatory variables that can be obtained from the p-variable model by dropping one variable at a time. In case none of the simpler models
performs better than the p-variable model, the procedure will search for the best multivariate model with
p + 1 explanatory variables. The stepwise procedure
continues in the same way and comes to an end when
no further reductions in Mallows Cp can be obtained.
The identied models read,
^ and r
^,
0.967 and 0.403 for the regression models for l
respectively. The scatter plot of Fig. 2 reports the opti^ (panel a) and r
^ (panel b) against the estimal values l
mates obtained from the regression models, applied
with the jack-knife procedure or without, i.e. models
(12) and (13). The E values and the scatter plots of
Fig. 2 indicate that the application of the jack-knife procedure does not signicantly inuence the performances
of the models, implying a strong robustness of the models themselves [3].
^ and r
^, the 51
Using the jack-knifed estimates of l
jack-knifed FDCs for the statistical model were constructed. The jack-knifed FDCs were then compared
with the empirical FDCs using the indexes of reliability
described at Section 4.2, where ND = 691 (i.e.,
D = 0.300, 0.301, . . . , 0.990). Table 2 reports the performance indexes, whereas Fig. 3 reports the relative error
bands for the jack-knifed FDCs, providing a comprehensive description of the model overall reliability for
ungauged river basins located within the study area.
0:108 lnMANP #
E 0:968
All analytical expressions with two or more parameters reproduced the empirical 51 FDCs extremely well.
By contrast, the regionalisation and the application of
the jack-knife validation to most of the parametric approaches resulted in controversial outcomes. For several
analytical expression we were unable to identify by
means of multiple regression analyses regional models
for all parameters, especially for the expressions in
which the physical meaning of the parameters is unclear
(see e.g., Eq. (4)).
The approach by Franchini and Suppo [11], unlike
other parametric approaches, regionalises streamow quantiles instead of parameters (see Section 3.2).
12
and
^ 3:155 0:238 lnP 0:419 lnDH
r
0:209 lnMANP #00
E 0:528
13
3
^ and r
^ are expressed in ln(m /s), A in km , DH in
where l
m, MANP in mm, and P in %.
The models (12) and (13) were then cross-validated
through a jack-knife procedure considering in turn each
of the 51 stations. The jack-knifed E resulted equal to
3
2
1
0
-1
-2
-1
-2
(a)
-1
0
1
2
3
3
Optimal values [ln(m /s)]
-1
(b)
1
2
3
3
Optimal values [ln(m /s)]
960
Table 2
Jack-knife cross-validation: indexes of performances
Regional approach
e
re
P1 (%)
P2 (%)
P3 (%)
Statistical
Parametric
Graphical
0.104
0.109
0.134
0.175
0.314
0.141
29.4
31.4
21.6
9.8
9.8
21.6
60.8
58.8
56.9
14
15
16
17
18
3
where Q(D) is expressed in m /s. The jack-knife crossvalidation of models from (15)(18) produced E values
equal to 0.952, 0.967, 0.932 and 0.900, respectively.
The scatter plots of Fig. 4 illustrate on the natural scale,
and for Q(D) 2 [0.1, 10.0] on the log scale, the performance of the best and the worst performing models, that is
models (16) and (18), respectively, along with the results
of their jack-knife cross-validation. Fig. 4 indicates that
the application of the jack-knife procedure does not signicantly inuence the performances of the models and
conrms the robustness of the models.
The jack-knifed estimates of the four reference daily
b 30 , Q
b 70 , Q
b 90 and Q
b 95 , were utilised to constreamows, Q
struct the 51 jack-knifed FDCs. The jack-knifed parameters of (5), ^
a, ^
b, and ^c, were estimated by an OLS
b 30 , 0:7; Q
b 70 ,
algorithm using the four pairs 0:3; Q
b 90 , 0:95; Q
b 95 and the resulting FDCs were then
0:9; Q
compared with the corresponding empirical FDCs, producing the statistical indexes set out in Table 2 and the
relative errors depicted in Fig. 3.
10
20
15
0.1
0.1
10
20
25
961
10
10
0.1
15
0.1
10
10
(a)
10
15
3
Empirical Q 70 (m /s)
20
25
(b)
10
15
3
Empirical Q 95 (m /s)
20
25
Fig. 4. Parametric approach; estimates of Q(D) obtained by applying the regional models and the jack-knife procedure: (panel a) Q70 (best
performing model); (panel b) Q95 (worst performing model).
1.2
Region 1
Region 4
1.0
Region 2
Region 5
Region 3
Region 6
0.8
0.6
0.4
0.2
0.0
0.3
0.4
0.5
0.6
0.7
0.8
0.9
Duration
962
100
Estimated Q (m /s)
Sub-sample length
e
re
P1 (%)
P2 (%)
P3 (%)
1 year
2 years
5 years
0.127
0.095
0.070
0.327
0.258
0.186
53.9
66.3
78.2
21.7
16.4
9.2
24.4
17.3
12.6
0.1
0.1
1
10
Empirical Q (m3/s)
100
19
3
Fig. 7. Distribution of the relative errors for the resampling experiment: mean, median and bands containing 75% and 90% of the relative
errors.
the L years of daily ows of the historical sample, we extracted the m = L l + 1 consecutive sub-samples with
length equal to l years; (b) using the procedure described
in Section 2, the empirical FDCs of each sub-sample
were constructed and compared with the empirical
FDC of the entire period of record (long-term FDC).
The comparison of step (b) was performed for 70 arbitrarily selected durations: D = 0.30, 0.31, . . . , 0.99.
For each streamgauge s = 1, 2, . . . , 14, the comparison
produced m values of es and re,s, computed as in (8) and
(9). The average of the m values obtained for each statistical index was regarded as representative for station s
and length l, and the average of the 14 mean values
was considered to be representative for the entire region.
These indexes are reported in Table 3 for l equal to 1, 2
and 5 years. Analogously, for each streamgauge
s = 1, 2, . . . , 14 and each sub-sample length l = 1, 2 and
5, the E index dened in (10) was computed. The average of the m values of E was regarded as representative
for station s and used to compute the percentages of the
14 cases for which E > 0.75 (P1), 0.75 P E > 0.50 (P2)
and E 6 0.50 (P3), reported in Table 3.
Fig. 7 shows for l = 1, 2 and 5 the average over the m
cases and 14 streamgauges of the mean and median of
the distribution of relative errors, along with the 75%
and 90% bands as functions of the duration, and can
be compared with the diagrams of Fig. 3.
Fig. 7 shows a generalised underestimation of the
FDCs for short durations and an extensive overestimation of the FDCs for long durations. This was expected,
as long-term FDCs are aected by the observation of
abnormally wet or dry periods during the period-of-record [28]. A sub-sample does not necessarily include
all extreme oods or low-ows that are present in the entire period-of-record, and therefore the sub-sample FDC
tends to underestimate the corresponding long-term
FDC for short durations and to overestimate it for long
durations.
8. Discussion of results
Before discussing the results of the study, it is worth
commenting on the multivariate regression models identied herein, that is Eqs. (12), (13) and (15) through (19).
Only a few of the many parameters available to describe
the basin morphology and climate were found to be useful as explicative variables of the regional models (i.e.,
A, P, DH, L, MAP, MANP, see Table 1). This indicates
that, due to the close statistical correlation between
these parameters it is sucient to consider a very small
number of them in order to represent the geomorphoclimatic eects on the streamow regime (e.g., [3]).
Table 4 reports the coecients of correlation between
the six catchment attributes utilised by the nal regional
regression models. Table 4 shows a high degree of corre-
963
Table 4
Correlation coecients for the geomorphoclimatic indexes utilised in
the regional regression models
A
P
DH
L
MAP
MANP
0.016
0.219
0.917
0.262
0.245
0.016
0.064
0.206
0.316
0.337
DH
0.219
0.064
0.117
0.149
0.035
MAP
MANP
0.917
0.206
0.117
0.270
0.280
0.262
0.316
0.149
0.270
0.966
0.245
0.337
0.035
0.280
0.966
964
9. Conclusions
The main aim of this study was to analyse the reliability of several regional procedures for the estimation of
ow-duration curves (FDCs) at ungauged sites. Achieving this aim is fundamental as (a) the estimation of FDCs
at ungauged sites is a crucial and recurrent hydrological
task for addressing problems related to hydropower generation, river and reservoir sedimentation water-use
assessment, water allocation and habitat suitability, (b)
the literature on the regionalisation of FDCs is surprisingly sparse and (c) the regional models themselves lack
validation or are validated for relatively small geographical regions, (d) for an eective use of the streamow
information available at gauged river basins with limited
record lengths, it is fundamental to know whether empirical FDCs based on small samples can represent a better
estimate of the long-term FDCs than the regional FDCs.
Several regional models for the estimation of FDCs
at ungauged sites within the duration range
D 2 [0.30, 0.99], based on three dierent theoretical approaches, were applied to a large geographical region
of eastern-central Italy including 51 unregulated river
basins. All regional models were cross-validated by
means of a jack-knife resampling procedure. The results
of the cross-validation were then compared with the results of a resampling experiment designed to asses the
reliability of empirical FDCs constructed from small
samples. The two main results of our study may be summarised as follows:
1. The cross-validation shows that the three regional
models (i.e., one model for each theoretical approach)
that provide the best performances over the study
region are characterised by similar degrees of reliability and robustness.
Acknowledgments
The work presented here has been partially supported
by the Italian National Research CouncilGroup for
the Prevention of Hydro-geological Disasters (CNRGNDCI). The authors wish to thank Dr. V.Y. Smakhtin
and an anonymous reviewer for helpful suggestions
which led to considerable improvements in the
manuscript.
References
[1] Beard LR. Statistical analysis in hydrology. Trans ASCE
1943;108:111060.
[2] Box GEP, Cox DR. An analysis of transformations. J R Stat Soc
Ser B 1964;26:21152.
[3] Brath A, Castellarin A, Franchini M, Galeati G. Estimating the
index ood using indirect methods. Hydrol Sci J
2001;46(3):399418.
[4] Castellarin A, Burn DH, Brath A. Assessing the eectiveness of
hydrological similarity measures for regional ood frequency
analysis. J Hydrol 2001;241(34):27085.
[5] Claps P, Fiorentino M. Probabilistic ow duration curves for use
in environmental planning and management. In: Harmancioglu
NB, et al, editors. Integrated approach to environmental data
[6]
[7]
[8]
[9]
[10]
[11]
[12]
[13]
[14]
[15]
[16]
[17]
[18]
965