Escolar Documentos
Profissional Documentos
Cultura Documentos
, 18
Brad Luen
Department of Mathematics
Reed College
Portland, OR, USA
Philip B. Stark
Department of Statistics
University of California
Berkeley, CA, 94720-3860 USA
Accepted 2011 ???? ??; Received 2011 January ??; in original form 2011 January 29
SUMMARY
We present strong statistical evidence thatcontrary to previous claimsspatially in-
homogeneous, temporally homogeneous Poisson process (SITHP) models do not fit
declustered Southern California seismicity catalogs. Earlier claims were based on tests
that divide time into intervals, count the events in those intervals, and apply a chi-
square test to the counts, approximating the expected number of intervals with each
event count by estimating the rate of the hypothesized Poisson process from the data.
This test lacks a sound statistical basis and can have low power, for at least two rea-
sons: (1) It ignores earthquake locations. (2) It ignores the order of the time intervals.
For instance, if the test does not reject for a given set of counts, it would not reject if
those counts had occurred in decreasing order with time, which would be implausible
for a SITHP. We evaluate declustered catalogs derived from the 19321971 Southern
California Earthquake Center catalog of 1,556 events with magnitude 3.8 and above
by applying four window declustering methods. A Kolmogorov-Smirnov temporal test
finds P -values of 0.006 to 0.022 for the hypothesis that the declustered catalogs fol-
low a homogeneous temporal Poisson processpoor fit. Since homogeneous Poisson
processes do not fit the marginal temporal distribution, SITHPs cannot not fit the
spatio-temporal distribution. The hypothesis that declustered catalogs follow a SITHP
implies the weaker hypothesis that event times and locations in declustered catalogs
are conditionally exchangeable. For three of the declustered catalogs, a nonparametric
spatio-temporal test finds P -values of 0.005 or below for this weaker hypothesis. For
the fourth declustered catalog, the P -value is 0.069, above 0.05 but still small.
Key words: Earthquake interaction, forecasting, and prediction; Probabilistic fore-
casting; Statistical seismology
1 INTRODUCTION of the chi-square test fail when seismicity really does follow
a SITHP, casting some doubt on the conclusion. Moreover,
Earthquake catalogs are often declustered as a precursor to not rejecting the null hypothesis does not imply that the
modeling the remaining events as a realization of a spatially null hypothesis is true: Failure to reject could be a Type II
inhomogeneous, temporally homogeneous Poisson process error, especially if the test has little power against plausible
(SITHP). Tests of the null hypothesis that declustered cat- alternativeswhich we show to be the case. (And if declus-
alogs are realizations of a SITHP have not rejected the null tering removes enough events, no test can have much power:
hypothesis, leading some studies to conclude that declus- The few remaining events will always pass a test for Poisson
tered catalogs are Poisson. For instance, the title of Gard- behavior.)
ner and Knopoff (1974) is Is the sequence of earthquakes in The Kolmogorov-Smirnov (KS) test, described in sec-
Southern California, with aftershocks removed, Poissonian? tion 3.3, can be also used to test the hypothesis that declus-
The abstract: Yes. tered seismicity follows a SITHP. Unlike the assumptions of
The claim in Gardner and Knopoff (1974) is based on the chi-square test, the assumptions of the KS test are sat-
a chi-square test described in section 3.1. The assumptions isfied when declustered seismicity actually follows a SITHP.
2 B. Luen and P.B. Stark
The KS test does not have good power against every alterna- In a SITHP, event locations are independent of event
tive, but it has better power than the chi-square test against times, so the space-time rate is the product of the marginal
some kinds of clustering. The KS test finds small P -values spatial rate and the uniform temporal rate. The total num-
for the hypothesis that the 19321971 Southern California ber of events, N , is random. We denote the random loca-
Earthquake Center (SCSC) catalog of events with magni- tions and times of the N events by {(Xi , Yi , Ti )}N i=1 . The
tude 3.8 and above, declustered with four methods, follow a times between successive events are (marginally) indepen-
SITHP. dent and identically distributed (iid) exponential random
The chi-square and KS tests use only the times of variables. The number of events in disjoint time intervals
events, not event locations. Section 4 gives an abstract non- are independent Poisson random variables with means pro-
parametric permutation test that uses locations as well as portional to the durations of the intervals; this is the basis of
times to test the hypothesis that event times in declustered the chi-square test described in section 3.1. Conditional on
catalogs are conditionally exchangeable given event loca- N = n, the times {Ti }n i=1 are (marginally) iid uniform ran-
tions, a hypothesis implied by the hypothesis that declus- dom variables; this is the basis of the Kolmogorov-Smirnov
tered catalogs are a realization of a SITHP. The test, based test described in section 3.3.
on ideas in Romano (1988, 1989), finds small P -values for
this weaker hypothesis, again using the 19321971 SCEC
catalog declustered in four ways. We believe this test is new
to seismology.
We study only main-shock window and linked-window 3 TEMPORAL TESTS
declustering methods. There are also stochastic declustering 3.1 The Chi-square test
methods, which use chance to decide whether to remove a
particular event (Zhuang et al. 2002; Vere-Jones 1970); the We believe that the test of the hypothesis that declustered
waveform similarity approach (Barani et al. 2007); and catalogs are realizations of a homogeneous temporal Poisson
others. See Davis and Frohlich (1991) and Zhuang et al. process used by Gardner and Knopoff (1974) and Barani
(2002) for taxonomies. et al. (2007) works as follows:
Main-shock window methods remove the earthquakes
in a space-time window around every main shock, suit- (i) Pick K 1. Partition the study period into K time
ably defined. Main-shock window methods can be thought intervals of length T /K. Count the events in each interval:
of as punching a hole in the catalog after each main Nk #{i : ti ((k 1)T /K, kT /K]}, k {1, . . . , K}.
shock. The hole is the window. Gardner and Knopoffs win- (1)
dows (Knopoff and Gardner 1972; Gardner and Knopoff (ii) Pick C 2, the number of categories. For k
1974) are common in main-shock declustering. They are {1, . . . , K}, interval k is in category c {0, . . . , C 2} if
larger in space and time the larger the shock is. it contains c events; interval k is in category C 1 if it
Linked-window methods calculate a space-time window contains C 1 or more events. Let Oc be the number of
for every event in the catalog, not just main shocks. In intervals observed to be in category c.
linked-window methods, an event is in a cluster if and only (iii) Estimate the theoretical rate of events per interval.
if it falls within the window of at least one other event in We believe Gardner and Knopoff (1974); Barani et al. (2007)
that cluster. Linked-window declustering replaces each clus- used the estimate
ter with a single eventfor instance, the first, the largest, or
an equivalent event. The most widely used linked-window = n/K. (2)
method is Reasenbergs (Reasenberg 1985). Reasenbergs
windows are larger in space but shorter in time the larger (iv) Calculate the expected number of intervals in cate-
the shock is. gory c on the assumption that events follow a Poisson pro-
Below, we examine temporal and spatio-temporal tests cess with rate :
of the hypotheses that the 19321971 SCEC catalog of ( c
events with magnitude 3.8 and above, declustered using four Ke c! , c = 0, 1, . . . , C 2
Ec PC2 (3)
main-shock window and linked-window methods, follows K j=0 Ej , c = C 1.
a temporally homogeneous, spatially heterogenous Poisson
process. The resulting P -values are generally quite small: (v) Calculate the chi-square statistic:
The declustered catalogs are not statistically compatible C1
with SITHP models. X (Oc Ec )2
2 . (4)
c=0
Ec
Process KS Chi-square
ilar results for 19321971 Southern California seismicity for
their windows. Method 2 is a main-shock window method.
Inhomogeneous Poisson 1 0.1658 Gardner and Knopoff (1974) performed a variety of
Gamma renewal 0.0009 1 chi-square tests on a number of catalogs declustered using
Method 1a. Among other things, they report results for a
Table 1. Estimated power of significance level 0.05 tests that catalog of earthquakes with magnitude at least 3.8 occurring
two temporal point processes are homogeneous Poisson processes, in the Southern California Local Area from 19321971.
based on 10,000 40-year simulations of each process. In the In- That raw catalog had 1,751 events; the declustered catalog
homogeneous Poisson process, events occur at expected rate had 503 events. They divided the forty-year period into ten-
0.25 per ten days for twenty years, then at expected rate 0.5 per day intervals, found Oc and estimated Ec for some range of
ten days for another twenty years. In the Gamma renewal c, and compared the chi-square statistic to a chi-square dis-
process, the times between events are independent and follow tribution with 2 degrees of freedom. They did not state C,
a gamma distribution with shape 2 and rate 1. The KS test is how they estimated , nor whether they used d = C 1 or
described in section 3.3. The chi-square test, which uses ten-day
d = C 2 in their tests. They found a P -value of 0.0599 and
intervals, C = 4 categories, and d = C 2 = 2 degrees of free-
hence did not reject the hypothesis that declustered catalogs
dom, is described in section 3.1. For the inhomogeneous Poisson
process, the KS test rejects in all simulations and the chi-square are Poisson.
test usually does not reject. For the gamma renewal process, the We do not have the catalog used by Gardner and
chi-square test rejects in all simulations and the KS test rarely Knopoff (1974), so we used the SCEC catalog? for the same
rejects. time period, covering approximately the same region. The
SCEC catalog contained 1,556 events with magnitude at
least 3.8 between 1932 and 1971. We declustered the cat-
clear departure from temporally homogeneous Poisson be- alog using Methods 1a, 1b, and 2 with the Gardner and
havior. The KS test rejected in all 10,000 simulations. But Knopoff (1974) windows and using Method 3. (We used
the chi-square test rejected in only 1658 of the 10,000 sim- Stefan Wiemers ZMAP package for MATLAB to apply
ulations: It has little power because it ignores the order of Reasenbergs method.) The declustered catalogs contained
the intervals. 437, 424, 544, and 985 events, respectively. Figure 1 maps
In the gamma renewal process, inter-event times are in- the events in the original catalog and the events that remain
dependent with a Gamma(2, 1) distribution (the time unit after declustering using methods 1b, 2, and 3. The maps for
is ten days). The expected rate is constant, but the distri- methods 1a and 1b are visually indistinguishable.
bution of inter-event times is not exponential, as it would We tested the SITHP hypothesis using the KS test and
be in a homogeneous Poisson process. The chi-square test the chi-square test with C = 4 and d = C 2 = 2. We
rejected in all 10,000 simulations: More intervals have one rejected the null hypothesis if the KS test had a P -value
event and fewer have two or more events than would be ex- less than 0.025 or the chi-square test had a nominal P -value
pected if the process were Poisson. The KS test rejected in less than 0.025. If the null hypothesis is true, the chance of a
only 9 of the 10,000 simulations. Type I error is no greater than 0.05, by Bonferronis inequal-
The KS test is sensitive to rate variations across the ity (ignoring the difference between the nominal and true P -
study period, but not to local departures from exponential value for the chi-square test). We also compared chi-square
inter-event times. The chi-square test is sensitive to depar- tests using = n/k to tests that estimate by solving (6);
tures from exponential inter-event times in short intervals, differences were negligible. Table 2 shows the results. None of
but not to long-term rate variations. Neither test uses spa- the declustered catalogs appears to be Poisson, contradict-
tial information, which we address in section 4. ing Gardner and Knopoff (1974). For all four declustering
methods, the KS test rejects the Poisson hypothesis at level
0.025. The chi-square test rejects at nominal level 0.025 for
3.5 Tests on declustered catalogs
methods 2 and 3.
We consider the following four window-based declustering
algorithms:
Method 1: Variant (a) Remove every event that is 4 SPATIO-TEMPORAL TESTS
in the window of some other event (Gardner and Knopoff
1974). Variant (b) Divide the catalog into clusters as follows: 4.1 A weaker null hypothesis: conditionally
An event is in a given cluster if and only if it is in the window exchangeable times
of at least one other event in the cluster. In every cluster,
The KS test rejects the hypothesis that the marginal distri-
remove all events except the largest (Gardner and Knopoff
bution of times in the declustered SCEC catalogs is Poisson.
1974).
The marginal distribution of times for a SITHP is Poisson.
Method 2: Consider the events in chronological order.
Hence the test in the previous section also rejects the hy-
If the ith event is in the window of a preceding larger shock
pothesis that declustered 19321971 SCEC catalogs are a
that has not already been deleted, delete it. If a larger shock
realizations of a SITHP.
is in the window of the ith event, delete the ith event. Oth-
erwise, retain the ith event (Knopoff and Gardner 1972).
Method 3: Reasenbergs method (Reasenberg 1985). ? http://www.data.scec.org/catalog_search/data_mag_loc.
Methods 1a, 1b, and 3 are linked-window methods. Gardner php
and Knopoff (1974) found that methods 1a and 1b gave sim- http://www.earthquake.ethz.ch/software/zmap
Are declustered earthquake catalogs Poisson? 5
(a) (b)
(c) (d)
Figure 1. (a): 19321971 SCEC catalog of 1,556 events of magnitude 3.8 or greater in Southern California. (b): The 424 events that
remain after declustering using method 1b. (c): The 544 events that remain after declustering using method 2. (d): The 985 events that
remain after declustering using method 3.
Moreover, SITHPs can have events arbitrarily close to- ally exchangeable given event locations. Since event times in
gether. But catalogs declustered with window methods have SITHPs are conditionally iid uniform given event locations,
a minimum spacing between events: If a catalog contains two SITHPs have conditionally exchangeable times given event
events very close in space and time, the later event will fall locations.
within the window of the former, and one or both of them
Under the hypothesis of conditionally exchangeable
will be deleted. Hence, catalogs declustered using window
times, given the set of locations {(xi , yi )} and (separately)
methods cannot be realizations of SITHPs.
the set of times {ti }, all one-to-one assignments of times
To try to salvage part of the SITHP hypothesis, we test
to locations have the same chance. If events close in space
a much weaker condition implied by SITHP: the hypothesis
tend to be close in timethe kind of clustering real seis-
that times are conditionally exchangeable given event loca-
micity exhibitstimes are not conditionally exchangeable.
tions. Let be the set of all n! permutations of {1, . . . , n}.
If events close in space tend to be distant in timewhich
We say a process has conditionally exchangeable times if,
can result from deleting events in windowstimes are not
conditional on the locations,
conditionally exchangeable.
d
{T1 , . . . , Tn } = {T(1) , . . . , T(n) } (10) We test the hypothesis that times are conditionally
exchangeable by adapting abstract methodology developed
d
for all permutations . (The notation = means has by Romano (1988, 1989). Let Pn be the empirical distribu-
the same probability distribution as.) If event times are tion of the times and locations of the n observed events. Let
conditionally iid given event locations, they are condition- Pn be the distribution that has equal mass at every ele-
6 B. Luen and P.B. Stark
Table 3. P -values for tests of the null hypotheses that the 1932
Table 2. P -values for tests of the null hypotheses that the 1932 1971 SCEC catalog of 1,556 events with magnitude 3.8 or larger,
1971 SCEC catalog of 1,556 events with magnitude 3.8 or larger, declustered using Methods 1a, 1b, 2, and 3 of section 3.5, have
declustered using Methods 1a, 1b, 2, and 3 of section 3.5, has a conditionally exchangeable times given the locations of the events.
homogeneous Poisson distribution in time. The number of events The number of events that remain after declustering is n. The
that remain after declustering is n. KS is an upper bound on the P -values are estimated from 10, 000 random permutations. CI
P -value for the Kolmogorov-Smirnov test. Chi-square plug-in is a 99% confidence interval for the P -value that would result
is the nominal P -value for a chi-square test that estimates by from examining all permutations instead of a random sample of
n/K. Chi-Square MLE is the nominal P -value for a chi-square permutations.
test that estimates by maximum likelihood from the category
counts using equation (6). Reject is yes if the P -value for the
KS test or the nominal P -value for the chi-square test is less than
0.025.