Escolar Documentos
Profissional Documentos
Cultura Documentos
Ordination Methods
Correspondence Analysis (CA) Detrended Correspondence Analysis (DCA) Factor Analysis (FA) Principles of Canonical Analysis Redundancy Analysis (RA) Canonical Correspondence Analysis (CCA) Canonical Correlation Analysis (CCorA) Discriminant Analysis (DA)
1. Direct Gradient Analysis................................2 2. Few species...........................................4 4. Monotonic responses to gradients (low beta).......Linear regression 4. Nonmonotonic responses to gradients.(high beta)......Generalized linear models 2. Many species..........................................5 5. Monotonic responses ............................RDA 5. Nonmonotonic responses.............................6 6. concerned about arch effect..................DCCA 6. not concerned about arch effect...............CCA 1. Indirect Gradient Analysis..............................3 3. Only distance values are available....................7 7. Monotonic responses ............................PCoA 7. Nonmonotonic responses..........................NMDS 3. Raw data available....................................8 8. Monotonic responses ...............................9 9. Variables noncommensurate......PCA - corr. matrix 9. Variables commensurate..........PCA - cov. matrix 8. Nonmonotonic responses............................10 10. Feel OK about prespecifying number of dimensions, not worried about local optima, not interested in species scores..............NMDS 10. Not as above, but willing to accept either arch effect or detrending/rescaling................11 11. Don't like arch, detrending OK ..........DCA 11. Arch OK, or only interested in axis 1.....CA
Dichotomous Key for Ordination Methods Not 100% accurate, but a good place to start. (see Palmer 1998)
6/1/10
6/1/10
CA
- Example -
Let's use an example other than the stand " species situation we have been looking at (although we could do this here too) and consider the relative abundance (0, +, ++) of a particular species observed at 100 sites. The temperature at each site was recorded and coded (1,2,3): (Descr.-2) Sp. is: Row Sums
Temp. (Descr.-1) Cold (1) Med. (2) Warm (3) Col. Sums
Rare (0) 10 10 15 35
Abund. (+) 10 15 5 30
40 35 25 100
CA
- Example -
Matrix Q contains the proportions pij and the marginal totals pi+ and p+j of the rows and columns, respectively. Identifiers of the rows and columns are given outside the matrix brackets in parentheses:
CA
- Example -
The eigenvalues of Q'Q are: #1 = 0.09613 (70.1%) and #2 = 0.04094 (29.9%) and #3 = 0 (because of centering)
6/1/10
CA
- Example The normalized eigenvectors of Q'Q are then:
CA
- Example In Scaling Type-1, F and V are determined to produce a CA joint plot:
Now, to put the rows (matrix F) at the centroids of the columns, the position of each row along an ordination axis is computed as the mean of the column positions, weighted by the relative frequencies of the observations in the various columns of that row...
CA
- Example -
Consider the first row of the original data. The relative frequencies of that row are 0.25, 0.25, 0.50. Multiplying matrix V by that vector provides the coordinates of the first row of the ordination diagram:
continuing:
6/1/10
CA
- Example -
CA using R
CA.csv
CA using R
6/1/10
CA using R
CA using R
Data Tables
Correspondence analysis has been applied to many types of data tables other than contingency tables. However, as a caveat, recognize again that in order for CA to work correctly, the data table must be dimensionally homogeneous (i.e., in the same physical units) and nonnegative ($ 0). If the data do not meet these assumptions, they may be transformed or recoded. This is a critical step in CA.
6/1/10
Arch Effect
Let's return to notion of coenocline distortion that we first considered in PCA. Recall that most of the these procedures require linear (or at least monotonic) responses. Species data, in particular, is usually unimodally distributed across a gradient. Recall that this problem usually manifests itself in the form of an arch or horseshoe in the data projection. Some ecologists are willing to tolerate this distortion while others feel that an attempt should be made to recover the original gradient via detrending.
Arch Effect
The most extreme form of the arch effect usually occurs while attempting to apply a Euclidean distance measure to species abundance data. A horseshoe is formed because the ends actually contract and fold inwards at the ends of Axis-1 and bend along Axis-3. This is because ED considers the extreme sites to be very near each other. In most instances, CA does not exhibit such a dramatic folding towards the terminal portions, but rather just bends along Axis-1 to form an arch.
6/1/10
6/1/10
Detrending by Segments
When detrending by segments (Hill and Gauch 1980), axis-I is divided in to a number of "segments" and, within each one, the mean of the scores along axis II is made equal to zero; in other words, data points in each segment are moved along axis-II to make their mean coincide with the abcissa. Proximities among points should in no case be interpreted as meaningful! Segments can generate large differences in scores for points that are near each other in the original ordination but happen to be on either side of a segment division. The number of segments is arbitrary. Different numbers of segments lead to different ordinations.
Detrending by Segments
Various software packages use 10 as a minimum number of segments and 46 as a maximum; 26 being a recommended starting or default value. This of course necessitates data sets with considerably more observations than 26. There are no empirical rules for the "correct" number of detrending segments. After detrending by segments, the DCA ordination has the interesting property that the axes are scaled in units of the average standard deviation (SD) of species turnover (Gauch 1982). Along a regular gradient, a species typically appears, rises to a modal value, and disappears in 4 SD; similarly, a complete turnover in species composition often occurs over 4 SD. Thus, the length of axis-I is often used as a measure of the length of the ecological gradient.
Detrending by Polynomials
Detrending by polynomials (Hill and Gauch 1980) directly follows from the fact that an arch is produced when a gradient of sufficient length is present in the data. When a sufficient number of species are present and replace each other along the gradient, the second axis of the CA approaches a quadratic function of the first one (i.e., a second degree polynomial). When detrending is sought, detrending by polynomials is an attractive method because it results in a continuous function of the previous axes, without the discontinuities generated by detrending-bysegments. However, the downside of this method is that it imposes a very specific polynomial model that the data must correspond to. It also does not solve terminal gradient compressions at the ends of the ordination axes.
6/1/10
2 1
DCA Axis 1
10
6/1/10
Factor Analysis
In the social sciences, analysis of the relationships among the descriptors of a multidimensional data matrix is frequently carried out via Factor Analysis (FA). Recall that the goal of PCA is to account for a maximum amount of the variance in the data, whereas the goal of factor analysis is to account for the covariance among descriptors. Put another way, PCA is directed towards reducing the diagonal elements of R. Factor analysis is directed more towards reducing the off-diagonal elements of R. Since reducing the diagonal elements reduces the off-diagonal elements and vice versa, both methods achieve much the same thing.
Factor Analysis
To do this, FA assumes that the observed descriptors are linear combinations of hypothetical underlying variables (i.e., the factors). Originally FA was used to evaluate such things as intelligence. Many variables could be measured such as age, parental education, family income, etc. Multiple variables might play out to show that Factor-1 was determined by all variables related to education and Factor-2 to socio-economic conditions (for example). There are few applications of FA in EEB, so I will not cover it in depth. An excellent treatment can be found in Tabachnick and Fidell (1996).
11
6/1/10
Canonical Analysis
Canonical analysis is the simultaneous analysis of two, or eventually several data tables. It permits biologists to do a direct comparison of two data matrices. Hence, canonical analysis and its derivatives are known as direct ordination methods. Often, in ecology, one is interested in the relationship between a first table describing species composition and a second table of environmental descriptors, observed at the same locations (i.e., objects or samples).
Canonical Analysis
Previous to this, we have considered indirect ordination methods (PCA, PCO, NMDS, CA, DCA) in that we would ordinate a species " stand matrix and then conduct some form of correlation or regression analysis on the ordination vectors to relate objects or descriptors to externally obtained environmental information. This procedure is performed a posteriori. In canonical analysis, with two matrices (X and Y), one is constrained by the other, and both are examined simultaneously. This permits one to directly test a priori hypotheses by bringing all of the variance of Y that is directly related to X and allowing formal tests of the hypotheses.
Canonical Form
In mathematics, a canonical form is the simplest and most comprehensive form to which certain functions, relations, or expressions can be reduced without loss of generality. For example, the canonical form of a covariance matrix is its matrix of eigenvalues. In general, most methods of canonical analysis employ eigenanalysis (some extensions have been described using NMDS). Canonical analysis combines the concepts of ordination and regression. It involves a response matrix Y and an explanatory matrix X. (See next slide.) Like previous ordination methods, canonical analysis produces orthogonal axes from which scatter diagrams may be plotted.
12
6/1/10
Variables y1...yp
Objects 1 to n
Var. y
Objects 1 to n
Variables x1...xp
Variables y1...yp
Objects 1 to n
Variables x1...xp
Ordination of Y under the constraint of X: Redundancy Analysis (RDA) or Canonical Correspondence Analysis (CCA)
Problems of canonical analysis can be represented via a partitioned covariance matrix resulting from the fusion of Y and X data sets and producing a joint dispersion matrix SY+X...
Submatrices SYY (order p ! p) and SXX (m ! m) concern each of two sets of descriptors, respectively, where SYX (p ! m) and its transpose S'YX = SXY (m ! p) account for the covariances among the descriptors of the two groups.
Redundancy Analysis
In redundancy analysis (RDA), each canonical ordination axis corresponds to a direction, in the multivariate scatter of objects (Y), which is maximally related to a linear combination of the explanatory variables X. A canonical axis is thus similar to a principal component. Two ordinations of the objects are obtained: (1) linear combinations of the Y variables (matrix F in PCA), (2) linear combinations of the fitted Y-hat variables (matrix Z), which are thus also linear combinations of the X variables. RDA preserves the Euclidean distance among objects in matrix Y-hat containing values of Y fitted by regression to the explanatory variables X.
13
6/1/10
14
6/1/10
Canonical Analysis
Unfortunately, we do not have the time to develop the details of the algebra of each of the 4 methods of canonical analysis previously described. But, you have now gained all of the necessary skills necessary to interpret the details on your own should you need to pursue one of these analyses. Two excellent sources of of information on these methods can be found in Legendre and Legendre (1998), ter Braak and !milauer (1998), and Lep" and !milauer (2003).
Canonical Analysis
As an alternative to a detailed treatment of mathematics behind each method, I would like to develop a worked example. Let's develop a data set using the number of fish observed at 10 sites along a transect running from the beach of a Caribbean island, with water depths going from 1 to 10 m. The first three sites are on sand and the others alternate between coral and "other substrate" (coded as 0/1).
1 2 3 4 5 6 7 8 9 10
0 0 0 0 1 0 1 0 1 0
1 1 1 0 0 0 0 0 0 0
0 0 0 1 0 1 0 10 1 0
15
6/1/10
RDA using R
- Tropical Fish Data Set -
RDA using R
- Tropical Fish Data Set -
16
6/1/10
RDA using R
- Tropical Fish Data Set -
RDA using R
- Tropical Fish Data Set -
RDA using R
- Tropical Fish Data Set -
17
6/1/10
RDA using R
- Tropical Fish Data Set -
Discriminant Analysis
A common situation arises in EEB applications where one starts with an already known grouping of objects, and one wishes to assess how well a group of quantitative descriptors can explain the object groups. Thus, the problem is no longer how to define or delineate groups, but rather how to interpret them. This is the realm of discriminant analysis. Discriminant analysis is a method of linear modeling, like analysis of variance, multiple regression, and canonical correlation analysis. DA is frequently used in systematics.
Discriminant Analysis
DA proceeds in two steps: (1) It first tests for the differences in the explanatory variables (X), among the predefined groups. This part of the analysis is identical to the overall test performed in the MANOVA. (2) If the test supports the alternative hypothesis of significant differences among groups in the X variables, the analysis proceeds to find the linear combinations (called discriminant functions) of the X variables that best discriminate the groups.
18
6/1/10
Discriminant Analysis
Like one-way ANOVA, discriminant analysis considers a single classification criterion (i.e., division of the objects into groups) and allows one to test whether the explanatory variables can discriminate among the groups. Testing for differences among group means in DA is identical to ANOVA for a single explanatory variable and to MANOVA for multiple explanatory variables. When it comes to modeling, i.e., finding the linear combinations of the variables (X) that best discriminate among the groups, DA is a form of "inverse analysis" where the classification criterion is considered to be the response variable (y) whereas the quantitative variables are explanatory (matrix X).
Discriminant Analysis
Note that discriminant analysis (DA) is also called canonical variates analysis (CVA). This method was first proposed by Fisher (1936) where he published the now famous data set where he described the morphology of 150 specimens of irises (Iridaceae) using 4 measured flower characters (lengths and widths of sepals and petals) belonging to three species. Again, in the interest of time, we will bypass the mathematical treatment of DA and work through the iris data set using a software application (NCSS).
Discriminant Analysis
Shown here are Fisher's data for the 150 plants (first 31 shown), four variables, and three species (coded 1,2,3: 1 = Iris setosa, 2 = Iris versicolor, and 3 = Iris virginica . Note that I. versicolor is actually a polyploid hybrid of the other two species. Data were originally collected by the botanist Edgar Anderson of the Missouri Botanical garden and used with permission.
19
6/1/10
This chart plots the values of the first and second discriminant function scores. By looking at this plot you can see what the classification rule would be. The first function appears to be the most important in separating the three species.
20