|
PC-ORD
Version 5 Review
Journal of Vegetation Science 17: 843-844, 2006, reproduced with
permission
by Grandin, Ulf
Department of Environmental Assessment
Swedish University of Agricultural Sciences
Box 7050
SE 75007 Uppsala
PC-ORD5 review JVS.pdf |
 |
PC-ORD version 5: A user-friendly toolbox for ecologists
Abstract
Recently, version 5 of PC-ORD, one of the major commercial software packages for
multivariate ecological community data analyses, was released. The new version offers a
whole range of techniques and methods for analyses of ecological data. It includes modules
for different types of ordination and classification, as well as other exploratory
techniques such as species-area curve analysis and indicator species analysis. Data are
stored in spreadsheets and can be easily manipulated in various ways. In essence, version
5 of PC-ORD offers the user a full toolbox for exploration and analysis of ecological
data, packed in a user-friendly environment.
Description
Recently, a new version of PC-ORD, a software package for multivariate analysis of
ecological data has been released. This package, developed by Bruce McCune and others
(McCune & Grace 2002) is one of the major commercial software packages for
multivariate ecological community data analyses. The new version 5 includes both
enhancements of existing analyses as well as new features. Among the new features is an
extended graph module with possibilities for 3D ordination plots, two-way cluster
dendrograms, dominance-diversity curves and frequency-abundance plots, and frequency
distributions. The main improvements to the previous graph module are better options for
editing graphs, and increased export options. The previous tray of analyses is extended
with permutation-based MANOVA with one-way, factorial, nested, and blocked designs,
two-way cluster analysis, smoothed univariate frequency distributions, and a function that
displays the most important summary features of a data set. The previous analyses are
enhanced with randomization tests for PCA, cluster analysis directly from a distance
matrix, writing of a distance matrix to spreadsheet or text file, and an option to break
down row and column summaries by a variable in the second matrix. To help users to select
the appropriate analysis, an advisor wizard, based on a decision tree, is added. Data
management and import/export has been improved. Version 5 allows for example simultaneous
adjustment of main and second matrices, and filtering rows by a criterion variable.
Review
Once the new user has become acquainted with the somewhat antiquated way of entering data,
PC-ORD version 5 offers a wide variety of tools for exploring data and testing hypotheses
in community ecology. The software is a collection of classical as well as more novel
statistics, used in numerical ecology. In addition to a variety of ordination and
classification techniques, the program also includes modules for testing group identity,
constructing species-area curves, Mantel tests and non-parametric MANOVA.
The interface is intuitive and easy to understand. It is easy to keep track of different
datasets and variables through complex analyses in several steps. There are a number of
possibilities for data transformation, manipulation and permutation. In all analyses,
results from intermediate calculations as well as final results are written to a results
window that can be saved. Additionally, ordination scores are written to a separate file,
which facilitates export.
For ecologists, multivariate statistical methods may be divided into hypothesis generating
(i.e. exploratory), and hypothesis testing methods (Økland 1996). Version 5 of PCORD
offers a wide variety of both types. The exploratory, or indirect, type of methods
includes traditional analyses such as principal components analysis, correspondence
analysis, and detrended correspondence analysis. In addition, there is an array of methods
for summarising and inspecting data, including e.g. calculation of diversity indices and
outlier analysis. Interesting and useful additional exploratory techniques include
species-area curves analysis and indicator species analysis (Dufrêne & Legendre
1997).
The hypothesis testing, or constrained, methods include both multidimensional scaling as
well as X2-based methods such as canonical correspondence analysis.
There are options for permutation tests of group identity but there is no option for
testing the significance of individual explanatory variables prior to a constrained
ordination. However, the graph module offers an elegant way of inspecting the contribution
of the individual explanatory variables. In ordination, PC-ORD can plot the relationship
between an ordination axis and individual species as well as explanatory variables.
For classification, PC-ORD offers a wide variety of tools. In the modules for both one-
and two-way hierarchical classification, a user may choose among many combinations of
distance measures and agglomeration techniques. The classical method TWINSPAN (Hill 1979)
is also included.
A new feature in the current version is a dichotomous decision tree for helping users to
select an appropriate method. The intentions behind this tree are obvious, but to be able
to answer the sometimes quite complex questions, the user has to be very familiar with
multivariate methods. My feeling is that a user who has the experience to be able to
answer the questions probably does not need the decision tree. Anyhow, for a user that has
just started using these techniques, the tree may be of great help, given that the user
knows the nomenclature. A more advanced user may use the tree to explore the capabilities
of the program.
Another interesting feature is the possibility of including your own programs as add-in
tools. In the standard installation, a program for calculating degree of nested-ness
(sensu Patterson & Atmar 1986) is included. This option may not be the most important
feature for a new or intermediate user, but is a means for the more advanced user to
personalise the program.
The graph module is easy to use and offers a user to view ordination results in both two
and three dimensions. An interesting feature is the possibility of drawing successional
vectors in ordination diagrams. Results of classifications are illustrated with
dendrograms in one or two dimensions, with scales showing distance, and remaining
information along a hierarchical tree. Produced graphs are of publication quality and can
be saved in a number of formats. There are numerous options for personalizing a graph,
including varying symbol sizes, labels, vectors, grids, and construction of joint plots.
Documentation of the program is only provided as comprehensive help files obtained from
within the program. The content of the help files is sufficient, with both examples as
well as theoretical background for the different techniques included in the program.
However, many users would probably prefer the documentation as a printed hardcopy.
PC-ORD can only be run under the operating system Windows, version Win98 or higher. The
program can accept data matrices with more than 500 million elements, or a maximum of
32000 columns or rows. This is probably larger than most ecological datasets. The price
for a single user licence is competitive compared to other similar commercial software. A
site licence is on the other hand relatively expensive as the cost increases with the
number of users. The website (www.pcord.com) offers online ordering, but the program
cannot be downloaded.
Many of the techniques and modules included in PCORD can also be found on the Internet as
self-standing freeware. VEGAN (Oksanen 2006) and Ginkgo (Font et al. 2006; see Bouxin
2005) are examples of free software for multivariate techniques, written for ecologists.
The PC-ORD module for species-area relationships is a light version of the freeware
EstimateS (Colwell 1997). TWINSPAN and IndVal which both are included in PCORD are also
available for free. However, in PC-ORD most necessary techniques for exploring and
analysing ecological data are collected in one common frame, with no need for repeated and
time-consuming data preparation for several programs.
In summary, PC-ORD offers a wide range of tools for analysing ecological data in a
user-friendly environment.
References
Bouxin, G. 2005. Review of Ginkgo, a multivariate analysis package. J. Veg. Sci. 16:
355-359.
Colwell, R.K. 1997. EstimateS: Statistical estimation of species richness and shared
species from samples. Version 5. User's guide and application. Published at:
http://viceroy.eeb.uconn.edu/estimates.
Dufrêne, M. & Legendre, P. 1997. Species assemblages and indicator species: the need
for a flexible asymmetrical approach. Ecol. Monogr. 67: 345-366.
Font, X., de Cáceres, M. & García, M. 2006. Ginkgo, a multivariate analysis tool.
See http://biodiver.bio.ub.es/vegana/index.html
Hill, M.O. 1979. TWINSPAN A FORTRAN program for arranging multivariate data in an
ordered two-way table by classification of the individuals and attributes. Cornell
University, Ithaca, NY, US.
Oksanen, J. 2006. Vegan: R functions for vegetation ecologists. Available at:
http://cc.oulu.fi/~jarioksa/softhelp/vegan.html
Patterson, B.D. & Atmar, W. 1986. Nested subsets and the structure of insular
mammalian faunas and archipelagos. Biol. J. Linn. Soc. 28: 65-82.
McCune, B. & Grace, J.B. (with Urban, D.L.) 2002. Analysis of ecological communities.
Mjm Software Design, Gleneden Beach, OR, US.
Økland, R.H. 1996. Are ordination and constrained ordination alternative or complementary
strategies in general ecological studies? J. Veg. Sci. 7: 289-292.
PC-ORD
Version 5 Testimonials
Marlin L. Bowles
Plant Conservation Biologist
The Morton Arboretum, USA
I have published four other papers that used PCORD-generated graphics, as well as
another that used TWINSPAN on PCORD to identify ecologically related groups. I have
also used the Bray/Curtis program on PCORD to generate similarity indices for several
papers. Needless to say, I cant say enough about how useful PCORD has been.
The new version should help even more.
Ethan Bright, Ph.D. Candidate
School of Natural Resources and Environment
The University of Michigan
Ann Arbor, Michigan, USA
I predict PC-ORD 5 will be a well-received improvement on the previous version.
Besides improving the program's statistical and graphical routines, the addition of an
"analytical wizard" and its ability to keep track (with a text file) of the
decision-making process make this an invaluable resource for both student and professional
alike.
PC-ORD
Version 4 Review
Bulletin of the Ecological Society of America 81:127-128. (2000)
by Aaron M. Ellison
Department of Biological Sciences
Mount Holyoke College
South Hadley, MA |
 |
PC-ORD is a software package for multivariate analysis and
classification of ecological data. The DOS version (version 2) was reviewed in the January
1996 ESA Bulletin, and the first Windows (16-bit) version (version 3) was
reviewed in the April 1998 ESA Bulletin. In early 1999, MjM released the 32-bit
product (version 4), reviewed here, which is no longer compatible with Windows 3.x, and
like most new releases, demands more memory and disk space than earlier versions. If
you're no longer using Windows 3.x, upgrading to PC-ORD version 4 has significant
advantages over version 3.
Version 4 of PC-ORD requires an 80486 or better CPU, which means
it could run on the new computers in the Hubble Telescope, but it's unlikely you could run
Windows 95/98/NT efficiently on an 80486 CPU. The software occupies about 5.5 Mb of hard
disk space and uses a minimum of 8 Mb RAM. PC-ORD will use all available memory for matrix
operations, so the previous 16 Mb limit on matrix size has been removed. The only
remaining constraint to matrix size is that the default format for matrices, *.wk1 (Lotus
version 2.0), allows matrices no larger than 32,000 rows x 32,000 columns.
Available analysis routines fall into two broad groups: ordination and
classification. Of the routines in Table 1, blocked multiresponse
permutation procedure (MRBP) and weighted averaging are new to version4. Nonmetric
multidimensional scaling (NMS) has been significantly enhanced to include an
"autopilot mode" that speeds through multiple runs and significance tests, and a
"predictive-mode" NMS that calculates scores for new data points based on prior
ordinations.
Plotting of species in ordination space, by using weighted averaging to
calculate their scores, is now available in NMS, Principal Components Analysis (PCA), and
Bray-Curtis ordinations. Distance measures available include Euclidean (raw, squared, and
relativized), Sorenson (raw and relativized), Jaccard, correlation, and chi-squared. In
addition, data summaries (mean, SD, sum, minimum, maximum, skewness, kurtosis, CV, species
richness (S), Shannon-Weiner diversity (H'), Shannon-Weiner evenness (H'/ln[S]),
and Simpson's index of diversity (D) can be calculated for rows (sites) or
columns (species). Identification of outliers (matrix rows or columns) based on all
distance measures is accomplished by a separate routine. Basic species-area analysis for
determining adequacy of sampling is also included.
Beginning with version 3, PC-ORD produced publication-quality graphs
from most routines. These have been rounded out in version 4, which includes
publication-quality graphs for cluster analysis (dendrograms), species-area curves (with
confidence bands), and NMS scree plots. Graphics files are output as *.emf
(windows-enhanced metafiles) or *.bmp (bitmapped). Data management has also improved in
version 4: spreadsheets can be edited (albeit without full Windows capabilities), data
transformed or relativized, matrices transposed or multiplied, rows or columns deleted
(based on user-defined criteria, such as emptiness or sparseness), shuffled (randomized),
or smoothed. Acceptable formats for input data files remain small (*.wk1 spreadsheet,
PC-ORD compact format, PC-ORD version 1 format, DECORANA/TWINSPAN condensed format, list
format, and comma-separated values (CSV) format), but are easily created with ASCII text
editors or spreadsheet programs. Finally, like many new statistical packages, PC-ORD saves
work as a "project" (*.prj) file, which is really a set of associated
files (options, settings, matrices, results, graphics) produced by PC-ORD. This
facilitates organization of a set of analyses and increases efficiency, because options
and settings do not have to be re-entered at the start of each session. Individual files
can still be saved one at a time.
PC-ORD is still one of the most easily used, comprehensive packages for
multivariate analysis of ecological data. Many of the routines are unavailable in standard
statistical packages (which at best usually provide only PCA and cluster analysis). The
version 4 user's manual provides somewhat more information on the pitfalls of different
techniques and options than earlier manuals, but still assumes general familiarity with
the literature on multivariate methods. Routines in PC-ORD are current, and the authors
are quick to correct bugs and revise algorithms as new ideas are published. Incremental updates and patches are available free from their web site <http://:www.pcord.com>. The package is reasonable priced and
should be considered strongly for research and teaching applications.
Literature cited
Beals, E. W. 1984. Bray-Curtis ordination: an effective strategy for
analysis of multivariate ecological data. Advances in Ecological Research 14:1-55.
Bray, J. R. and J. T. Curtis. 1957. An ordination of upland forest
communities of southern Wisconsin. Ecological Monographs 27:325-349.
Grieg-Smith, P. 1983. Quantitative plant ecology. Third edition.
Blackwell Scientific, Oxford, UK.
Hill, M. O. 1979a. DECORANA--A FORTRAN program for detrended
correspondence analysis and reciprocal averaging. Section of Ecology and Systematics,
Cornell University, Ithaca, New York, USA.
Hill, M. O. 1979b. TWINSPAN--A FORTRAN program for arranging
multivariate data in an ordered two-way table by classification of the individuals and
attributes. Section of Ecology and Systematics, Cornell University, Ithaca, New York, USA.
Mantel, N. 1967. The detection of disease clustering and generalized
regression approach. Cancer Research 27:209-220.
Matcher, P. M. 1976. Computational methods of multivariate analysis in
physical geography. John Wiley and Sons, London, UK.
Mielke, P. W., Jr. 1984. Meteorological applications of permutation
techniques based on distance functions. Pages 813-830 in P. R. Krishnaiah and P.
K. Sen, editors. Handbook of statistics. Volume 4. Elsevier Science, The Hague, The
Netherlands.
Okansen, J., and P. R. Minchin. 1997. Instability of ordination results
under changes in input data order: explanations and remedies. Journal of Vegetation
Science 8:447-454.
ter Braak, C. J. F. 1986. Canonical correspondence analysis: a new
eigenvector technique for multivariate direct gradient analysis. Ecology 67:1167-1179.
Whittaker, R. H. 1967. Gradient analysis of vegetation. Biological
Reviews 42:207-264.
Table 1. Analytical methods available in PC-ORD
version 4 for multivariate ordination and classification
| Type and method |
Algorithm |
| Ordination |
|
| Bray-Curtis |
Bray and Curtis (1957), Beals (1984) |
| Canonical Correspondence Analysis (CCA) |
ter Braak (1986) with corrections of Okasanen and Minchin
(1997) |
| Detrended Correspondence Analysis (DCA) |
Hill (1979a) with corrections of Okasanen and
Minchin (1997) |
| Nonmetric Multidimensional Scaling (NMS) |
Mather (1976) |
| Principal Components Analysis (PCA) |
Grieg-Smith (1983) |
| Reciprocal Averaging |
Hill (1979a) |
| Weighted Averaging Classification |
Whittaker (1967) |
| Classification |
|
| Cluster Analysis |
|
| Multiresponse Permutation Procedures (MRPP) |
Mielke (1984) |
| Blocked MRPP (MRPP) |
Mielke (1984) |
| Two-way Indicator Species Analysis (TWINSPAN) |
Hill (1979b) |
| Indicator Species Analysis |
Durêne and Legendre (1997) |
| Mantel test |
Mantel (1967) |
|