Escolar Documentos
Profissional Documentos
Cultura Documentos
Your use of the JSTOR archive indicates your acceptance of JSTOR's Terms and Conditions of Use, available at
http://www.jstor.org/page/info/about/policies/terms.jsp. JSTOR's Terms and Conditions of Use provides, in part, that unless
you have obtained prior permission, you may not download an entire issue of a journal or multiple copies of articles, and you
may use content in the JSTOR archive only for your personal, non-commercial use.
Please contact the publisher regarding any further use of this work. Publisher contact information may be obtained at
http://www.jstor.org/action/showPublisher?publisherCode=astata.
Each copy of any part of a JSTOR transmission must contain the same copyright notice that appears on the screen or printed
page of such transmission.
JSTOR is a not-for-profit service that helps scholars, researchers, and students discover, use, and build upon a wide range of
content in a trusted digital archive. We use information technology and tools to increase productivity and facilitate new forms
of scholarship. For more information about JSTOR, please contact support@jstor.org.
American Statistical Association is collaborating with JSTOR to digitize, preserve and extend access to Journal
of the American Statistical Association.
http://www.jstor.org
1320 Journalof the AmericanStatisticalAssociation,December 2000
'simple' methodsfor solving maximumlikelihoodequa- these authorswere the firstto put fortha unifiedframe-
tions,especiallyforuse whenthereare singleobservations workshowingthesimilarities betweenseeminglydisparate
at each x value,so thatthelinearizingtransformation
is not methods,such as probitregression, linearmodels,and con-
applicable."Note the need forsimplemethodsdespitethe tingencytables.Theyrecognizedthatfitting a probitregres-
factthat"computers"in 1966 are now machines. sion by iterativefitsusing the "workingprobits,"namely
For thelogisticregressionmodel akin to (2), namely (4), couldbe generalizedin a straightforwardwayto unifya
wholecollectionof maximumlikelihoodproblems.Replac-
Y indep.Bernoulli(p) ing q (.) witha general"link"function, g(.), and defining
and a "workingvariate"via
whereY1*is equal to 3 fora success and -3 fora failure. 1. What is the distribution of the data (forfixedvalues
That is, we can approximatethe logisticregressioncoeffi- of the predictorsand possibly aftera transformation)?
cientsin a crudewaybyan ordinary leastsquaresregression 2. What function of the mean will be modeledas linear
on a coded Y. in the predictors?
3. What will thepredictorsbe?
Logistic regressionis oftenused as an alternatemethod
fortwo-groupdiscriminant analysis(Cox and Snell 1989), What advantagesdoes thisapproachhave? First,it unifies
by usingthe(binary)groupidentifier as the"response"and whatappearto be verydifferent methodologies, whichhelps
themultivariate vectorsas the"predictors." This is a useful to understand, use, and teach the techniques.Second, be-
alternativewhenthe usual multivariate normalityassump- cause therightside of the equationis a linearmodel after
tionforthemultivariate vectorsis questionable;forexam- applyingthe link,manyof the conceptsof linear models
ple, whenone or morevariablesare binaryor categorical. carryover to GLMs. For example,the issues of full-rank
When it is reasonableto assume multivariate normality, versusoverparameterized models are similar.
theusual Fisherdiscriminant functionis givenby S- 1(X1 - The applicationof GLMs became a realityin the mid
X2), whereXi is themeanof thevectorsfortheithgroup. 1970s, when GLMs were incorporatedinto the statistics
If we code the successes and failuresas 1 and -1, then package GENSTAT and made availableinteractively in the
Xi - X2 = X'Y. Thus we see thatthedifference between GLIM software.Users of thesepackages could thenhan-
logisticregressionand discriminant functionanalysisis 2, dle linearregression, logisticand probitregression, Poisson
in absolutevalue. regression,log-linearmodels,and regressionwithskewed
continuousdistributions, all in a consistentmanner.Both
2. ORIGINS packages are stillwidelyused and are currently distributed
GLMs appeared on the statisticalscene in the path- by the NumericalAlgorithmsGroup (www.nag.com).Of
breakingarticleof Nelder and Wedderburn(1972). Even course,by now,mostmajor statisticalpackages have facil-
thoughvirtuallyall of the pieces had previouslyexisted, itiesforGLMs; forexample,SAS Proc GENMOD.
Theoryand Methodsof Statistics 1323