Re: Logistic Regression
- From: clemenr@xxxxxxxxxx
- Date: 30 May 2005 13:36:31 -0700
Thanks. As a general wrap-up to this thead.
(i) Using linear regression discriminating two authors labelled 0/1
performs better than logistic regression on the same data.
(ii) Putting maximum limits on weights improves performance for
logistic regression applied to, say, 12 variables. Unchanged logistic
regression on 5 variables still gives higher performance.
(iii) On small numbers of variables, this considerably out-performs a
naive Bayes classifier I had been using.
(iv) On larger numbers of variables, the regression approaches are
extremely poor. The naive Bayes classifier performs much better.
(v) There has been no need for authorship attribution approaches that
optimise performance on small numbers of variables for at least 40
years.
(vi) The information fusion approach I described in a previous posting
where a regression is fitted to the first N variables, and then another
is fitted to the next N variables, and so on, is worthless.
(vii) Converting my compositional data to non-compositional data did
not help solve the problems of (vi) in the slightest.
(viii) Trying a few seat-of-the-pants non-linear transformations of my
data into higher numbers of dimensions did not help in the slightest.
Conclusion: I'd previously played with classifiers based on linear
regression, and obtained results that were so poor, that I assumed that
my code must have bugs and never mentioned any results to anybody.
Given that I have new code in a different language (C/gretl versus
Java/Jama) and have found that I get good fits in lower dimensions, I
think that these approaches are unsuitable for my data.
I may revive these programs when I get around to doing some more
information fusion experiments.
Thanks to everyone who posted on this thread.
Next on the list of things to do: Support Vector Machines. Sadly I've
already concluded that it will not be reasonable for me to write my own
quadratic programming code :-(
Cheers,
Ross-c
.
- References:
- Logistic Regression
- From: clemenr
- Re: Logistic Regression
- From: Phil Sherrod
- Re: Logistic Regression
- From: clemenr
- Re: Logistic Regression
- From: Ray Koopman
- Re: Logistic Regression
- From: G Robin Edwards
- Re: Logistic Regression
- From: clemenr
- Re: Logistic Regression
- From: G Robin Edwards
- Logistic Regression
- Prev by Date: Re: a principal component analysis question
- Next by Date: Re: a principal component analysis question
- Previous by thread: Re: Logistic Regression
- Next by thread: Ian Stewart's Web Pages?
- Index(es):
Relevant Pages
|