Comparing two programs for same data



Hi,

I have the following problem. I have a text corpus in which each word
is tagged with a semantic concept. I have two different algorithms
that both try to assign the correct tag to each word. After running
each algorithm, I can measure the performance of each algorithm by
looking at the number of tokens that it tagged correctly.
For example, suppose there are 3000 words, algorithm1 tags 2115 of
them correctly, and algorithm2 tags 2200 of them correctly.
Now my question is: if one of the algorithms does better than the
other, which test can I apply to check if the difference is
statistically significant?
My knowledge of statistics is superficial. For an expert, this
question is probably not too difficult. Thanks in advance,

Frederik

.



Relevant Pages

  • Re: Comparing two programs for same data
    ... For example, suppose there are 3000 words, algorithm1 tags 2115 of ... if one of the algorithms does better than the ... My knowledge of statistics is superficial. ...
    (sci.stat.math)
  • Re: Comparing two programs for same data
    ... is tagged with a semantic concept. ... For example, suppose there are 3000 words, algorithm1 tags 2115 of ... if one of the algorithms does better than the ...
    (sci.stat.math)