25 lines
891 B
Plaintext
25 lines
891 B
Plaintext
Return-Path: barry@python.org
|
|
Delivery-Date: Fri Sep 6 17:59:49 2002
|
|
From: barry@python.org (Barry A. Warsaw)
|
|
Date: Fri, 6 Sep 2002 12:59:49 -0400
|
|
Subject: [Spambayes] test sets?
|
|
References: <LNBBLJKPBEHFEDALKOLCAEHLBCAB.tim.one@comcast.net>
|
|
<LNBBLJKPBEHFEDALKOLCKEHPBCAB.tim.one@comcast.net>
|
|
Message-ID: <15736.57093.811682.371784@anthem.wooz.org>
|
|
|
|
|
|
TP> A false positive *really* has to work hard then, eh? The long
|
|
TP> quote of a Nigerian scam letter is one of the two that made
|
|
TP> it, and spamprob() looked at all this stuff before deciding it
|
|
TP> was spam:
|
|
|
|
Here's an interesting thing to test: discriminate words differently if
|
|
they are on a line that starts with `>' or, to catch styles like
|
|
above, that the first occurance on a line of < or > is > (to eliminate
|
|
html).
|
|
|
|
Then again, it may not be worth trying to un-false-positive that
|
|
Nigerian scam quote.
|
|
|
|
-Barry
|