« Notes from the Stanford Spectrum Conference | Main | It's a Supernova kind of day! »

March 4, 2003

Popfile Update #4

I cut down the number of "buckets" into which messages are classified, and Popfile's accuracy in filtering spam has gone up to about 95%. However, I still get false positives, roughly one per day. Classifying real mail as spam is much worse than letting a few spams through. This is an area where Bayesian filters like Popfile do better than rule-based systems, but better may not be good enough. After spending hours training Popfile on over 3,000 messages, I'm at the point where I think it's worth using, though just barely.

Posted by Kevin Werbach at March 4, 2003 9:44 AM

Trackback Pings

TrackBack URL for this entry:

Comments

Post a comment

Thanks for signing in, . Now you can comment. (sign out)

(If you haven't left a comment here before, you may need to be approved by the site owner before your comment will appear. Until then, it won't appear on the entry. Thanks for waiting.)


Remember me?