Gentoo Forums
Gentoo Forums
Gentoo Forums
Quick Search: in
What spamfilter do you recommend?
View unanswered posts
View posts from last 24 hours

Goto page Previous  1, 2  
Reply to topic    Gentoo Forums Forum Index Networking & Security
View previous topic :: View next topic  
Author Message
bisho
n00b
n00b


Joined: 02 May 2003
Posts: 69
Location: Madrid (Spain)

PostPosted: Tue Sep 02, 2003 11:20 pm    Post subject: Reply with quote

For a multiuser system the best one is DSPAM. It uses bayesian filter improved (not only analizes word statistics but sentences as well), each user has its own data, the data is cleaned with unused words to keep the databases clean, provides a reply to an address based training, etc.

For one person I use bogofilter, because its really fast and I have a bunch of mail & spam samples to teach the filter and get it working well fast.

Spamassasain could be a good option to start, to catch emails with the rules until the bayesian filter is trained. At first it would be slow, but at the end you could disable all the rules but bayesian when is trained enought.
_________________
-=[ bisho [=-
Back to top
View user's profile Send private message
Regor
Guru
Guru


Joined: 06 Aug 2002
Posts: 545
Location: 39° 2' 48" N, 120° 59' 2" W

PostPosted: Wed Sep 03, 2003 5:59 am    Post subject: Reply with quote

eyevee99 wrote:
I've just decided to install spambayes... How do I set it up to run when I log in? Does it have to run as root etc?

Wouldn't each user have to run their own version?

Cheers,
ryan


How you run it depends on what your needs are. Personally, I invoke it via procmail since I'm already using it to parse mail into different inboxes.

There are examples for different situations on the Spambayes webpage, use them as a guide.

And although I imagine you could set it up to run system-wide with one spam database, it would be difficult to train since what one user might consider legitimate email might well look like spam to another so it's unlikely that one ruleset would fit all.
_________________
Sometimes the appropriate response to reality is to go insane.
-Philip K. Dick, Valis
Back to top
View user's profile Send private message
neenee
Veteran
Veteran


Joined: 20 Jul 2003
Posts: 1786

PostPosted: Wed Sep 03, 2003 11:18 am    Post subject: Reply with quote

i run spamassassin in combination with razor,
which seems to ctach 100% over a persiod
of a week so far (30+ mails per day, about
10% is spam).

my only problem with it is, is that it takes
10+ seconds to verify one email.
Back to top
View user's profile Send private message
gOA-pSY
n00b
n00b


Joined: 20 Jun 2003
Posts: 6

PostPosted: Wed Sep 03, 2003 12:26 pm    Post subject: problem with 30sec solved! Reply with quote

simply add "skip_rbl_checks 1" to your /etc/mail/spamassassin/local.cf and it only needs ~1sec for each mail!
Back to top
View user's profile Send private message
neenee
Veteran
Veteran


Joined: 20 Jul 2003
Posts: 1786

PostPosted: Wed Sep 03, 2003 1:46 pm    Post subject: Reply with quote

thank you for the suggestion :)

(i will first check what that setting
does though).

*update*

ok. that skips the real time black-
list.
Back to top
View user's profile Send private message
pfft
n00b
n00b


Joined: 04 Sep 2003
Posts: 22
Location: nowhere

PostPosted: Thu Sep 04, 2003 12:38 am    Post subject: rblsmtpd Reply with quote

/var/qmail/supervise/qmail-smtpd/run

#!/bin/sh
QMAILDUID=`id -u qmaild`
NOFILESGID=`id -g qmaild`
exec /usr/bin/softlimit -m 8000000 \
/usr/bin/tcpserver -v -p -R -x /etc/tcp.smtp.cdb \
-u $QMAILDUID -g $NOFILESGID 0 smtp \
/usr/bin/rblsmtpd -b -r relays.ordb.org \
-r dev.null.dk \
-r dnsbl.njabl.org \
-r qmail.bondedsender.org \
-r cbl.abuseat.org \
-r dnsbl.delink.net \
-r blackholes.easynet.nl \
-r dnsbl.sorbs.net \
-r bl.technovision.dk \
-r vox.schpider.com \
/var/qmail/bin/qmail-smtpd \
/bin/checkpasword /bin/true 2>&1

;=)
Back to top
View user's profile Send private message
Mirrorball
Apprentice
Apprentice


Joined: 25 Apr 2003
Posts: 235
Location: Brazil

PostPosted: Mon Oct 27, 2003 8:31 pm    Post subject: Reply with quote

I used to use spamassassin but the program wasn't able to detect some spam I was receiving a lot of lately, about viagra and other drugs, because the spammers were using bad HTML and wrong spelling to avoid detection. I now use POPFile and it's doing a very good job. It's the program I recommend at the moment.
Back to top
View user's profile Send private message
nsahoo
l33t
l33t


Joined: 17 Jul 2003
Posts: 618

PostPosted: Mon Dec 01, 2003 6:00 am    Post subject: Reply with quote

what's wrong with using thunderbird or mozilla-mail instead? it has a bayesian filter.
Back to top
View user's profile Send private message
Chris W
l33t
l33t


Joined: 25 Jun 2002
Posts: 972
Location: Brisbane, Australia

PostPosted: Mon Dec 01, 2003 10:48 am    Post subject: Reply with quote

Just for those wondering why SpamAssassin (SA) misses quite a few. After install SA uses only the rules to determine spam-ness. The Bayesian filter in SA doesn't kick in until it has seen enough spam. It trains itself based on the spammiest stuff it sees (based only on the rules) and deliberately avoids anything that could be a false positive (i.e. low scoring spam). You can accelerate the process by using the sa-learn utility on stuff you deliberately categorise as spam or ham. Once the Bayesian stuff kicks in the false negatives largely dry up. I get one or two false negatives, and no false positives, from every week (approx 200 spams).

Looking at my logs for today: the longest spam processing took 11 seconds, with all the others (50 odd) under 4 seconds. This runs on a Pentium II 300MHz with all thr RBL and other checks.

KMail hanging during the filtering process is not SA's fault. If the program were suitably multithreaded the UI responsiveness would remain.[/quote]
_________________
Cheers,
Chris W
"Common sense: The collection of prejudices acquired by age 18." -- Einstein
Back to top
View user's profile Send private message
neenee
Veteran
Veteran


Joined: 20 Jul 2003
Posts: 1786

PostPosted: Mon Dec 01, 2003 11:01 am    Post subject: Reply with quote

that's good info :wink:

i would like to add to that by giving the tip of creating
a cronjob for sa-learn, such as this for evolution:

0 0 * * * sa-learn --mbox --spam /home/neenee/evolution/local/Spam

which will learn spam from your evolution Spam
folder in mbox format every day at midnight.

i used to do this manually from my window-managers'
popup window, but this works just fine as well.

perhaps a more space-efficient way would be to
have sa-learn learn spam once every week and
then cleaning out the spam folder since it has al-
ready learnt that spam.

good luck :wink:
Back to top
View user's profile Send private message
asimon
l33t
l33t


Joined: 27 Jun 2002
Posts: 979
Location: Germany, Old Europe

PostPosted: Thu Mar 18, 2004 4:34 pm    Post subject: Reply with quote

I too like SpamAssassin very much. It was easy to setup (much more easier than for example dspam or crm114) and works very good here. I don't use the online tests, only local stuff and Bayesian filter. With the Bayesian filter SpamAssassin is very efficient (most comparisions I found on the web are based on rather old SA versions which did'nt had the bayesian filter, thus it's no wonder why in those tests bogofilter, dspam, etc. are much better than SA). Looking at the report header I see that every spam mail I get gets alone from the BAYES_99 rule alone enough points to be classified as spam.

But as with every Bayesian filter it's very important to constantly train SA with your spam and ham mails. From what I see autolearning (with the default parameters) alone is not enough because I at least don't get much spam with very high scores, thus for most spam the autolearner actually doesn't learn. But with a cron job doing sa-learn this is no problem. Sometimes some spam mails get's through (like lately some spam mails with many misspelled words or some fake mail from MS with some securiy update for Windows) but after feeding them to sa-learn the next ones will not get through.

What I appreciate especially is that SA after around 6000 (I use it since January) spam mails it never classified a ham mail as spam (at least as far as I can tell ;-) ). This is very important.

The future of SA also looks very bright. Version 3 will include an API for plugins. Thus we will have plugins for other filters like dspam (already exists), crm114, bofofilter, etc.

On the SA mailing list there is a post where someone did some small benchmark to compare the speed of SA's bayesian filter and dspam (it's usually said that SA is very slow because it's in Perl and dspam very fast). In this test SA was only 10% slower than dspam.
Back to top
View user's profile Send private message
mallchin
l33t
l33t


Joined: 21 Jan 2003
Posts: 655
Location: United Kingdom

PostPosted: Sun Aug 08, 2004 4:28 pm    Post subject: Reply with quote

Well, I'm in a quandry.

I use amavisd-new which supports spamassassin, dpsma and razor and don't know which to use, or if it'll let me use any or all of them together. I guess just SA would be ideal, I've been using that for a while okay.

My other wonder is how I can use neenee's sa-learn cronjob to teach SA on my mail server, but my evo mail is on my workstation. How can I use SA to learn on my workstation and port the rules it's learnt to my mail server?

/me throws stones at nasty spammers, shoo shoo
_________________
6700 @ 2.66GHz, 4Gb RAM, 2 x 500Gb, 8800 GTX, PhysX, X-Fi, 24" Widescreen, Tux mascot
Back to top
View user's profile Send private message
BlinkEye
Veteran
Veteran


Joined: 21 Oct 2003
Posts: 1046
Location: Gentoo Forums

PostPosted: Fri Aug 27, 2004 11:31 am    Post subject: Reply with quote

as you have a mail server your mail first gets to it. so there should be no problem of filtering mails there. bad thing is if you use pop and spam gets through, though, which you might be using. hmm
_________________
Easily backup up your system? klick
Get rid of SSH Brute Force Attempts / Script Kiddies klick
Back to top
View user's profile Send private message
xiangzi
n00b
n00b


Joined: 11 Mar 2004
Posts: 24
Location: San Francisco, CA

PostPosted: Fri Sep 03, 2004 12:26 am    Post subject: bogofilter performance Reply with quote

Have been using bogofilter with evolution for a week or 2 now as described here:

http://www.ime.usp.br/~rsilva/bogo-and-evo/

Trained on about 1000 spam and a bit more non-spam, but the results so far are pretty disappointing -- it's only getting 1/2 to 2/3 of the spam, and doesn't seem to be improving over time (in fact today has been it's worst day ever, more like 1/4). But other people sound happy with it. Did you have to tweak it to get good results, and if so, what was your recipe?
Back to top
View user's profile Send private message
BlinkEye
Veteran
Veteran


Joined: 21 Oct 2003
Posts: 1046
Location: Gentoo Forums

PostPosted: Fri Sep 03, 2004 10:29 am    Post subject: Reply with quote

the really important thing is that you don't put bad mails in the .Ham folder and train it with these. i made this mistake the first time and it didn't work out. i'd say: put less mails in you .Ham folder - but mails you're really sure of and you're set. btw.: what about the configs? i'm not sure at all how to set them (i'm using bogofilter only for some days now) but maybe you don't have them set correctly?!? i've looked for examples and explanation but haven't found anything.
_________________
Easily backup up your system? klick
Get rid of SSH Brute Force Attempts / Script Kiddies klick
Back to top
View user's profile Send private message
xiangzi
n00b
n00b


Joined: 11 Mar 2004
Posts: 24
Location: San Francisco, CA

PostPosted: Sat Sep 04, 2004 6:27 pm    Post subject: Reply with quote

Yeah, I was careful to expunge all the spam from the ham folder first.
Back to top
View user's profile Send private message
BlinkEye
Veteran
Veteran


Joined: 21 Oct 2003
Posts: 1046
Location: Gentoo Forums

PostPosted: Sat Sep 04, 2004 6:49 pm    Post subject: Reply with quote

well, then i guess your config file's the bug. i'm really satisfied with bogofilter yet - i had only about 20 spam mails left to train but it works flawlessly
_________________
Easily backup up your system? klick
Get rid of SSH Brute Force Attempts / Script Kiddies klick
Back to top
View user's profile Send private message
Kick
n00b
n00b


Joined: 23 Jul 2003
Posts: 13

PostPosted: Wed Oct 20, 2004 6:04 pm    Post subject: Reply with quote

Chris W wrote:
Looking at my logs for today: the longest spam processing took 11 seconds, with all the others (50 odd) under 4 seconds. This runs on a Pentium II 300MHz with all thr RBL and other checks.


Really? How much RAM do you have?
My PII-300, take between 30s and 1mn to check any single message, with network tests *disabled*!!

Well, after further analysis, it seems that SA 3.0 is a true memory hogger. Perhaps you're talking about SA 2.60, which is indeed much more faster on my own box.
Back to top
View user's profile Send private message
Display posts from previous:   
Reply to topic    Gentoo Forums Forum Index Networking & Security All times are GMT
Goto page Previous  1, 2
Page 2 of 2

 
Jump to:  
You cannot post new topics in this forum
You cannot reply to topics in this forum
You cannot edit your posts in this forum
You cannot delete your posts in this forum
You cannot vote in polls in this forum