All of lore.kernel.org
 help / color / mirror / Atom feed
From: Andrea Arcangeli <andrea@suse.de>
To: Matt Heler <lkml@lpbproductions.com>
Cc: "Johnson, Richard" <rjohnson@analogic.com>, linux-kernel@vger.kernel.org
Subject: offtopic (Re: Horiffic SPAM)
Date: Tue, 23 Sep 2003 21:06:56 +0200	[thread overview]
Message-ID: <20030923190656.GF1269@velociraptor.random> (raw)
In-Reply-To: <200309231153.09298.lkml@lpbproductions.com>

On Tue, Sep 23, 2003 at 11:53:04AM -0700, Matt Heler wrote:
> -----BEGIN PGP SIGNED MESSAGE-----
> Hash: SHA1
> 
> Ive been living in a mail hole theese past few years.. Where does one get this 
> baesyan algorithm ?? 

www.spamassassin.org

~/bin/Mail-SpamAssassin-2.60/sa-learn --mbox --spam ~/mail/spam
~/bin/Mail-SpamAssassin-2.60/sa-learn --mbox --spam ~/mail/spam-bad

spam-bad is differentiated because it gets >15 marks, so it gets deleted
immediatly after learning. (see the docs in the package)

but make sure to teach the baesyan about your regular email first, the
number of "ham" must be >= "spam" or your risk losing legitmate email. I
use my inbox as "ham" (that's around 10000 messages).

this is the status of my db

0.000          0        688          0  non-token data: nspam
0.000          0       9722          0  non-token data: nham

see now what it returns for these >100k viruses (Bayesian spam
probability is 99 to 100%)

-------- cut and paste begin ---------
 pts rule name              description
---- ---------------------- --------------------------------------------------
 0.1 HTML_MESSAGE           BODY: HTML included in message
 1.7 HTML_RELAYING_FRAME    BODY: Frame wanted to load outside URL
 5.4 BAYES_99               BODY: Bayesian spam probability is 99 to 100%
                            [score: 1.0000]
 0.3 MIME_HTML_ONLY         BODY: Message only has text/html MIME parts
 0.1 HTML_50_60             BODY: Message is 50% to 60% HTML
 5.6 IFRAME                 BODY: IFRAME virus
 3.0 MICROSOFT_EXECUTABLE   RAW: Message includes Microsoft executable program
 0.6 MIME_HTML_NO_CHARSET   RAW: Message text in HTML without charset
 0.1 MIME_SUSPECT_NAME      RAW: MIME filename does not match content
 1.1 MIME_HTML_ONLY_MULTI   Multipart message only has text/html MIME parts

The original message was not completely plain text, and may be unsafe to
open with some email clients; in particular, it may contain a virus,
or confirm that your address can receive spam.  If you wish to view
it, it may be safer to save it to a file and open it with an editor.


[-- Attachment #2: original message before SpamAssassin --]
[-- Type: message/rfc822, Encoding: 8bit, Size: 142K --]

Date: Tue, 23 Sep 2003 13:30:38 -0500
From: "microsoft net message system" <mailerrobot@america.com>
To: "network recipient" <client@yourdomain.com>
SUBJECT: Bug Advice
X-Virus-Information: Please visit http://enap.wt.net for more
information
X-Virus-Scanner: Found to be clean

[-- Autoview using lynx -dump '/tmp/mutt.html' --]

   IFRAME: [1]cid:mccexrrgkte

   Hi.
   Undeliverable to mxwxeztble@america.com
[..]
-------- cut and paste end ---------

Andrea - If you prefer relying on open source software, check these links:
	    rsync.kernel.org::pub/scm/linux/kernel/bkcvs/linux-2.[45]/
	    http://www.cobite.com/cvsps/
	    svn://svn.kernel.org/linux-2.[46]/trunk

  reply	other threads:[~2003-09-23 19:07 UTC|newest]

Thread overview: 12+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2003-09-23 18:11 Horiffic SPAM Richard B. Johnson
2003-09-23 18:36 ` Andrea Arcangeli
2003-09-23 18:53   ` Matt Heler
2003-09-23 19:06     ` Andrea Arcangeli [this message]
2003-09-24  3:15       ` offtopic (Re: Horiffic SPAM) Sandy Harris
2003-09-24  6:28     ` Horiffic SPAM Paul Dickson
2003-09-24 14:18   ` Richard B. Johnson
2003-09-25  8:21     ` [OT] " Helge Hafting
2003-09-25 12:30       ` Richard B. Johnson
2003-09-25 14:59       ` Valdis.Kletnieks
2003-09-25 15:36         ` Toshiba Tecra S1 Battery Status Bernt Hansen
2003-09-23 18:43 ` [OT] Re: Horiffic SPAM Grant Miner

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=20030923190656.GF1269@velociraptor.random \
    --to=andrea@suse.de \
    --cc=linux-kernel@vger.kernel.org \
    --cc=lkml@lpbproductions.com \
    --cc=rjohnson@analogic.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.