public inbox for linux-kernel@vger.kernel.org
 help / color / mirror / Atom feed
From: "Martin J. Bligh" <mbligh@mbligh.org>
To: Lee Revell <rlrevell@joe-job.com>
Cc: Matti Aarnio <matti.aarnio@zmailer.org>,
	linux-kernel <linux-kernel@vger.kernel.org>
Subject: Re: Spam, bogofilter, etc
Date: Mon, 02 Oct 2006 08:24:41 -0700	[thread overview]
Message-ID: <45212F39.5000307@mbligh.org> (raw)
In-Reply-To: <1159802486.4067.140.camel@mindpipe>

Lee Revell wrote:
> On Mon, 2006-10-02 at 13:03 +0300, Matti Aarnio wrote:
>> I do think that Markov Chains combined with Bayes Statistics 
>> might do a wee bit better.  (Except with very short emails.)
>> However all that these things are able to do is essentially
>> grow the key database when spammers are producing new mutated
>> (mis-spelled) texts by mixing in spaces, punctuations, and even
>> occasional characters.
>>
>> For recognizing those pill merchants one needs complex software
>> to read the site at the URL, and to read texts out of the IMAGES
>> at the site.  Captcha to get thru spam filters...
>>
> 
> Could a heuristic be added to reject messages with wildly incorrect
> dates?  I notice that the last 5-10 messages in my LKML folder every
> morning are spam with a date that's ~24 hours in the future.

If you got rid of "slut" and "schoolgirl" that'd get rid of half of it.

M.

  reply	other threads:[~2006-10-02 15:26 UTC|newest]

Thread overview: 35+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2006-09-29 14:23 Spam, bogofilter, etc Lee Revell
2006-09-29 14:29 ` Ismail Donmez
2006-10-01 23:23 ` Chris Wedgwood
2006-10-02  0:41   ` Kasper Sandberg
2006-10-02 10:03 ` Matti Aarnio
2006-10-02 15:21   ` Lee Revell
2006-10-02 15:24     ` Martin J. Bligh [this message]
2006-10-02 15:48       ` Lee Revell
2006-10-02 17:39         ` Erik Andersen
2006-10-03  3:37           ` dean gaudet
2006-10-03  4:05             ` Neil Brown
2006-10-02 16:40       ` Linus Torvalds
2006-10-02 17:49         ` Alan Cox
2006-10-02 17:19           ` David Lang
2006-10-02 18:02           ` Linus Torvalds
2006-10-02 18:07             ` Martin Bligh
2006-10-02 18:22             ` Valdis.Kletnieks
2006-10-02 18:29               ` Linus Torvalds
2006-10-02 19:31                 ` jdow
2006-10-02 19:31                 ` Antonio Vargas
2006-10-02 21:58             ` Alan Cox
2006-10-04 22:41             ` Adrian Bunk
2006-10-03 17:32           ` Mariusz Kozlowski
2006-10-02 21:33         ` Horst H. von Brand
2006-10-03  8:08         ` John Graham-Cumming
2006-10-03  8:52           ` Howard Chu
2006-10-03  9:40         ` Devdas Bhagat
2006-10-03  9:43         ` Helge Hafting
2006-10-03 10:50         ` Gordon Cormack
2006-10-02 17:34   ` Thomas Davis
2006-10-03 16:42   ` Mariusz Kozlowski
2006-10-27 22:30 ` Oleg Verych
  -- strict thread matches above, loose matches on Subject: below --
2006-10-03  6:08 Paul Zimmerman
2006-10-03 12:51 ` Valdis.Kletnieks
     [not found] <20061003060346.55869.qmail@web80821.mail.yahoo.com>
2006-10-03  7:01 ` Neil Brown

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=45212F39.5000307@mbligh.org \
    --to=mbligh@mbligh.org \
    --cc=linux-kernel@vger.kernel.org \
    --cc=matti.aarnio@zmailer.org \
    --cc=rlrevell@joe-job.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox