All of lore.kernel.org
 help / color / mirror / Atom feed
From: "Martin J. Bligh" <mbligh@mbligh.org>
To: Lee Revell <rlrevell@joe-job.com>
Cc: Matti Aarnio <matti.aarnio@zmailer.org>,
	linux-kernel <linux-kernel@vger.kernel.org>
Subject: Re: Spam, bogofilter, etc
Date: Mon, 02 Oct 2006 08:24:41 -0700	[thread overview]
Message-ID: <45212F39.5000307@mbligh.org> (raw)
In-Reply-To: <1159802486.4067.140.camel@mindpipe>

Lee Revell wrote:
> On Mon, 2006-10-02 at 13:03 +0300, Matti Aarnio wrote:
>> I do think that Markov Chains combined with Bayes Statistics 
>> might do a wee bit better.  (Except with very short emails.)
>> However all that these things are able to do is essentially
>> grow the key database when spammers are producing new mutated
>> (mis-spelled) texts by mixing in spaces, punctuations, and even
>> occasional characters.
>>
>> For recognizing those pill merchants one needs complex software
>> to read the site at the URL, and to read texts out of the IMAGES
>> at the site.  Captcha to get thru spam filters...
>>
> 
> Could a heuristic be added to reject messages with wildly incorrect
> dates?  I notice that the last 5-10 messages in my LKML folder every
> morning are spam with a date that's ~24 hours in the future.

If you got rid of "slut" and "schoolgirl" that'd get rid of half of it.

M.

  reply	other threads:[~2006-10-02 15:26 UTC|newest]

Thread overview: 35+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2006-09-29 14:23 Spam, bogofilter, etc Lee Revell
2006-09-29 14:29 ` Ismail Donmez
2006-10-01 23:23 ` Chris Wedgwood
2006-10-02  0:41   ` Kasper Sandberg
2006-10-02 10:03 ` Matti Aarnio
2006-10-02 15:21   ` Lee Revell
2006-10-02 15:24     ` Martin J. Bligh [this message]
2006-10-02 15:48       ` Lee Revell
2006-10-02 17:39         ` Erik Andersen
2006-10-03  3:37           ` dean gaudet
2006-10-03  4:05             ` Neil Brown
2006-10-02 16:40       ` Linus Torvalds
2006-10-02 17:49         ` Alan Cox
2006-10-02 17:19           ` David Lang
2006-10-02 18:02           ` Linus Torvalds
2006-10-02 18:07             ` Martin Bligh
2006-10-02 18:22             ` Valdis.Kletnieks
2006-10-02 18:29               ` Linus Torvalds
2006-10-02 19:31                 ` jdow
2006-10-02 19:31                 ` Antonio Vargas
2006-10-02 21:58             ` Alan Cox
2006-10-04 22:41             ` Adrian Bunk
2006-10-03 17:32           ` Mariusz Kozlowski
2006-10-02 21:33         ` Horst H. von Brand
2006-10-03  8:08         ` John Graham-Cumming
2006-10-03  8:52           ` Howard Chu
2006-10-03  9:40         ` Devdas Bhagat
2006-10-03  9:43         ` Helge Hafting
2006-10-03 10:50         ` Gordon Cormack
2006-10-02 17:34   ` Thomas Davis
2006-10-03 16:42   ` Mariusz Kozlowski
2006-10-27 22:30 ` Oleg Verych
  -- strict thread matches above, loose matches on Subject: below --
2006-10-03  6:08 Paul Zimmerman
2006-10-03 12:51 ` Valdis.Kletnieks
     [not found] <20061003060346.55869.qmail@web80821.mail.yahoo.com>
2006-10-03  7:01 ` Neil Brown

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=45212F39.5000307@mbligh.org \
    --to=mbligh@mbligh.org \
    --cc=linux-kernel@vger.kernel.org \
    --cc=matti.aarnio@zmailer.org \
    --cc=rlrevell@joe-job.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.