|
FastMail Forum All posts relating to FastMail.FM should go here: suggestions, comments, requests for help, complaints, technical issues etc. |
|
Thread Tools |
28 Dec 2012, 08:06 PM | #1 |
Member
Join Date: Aug 2005
Posts: 35
|
Where's my Spam Bayes DB?
Hi,
I've been using Fastmail for years and I'm completely satisfied with the service. However, something happened to my Spam Bayes database: I'm quite sure I **WAS** "using" the "personal" Db as it had a lot of "learned" messages, and there's a threshold of messages to begin using the "personal" Db instead of the global. This is an automatic process. Something strange happened, I think, a couple of weeks ago. While quickly reviewing the Spam folder, I noticed a few messages down there... that shouldn't have been there as they all belonged to folders where similar messages had been filed for years, and those folders had spam learning enabled. Then for a few days I noticed some spam messages not caught in the spam filters. NOW I was playing around with settings and I see my Bayes Db contains just a few pieces of spam samples, while many more non spam samples have been learned by non-spam learing folders. I didn't clear my Db using the clear button, I'm quite sure... so ... has anybody seen a similar behaviour? |
28 Dec 2012, 09:32 PM | #2 |
The "e" in e-mail
Join Date: Feb 2006
Location: EU
Posts: 4,944
|
It seems that something similar happened in the past. I'd submit a support ticket, may be your database can be restored....
|
29 Dec 2012, 04:19 AM | #3 |
Intergalactic Postmaster
Join Date: May 2004
Location: Irving, Texas
Posts: 8,927
|
Searching for user or global BAYES tags in messages
It's easy to determine if Bayes filtering changed over time. When you look at the full headers of a message (such as by reading the raw message contents), you will see the following header near the top (usually about the 6th header): X-Spam-hits: (spam indicator list returned by SpamAssassin)
There are usually two entries in the spam hit list which start with BAYES:
Last edited by n5bb : 30 Dec 2012 at 12:07 AM. Reason: Corrected BAYES_USED tags and added global_overload |
29 Dec 2012, 04:31 AM | #4 |
Essential Contributor
Join Date: Jun 2006
Posts: 369
|
|
29 Dec 2012, 05:02 AM | #5 |
Intergalactic Postmaster
Join Date: May 2004
Location: Irving, Texas
Posts: 8,927
|
It has not changed for my account. If the original poster searches their headers as I have suggested, we will be working with a verified issue starting on a certain date, rather than guesses.
|
29 Dec 2012, 03:03 PM | #6 |
Member
Join Date: Aug 2005
Posts: 35
|
Thanks for the suggestion. I took one of the folders, and searched the strings.
The results are interesting. Of the (around) 8000 messages, approx 2000 have a "global" tag, 4000 and more have "user" and some 2000 are really old (recvd before June 2007) and have BAYES_00 The strange thing is that I discovered I've been switched more than one time to the "global" then back to "user" - now I vaguely remember that a couple of times, when I went to the Spam folder, I noticed an increase of non spam messages landing there, I selected them all and marked as non spam. I never remeber resetting the DB, but I've read about a couple of Bayes DB incidents It seems the last event happened this August, but I may have not noticed or remember the right date (it seemed to me it happened more recently). Consider that I CAREFULLY keep my address book updated, with 3300 contacts with multiple emails each, and contacts have a -30 spam score in the headers. So when I switch to "global", I don't see many legitimate messages going to the spam folder; I usually notice a few more "localized" spam messages getting through, probably because the global filter has not collected many "Italian sourced" spam samples So when this happened it usually caused just a few badly managed newsletters and some device alerts (ouch! I added them to the address book!) to end up in the Spam folder. However, this is a bit worrisome! |
30 Dec 2012, 12:11 AM | #7 |
Intergalactic Postmaster
Join Date: May 2004
Location: Irving, Texas
Posts: 8,927
|
To my knowledge, the operating system upgrade about two years ago (Dec 2010) mentioned in the link from janusz in this thread was the last large system user Bayes database failure. Other things you may see include:
Bill |
30 Dec 2012, 03:34 AM | #8 |
Member
Join Date: Aug 2005
Posts: 35
|
It doesn't seem a message size or system overload issue. Messages are "marked" as "globally" o "user" filtered in very large blocks, just as if global has been used from date a to date b and then user has been used from date b to date c and so on.
However, I DID feel something was wrong when I found (just a few, to say the truth) messages in the wrong place but, again, as the address book rule is always valid, so this was not a big problem to me. Anyway, gloabl or user based, the filtering is excellent. Now that I'm in global mode, I do see a few messages slip through the filter and get to my folders, but I started training the db again. |
30 Dec 2012, 07:48 AM | #9 |
Intergalactic Postmaster
Join Date: May 2004
Location: Irving, Texas
Posts: 8,927
|
You can train your user Bayes database on existing messages. So it should be easy to get the non-spam messages. If you don't empty your Spam (Junk Mail) very often, you might have enough there also. I have my Junk Mail folder set to auto-delete after 31 days, and currently 200 spam messages have accumulated over that interval. I have my spam filter set to auto-discard at a spam level greater than 5, so if I remove that setting it will accumulate spam faster.
Bill |
30 Dec 2012, 09:37 AM | #10 | |
The "e" in e-mail
Join Date: Dec 2004
Location: a virtually impossible but finitely improbable position
Posts: 2,320
|
Quote:
|
|
30 Dec 2012, 09:50 AM | #11 |
Intergalactic Postmaster
Join Date: May 2004
Location: Irving, Texas
Posts: 8,927
|
I only have my discard setting set that low because I own my own domain and also use a hobby forwarding service, both of which get a large amount of spam. I was tired of my Junk Mail box getting filled with several messages an hour. When you realize that the vast majority of spam connections are refused by the Fastmail incoming server without being accepted, it's amazing how many spam are being sent.
|