Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.
Comment: Replaced dead link to rules list with references to SVN and sa-update

How are the scores assigned?

SA's The scores are assigned using a genetic algorithm (GA), to optimise their efficiency and minimise neural network trained with error back propagation (Perceptron). Both systems attempt to optimize the efficiency of the rules that are run in terms of minimizing the number of false positives and false negatives.

You can find all of the currently active rules and their scores in the Subversion repository under /trunk/rules or by downloading the latest published set using the sa-update tool. 

You More information can be found on the 'Tests' page. Note that you can help this system by providing statistics on your mail spool via NightlyMassCheck and RescoreMassCheck.

Some DNS blacklist rules are distributed with scores of 0. These generally request or require payment, and as such are disabled by default. Feel free to enable the lookups, if you've paid for them.

A score of 0 will stop a rule from being run.

Confusing scores

Note: Scores for "learn" rules , such as BAYES_*, that rate the probability that a message is spam, (example the various BAYES_?? rules) are scored using the same method. This can produce "confusing" scores, for instance, that have scores which seem incorrect (example BAYES_80 with a higher score than BAYES_99. There are a few reasons for this. 1) The GA does not understand that BAYES_* are ). This is due to the fact that rules are not related to one another, they're seperate rules that need seperate scores. 2) More importantly, the higher the separate rules have separate scores.

Messages with high probability from a "learn" rule , the higher likelihood that the message also hit a bunch of will most likely match other rules. This lets the GA score generation system lower the "learn" rule score due to the inevitable false positive, while also still marking the message as spam via the other rule scorespreventing false positives. The message still is recognized as spam due to the sum of all rule scores.

Some DNS blacklist rules are distributed with scores of 0. These generally request or require payment are disabled by default. Feel free to enable the lookups, if you've paid for them.

A score of 0 will stop a rule from being run.

In version 2.x, the scores are assigned using a genetic algorithm (GA).