Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.
Comment: converted to 1.6 markup

...

Task Proposals

Subject ID: spamassassin-easy-mass-check

Title: Nightly Mass Check for Normal People

ASF Project: SpamAssassin - http://SpamAssassin.apache.org/

Keywords: perl, email, corpora, distributed, community

Description:

We need a way to make nightly mass check easily accessible to normal users. They need easy to use software to do mass checks and submit results. They must be properly trained on the sorting rules. Our project then needs some way of tracking the level of trust of these growing number of submitters. see SocNightlyMassCheck

Possible Mentors: Justin Mason (jm at jmason.org)

Status: -

Subject ID: spamassassin-persistent-db-conns

Title: Persistent database connections

ASF Project: SpamAssassin - http://SpamAssassin.apache.org/

Keywords: perl, databases, sql

Description:

http://issues.apache.org/SpamAssassin/show_bug.cgi?id=2037 :
persistent database connections for SpamAssassin's Bayes subsystem. Michael:
'This exists, but is not an ASL friendly license. So a "clean room" implementation might be cool.'

Possible Mentors: -

Status: -

Subject ID: spamassassin-separate-expiry

Title: Helper process for Bayes expiry

ASF Project: SpamAssassin - http://SpamAssassin.apache.org/

Keywords: perl, bayes, spamd, processes

Description:

Theo said: 'I also suggested having things like Bayes expiry and such being passed back to the parent who can spawn a helper process to do the work. That way the children processes will be able to accept, process, return the result, notify parent for bayes work, go back to listening. Right now we do: accept, process, do bayes work, return result, go back to listening, which ends up causing timeouts and possibly eats up all processing children.'

Possible Mentors: Theo Van Dinter (felicity at apache.org)

Status: -

Subject ID: spamassassin-arf-plugin

Title: ARF plugin

ASF Project: SpamAssassin - http://SpamAssassin.apache.org/

Keywords: arf, plugins, reporting, perl

Description:

...

Possible Mentors: Justin Mason (jm at jmason.org)

Status: -

Subject ID: spamassassin-httpd-spamd

Title: Finish up Apache::SpamD httpd module

ASF Project: SpamAssassin - http://SpamAssassin.apache.org/

Keywords: apache, httpd, modules, perl

Description:

http://issues.apache.org/SpamAssassin/show_bug.cgi?id=4603 :
Finish up and polish the Apache::SpamD httpd module.

Possible Mentors: -

Status: -

Subject ID: spamassassin-quarantine-config-ui

Title: Quarantine / user-configuration web UI

ASF Project: SpamAssassin - http://SpamAssassin.apache.org/

Keywords: web, ui, quarantine, user-configuration, cgi, perl

Description:

Create a web application for message quarantine or user configuration, as part of the SpamAssassin project.

Possible Mentors: -

Status: -

Subject ID: spamassassin-corpus

Title: Maintain a SpamAssassin corpus of messages

ASF Project: SpamAssassin - http://SpamAssassin.apache.org/

Keywords: corpora, mail, collection, perl

Description:

Theo said: 'I'd almost rather we shift this around and make a "SpamAssassin Corpora", have all of us focus on making that the best it can be, and use that for mass-checks, etc.'

This could be a good possibility. Contributors can upload their own mail corpora to a central web app where the mass-check occurs. The mail collections could be quickly checked for validity, and tagged based on how much privacy the user wants for their mails (therefore controlling further redistribution of those mails).

Related to 'spamassassin-easy-mass-check' above.

Possible Mentors: Justin Mason (jm at jmason.org), Theo Van Dinter (felicity at apache.org)

Status: -

Subject ID: spamassassin-rules-db

Title: Rules explanation database

ASF Project: SpamAssassin - http://SpamAssassin.apache.org/

Keywords: rules, wiki, web, perl

Description:

http://issues.apache.org/SpamAssassin/show_bug.cgi?id=4771 - It'd be nice to have a way for users/admins/interested parties to have an easy way to look up a human-readable description of rules, based on the rule name; using the wiki as part of that would be the best solution.

Possible Mentors: Justin Mason (jm at jmason.org)

Status: -

Subject ID: spamassassin-better-reload

Title: Better way to reload the spamd configuration

ASF Project: SpamAssassin - http://SpamAssassin.apache.org/

Keywords: reload, spamd, sighup, restart, perl

Description:

http://issues.apache.org/SpamAssassin/show_bug.cgi?id=4774 :
we currently have a very heavyweight configuration-rereading system where the entire process restarts. This is too heavyweight, and can be improved.

Possible Mentors: Justin Mason (jm at jmason.org)

Status: -

Subject ID: spamassassin-message-test-suite

Title: a message-parser test suite

ASF Project: SpamAssassin - http://SpamAssassin.apache.org/

Keywords: testing, testsuite, parsing, mail, perl

Description:

http://issues.apache.org/SpamAssassin/show_bug.cgi?id=4559 : Every now and again, we come up against bugs in our message parser (MIME, HTML, headers, base64/qp decoding, etc. etc.) We fix them, but occasionally there's regressions. I envisage it as using a vast collection of message files, something like a mass-check corpus, and a set of tests to ensure the parser sees what it should be seeing.

Possible Mentors: Justin Mason (jm at jmason.org), Theo Van Dinter (felicity at apache.org)

Status: -

Subject ID: spamassassin-reduce-memory-usage

Title: Reduce memory footprint of spamd

...

Description:

...

Possible Mentors: -

Status: -

Subject ID: spamassassin-improved-chi

Title: Implement 'Improved Chi' in the BAYES rules

ASF Project: SpamAssassin - http://SpamAssassin.apache.org/

Keywords: bayes, chi, robinson, perl

Description:

http://issues.apache.org/SpamAssassin/show_bug.cgi?id=3460 :
Handling Redundancy in Email Token Probabilities, Gary Robinson. http://www.garyrobinson.net/2004/04/improved_chi.html . Has shown good results. Implement in SpamAssassin and benchmark.

Possible Mentors: Justin Mason (jm at jmason.org)

Status: -

Subject ID: spamassassin-spamd-unix-and-tcp

Title: spamd should support both UNIX domain and TCP sockets

ASF Project: SpamAssassin - http://SpamAssassin.apache.org/

Keywords: spamd, unix-domain, sockets, networking, perl

Description:

http://issues.apache.org/SpamAssassin/show_bug.cgi?id=3991 : currently spamd supports either UNIX-domain or TCP/IP sockets for incoming scan requests. It should support both simultaneously, in the one set of daemon processes.

Possible Mentors: Justin Mason (jm at jmason.org)

Status: -

Subject ID: spamassassin-dobly

Title: Benchmark and implement "Dobly" Noise Reduction

ASF Project: SpamAssassin - http://SpamAssassin.apache.org/

Keywords: dobly, bayes, classifiers, perl

Description:

http://issues.apache.org/SpamAssassin/show_bug.cgi?id=3078 : investigate "Dobly" noise reduction a la http://bnr.nuclearelephant.com/ , in a form that can be incorporated into SpamAssassin. Benchmark results using 10-fold cross-validation.

Possible Mentors: Justin Mason (jm at jmason.org)

Status: -

Subject ID: spamassassin-secure-user-auth

Title: Secure user authentication in the spamd protocol

ASF Project: SpamAssassin - http://SpamAssassin.apache.org/

Keywords: spamd, protocol, tls, perl

Description:

http://issues.apache.org/SpamAssassin/show_bug.cgi?id=4550 :
a secure method to authenticate users over a spamc/spamd connection.

Possible Mentors: Justin Mason (jm at jmason.org)

Status: -

Subject ID: spamassassin-mua-learner-plugins

Title: Thunderbird/Outlook/etc Plugins for Learning Messages via Spamd

ASF Project: SpamAssassin - http://SpamAssassin.apache.org/

Keywords: spamd, protocol, bayesian, perl, thunderbird, outlook, mua

Description:

The spamd protocol is well defined and allows for learning/reporting of messages. What is currently lacking is the ability to learn/report those messages via popular mail applications (Thunderbird, Outlook, etc). Many of these applications have a well established API for creating add-on plugins that would allow for this type of activity.

Possible Mentors: -

Status: -

Subject ID: spamassassin-distributed-mass-check

Title: Distributed mass-check

ASF Project: SpamAssassin - http://SpamAssassin.apache.org/

Keywords: corpora, perl

Description:

mass-check currently makes use of a single system to process a number of messages. However, in larger organizations, or for people with multiple machines, it would be nice if multiple machines could all process a single mass-check run, preferably without needing to share the same filesystem, paths, etc. It would also be useful if we ended up with a single large corpus (see the spamassassin-corpus project above), so that multiple people could run the messages through over the Internet.

Possible Mentors: Theo Van Dinter (felicity at apache.org)

Status: -

Template

Please use this for further project suggestions...

Subject ID: spamassassin-xxxx

Title: xxxxx

...

Description:

xxxx

Possible Mentors: -

Status: -

...