Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.
Comment: [Original edit by Dave]

...

Hostname

Function

Software

Configs/Location

Resource/URL

SVN Location

incoming.apache.org

Donated by Sonic

 

 

 

 

sa-vm1.apache.org

DNS Hidden Master

PowerDNS

/etc/powerdns/pdns.d/pdns.local.conf

spamassassin.org

dns (webserver API key redacted)

 

Rsync Mirrors

rsyncd

/etc/rsyncd.conf

rsync.spamassassin.org

trunk/build/automc/etc-rsyncd.conf

 

Web Server

apache2

/etc/apache2/sites-available/automc.conf

updates.spamassassin.org

trunk/build/automc/automc-apache2.conf

 

 

apache2

/etc/apache2/sites-available/automc.conf

ruleqa.spamassassin.org

trunk/build/automc/ruleqa.cgi

<ac:structured-macro ac:name="unmigrated-wiki-markup" ac:schema-version="1" ac:macro-id="8da42a5bc608e48b-de989978-42c045df-9c8fa96c-23f7ce06844fcb0d94bf780a"><ac:plain-text-body><![CDATA[

 

sa-update mirrors

svn

rsyncd [updates] for mirrors

spamassassin.apache.org/updates

[site/updates

http://svn.apache.org/repos/asf/spamassassin/site/updates/]/MIRRORED.BY

]]></ac:plain-text-body></ac:structured-macro>

 

Nightly Masscheck

cron/scripts

/usr/local/spamassassin/automc/html

ruleqa.spamassassin.org

trunk/backend/nitemc/README

...

The sa-vm1 server TZ is US PDT -7 so cron entries will be 7 hours ahead for UTC. Scripts should have TZ=UTC so they think they are running in UTC. If they are run from cron and detect they are in DST, they should sleep for 1 hour to stay on UTC. All times below are in UTC.

mkupdates

...

This section of scripts publishes new ruleset updates to the mirrors. There are currently (June 2017) two different rule daily updates. Both do lint tests against the latest version of SpamAssassin but the first one updates the 72_scores.cf based on the masscheck contributions while the second one is a "blind" rule promotion and tagged build of SVN rules for the masscheck area setup later.

Wiki Markup
 *25 2 \* \* \* automc *~/svn/trunk/build/mkupdates/do-stable-update-with-scores*
  **~/svn/masses/rule-update-score-gen/do-nightly-rescore-example.sh*
   **~/svn/masses/rule-update-score-gen/generate-new-scores.sh*
    *uses ~/tmp/generate-new-scores for SVN work area
    *sorts out the usable corpus from the latest 'SVN revision' at the top of the submitter's log file which should match the latest tagged build of SVN rules
    *checks the sorted corpus for a minimum number of valid contributors and ham/spam
  **~/svn/trunk/build/mkupdates/mkupdate-with-scores*
   *uses ~/tmp/sa-mkupdate for SVN working area
   *gets latest SVN $\{REVISION\} from rulesrc/scores/score-set\*
   *masses \-> perl Makefile.PL && make (complete build of SA and test)
    *perl hit-frequencies
    *garescorer - compiles and runs it, requires build/pga
  *sends email if not enough masscheck submitters or usuable ham/spam for the latest SVN revision
  *creates $\{REVISION\}.tar.gz $\{REVISION\}.tar.gz.sha1 and $\{REVISION\}.tar.gz.asc in /var/www/automc.spamassassin.org/updates for mirrors to pull
  *updates DNS entries \[0-3\].3.3.updates.spamassassin.org - newer 3.4.x versions have a CNAME to 3.3.3.updates.spamassassin.org
Wiki Markup

  *30 8 \* \* \* automc *~/svn/trunk/build/mkupdates/run_nightly* > /var/www/automc.spamassassin.org/mkupdates/mkupdates.txt
  *creates new rules/active.list
  *commits new rules/active.list
  *runs spamassassin lint against the updated rules and checks in a tagged version of 'sa-update_$\{SA_VERSION\}_$\{TSTAMP\}'
  *creates $\{REVISION\}.tar.gz $\{REVISION\}.tar.gz.sha1 and $\{REVISION\}.tar.gz.asc in /var/www/automc.spamassassin.org/updates for mirrors to pull
  *updates DNS entries \[0-3\].3.3.updates.spamassassin.org - newer 3.4.x versions have a CNAME to 3.3.3.updates.spamassassin.org

...

Script rewrite notes:
   *Make each primary step modular since these steps are commmon in other scripts
   *Should check for minimum contributors of ham/spam up front and not waste resources if requirements not met
   *These 3 scripts above all share the same temp working dir.  This should be determined from config file or relative path of user's home dir for flexibility.
   *Should be able to run the ham/spam processing in parallel and merge the results together to cut this time in half
   *Temp working dir for the corpus should be persistent so the rsync copy will be faster.
   *Usuable corpus symlink setup could be improved.  Invalid stale corpus should be removed into an archive/excluded dir.

Wiki Markup
 *30 8 \* \* \* automc *~/svn/trunk/build/mkupdates/run_nightly* > /var/www/automc.spamassassin.org/mkupdates/mkupdates.txt
  *creates new rules/active.list
  *commits new rules/active.list
  *runs spamassassin lint against the updated rules and checks in a tagged version of 'sa-update_$\{SA_VERSION\}_$\{TSTAMP\}'
  *creates $\{REVISION\}.tar.gz $\{REVISION\}.tar.gz.sha1 and $\{REVISION\}.tar.gz.asc in /var/www/automc.spamassassin.org/updates for mirrors to pull
  *updates DNS entries \[0-3\].3.3.updates.spamassassin.org - newer 3.4.x versions have a CNAME to 3.3.3.updates.spamassassin.org
  *Script rewrite notes:
   *Uses many of the same primary steps previous section so reuse the code and not have to maintain multiple versions
   *Should be turned into generic script that can be run on demand via SVN trigger/polling

nitemc

These run shortly after the build/mkupdates/run_nightly to setup the masscheck download area based on the latest tagged build of SVN rules.
34 8 * * 0-5 automc *~/svn/nitemc/corpora_runs >> ~/rsync/corpus/nightly-versions.txt
36 8 * * 0-5 automc *~/svn/nitemc/extract_to_rsync_dir nightly ~/rsync/corpus/nightly-versions.txt
34 8 * * 6 automc *~/svn/nitemc/corpora_runs >> ~/rsync/corpus/weekly-versions.txt
36 8 * * 6 automc *~/svn/nitemc/extract_to_rsync_dir weekly ~/rsync/corpus/weekly-versions.txt