Collecting
From ASSPSMTP
On This Page |
How It Works
Details
Collecting FAQ
Collecting Configuration
based upon version 1.3.5(9.18)
- Spam Collect Addresses* (spamaddresses)
- Mail to any of these addresses are always spam and will contribute to the spam-collection unless from someone on the whitelist. Accepts specific addresses (user@domain.com), user parts (user) or entire domains (@domain.com). The addresses are not validated, they are readdressed to ccallspam, however you can superced this by putting a valid address into sendAllCollect below.
Catchall Address for Collect Addresses (sendAllCollect)
- ASSP will readdress messages addressed to Collect Addresses to this address.
- Example: collect@mydomain.com
Use Collect Addresses for Testing Your Environment (DoNotBlockCollect)
- If set ASSP will block messages from Collect Addresses after other checks are performed. That may help to test and control activated filters.
Do Not Collect Messages from/to these Addresses* (noCollecting)
- Accepts specific addresses (user@domain.com), user parts (user) or entire domains (@domain.com).
Use Subject as Maillog Names (UseSubjectsAsMaillogNames)
- You can turn this on to help you manually identify mail in your spam and non-spam collections. This will prevent ASSP from controlling the number of files in your collections(->MAXFILES). If your collections grow over 1000 items run move2num.pl and turn this off.
Do Not Collect Redlisted Mails (DoNotCollectRed)
- Mails (Spam/Ham) matching Red Regex or Redlist will not be stored.
Do Not Collect Bounced Mails (DoNotCollectBounces)
- Mails matching <Bounce Senders> will not be collected.
Don't Collect Mail (NoMaillog)
- Check this if you're using Whitelist-Only and don't care to save mail to build the Bayesian database.
Max Files (MaxFiles)
- If you're not using subjects as file names, this is the maximum number of files to keep in each collection (spam & nonspam)
It's actually less than this -- files get a random number between 1 and $MaxFiles.
Files Distribution (FilesDistribution)
- This defines how file names are chosen in each collection. If set to 1, names are uniformly distributed. If set between 0.01 and 0.99, names distribution is exponential -- files get lower numbers more frequently. This prevents from corpus being refreshed too quickly, especially when MaxFiles is set to low value
- Example: 3000
"Recommended: 0.5, Default: 1
Max Bytes (MaxBytes)
- How many bytes of the message will ASSP look at? Mails stored in the collecting folders will be truncated to this size. The average of Ham messages is 8K, the average of Spam messages is 4K. Usually the spam folder will be filled quicker than the notspam folder, therefore set this value to 8000 to get more wordpairs per Ham Message. When both folders are close to the maxfiles limit, reduce it to 4000.
Error Max Bytes (ErrorMaxBytes)
- How many bytes of an error report message will ASSP look at.
- Example: 10000.
Do Norm Optimizing (DoNormOptimizing)
- The relation between collected Spam and Ham wordpairs is called the 'Norm'(Normality). Ideally you want to keep the Norm as close to '1.0' as possible - which means an equal number of ham to spam wordpairs - but anything between '0.5'(more ham) and '1.5'(more spam) is considered healthy. The main thing to understand is the relationship: the lower the Norm, the greater the chance a message will be determined to be spam, or, the higher the Norm, the less chance a message will be determined to be spam. If this option is set, ASSP will automatically manage the number of collected ham or spam messages in an effort to keep the Norm in balance. Use Subject as Maillog Names must be off
Whitelisted rejected Attachments (wlAttachLog)
- Where to store whitelisted rejected mail+attachments. Recommended : 5
- 1 = spamfolder
- 2 = notspam folder
- 3 = spamfolder & ccallspam
- 4 = mailok folder
- 5 = attachment folder
- 6 = discard
- 7 = discard & ccallspam.
NoProcessing rejected Attachments (npAttachLog)
- Where to store noprocessing rejected mail+attachments. Recommended: 6
External rejected Attachments (extAttachLog)
- Where to store external rejected mail+attachments. Recommended: 6
Virus Infected (SpamVirusLog)
- Where to store virus infected messages. Recommended: 6
Spam Bombs (spamBombLog)
- Where to store spam bombs. Recommended: 6 or 7
Scripts (scriptLog)
- Where to store scripted messages. Recommended: 3
OK Mail (baysNonSpamLog)
- Where to store non spam (message ok) messages. These are messages which are considered as HAM, but are not stored in the standard HAM folder because of our policy to use only confirmed HAM messages (whitelisted or local) for SpamDB. Put 4 here if you want them to be stored in okmail-folder (outside the spamdb-collection). Put 2 here if you want them to be stored in the HAM folder.
Non Spam (NonSpamLog)
- Where to store whitelisted/local non spam messages. Recommended: 2.
Blacklisted Domains (blDomainLog)
- Where to store blacklisted domain messages. Recommended: 3
Blacklisted Helos (spamHeloLog)
- Where to store spam helo messages. Recommended: 6 or 7.
Forged Helos (forgedHeloLog)
- Where to store forged helo messages. Strongly recommended: 6
Invalid Helos (invalidHeloLog)
- Where to store invalid helo messages. Recommended: 6
Spam Collect Addresses (spamBucketLog)
- Where to store emails addressed to Spam Collect Addresses. Recommended: 1
Bayesian Spams (baysSpamLog)
- Where to store Bayesian spam messages. Recommended: 3
SPF Failures (SPFFailLog)
- Where to store SPF Failure spam messages. Recommended: 3
DNSBL Failures (RBLFailLog)
- Where to store DNSBL Failure spam messages. Recommended: 3
URIBL Failures (URIBLFailLog)
- Where to store URIBL Failure spam messages. Recommended: 3
SRS Failures (SRSFailLog)
- Where to store SRS Failure (not signed bounces) spam messages. Recommended: 3
Missing/Invalid Pointer (spamPTRLog)
- Where to store Missing/Invalid Pointer rejected messages. Recommended: 3
Missing MX Record (spamMXALog)
- Where to store Missing MX record rejected messages. Recommended: 3
Invalid Local Sender (spamISLog)
- Where to store messages from a local domain with an unknown userpart. Recommended: 6
Message Limit Blocks (spamMSLog)
- Where to store Message Scoring Limit rejected messages. Recommended: 3
PenaltyBox Blocks (spamPBLog)
- Where to store PB rejected messages. Recommended: 6 or 7
- 1 = spamfolder
- 2 = notspam folder
- 3 = spamfolder & ccallspam
- 4 = mailok folder
- 5 = attachment folder
- 6 = discard
- 7 = discard & ccallspam.
Non Spam Collection Frequency (freqNonSpam)
- Store every n'th non spam message. If you set the value to 10 then every 10th message is logged. These frequency settings are for ASSP users with a mature installation who experience heavy mail or spam volumes. Enter a larger value if the non spam corpus is being refreshed too quickly. Default Value = 1, log every message.
Spam Collection Frequency (freqSpam)
- Store every n'th spam message. The same as for non spam but helps prevent spam corpuses being skewed by flooding. It is recommended that this be set depending on spam volume. Default value = 1, log every message.


