From 5958c6541e9dc0e83831411bcbb5c330c72e6531 Mon Sep 17 00:00:00 2001 From: Vsevolod Stakhov Date: Thu, 13 May 2010 19:34:15 +0400 Subject: [PATCH] * Add chapters about workers and factors configuration parameters --- doc/rspamd.texi | 133 ++++++++++++++++++++++++++++++++++++++++++++++++ 1 file changed, 133 insertions(+) diff --git a/doc/rspamd.texi b/doc/rspamd.texi index 2a78b1192..125c01a3e 100644 --- a/doc/rspamd.texi +++ b/doc/rspamd.texi @@ -562,4 +562,137 @@ testing. more information about ip lists look at @ref{config atoms}. @end multitable +@section Factors configuration. + +Setting of rspamd factors is the main way to change rules' weights. Factors set +up weights for all rules: for those that have static weights (for example simple +regexp rules) and for those that have dynamic weights (for example statistic +rules). In all cases the base weight of rule is multiplied by factor value. For +static rules base weight is usually 1.0. So we have: +@itemize @bullet +@item @math{w_{symbol} = w_{static} * factor} - for static rules +@item @math{w_{symbol} = w_{dynamic} * factor} - for dynamic rules +@end itemize +Also there is an ability to add so called "grow factor" - additional multiplier +that would be used when we have more than one symbol in metric. So for each +added symbol this factor would increment its power. This can be written as: +@math{w_{total} = w_1 * gf ^ 0 + w_2 * gf ^ 1 + ... + w_n * gf ^ {n - 1}} +Grow multiplier is used to increment weight of rules when message got many +symbols (likely spammy). Note that only rules with positive weights would +increase grow factor, those with negative weights would just be added. Also note +that grow factor can be less than 1 but it is uncommon use (in this case we +would have weight lowering when we have many symbols for this message). Factors +can be set up with config section @emph{factors}: +@example + + 0.1 + 1.1 + +@end example + +Note that you basically need to add factor when you add additional rules. The +decision of weight of newly added rule basically depends on its importance. For +example you are absolutely sure that some rule would add a symbol on only spam +messages, so you can increase weight of such rule so it would filter such spam. +But if you increase weight of rules you should be more or less sure that it +would not increase false positive errors rate to unacceptable level (false +positive errors are errors when good mail is treated as spam). Rspamd comes with +a set of default rules and default weights of that rules are placed in +rspamd.xml.sample. In most cases it is reasonable to change them for your mail +system, for example increase weights of some rules or decrease for others. Also +note that default grow factor is 1.0 that means that weights of rules do not +depend on count of added symbols. For some situations it useful to set grow +factor to value more than 1.0. Also by modifying factors it is possible to +manage static multiplier for dynamic rules. + +@section Workers configuration. + +Workers are rspamd processes that are doing specific jobs. Now are supported 4 +types of workers: +@enumerate 1 +@item Normal worker - a typical worker that process messages. +@item Controller worker - a worker that manages rspamd, get statistics and do +learning tasks. +@item Fuzzy storage worker - a worker that contains a collection of fuzzy +hashes. +@item LMTP worker - experimental worker that acts as LMTP server. +@end enumerate + +These types of workers has some common parameters: +@multitable @columnfractions .2 .8 +@headitem Parameter @tab Mean +@item type +@tab Type of worker (normal, controller, lmtp or fuzzy) +@item bind_socket +@tab Socket credits to bind this worker to. Inet and unix sockets are supported: +@example +localhost:11333 +/var/run/rspamd.sock +@end example +@noindent +Also for inet sockets you may specify @code{*} as address to bind to all +available inet interfaces: +@example +*:11333 +@end example +@noindent +@item count +@tab Number of worker processes of this type. By default this number is +equialent to number of logical processors in system. +@item maxfiles +@tab Maximum number of file descriptors available to this worker process. +@item maxcore +@tab Maximum size of core file that would be dumped in cause of critical errors +(in mega/kilo/giga bytes). +@end multitable + +Also each of workers types can have specific parameters: +@itemize @bullet +@item Normal worker: +@itemize @bullet +@item @var{custom_filters} - path to dynamically loaded plugins that would do real +check of incoming messages. These modules are described further. +@item @var{mime} - if this parameter is "no" than this worker assumes that incoming +messages are in non-mime format (e.g. forum's messages) and standart mime +headers are added to them. +@end itemize +@item Controller worker: +@itemize @bullet +@item @var{password} - a password that would be used to access to contorller's +privilleged commands. +@end itemize +@item Fuzzy worker: +@itemize @bullet +@item @var{hashfile} - a path to file where fuzzy hashes would be permamently stored. +@item @var{use_judy} - if libJudy is present in system use it for faster storage. +@item @var{frequent_score} - if judy is not turned on use this score to place hashes +with score that is more than this value to special faster list (this is designed +to increase lookup speed for frequent hashes). +@item @var{expire} - time to expire of fuzzy hashes after their placement in storage. +@end itemize +@end itemize + +These parameters can be set inside worker's definition: +@example + + fuzzy + *:11335 + 1 + 2048 + 0 + + yes + /spool/rspamd/fuzzy.db + 10d + +@end example +@noindent + +The purpose of each worker's type would be described later. The main parameters +that could be defined are bind sockets for workers, their count, password for +controller's commands and parameters for fuzzy storage. Default config provides +reasonable values of this parameters (except password of course), so for basic +configuration you may just replace controller's password to more secure one. + + @bye -- 2.39.5