Commit message (Collapse) | Author | Age | Files | Lines | |
---|---|---|---|---|---|
* | Fix for libevent.0.4.0 | Vsevolod Stakhov | 2011-07-22 | 1 | -0/+4 |
| | |||||
* | Write error about too few tokens during learning. | Vsevolod Stakhov | 2011-07-22 | 1 | -0/+5 |
| | |||||
* | Increase buffer for output. | Vsevolod Stakhov | 2011-07-22 | 1 | -1/+1 |
| | |||||
* | Added tag 0.4.0 for changeset c52f190b0592 | Vsevolod Stakhov | 2011-07-22 | 1 | -0/+1 |
| | |||||
* | Add workaround for clang under linux. | Vsevolod Stakhov | 2011-07-21 | 17 | -37/+45 |
| | | | | Fix problems found by static analyzing. | ||||
* | Another fix to avoid race with settings - add reference counter. | Vsevolod Stakhov | 2011-07-21 | 2 | -22/+50 |
| | |||||
* | * Add start script for red hat compatible systems | Vsevolod Stakhov | 2011-07-20 | 25 | -57/+656 |
| | | | | | | | Add descriptions for some rspamd API functions (no functional changes). --HG-- rename : linux/rspamd => linux/rspamd_debian.in | ||||
* | Rework http chunked encoding parsing. | Vsevolod Stakhov | 2011-07-20 | 5 | -97/+115 |
| | |||||
* | Do not try to use information about dispatcher after callback fails (found ↵ | Vsevolod Stakhov | 2011-07-19 | 1 | -1/+0 |
| | | | | by valgrind). | ||||
* | * Add classifiers pre-selection script | Vsevolod Stakhov | 2011-07-19 | 2 | -0/+52 |
| | |||||
* | Fix coredumps on some specific messages with specific urls. | Vsevolod Stakhov | 2011-07-19 | 3 | -11/+24 |
| | | | | | Fix coredumps while closing log file. Fix parsing of chunked HTTP replies. | ||||
* | Ref hash table at settings loading. | Vsevolod Stakhov | 2011-07-18 | 1 | -1/+1 |
| | |||||
* | Fix statfiles class determination euristic. | Vsevolod Stakhov | 2011-07-18 | 2 | -12/+12 |
| | | | | Fix call of classifier pre-callback. | ||||
* | Fix textpart:get_language lua function. | Vsevolod Stakhov | 2011-07-18 | 1 | -0/+1 |
| | |||||
* | Create statfiles using learn_spam function for bayes classifier. | Vsevolod Stakhov | 2011-07-18 | 2 | -1/+19 |
| | | | | Fix call of pre callbacks for a classifier. | ||||
* | Use event_set correctly after event_del. | Vsevolod Stakhov | 2011-07-18 | 1 | -0/+1 |
| | |||||
* | Fix bug with data corruption during settings application. | Vsevolod Stakhov | 2011-07-18 | 1 | -1/+8 |
| | |||||
* | Begin to write normal and updated default configuration. | Vsevolod Stakhov | 2011-07-15 | 2 | -20/+30 |
| | | | | | --HG-- rename : rspamd.xml.sample => conf/rspamd-basic.xml.in | ||||
* | Another fix for comparing parts without content - two empty parts are equal. | Vsevolod Stakhov | 2011-07-14 | 1 | -1/+6 |
| | |||||
* | Fix coredump on messages with one url only. | Vsevolod Stakhov | 2011-07-14 | 1 | -1/+1 |
| | |||||
* | Fix core dumps when no symbols are found in a message. | Vsevolod Stakhov | 2011-07-14 | 1 | -1/+1 |
| | |||||
* | * Add learn_spam/learn_ham interface to librspamdclient and to rspamc | Vsevolod Stakhov | 2011-07-14 | 9 | -47/+332 |
| | | | | | * Improve logic of io dispatcher restoration Remove correction factor from bayes as it leads to classify errors. | ||||
* | Adjust interval. | Vsevolod Stakhov | 2011-07-14 | 1 | -2/+2 |
| | |||||
* | * Remove completion logic from controller: it is hardly used but breaks new ↵ | Vsevolod Stakhov | 2011-07-14 | 1 | -35/+26 |
| | | | | commands logic | ||||
* | Fix multiply compare_parts_distance calls. | Vsevolod Stakhov | 2011-07-14 | 1 | -2/+12 |
| | |||||
* | Ignore arguments order in compare_parts_distance function. | Vsevolod Stakhov | 2011-07-14 | 1 | -2/+2 |
| | |||||
* | Change logic of params inside compare parts distance. | Vsevolod Stakhov | 2011-07-14 | 2 | -11/+51 |
| | | | | | During learning and classifying compare parts using new algorithm. Raise similarity factor. | ||||
* | * Add new algorithm based on diff algorithm to compare relatively short text ↵ | Vsevolod Stakhov | 2011-07-13 | 14 | -11/+498 |
| | | | | parts | ||||
* | Add validity detector for statfiles inside classifier. | Vsevolod Stakhov | 2011-07-13 | 3 | -2/+59 |
| | | | | Add euristic to detect spam/ham classes based on statfile symbol. | ||||
* | * Add second argument to compare_parts_distance function so it can be used ↵ | Vsevolod Stakhov | 2011-07-13 | 1 | -21/+37 |
| | | | | as interval: arg2 <= distance <= arg1 | ||||
* | * Add ability to get difference between two parts from lua code | Vsevolod Stakhov | 2011-07-12 | 2 | -0/+50 |
| | |||||
* | * First commit to implement multi-statfile filter system with new learning ↵ | Vsevolod Stakhov | 2011-07-12 | 17 | -82/+649 |
| | | | | mechanizm (untested yet) | ||||
* | Cache data of parts distance function to speed up multiply rules with such ↵ | Vsevolod Stakhov | 2011-07-12 | 2 | -1/+24 |
| | | | | function. | ||||
* | * Make fuzzy hashes utf8 compatible. | Vsevolod Stakhov | 2011-07-12 | 2 | -35/+75 |
| | |||||
* | * Add a simple logic of language detection for text parts (unicode script based) | Vsevolod Stakhov | 2011-07-11 | 4 | -2/+118 |
| | |||||
* | Fix phishing detection with img flag. | Vsevolod Stakhov | 2011-07-11 | 4 | -32/+61 |
| | | | | | | Handle unclosed HTML tags properly. Remove warnings for types on 32 bit archs. Do not touch grow factor many times when one shot mode is turned on. | ||||
* | * Improve performance of settings lookup | Vsevolod Stakhov | 2011-06-30 | 6 | -106/+122 |
| | |||||
* | * Add correcting factor to statistics. | Vsevolod Stakhov | 2011-06-28 | 6 | -36/+105 |
| | | | | | | Now learning increments version of a statfile. Avoid learning and classifying of similar text parts if a message has 2 text parts. Several fixes to statistics. | ||||
* | * Add ability to specify noip option for uribl suffix to avoid checking urls ↵ | Vsevolod Stakhov | 2011-06-28 | 2 | -3/+29 |
| | | | | with ip addresses on such lists. | ||||
* | Fix statshow utility. | Vsevolod Stakhov | 2011-06-27 | 2 | -4/+5 |
| | |||||
* | Remove debug. | Vsevolod Stakhov | 2011-06-27 | 1 | -1/+0 |
| | |||||
* | Fix incorrect calculating of token length. | Vsevolod Stakhov | 2011-06-27 | 2 | -2/+3 |
| | |||||
* | * Welcome 0.4.0 | Vsevolod Stakhov | 2011-06-24 | 21 | -184/+276 |
| | | | | | | | | | | | | | | | | | | Uncompatible changes: - Statistics is uncompatible in utf8 mode Major changes: - Improved utf8 mode - Convert all characters to lowercase in statistics - Skip URL's in statistics - Improve speed of bayes classifier by using integer arithmetics - Fixed statfiles synchronization that was broken for a long time - Synchronization is now configurable Minor changes: - Bugfixes - Removed some of legacy code - Types polishing | ||||
* | Oops, remove debug. | Vsevolod Stakhov | 2011-06-23 | 1 | -7/+0 |
| | |||||
* | * Fixes to fuzzy hashing logic, skip urls while estimating fuzzy hash | Vsevolod Stakhov | 2011-06-23 | 12 | -69/+213 |
| | | | | | Fix tags stripping. Fix phishing checks (ignore img tags). | ||||
* | Another fix with reload command. | Vsevolod Stakhov | 2011-06-20 | 1 | -3/+4 |
| | |||||
* | Fix reload command. | Vsevolod Stakhov | 2011-06-20 | 2 | -1/+2 |
| | |||||
* | Handle files with zero lenght properly. | Vsevolod Stakhov | 2011-06-17 | 2 | -0/+22 |
| | | | | Reported by: Andrej Zverev | ||||
* | Fix rspamc client to handle multiply files properly. | Vsevolod Stakhov | 2011-06-17 | 1 | -8/+9 |
| | |||||
* | Actually all times are in GMT already, so avoid conversion to prevent dst ↵ | Vsevolod Stakhov | 2011-06-15 | 2 | -3/+1 |
| | | | | loosing. |