Commit message (Collapse) | Author | Age | Files | Lines | |
---|---|---|---|---|---|
* | Explicitly set in_class field while learning. | Vsevolod Stakhov | 2013-12-01 | 1 | -0/+1 |
| | |||||
* | Fix bayes algorithm. | Vsevolod Stakhov | 2013-11-15 | 1 | -4/+4 |
| | |||||
* | Parse classifiers and statfiles in ucl. | Vsevolod Stakhov | 2013-11-07 | 2 | -2/+2 |
| | |||||
* | Detect exp2l as well. Update to 0.5.6. | Vsevolod Stakhov | 2013-06-10 | 1 | -0/+2 |
| | |||||
* | Test for expl function presence. | Vsevolod Stakhov | 2013-06-09 | 1 | -0/+4 |
| | |||||
* | Try to avoid exp overflowing. | Vsevolod Stakhov | 2013-05-27 | 1 | -3/+3 |
| | |||||
* | Fix probability calculations. | Vsevolod Stakhov | 2013-05-23 | 1 | -1/+1 |
| | |||||
* | Fix call to get statfile's revision. | Vsevolod Stakhov | 2013-05-23 | 1 | -3/+4 |
| | |||||
* | Use revisions to get total learn numbers. | Vsevolod Stakhov | 2013-05-23 | 1 | -2/+2 |
| | |||||
* | Fix calculations. | Vsevolod Stakhov | 2013-05-23 | 1 | -3/+8 |
| | |||||
* | New chi2square based bayes normalizer. | Vsevolod Stakhov | 2013-05-23 | 1 | -160/+110 |
| | |||||
* | * Use murmur hash for all hashes as it is more efficient and provides more ↵ | Vsevolod Stakhov | 2012-10-08 | 1 | -9/+29 |
| | | | | | | uniform distribution as glib's default one. * Fix probability renormalization while using advanced classification. | ||||
* | * Add meta-classification example. | Vsevolod Stakhov | 2012-10-04 | 1 | -4/+7 |
| | | | | | Many changes to advanced statistic and meta-classification logic. Add example of complex meta-classification. | ||||
* | Another debian license fix. | Vsevolod Stakhov | 2012-09-10 | 3 | -3/+3 |
| | | | | | Add apache license for regexp that were delivered from SpamAssassin project. Fix debian/copyright for src/dns.c. | ||||
* | Update copyright (required by debian). | Vsevolod Stakhov | 2012-09-04 | 3 | -9/+9 |
| | |||||
* | * Rework thread pools locking logic to avoid global lua mutex usage. | Vsevolod Stakhov | 2012-08-22 | 3 | -35/+15 |
| | | | | | | Fixed several memory leaks with modern glib. Fixed memory leak in dkim code. Fixed a problem with static global variables in shared libraries. | ||||
* | * More things to be thread-safe: | Vsevolod Stakhov | 2012-02-13 | 2 | -0/+21 |
| | | | | | | | - pool allocator is now thread-safe - lua subsystem now holds lock to avoid lua stack corruption - events subsystem now using conditional variables to wait for async_threads - insert_result is thread-safe now | ||||
* | * Add configuration utils for kvstorage | Vsevolod Stakhov | 2011-10-17 | 1 | -3/+3 |
| | |||||
* | Fix signness in arithmetic operations. | Vsevolod Stakhov | 2011-08-04 | 2 | -5/+5 |
| | |||||
* | * Fix build under CentOS 5 with old glib 2.12 | Vsevolod Stakhov | 2011-07-29 | 4 | -18/+15 |
| | | | | | * Fix build of rspamd with CMAKE_BINARY_DIR differs from CMAKE_SOURCE_DIR Rework include style. | ||||
* | * Add max_tokens options to avoid classifying and learning with too much ↵ | Vsevolod Stakhov | 2011-07-26 | 1 | -7/+47 |
| | | | | | | tokens from one message. Fix stupid memory leakage on client's timeout. | ||||
* | Write error about too few tokens during learning. | Vsevolod Stakhov | 2011-07-22 | 1 | -0/+5 |
| | |||||
* | Create statfiles using learn_spam function for bayes classifier. | Vsevolod Stakhov | 2011-07-18 | 1 | -0/+18 |
| | | | | Fix call of pre callbacks for a classifier. | ||||
* | * Add learn_spam/learn_ham interface to librspamdclient and to rspamc | Vsevolod Stakhov | 2011-07-14 | 1 | -1/+1 |
| | | | | | * Improve logic of io dispatcher restoration Remove correction factor from bayes as it leads to classify errors. | ||||
* | * First commit to implement multi-statfile filter system with new learning ↵ | Vsevolod Stakhov | 2011-07-12 | 4 | -18/+103 |
| | | | | mechanizm (untested yet) | ||||
* | * Add correcting factor to statistics. | Vsevolod Stakhov | 2011-06-28 | 1 | -15/+33 |
| | | | | | | Now learning increments version of a statfile. Avoid learning and classifying of similar text parts if a message has 2 text parts. Several fixes to statistics. | ||||
* | Fix statshow utility. | Vsevolod Stakhov | 2011-06-27 | 1 | -2/+2 |
| | |||||
* | * Welcome 0.4.0 | Vsevolod Stakhov | 2011-06-24 | 1 | -16/+15 |
| | | | | | | | | | | | | | | | | | | Uncompatible changes: - Statistics is uncompatible in utf8 mode Major changes: - Improved utf8 mode - Convert all characters to lowercase in statistics - Skip URL's in statistics - Improve speed of bayes classifier by using integer arithmetics - Fixed statfiles synchronization that was broken for a long time - Synchronization is now configurable Minor changes: - Bugfixes - Removed some of legacy code - Types polishing | ||||
* | * Major cleanup of cmake build system | Vsevolod Stakhov | 2011-05-06 | 2 | -1/+7 |
| | | | | | | * Add initial version of statshow utility for statfiles debugging * Add debugging for statistics * Remove unused utilities | ||||
* | Fixes in classifying for small messages. | Vsevolod Stakhov | 2011-01-25 | 2 | -6/+24 |
| | |||||
* | Implement checking options for modules | Vsevolod Stakhov | 2010-12-13 | 1 | -0/+1 |
| | | | | | | Implement checking for classifier options Fix redirector to handle timeouts and invalid replies properly Fix surbl module not to check each url | ||||
* | Remove G_INLINE_FUNC definitions as I misunderstood its purposes. | Vsevolod Stakhov | 2010-10-15 | 2 | -2/+2 |
| | |||||
* | Fixes bugs found with clang-static analyser. | Vsevolod Stakhov | 2010-10-11 | 2 | -2/+14 |
| | | | | | Strictly follow c99 standart. Turn on pedantic c99 checks. | ||||
* | * Fix races in fuzzy storage | Vsevolod Stakhov | 2010-08-26 | 1 | -1/+3 |
| | |||||
* | * Fix normalization for systems that have not tanhl function | Vsevolod Stakhov | 2010-08-18 | 1 | -2/+2 |
| | |||||
* | * Remove normalizer as it is winnow specific thing, so all statistic ↵ | Vsevolod Stakhov | 2010-08-13 | 1 | -4/+18 |
| | | | | algorithms now returns value from 0 to 1 | ||||
* | * Add bayesian classifier (initial version) | Vsevolod Stakhov | 2010-08-13 | 4 | -11/+409 |
| | |||||
* | * One more try to improve accuracy of winnow algorithm | Vsevolod Stakhov | 2010-08-06 | 1 | -10/+19 |
| | |||||
* | * Fixes to winnow learning | Vsevolod Stakhov | 2010-08-05 | 2 | -47/+114 |
| | |||||
* | * Fix some logic errors in learning | Vsevolod Stakhov | 2010-08-03 | 1 | -6/+8 |
| | |||||
* | * Improve logic of learning messages: do not learn more than specific threshold | Vsevolod Stakhov | 2010-08-02 | 1 | -8/+82 |
| | | | | * Fix inserting results for symbols that were incorrectly (for example more than 1 time) defined in config file | ||||
* | * Change metric logic | Vsevolod Stakhov | 2010-06-16 | 1 | -1/+1 |
| | | | | | | | * Completely remove lex/yacc readers for config * Make common sense of metric/action and symbols * Sync changes with all plugins TODO: add this to documentation | ||||
* | * Fix strict aliasing while compiling with optimization | Vsevolod Stakhov | 2010-05-31 | 1 | -1/+1 |
| | | | | | * Fix tanhl detection for platforms that have not implementation of it * Remove several compile warnings | ||||
* | * Fix order | Vsevolod Stakhov | 2010-05-27 | 1 | -3/+3 |
| | |||||
* | * In classify normalize result after comparing, not before | Vsevolod Stakhov | 2010-05-27 | 1 | -3/+3 |
| | |||||
* | * Convert statistic sums to use long double for counters | Vsevolod Stakhov | 2010-05-27 | 2 | -24/+18 |
| | | | | * Use hyperbolic tangent for internal normalizer | ||||
* | * Implement new learning system, now rspamd should be much more intelligent ↵ | Vsevolod Stakhov | 2010-05-27 | 1 | -32/+125 |
| | | | | while learning messages | ||||
* | * Fix awfull bug in classifying when first statfile has twice weight than ↵ | Vsevolod Stakhov | 2010-05-14 | 1 | -2/+2 |
| | | | | | | second... * Fix undisclosed recipients detection | ||||
* | * Bugfixes: | Vsevolod Stakhov | 2010-04-20 | 1 | -1/+1 |
| | | | | | | | | | | - handle '\' characters in lua strings correctly - fix lua initialization - avoid of using global lua state (global L) - fix listen sockets hash to allow multiply workers of same type but on different listen sockets - fix modules options inserting to allow multiply options of the same name - fix parsing of lua options - fix lua rules | ||||
* | * Add option min_tokens to classifier that allows to skip too short messages ↵ | Vsevolod Stakhov | 2010-03-22 | 1 | -1/+34 |
| | | | | | | from statistic check, format: min_tokens = "10"; (for 10 words minimum) |