summaryrefslogtreecommitdiffstats
path: root/src/tokenizers/tokenizers.c
Commit message (Expand)AuthorAgeFilesLines
* Reorganize statfiles and classifiers into libstat.Vsevolod Stakhov2015-01-161-260/+0
* Fix some leaks.Vsevolod Stakhov2014-12-301-7/+10
* Rework text tokenizaton.Vsevolod Stakhov2014-12-231-21/+16
* Rework words tokenization.Vsevolod Stakhov2014-12-231-72/+33
* Make words from text parts.Vsevolod Stakhov2014-12-171-2/+2
* Refactor function names.Vsevolod Stakhov2014-11-291-8/+8
* Fix buffer overrun when HTML exceptions are used.Vsevolod Stakhov2014-08-121-3/+4
* Unify style without sorting headers.Vsevolod Stakhov2014-07-231-57/+74
* Revert "Unify code style."Vsevolod Stakhov2014-07-231-75/+58
* Unify code style.Vsevolod Stakhov2014-07-231-58/+75
* Refactor worker task structure and API.Vsevolod Stakhov2014-04-211-2/+2
* Refactor memory pool naming.Vsevolod Stakhov2014-04-201-5/+5
* Parse classifiers and statfiles in ucl.Vsevolod Stakhov2013-11-071-1/+1
* Another debian license fix.Vsevolod Stakhov2012-09-101-1/+1
* Update copyright (required by debian).Vsevolod Stakhov2012-09-041-3/+3
* * Add configuration utils for kvstorageVsevolod Stakhov2011-10-171-3/+0
* Fix signness in arithmetic operations.Vsevolod Stakhov2011-08-041-1/+1
* * Fix build under CentOS 5 with old glib 2.12Vsevolod Stakhov2011-07-291-1/+1
* * Add correcting factor to statistics.Vsevolod Stakhov2011-06-281-1/+1
* Fix incorrect calculating of token length.Vsevolod Stakhov2011-06-271-2/+2
* * Welcome 0.4.0Vsevolod Stakhov2011-06-241-16/+48
* * Skip short utf words in statisticsVsevolod Stakhov2011-06-031-2/+2
* * Major cleanup of cmake build systemVsevolod Stakhov2011-05-061-2/+2
* * Rewrite URL storage systemVsevolod Stakhov2011-02-241-32/+0
* * Write Emails: header in outputVsevolod Stakhov2011-02-111-1/+1
* * Tokenize subject using osb tokenizer.Vsevolod Stakhov2011-02-111-13/+5
* Fixes in classifying for small messages.Vsevolod Stakhov2011-01-251-1/+1
* * Many fixes to fuzzy hashes logic and tokenization.Vsevolod Stakhov2011-01-241-4/+33
* * Add Subject header to statisticsVsevolod Stakhov2010-12-241-0/+38
* * Fix shared usage of statfilesVsevolod Stakhov2010-09-161-2/+4
* * Retab, no functional changesVsevolod Stakhov2009-10-021-48/+48
* * Make autolearn workingVsevolod Stakhov2009-07-091-5/+3
* * Rework url parsing algorithmsVsevolod Stakhov2009-06-021-1/+5
* * Add functions to parse headers and urls into statfile tokensVsevolod Stakhov2009-03-161-0/+116
* * Prepare to migrate to cmake (still need to write install target and working...Vsevolod Stakhov2009-02-161-1/+1
* * Add BSD license textVsevolod Stakhov2009-02-161-0/+24
* * Rewrite message parserVsevolod Stakhov2009-01-211-2/+4
* * Rewrite perl client for rspamd, now it allows access to both normal and con...Vsevolod Stakhov2009-01-191-2/+4
* * Fix errors in learning implementationVsevolod Stakhov2009-01-111-3/+7
* * Use binary tree in tokenizers, that would provide us fast checking for uniq...Vsevolod Stakhov2008-12-041-0/+12
* * Add learning interface to rspamd (still requires classifier to work)Vsevolod Stakhov2008-12-021-0/+18
* * Add simple implementation of OSB tokenizerVsevolod Stakhov2008-11-071-0/+45