aboutsummaryrefslogtreecommitdiffstats
path: root/src/libstat/tokenizers
Commit message (Expand)AuthorAgeFilesLines
* [Feature] Add unigramms support in bayesVsevolod Stakhov2017-04-131-0/+12
* [Minor] More strict boundaries checks and composites policies fixVsevolod Stakhov2017-04-091-0/+2
* [Fix] Fix processing of small tokens vectorsVsevolod Stakhov2017-04-041-3/+8
* [Rework] Set token data as uint64_t instead of chars arrayVsevolod Stakhov2017-04-042-17/+3
* [Minor] Some fixes for displaying tokens infoVsevolod Stakhov2017-03-311-2/+3
* [Feature] Store text tokens inside bayes tokensVsevolod Stakhov2017-03-312-11/+23
* [Minor] Fix various style issuesVsevolod Stakhov2017-03-231-1/+0
* [Minor] Use libicu for tokenizersVsevolod Stakhov2017-02-251-18/+22
* [Rework] Use a special structure for stats tokensVsevolod Stakhov2017-02-143-13/+26
* [Rework] Rework exceptions and newlines processingVsevolod Stakhov2016-07-131-9/+13
* [Fix] Switch hashes to mumhashVsevolod Stakhov2016-07-131-9/+12
* [Feature] New abstract hashing API in cryptoboxVsevolod Stakhov2016-05-101-3/+4
* Refactor UCL APIVsevolod Stakhov2016-02-161-5/+5
* Switch the rest to apache 2Vsevolod Stakhov2016-02-042-42/+24
* Fix tokenizationVsevolod Stakhov2016-01-052-119/+89
* Some more fixes to OSB algorithmVsevolod Stakhov2015-11-231-1/+4
* Implement words decaying for text parts.Vsevolod Stakhov2015-11-122-6/+65
* Fix format issues found by static analysisVsevolod Stakhov2015-11-112-2/+2
* Allow conditional build of snowball.Vsevolod Stakhov2015-10-231-1/+0
* Fix statistics.Vsevolod Stakhov2015-10-063-17/+16
* Rename main.h and main.c to `rspamd.X`Vsevolod Stakhov2015-09-222-2/+2
* More logging updates.Vsevolod Stakhov2015-08-291-7/+7
* Fix sqlite3 backend initialization.Vsevolod Stakhov2015-07-271-0/+1
* Some more fixes to tokenizator init.Vsevolod Stakhov2015-07-271-0/+6
* Fix issues with compatibility tokenization.Vsevolod Stakhov2015-07-271-1/+9
* Fix tokenizers and mmapped file.Vsevolod Stakhov2015-07-272-22/+57
* Fix stat processing.Vsevolod Stakhov2015-07-271-0/+4
* More changes to tokenization.Vsevolod Stakhov2015-07-271-2/+4
* Start tokenizers rework.Vsevolod Stakhov2015-07-271-4/+8
* Use new tokenization by default for created statfiles.Vsevolod Stakhov2015-07-271-1/+1
* Allow adding of prefix for tokenizers.Vsevolod Stakhov2015-07-262-4/+18
* Disable signatures detection as it breaks stuff.Vsevolod Stakhov2015-07-141-1/+1
* Implement skipping of signatures in text messages.Vsevolod Stakhov2015-07-142-13/+35
* Use not common name for tokenization exceptions.Vsevolod Stakhov2015-05-211-2/+2
* More fixes to tokenization.Vsevolod Stakhov2015-05-211-4/+7
* Fix critical bug in tokenization logic.Vsevolod Stakhov2015-05-201-1/+1
* Save OSB window index inside token.Vsevolod Stakhov2015-04-131-0/+2
* Use new siphash implementation.Vsevolod Stakhov2015-04-081-5/+6
* Fix tokenization of the last token in a message.Vsevolod Stakhov2015-04-021-1/+1
* Fix normalization and tokenization.Vsevolod Stakhov2015-04-021-1/+3
* Update remain on tokenization.Vsevolod Stakhov2015-04-011-0/+1
* Add new UTF8 tokenizer.Vsevolod Stakhov2015-04-012-23/+142
* Add compatibility layer for tokenization.Vsevolod Stakhov2015-04-013-5/+76
* Rework osb configuration.Vsevolod Stakhov2015-04-011-56/+112
* Save classifier configuration inside statfile config.Vsevolod Stakhov2015-04-012-4/+1
* Rework tokenization:Vsevolod Stakhov2015-02-233-43/+124
* Allow configurable tokenizers.Vsevolod Stakhov2015-02-222-3/+3
* Rework tokenization invocation.Vsevolod Stakhov2015-01-232-40/+0
* Add initial processing routines.Vsevolod Stakhov2015-01-232-7/+6
* Rework types for tokenizers functions.Vsevolod Stakhov2015-01-233-27/+17