index
:
rspamd.git
external-maps
libev-migration
log_json
master
mime-rework
rdns-tcp-rework
rework-symcache
rspamd-0.5
rspamd-0.6
rspamd-0.7
rspamd-0.8
rspamd-0.9
rspamd-1.0
rspamd-1.1
rspamd-1.2
rspamd-1.3
rspamd-1.4
rspamd-1.5
rspamd-1.6
rspamd-1.9
rspamd-3.10
rspamd-3.7
rspamd-3.8
rspamd-3.9
torch-removal
vstakhov-anonymize-mime
vstakhov-another-grow-factor-fix
vstakhov-ci-try
vstakhov-conf-reorg
vstakhov-cpu-detection
vstakhov-cumulative-tcp-timeout
vstakhov-fasttext-langdet
vstakhov-fix-2047-encode
vstakhov-fix-dcc
vstakhov-fuzzy-cxx
vstakhov-fuzzy-limits-display
vstakhov-fuzzy-noop
vstakhov-fuzzy-tcp
vstakhov-gpt-ollama
vstakhov-keypair-encoding
vstakhov-known-senders
vstakhov-llm-anonymize
vstakhov-llm-embeddings
vstakhov-lua-shingles
vstakhov-lua-text-api
vstakhov-new-hiredis
vstakhov-openssl-provider-message
vstakhov-redis-pool-fixes
vstakhov-remove-control-block
vstakhov-some-build-fixes
vstakhov-ssl-fixes
vstakhov-stringzilla
vstakhov-strip-attachments
vstakhov-surbl-conf-fix
vstakhov-universal-hashing-lua
vstakhov-utf8-mime
vstakhov-zstd-headers
Rapid spam filtering system: https://github.com/rspamd/rspamd
www-data
about
summary
refs
log
tree
commit
diff
stats
log msg
author
committer
range
path:
root
/
src
/
libstat
/
tokenizers
/
tokenizers.c
Commit message (
Expand
)
Author
Age
Files
Lines
...
*
[Minor] Use libicu for tokenizers
Vsevolod Stakhov
2017-02-25
1
-18
/
+22
*
[Rework] Use a special structure for stats tokens
Vsevolod Stakhov
2017-02-14
1
-8
/
+15
*
[Rework] Rework exceptions and newlines processing
Vsevolod Stakhov
2016-07-13
1
-9
/
+13
*
[Fix] Switch hashes to mumhash
Vsevolod Stakhov
2016-07-13
1
-9
/
+12
*
Switch the rest to apache 2
Vsevolod Stakhov
2016-02-04
1
-21
/
+12
*
Implement words decaying for text parts.
Vsevolod Stakhov
2015-11-12
1
-4
/
+63
*
Fix format issues found by static analysis
Vsevolod Stakhov
2015-11-11
1
-1
/
+1
*
Fix statistics.
Vsevolod Stakhov
2015-10-06
1
-14
/
+13
*
Rename main.h and main.c to `rspamd.X`
Vsevolod Stakhov
2015-09-22
1
-1
/
+1
*
Disable signatures detection as it breaks stuff.
Vsevolod Stakhov
2015-07-14
1
-1
/
+1
*
Implement skipping of signatures in text messages.
Vsevolod Stakhov
2015-07-14
1
-12
/
+33
*
Use not common name for tokenization exceptions.
Vsevolod Stakhov
2015-05-21
1
-2
/
+2
*
More fixes to tokenization.
Vsevolod Stakhov
2015-05-21
1
-4
/
+7
*
Fix critical bug in tokenization logic.
Vsevolod Stakhov
2015-05-20
1
-1
/
+1
*
Fix tokenization of the last token in a message.
Vsevolod Stakhov
2015-04-02
1
-1
/
+1
*
Fix normalization and tokenization.
Vsevolod Stakhov
2015-04-02
1
-1
/
+3
*
Update remain on tokenization.
Vsevolod Stakhov
2015-04-01
1
-0
/
+1
*
Add new UTF8 tokenizer.
Vsevolod Stakhov
2015-04-01
1
-22
/
+141
*
Add compatibility layer for tokenization.
Vsevolod Stakhov
2015-04-01
1
-1
/
+1
*
Save classifier configuration inside statfile config.
Vsevolod Stakhov
2015-04-01
1
-1
/
+1
*
Rework tokenization:
Vsevolod Stakhov
2015-02-23
1
-13
/
+0
*
Rework tokenization invocation.
Vsevolod Stakhov
2015-01-23
1
-37
/
+0
*
Add initial processing routines.
Vsevolod Stakhov
2015-01-23
1
-4
/
+3
*
Rework types for tokenizers functions.
Vsevolod Stakhov
2015-01-23
1
-1
/
+1
*
Reorganize libstat API.
Vsevolod Stakhov
2015-01-23
1
-18
/
+0
*
Rework statistics runtime structures.
Vsevolod Stakhov
2015-01-23
1
-1
/
+1
*
New statistics token definition.
Vsevolod Stakhov
2015-01-18
1
-3
/
+3
*
Start refactoring of statistics in rspamd.
Vsevolod Stakhov
2015-01-18
1
-3
/
+2
*
Reorganize statfiles and classifiers into libstat.
Vsevolod Stakhov
2015-01-16
1
-0
/
+260
[prev]