aboutsummaryrefslogtreecommitdiffstats
path: root/src/libstat/tokenizers/osb.c
Commit message (Expand)AuthorAgeFilesLines
* [Project] Finish basic tasks in new unicode projectVsevolod Stakhov2018-11-251-10/+20
* [Project] Rework stemmingVsevolod Stakhov2018-11-241-3/+2
* [Feature] Skip stop words in statisticsVsevolod Stakhov2018-11-151-8/+20
* [Fix] Rework bayes calculations...Vsevolod Stakhov2018-11-141-1/+1
* [Minor] Further g_slice cleanupVsevolod Stakhov2017-10-281-2/+2
* [Feature] Add unigramms support in bayesVsevolod Stakhov2017-04-131-0/+12
* [Fix] Fix processing of small tokens vectorsVsevolod Stakhov2017-04-041-3/+8
* [Rework] Set token data as uint64_t instead of chars arrayVsevolod Stakhov2017-04-041-5/+3
* [Minor] Some fixes for displaying tokens infoVsevolod Stakhov2017-03-311-2/+3
* [Feature] Store text tokens inside bayes tokensVsevolod Stakhov2017-03-311-11/+22
* [Minor] Fix various style issuesVsevolod Stakhov2017-03-231-1/+0
* [Rework] Use a special structure for stats tokensVsevolod Stakhov2017-02-141-4/+10
* [Feature] New abstract hashing API in cryptoboxVsevolod Stakhov2016-05-101-3/+4
* Refactor UCL APIVsevolod Stakhov2016-02-161-5/+5
* Switch the rest to apache 2Vsevolod Stakhov2016-02-041-21/+12
* Fix tokenizationVsevolod Stakhov2016-01-051-94/+79
* Some more fixes to OSB algorithmVsevolod Stakhov2015-11-231-1/+4
* Fix format issues found by static analysisVsevolod Stakhov2015-11-111-1/+1
* Allow conditional build of snowball.Vsevolod Stakhov2015-10-231-1/+0
* Fix statistics.Vsevolod Stakhov2015-10-061-2/+2
* More logging updates.Vsevolod Stakhov2015-08-291-7/+7
* Fix sqlite3 backend initialization.Vsevolod Stakhov2015-07-271-0/+1
* Some more fixes to tokenizator init.Vsevolod Stakhov2015-07-271-0/+6
* Fix issues with compatibility tokenization.Vsevolod Stakhov2015-07-271-1/+9
* Fix tokenizers and mmapped file.Vsevolod Stakhov2015-07-271-18/+49
* Use new tokenization by default for created statfiles.Vsevolod Stakhov2015-07-271-1/+1
* Allow adding of prefix for tokenizers.Vsevolod Stakhov2015-07-261-2/+14
* Save OSB window index inside token.Vsevolod Stakhov2015-04-131-0/+2
* Use new siphash implementation.Vsevolod Stakhov2015-04-081-5/+6
* Add compatibility layer for tokenization.Vsevolod Stakhov2015-04-011-2/+63
* Rework osb configuration.Vsevolod Stakhov2015-04-011-56/+112
* Rework tokenization:Vsevolod Stakhov2015-02-231-29/+123
* Allow configurable tokenizers.Vsevolod Stakhov2015-02-221-1/+1
* Add initial processing routines.Vsevolod Stakhov2015-01-231-3/+3
* Rework types for tokenizers functions.Vsevolod Stakhov2015-01-231-14/+7
* Rework statistics runtime structures.Vsevolod Stakhov2015-01-231-3/+6
* New statistics token definition.Vsevolod Stakhov2015-01-181-11/+6
* Reorganize statfiles and classifiers into libstat.Vsevolod Stakhov2015-01-161-0/+122