aboutsummaryrefslogtreecommitdiffstats
path: root/src/fuzzy.c
Commit message (Collapse)AuthorAgeFilesLines
* Another debian license fix.Vsevolod Stakhov2012-09-101-1/+1
| | | | | Add apache license for regexp that were delivered from SpamAssassin project. Fix debian/copyright for src/dns.c.
* Update copyright (required by debian).Vsevolod Stakhov2012-09-041-3/+3
|
* More cleanups and fixes for compiler warnings.0.4.6Vsevolod Stakhov2011-12-201-3/+3
|
* Fix signness in arithmetic operations.Vsevolod Stakhov2011-08-041-5/+5
|
* Fix coredumps when learning message with one empty part.Vsevolod Stakhov2011-07-251-8/+10
|
* Another fix for comparing parts without content - two empty parts are equal.Vsevolod Stakhov2011-07-141-1/+6
|
* Fix coredump on messages with one url only.Vsevolod Stakhov2011-07-141-1/+1
|
* * Add new algorithm based on diff algorithm to compare relatively short text ↵Vsevolod Stakhov2011-07-131-6/+18
| | | | parts
* * Add ability to get difference between two parts from lua codeVsevolod Stakhov2011-07-121-0/+5
|
* * Make fuzzy hashes utf8 compatible.Vsevolod Stakhov2011-07-121-34/+74
|
* * Welcome 0.4.0Vsevolod Stakhov2011-06-241-11/+11
| | | | | | | | | | | | | | | | | | Uncompatible changes: - Statistics is uncompatible in utf8 mode Major changes: - Improved utf8 mode - Convert all characters to lowercase in statistics - Skip URL's in statistics - Improve speed of bayes classifier by using integer arithmetics - Fixed statfiles synchronization that was broken for a long time - Synchronization is now configurable Minor changes: - Bugfixes - Removed some of legacy code - Types polishing
* Oops, remove debug.Vsevolod Stakhov2011-06-231-7/+0
|
* * Fixes to fuzzy hashing logic, skip urls while estimating fuzzy hashVsevolod Stakhov2011-06-231-4/+132
| | | | | Fix tags stripping. Fix phishing checks (ignore img tags).
* * Many fixes to fuzzy hashes logic and tokenization.Vsevolod Stakhov2011-01-241-3/+27
|
* Fixes types (use glib ones) no functional change.Vsevolod Stakhov2010-10-061-23/+23
| | | | | Now all comments in commit logs beginning with '*' would be included in changelog, so important changes would be separated from small ones.
* * Retab, no functional changesVsevolod Stakhov2009-10-021-49/+49
|
* * Make fuzzy storage working (tested checking, adding and deleting of fuzzy ↵Vsevolod Stakhov2009-07-311-2/+2
| | | | | | hashes from storage) * Fix stupid bug in fuzzy distance calculations
* * Fix indexes in fuzzy hashes calculationsVsevolod Stakhov2009-07-161-1/+1
| | | | | * Add optimization flags support * Enable -O by default
* * Avoid division by zero while comparing fuzzy hashesVsevolod Stakhov2009-04-281-0/+5
|
* * Add 3 functions:Vsevolod Stakhov2009-03-301-1/+1
| | | | | | | | - is_recipients_sorted - has_only_html_part - compare_recipients_distance * Update documentation * Fix build
* * Fix many compile time warnings and polish codeVsevolod Stakhov2009-03-241-1/+1
|
* * Fix error in expression parser that causes bad errors with expressions ↵Vsevolod Stakhov2009-03-231-1/+12
| | | | | | | | | | | | that have regexp at the end * Improve test for fuzzy hashes * Add new object - TextPart to perl XS library that allows access to stripped parts and fuzzy hashes * Add documentation for expressions parser and fot Mail::Rspamd::TextPart * Allways calculate fuzzy hash for text parts * Store text parts separately from other parts * Add compare_parts_distance for expressions that calculates difference in 2 parts messages * Do not try to substitute variables in empty strings
* * Add fuzzy hashes implementation (with adler-32 roll hash and FNV-32 hash)Vsevolod Stakhov2009-03-201-0/+273
* Add test case for fuzzy hashing check