Commit message (Collapse) | Author | Age | Files | Lines | |
---|---|---|---|---|---|
* | [Project] Finish rework of passthrough actions | Vsevolod Stakhov | 2018-10-11 | 1 | -13/+11 |
| | |||||
* | [Rework] Always create result for a task | Vsevolod Stakhov | 2018-10-11 | 1 | -1/+1 |
| | |||||
* | [Minor] Fix CMA calculations | Vsevolod Stakhov | 2018-10-06 | 1 | -6/+11 |
| | |||||
* | [Minor] Insert spaces only if text part is not suspected to be a link | Mikhail Galanin | 2018-09-18 | 1 | -2/+14 |
| | | | | | | | sa_body requires newlines to be replaces with spaces. However, some email clients highlight URL in triangular braces (<>) even if it's broken by a newline. Let's just remove \n if current text part potentially can be such URL. | ||||
* | [Minor] Plug utexts leak | Vsevolod Stakhov | 2018-09-11 | 1 | -0/+5 |
| | |||||
* | [Fix] Fix various corner cases for language detection | Vsevolod Stakhov | 2018-09-08 | 1 | -1/+5 |
| | |||||
* | [Minor] Check language detector prior to usage | Vsevolod Stakhov | 2018-09-07 | 1 | -1/+1 |
| | |||||
* | [Rework] Rework language detector | Vsevolod Stakhov | 2018-09-07 | 1 | -32/+3 |
| | |||||
* | [Feature] Implement new text tokenizer based on libicu | Vsevolod Stakhov | 2018-09-06 | 1 | -15/+8 |
| | |||||
* | [Minor] Add UText wrapper for stripped content | Vsevolod Stakhov | 2018-09-06 | 1 | -1/+14 |
| | |||||
* | [Minor] Sigh, fix non-empty parts this time | Vsevolod Stakhov | 2018-09-05 | 1 | -3/+2 |
| | |||||
* | [Minor] Fix processing of empty parts | Vsevolod Stakhov | 2018-09-05 | 1 | -16/+23 |
| | |||||
* | [Rework] Rework utf content processing in text parts | Vsevolod Stakhov | 2018-09-05 | 1 | -127/+148 |
| | | | | | | - Store unicode in UTF parts - Store unicode for HTML parts - Rename struct fields and split them into unicode/utf components | ||||
* | [Project] Further changes in unicode operations | Vsevolod Stakhov | 2018-08-23 | 1 | -9/+6 |
| | | | | | | | * Normalise unicode * Add normality flag for text parts * Store UCS in text parts * Rework unicode conversions and operations | ||||
* | [Project] Start unicode rework | Vsevolod Stakhov | 2018-08-23 | 1 | -4/+12 |
| | |||||
* | [Minor] Include subject in task digest | Vsevolod Stakhov | 2018-08-14 | 1 | -0/+1 |
| | |||||
* | [Fix] Fix messages insertion | Vsevolod Stakhov | 2018-07-31 | 1 | -3/+7 |
| | |||||
* | [Fix] Fix HTML part detection heuristic | Vsevolod Stakhov | 2018-07-28 | 1 | -2/+4 |
| | |||||
* | [Fix] Detect empty text part as text, not HTML | Mikhail Galanin | 2018-07-27 | 1 | -5/+10 |
| | | | | Issue: #2367 | ||||
* | [Minor] Fix gtube detection if nothing else is presented | Vsevolod Stakhov | 2018-06-26 | 1 | -1/+1 |
| | |||||
* | [Feature] Split message parsing and processing | Vsevolod Stakhov | 2018-06-06 | 1 | -39/+45 |
| | |||||
* | [Minor] Improve performance of language detector | Vsevolod Stakhov | 2018-04-17 | 1 | -2/+1 |
| | |||||
* | [Minor] Fix condition | Vsevolod Stakhov | 2018-04-06 | 1 | -76/+75 |
| | |||||
* | [Feature] Do not detect language twice for relative parts | Vsevolod Stakhov | 2018-04-06 | 1 | -117/+177 |
| | |||||
* | [Minor] Preserve more information in the artificial header | Vsevolod Stakhov | 2018-04-04 | 1 | -0/+9 |
| | | | | Issue: #2143 | ||||
* | [Minor] Fix out-of-boundary access | Vsevolod Stakhov | 2018-03-27 | 1 | -1/+5 |
| | |||||
* | [Fix] Rework logic of actions setting | Vsevolod Stakhov | 2018-03-22 | 1 | -2/+0 |
| | |||||
* | [Fix] Deal with deeply nested messages more aggressively | Vsevolod Stakhov | 2018-03-12 | 1 | -1/+17 |
| | |||||
* | [Feature] Try to detect HTML messages better | Vsevolod Stakhov | 2018-02-03 | 1 | -1/+17 |
| | |||||
* | [Minor] Improve language detection debug logging | Vsevolod Stakhov | 2018-01-15 | 1 | -1/+2 |
| | |||||
* | [Fix] Various improvements in language detection | Vsevolod Stakhov | 2018-01-13 | 1 | -5/+7 |
| | |||||
* | [Project] Preliminary version of ngramms based language detector | Vsevolod Stakhov | 2018-01-13 | 1 | -1/+12 |
| | |||||
* | [Project] Remove old language detector | Vsevolod Stakhov | 2018-01-13 | 1 | -176/+36 |
| | |||||
* | [Project] Add unigramms to language detection as well | Vsevolod Stakhov | 2018-01-01 | 1 | -2/+16 |
| | |||||
* | [Minor] Check gtube in larger messages | Vsevolod Stakhov | 2017-12-29 | 1 | -1/+1 |
| | |||||
* | [Feature] Add more gtube like patterns to test other spam actions | Vsevolod Stakhov | 2017-12-24 | 1 | -15/+74 |
| | |||||
* | [CritFix] Fix newlines detection | Vsevolod Stakhov | 2017-10-21 | 1 | -0/+7 |
| | | | | MFH: rspamd-1.6 | ||||
* | [Fix] Further tokenization fixes | Vsevolod Stakhov | 2017-10-21 | 1 | -2/+2 |
| | | | | MFH: rspamd-1.6 | ||||
* | [CritFix] Another portion of tokenization fixes | Vsevolod Stakhov | 2017-10-18 | 1 | -4/+14 |
| | | | | MFH: rspamd-1.6 | ||||
* | [Fix] Lowercase words | Vsevolod Stakhov | 2017-10-18 | 1 | -2/+0 |
| | | | | MFH: rspamd-1.6 | ||||
* | [Minor] Avoid out of boundary reads | Vsevolod Stakhov | 2017-10-14 | 1 | -2/+2 |
| | |||||
* | [Feature] Add more text attributes | Vsevolod Stakhov | 2017-09-23 | 1 | -0/+10 |
| | |||||
* | [Fix] Fix get_content method | Vsevolod Stakhov | 2017-07-18 | 1 | -1/+1 |
| | | | | Issue: #1759 | ||||
* | [Minor] Fix element added | Vsevolod Stakhov | 2017-06-22 | 1 | -4/+4 |
| | |||||
* | [Feature] Add text_part:get_stats function | Vsevolod Stakhov | 2017-06-21 | 1 | -2/+2 |
| | |||||
* | [Minor] Add some more data to the spaces strip algorithm | Vsevolod Stakhov | 2017-06-21 | 1 | -0/+12 |
| | |||||
* | [Feature] Store more data when stripping newlines | Vsevolod Stakhov | 2017-06-21 | 1 | -25/+70 |
| | |||||
* | [Feature] Rework newlines strip parser one more time | Vsevolod Stakhov | 2017-06-21 | 1 | -0/+146 |
| | | | | Issue: #1687 | ||||
* | [Feature] Support MTA name header | Vsevolod Stakhov | 2017-05-31 | 1 | -1/+3 |
| | |||||
* | [Feature] Reuse stemmers in the cache | Vsevolod Stakhov | 2017-05-13 | 1 | -8/+21 |
| |