Vsevolod Stakhov
14c13854d3
[Rework] Further types conversion (no functional changes)
vor 2 Monaten
Vsevolod Stakhov
6b2b416718
[Rework] Remove some of the GLib types in lieu of standard ones
This types have constant conflicts with the system ones especially on OSX.
vor 2 Monaten
Vsevolod Stakhov
1d0ff299f5
[Fix] Do not save multipatterns to FS in certain cases
vor 2 Monaten
Vsevolod Stakhov
75327b3889
[Project] Convert the remaining
vor 2 Monaten
Vsevolod Stakhov
8238dc0a23
[Project] Start optimization of userdata hashing
vor 2 Monaten
Vsevolod Stakhov
e6d7103b9e
[Minor] Fix unit tests
vor 6 Monaten
Vsevolod Stakhov
c203493c90
[Feature] Reiterate on gtube patterns
vor 6 Monaten
Vsevolod Stakhov
537a7180a0
[Rework] Use clang-format to unify formatting in all sources
No meaningful changes.
vor 10 Monaten
Vsevolod Stakhov
5fd7a90fda
[Feature] Add order to urls structure
vor 10 Monaten
Vsevolod Stakhov
c82c2ccc54
[Minor] Oops: I have written too long in C++
vor 10 Monaten
Vsevolod Stakhov
f0d4093ec0
[Feature] Process HTML parts before text ones
vor 10 Monaten
Vsevolod Stakhov
360bba643a
[Minor] Slightly change the text processing logic
vor 10 Monaten
Vsevolod Stakhov
60e234c400
[Minor] Compare parts with many words by just using their length
vor 11 Monaten
Vsevolod Stakhov
d46799f287
[Minor] Always report multipattern compile error
Issue: #4487
vor 1 Jahr
Vsevolod Stakhov
1a7b7d7076
[Feature] Add html parsing limit
vor 1 Jahr
Vsevolod Stakhov
bb5e5fb414
[Rework] Allow to set a different behaviour for actions from settings
Issue: #4025
vor 2 Jahren
Vsevolod Stakhov
513b6c6ab9
[Project] Start using of the new received structure
vor 2 Jahren
Vsevolod Stakhov
229cf1c94b
[Fix] Add a special logic for text part with no text extraction
vor 2 Jahren
Vsevolod Stakhov
86e3a152fb
[Minor] Do not go into utf8 checks on non utf8 leading characters
vor 2 Jahren
Vsevolod Stakhov
118b5de7b9
[Minor] Fix spaces state in newlines removal SM
vor 2 Jahren
Vsevolod Stakhov
6c3399e7b9
[Minor] Remove bogus likely
vor 2 Jahren
Vsevolod Stakhov
d4871e22d6
[Minor] Create a standard io loop in rspamadm
vor 2 Jahren
Vsevolod Stakhov
b6be849d7d
[Minor] Enable css parser by default
vor 2 Jahren
Vsevolod Stakhov
0e21fac34d
[Minor] Lemove legacy
vor 2 Jahren
Vsevolod Stakhov
66fd827632
[Minor] Fix format string for size_t
vor 3 Jahren
Vsevolod Stakhov
1d3c9379b9
[Rework] Html: Deal with the utf_content part
vor 3 Jahren
Vsevolod Stakhov
c017695895
[Minor] Move html code to a separate subdir (no functional changes)
vor 3 Jahren
Vsevolod Stakhov
24f85d1c06
[Minor] Fix broken tests suite
vor 3 Jahren
Vsevolod Stakhov
40db5f6260
[Project] Css: Enable conditional css parsing support from the HTML parser
vor 3 Jahren
Vsevolod Stakhov
70c783d20f
[Minor] Fix oob read
vor 3 Jahren
Vsevolod Stakhov
e170d0f7d9
[Minor] Avoid extra sort
vor 3 Jahren
Vsevolod Stakhov
60173e2236
[Minor] Sort urls for mime parts
vor 3 Jahren
Vsevolod Stakhov
87f9e50500
[Minor] Move urls find in subject to task process stage
vor 3 Jahren
Vsevolod Stakhov
95106e8145
[Minor] Fix rare case of off-by-one read
vor 3 Jahren
Vsevolod Stakhov
4aee16e53d
[Minor] Fix rspamd_has_only_html_part function + refactor macro names
vor 3 Jahren
Vsevolod Stakhov
8b18d032b3
[Fix] Fix html attachments checks
vor 3 Jahren
Vsevolod Stakhov
be2dc6559f
[Minor] Do not count skipped words
vor 3 Jahren
Vsevolod Stakhov
ffdde528bb
[Minor] Do not count empty words
vor 3 Jahren
Vsevolod Stakhov
4d1cd21282
[Minor] Avoid code duplication
vor 4 Jahren
Vsevolod Stakhov
5dd1ccbb33
[Minor] Allow attaching of urls to the mime parts
vor 4 Jahren
Vsevolod Stakhov
c6822d66ed
[Minor] Try to process images before text part heuristic
vor 4 Jahren
Vsevolod Stakhov
b80e8b8477
[Project] Rework some more functions to work with shadow results
vor 4 Jahren
Vsevolod Stakhov
f605d67050
[Rework] URL: Another update for urls extraction logic
URL extraction from HTML parts should look like this:
1. Extract href links
2. Convert HTML to plain text and extract:
a) (http|https|ftp)://foo.bar and www.foo
b) email like strings \bfoo@bar.baz\b .
For all extracted strings check if we have host with a domain from the public suffix.
vor 4 Jahren
Vsevolod Stakhov
9e7bf606f9
[Rework] Change the way to extract URLs when dealing with alternative parts
vor 4 Jahren
Vsevolod Stakhov
5cb7a60c65
[Rework] Urls: adopt html related stuff
vor 4 Jahren
Vsevolod Stakhov
50a043a7cb
[Rework] Urls: more rework of the urls sets
vor 4 Jahren
Vsevolod Stakhov
cfdf17ea1a
[Fix] Avoid collisions in mempool variables by changing fuzzy caching logic
vor 4 Jahren
Vsevolod Stakhov
7e4aa922ca
[Fix] Fix smtp message on passthrough result
Issue: #3269
vor 4 Jahren
Vsevolod Stakhov
5ebcabcff3
[Minor] Remove non-reject gtube patterns unless enable_test_patterns is enabled
vor 4 Jahren
Vsevolod Stakhov
24f07eac1d
[Project] Track more memory allocations in a task
vor 4 Jahren