Vsevolod Stakhov
14c13854d3
[Rework] Further types conversion (no functional changes)
2 kuukautta sitten
Vsevolod Stakhov
d5e76f0a91
[Minor] Fix pointers operations
10 kuukautta sitten
Vsevolod Stakhov
537a7180a0
[Rework] Use clang-format to unify formatting in all sources
No meaningful changes.
10 kuukautta sitten
Vsevolod Stakhov
5fd7a90fda
[Feature] Add order to urls structure
10 kuukautta sitten
Vsevolod Stakhov
eef2f3cac7
[Feature] Reorganise struct rspamd_url to be 64 bytes size
10 kuukautta sitten
Vsevolod Stakhov
1496f881cf
[Minor] Parse remaning content when reaching html limit
11 kuukautta sitten
Vsevolod Stakhov
0babcc489b
[Test] Fix unit tests
1 vuosi sitten
Vsevolod Stakhov
1a7b7d7076
[Feature] Add html parsing limit
1 vuosi sitten
Heiko Becker
0a916c632e
Fix build with gcc 13 by including <cstdint>
Like other versions before, gcc 13 moved some includes around and as a
result <cstdint> is no longer transitively included. Explicitly include
it for uint8_t.
1 vuosi sitten
Vsevolod Stakhov
acb8e2cd80
[Fix] Try to fix parsing of the unencoded `>` characters in html attributes
1 vuosi sitten
Vsevolod Stakhov
969902f045
[Rework] Use another version of hash table from the same author
1 vuosi sitten
Vsevolod Stakhov
ef699ea4d7
[Rework] Switch minimum C++ standard version to C++20
1 vuosi sitten
Vsevolod Stakhov
48297adf96
[Minor] Simplify array_of using types deduction
2 vuotta sitten
Josh Soref
2b8e6958f4
Spelling (#4086)
[Rework] Massive spelling fix from @jsoref
2 vuotta sitten
Vsevolod Stakhov
b1bef4702c
[Fix] When checking for phishing, we need to convert punicode -> UTF8, not vice versa
Previosly, Rspamd has checked for `xn--` signature just to convert it back to `xn--` form.
According to the expected results, it should do the opposite conversion.
Issue: #4065
2 vuotta sitten
Vsevolod Stakhov
246e62c22b
[Minor] Damn C++ implicit conversions
2 vuotta sitten
Vsevolod Stakhov
420a261aa4
[Minor] Set special flag for non A generated html urls
2 vuotta sitten
Vsevolod Stakhov
84aef0280c
[Minor] Really set a sane font size
2 vuotta sitten
Vsevolod Stakhov
0983f59671
[Minor] Add type safety and fix default font size
2 vuotta sitten
Vsevolod Stakhov
ada2d8000b
[Minor] Another ubsan catch
2 vuotta sitten
Vsevolod Stakhov
03f73692a9
[Minor] Fix enum handling
2 vuotta sitten
Vsevolod Stakhov
5aac0be13a
[Minor] Add safe-guard for the root tag
2 vuotta sitten
Vsevolod Stakhov
40101edb0e
[Fix] Fix logic in HTML processing FSM
2 vuotta sitten
Vsevolod Stakhov
90dfe4e87a
[Minor] Fix triggering of the suspicious url rule on percent encoding
2 vuotta sitten
Vsevolod Stakhov
d4863f6e33
[Minor] Fix for old compilers
2 vuotta sitten
Vsevolod Stakhov
708530514c
[Fix] Find suspicious url encodings that could break url extraction
2 vuotta sitten
Vsevolod Stakhov
e4621c415e
[Minor] Add explicit cast
2 vuotta sitten
Vsevolod Stakhov
52783f990b
[Minor] Fix one more issue
2 vuotta sitten
Vsevolod Stakhov
15701e9dc2
[Fix] Html: Fix the case where only bgcolor is explicitly set
2 vuotta sitten
Vsevolod Stakhov
b5cdc5c6b3
[Minor] Do not append style and title tags in invisible content
2 vuotta sitten
Vsevolod Stakhov
4f254839f8
[Fix] Fix parsing of the unquoted attributes in HTML
2 vuotta sitten
Vsevolod Stakhov
a26db85d6a
[Minor] Add a special state to tags content parsing to avoid illegal lookahead
2 vuotta sitten
Vsevolod Stakhov
5e8c168fcc
[Minor] Oops, fix emplacement
2 vuotta sitten
Vsevolod Stakhov
416da198db
[Minor] Some minor cleanups
Found by: coverity scan
2 vuotta sitten
Vsevolod Stakhov
6f5881e0bb
[Minor] Escape \uFFFD in the sources
2 vuotta sitten
Vsevolod Stakhov
4c6234a1a0
[Fix] Fix some complicated case with the closing tags parsing
2 vuotta sitten
Vsevolod Stakhov
909a594f8d
[Rework] Save invisible content to a separate buffer
2 vuotta sitten
Vsevolod Stakhov
c33af58a53
[Minor] Fix stupid memory leak
2 vuotta sitten
Vsevolod Stakhov
2e6f1ebd70
[Minor] Store multiple base tags (but use only first one as usually)
2 vuotta sitten
Vsevolod Stakhov
e966950fb9
[Minor] Fix small href values with a base urls presented
2 vuotta sitten
Vsevolod Stakhov
ac1c49b2e8
[Minor] Set invisible flag on invisible urls
2 vuotta sitten
Vsevolod Stakhov
67e271be5c
[Minor] Restore the old behaviour
2 vuotta sitten
Vsevolod Stakhov
db00be0b4a
[Minor] Restore old hacks for flags propagation
2 vuotta sitten
Vsevolod Stakhov
4e02182eae
[Minor] Improve readability
Suggested by: @citrin
2 vuotta sitten
Vsevolod Stakhov
e947bbb495
[Minor] One more fix for checking displayed urls
2 vuotta sitten
Vsevolod Stakhov
c4ec1883f7
[Minor] Remove redundant cycle
2 vuotta sitten
Vsevolod Stakhov
6c2d6e771a
[Minor] Oops, fix misprint
2 vuotta sitten
Vsevolod Stakhov
572071dd6d
[Fix] Fix phishing flag set
2 vuotta sitten
Vsevolod Stakhov
b369727b44
[Minor] Further fixes to the html tags content methods
2 vuotta sitten
Vsevolod Stakhov
7d3e0fc85e
[Minor] Some fixes in content extraction for html tags
2 vuotta sitten