index
:
rspamd.git
external-maps
libev-migration
log_json
master
mime-rework
rdns-tcp-rework
rework-symcache
rspamd-0.5
rspamd-0.6
rspamd-0.7
rspamd-0.8
rspamd-0.9
rspamd-1.0
rspamd-1.1
rspamd-1.2
rspamd-1.3
rspamd-1.4
rspamd-1.5
rspamd-1.6
rspamd-1.9
rspamd-3.10
rspamd-3.7
rspamd-3.8
rspamd-3.9
torch-removal
vstakhov-anonymize-mime
vstakhov-another-grow-factor-fix
vstakhov-ci-try
vstakhov-conf-reorg
vstakhov-cpu-detection
vstakhov-cumulative-tcp-timeout
vstakhov-fasttext-langdet
vstakhov-fix-2047-encode
vstakhov-fix-dcc
vstakhov-fuzzy-cxx
vstakhov-fuzzy-limits-display
vstakhov-fuzzy-tcp
vstakhov-gpt-ollama
vstakhov-keypair-encoding
vstakhov-known-senders
vstakhov-llm-anonymize
vstakhov-llm-embeddings
vstakhov-lua-text-api
vstakhov-new-hiredis
vstakhov-openssl-provider-message
vstakhov-remove-control-block
vstakhov-some-build-fixes
vstakhov-ssl-fixes
vstakhov-stringzilla
vstakhov-strip-attachments
vstakhov-surbl-conf-fix
vstakhov-universal-hashing-lua
vstakhov-utf8-mime
vstakhov-zstd-headers
Rapid spam filtering system: https://github.com/rspamd/rspamd
www-data
about
summary
refs
log
tree
commit
diff
stats
log msg
author
committer
range
path:
root
/
src
/
libserver
/
html.c
Commit message (
Collapse
)
Author
Age
Files
Lines
*
[Minor] Move html code to a separate subdir (no functional changes)
Vsevolod Stakhov
2021-05-20
1
-3423
/
+0
|
*
[Rework] Use C++ version for unicode normalisation
Vsevolod Stakhov
2021-05-17
1
-1
/
+1
|
*
[Rework] Use C++ utf8 library with unit tests to trim whitespaces
Vsevolod Stakhov
2021-05-14
1
-37
/
+3
|
*
[Minor] Strip visible parts of urls using utf rules
Vsevolod Stakhov
2021-05-14
1
-2
/
+37
|
*
[Minor] Do not treat unnormalised urls as obscured
Vsevolod Stakhov
2021-05-14
1
-4
/
+0
|
*
[Minor] One more fix in the ZW spaces urls processing logic
Vsevolod Stakhov
2021-05-13
1
-9
/
+2
|
*
[Fix] Fix normalisation flags propagation
Vsevolod Stakhov
2021-05-11
1
-15
/
+4
|
*
[Rework] Rename phished url to a linked url
Vsevolod Stakhov
2021-04-19
1
-1
/
+1
|
*
[Minor] Avoid FP when a protocol prefix is implicitly added
Vsevolod Stakhov
2021-04-15
1
-2
/
+2
|
*
[Minor] Propagate images flag
Vsevolod Stakhov
2021-04-14
1
-3
/
+16
|
*
[Project] Css: Implement styles merging
Vsevolod Stakhov
2021-03-29
1
-1
/
+2
|
*
[Project] Css: Enable conditional css parsing support from the HTML parser
Vsevolod Stakhov
2021-03-26
1
-3
/
+42
|
*
[Minor] Fix urls count tracking logic
Vsevolod Stakhov
2021-03-24
1
-0
/
+2
|
*
[Fix] Urls: Fix processing of html urls when it comes to the flags
Vsevolod Stakhov
2021-03-06
1
-3
/
+8
|
|
|
|
Issue: #3664
*
[Minor] Try to find some obfuscation attemtps
Vsevolod Stakhov
2021-03-05
1
-2
/
+22
|
|
|
|
Issue: #3637
*
[Minor] Various fixes for display link detection
Vsevolod Stakhov
2021-03-05
1
-6
/
+18
|
*
[Minor] HTML: Extract urls from `action` attribute
Vsevolod Stakhov
2021-02-02
1
-1
/
+6
|
*
[Minor] Temporary workaround (should be fixed properly at some point)
Vsevolod Stakhov
2021-01-20
1
-1
/
+2
|
*
[Fix] Html: Attach inline tags to the structure
Vsevolod Stakhov
2021-01-19
1
-3
/
+11
|
*
[Fix] Html: Do not treat empty tags as block tags
Vsevolod Stakhov
2021-01-12
1
-1
/
+1
|
*
[Fix] Do not process links in ignored html tags
Vsevolod Stakhov
2021-01-06
1
-2
/
+2
|
*
[Feature] Extract text from img alt attributes
Vsevolod Stakhov
2021-01-06
1
-3
/
+20
|
*
[Fix] Html: Add entities collisions prevention logic (e.g. for mathml entities)
Vsevolod Stakhov
2020-10-13
1
-1
/
+58
|
*
[Minor] Oops, fix crash
Vsevolod Stakhov
2020-07-16
1
-0
/
+2
|
*
[Fix] Exclude damaged urls from html parser
Vsevolod Stakhov
2020-07-16
1
-1
/
+1
|
*
[Minor] Add link tag basic processing
Vsevolod Stakhov
2020-07-16
1
-0
/
+32
|
*
[Minor] Ignore data urls
Vsevolod Stakhov
2020-07-16
1
-3
/
+7
|
*
[Minor] Fix data images processing in html links
Vsevolod Stakhov
2020-07-16
1
-2
/
+6
|
*
[Minor] Add one more boundary check
Vsevolod Stakhov
2020-06-08
1
-1
/
+1
|
*
[Minor] Fix corner case in html escaping
Vsevolod Stakhov
2020-06-03
1
-6
/
+12
|
*
[Minor] Allow attaching of urls to the mime parts
Vsevolod Stakhov
2020-05-05
1
-9
/
+27
|
*
[Fix] One more fix to skip images that are not urls
Vsevolod Stakhov
2020-05-01
1
-7
/
+10
|
*
Revert "[Minor] Do not append unbalanced closing tags"
Vsevolod Stakhov
2020-04-30
1
-14
/
+15
|
|
|
|
This reverts commit e1339c646f9a910f4cc1805020af35a7c1f82a1d.
*
[Minor] Use more strict checks for image urls
Vsevolod Stakhov
2020-04-30
1
-4
/
+10
|
*
[Minor] Do not append unbalanced closing tags
Vsevolod Stakhov
2020-04-27
1
-15
/
+14
|
*
[Minor] Oops, forgot to fill struct field
Vsevolod Stakhov
2020-03-23
1
-0
/
+1
|
*
[Rework] Urls: process query urls in HTML urls correctly
Vsevolod Stakhov
2020-03-22
1
-40
/
+39
|
*
[Minor] Oops, fix html urls processing
Vsevolod Stakhov
2020-03-12
1
-1
/
+1
|
*
[Minor] Fix bitset size
Vsevolod Stakhov
2020-03-11
1
-1
/
+1
|
*
[Rework] Urls: adopt html related stuff
Vsevolod Stakhov
2020-03-09
1
-97
/
+45
|
*
[Rework] Rework URL structure: adjust tld part
Vsevolod Stakhov
2020-03-09
1
-6
/
+6
|
*
[Rework] Rework URL structure: more structure optimisations
Vsevolod Stakhov
2020-03-09
1
-2
/
+2
|
*
[Rework] Rework URL structure: host field
Vsevolod Stakhov
2020-03-09
1
-7
/
+7
|
*
[Fix] Another brain damage html standard adoptions
Vsevolod Stakhov
2020-03-02
1
-3
/
+29
|
*
[Fix] Fix parsing of the html tags with no spaces after attributes
Vsevolod Stakhov
2020-03-02
1
-0
/
+5
|
*
[Minor] Fix stupid email clients entities 'guessing'
Vsevolod Stakhov
2020-02-18
1
-4
/
+36
|
*
[CritFix] Fix html entities decoding
Vsevolod Stakhov
2020-02-03
1
-2
/
+2
|
*
[Fix] Fix white on white rule and add is_leaf flag
Vsevolod Stakhov
2020-01-23
1
-6
/
+12
|
*
[Fix] More fixes in html tag content calculations
Vsevolod Stakhov
2020-01-09
1
-7
/
+40
|
*
[Rework] Rework HTML tags content attachment
Vsevolod Stakhov
2020-01-06
1
-25
/
+45
|
[next]