index
:
rspamd.git
external-maps
libev-migration
log_json
master
mime-rework
rdns-tcp-rework
rework-symcache
rspamd-0.5
rspamd-0.6
rspamd-0.7
rspamd-0.8
rspamd-0.9
rspamd-1.0
rspamd-1.1
rspamd-1.2
rspamd-1.3
rspamd-1.4
rspamd-1.5
rspamd-1.6
rspamd-1.9
rspamd-3.10
rspamd-3.7
rspamd-3.8
rspamd-3.9
torch-removal
vstakhov-anonymize-mime
vstakhov-another-grow-factor-fix
vstakhov-ci-try
vstakhov-conf-reorg
vstakhov-cpu-detection
vstakhov-cumulative-tcp-timeout
vstakhov-fasttext-langdet
vstakhov-fix-2047-encode
vstakhov-fix-dcc
vstakhov-fuzzy-cxx
vstakhov-fuzzy-limits-display
vstakhov-fuzzy-tcp
vstakhov-gpt-ollama
vstakhov-keypair-encoding
vstakhov-known-senders
vstakhov-llm-anonymize
vstakhov-llm-embeddings
vstakhov-lua-text-api
vstakhov-new-hiredis
vstakhov-openssl-provider-message
vstakhov-remove-control-block
vstakhov-some-build-fixes
vstakhov-ssl-fixes
vstakhov-stringzilla
vstakhov-strip-attachments
vstakhov-surbl-conf-fix
vstakhov-universal-hashing-lua
vstakhov-utf8-mime
vstakhov-zstd-headers
Rapid spam filtering system: https://github.com/rspamd/rspamd
www-data
about
summary
refs
log
tree
commit
diff
stats
log msg
author
committer
range
path:
root
/
src
/
libserver
/
html.c
Commit message (
Expand
)
Author
Age
Files
Lines
*
[Minor] Oops, forgot to fill struct field
Vsevolod Stakhov
2020-03-23
1
-0
/
+1
*
[Rework] Urls: process query urls in HTML urls correctly
Vsevolod Stakhov
2020-03-22
1
-40
/
+39
*
[Minor] Oops, fix html urls processing
Vsevolod Stakhov
2020-03-12
1
-1
/
+1
*
[Minor] Fix bitset size
Vsevolod Stakhov
2020-03-11
1
-1
/
+1
*
[Rework] Urls: adopt html related stuff
Vsevolod Stakhov
2020-03-09
1
-97
/
+45
*
[Rework] Rework URL structure: adjust tld part
Vsevolod Stakhov
2020-03-09
1
-6
/
+6
*
[Rework] Rework URL structure: more structure optimisations
Vsevolod Stakhov
2020-03-09
1
-2
/
+2
*
[Rework] Rework URL structure: host field
Vsevolod Stakhov
2020-03-09
1
-7
/
+7
*
[Fix] Another brain damage html standard adoptions
Vsevolod Stakhov
2020-03-02
1
-3
/
+29
*
[Fix] Fix parsing of the html tags with no spaces after attributes
Vsevolod Stakhov
2020-03-02
1
-0
/
+5
*
[Minor] Fix stupid email clients entities 'guessing'
Vsevolod Stakhov
2020-02-18
1
-4
/
+36
*
[CritFix] Fix html entities decoding
Vsevolod Stakhov
2020-02-03
1
-2
/
+2
*
[Fix] Fix white on white rule and add is_leaf flag
Vsevolod Stakhov
2020-01-23
1
-6
/
+12
*
[Fix] More fixes in html tag content calculations
Vsevolod Stakhov
2020-01-09
1
-7
/
+40
*
[Rework] Rework HTML tags content attachment
Vsevolod Stakhov
2020-01-06
1
-25
/
+45
*
[Project] Track more memory allocations in a task
Vsevolod Stakhov
2019-12-23
1
-0
/
+2
*
[Fix] Fix base tag processing according to stupid HTML renderer behaviour
Vsevolod Stakhov
2019-12-16
1
-16
/
+8
*
[Minor] Fix some corner cases in HTML parsing
Vsevolod Stakhov
2019-10-11
1
-2
/
+5
*
[Minor] Fix compile warnings
Vsevolod Stakhov
2019-10-10
1
-3
/
+1
*
[Minor] Slightly improve debug logging
Vsevolod Stakhov
2019-09-04
1
-1
/
+4
*
[Rework] Rework image urls processing
Vsevolod Stakhov
2019-08-29
1
-2
/
+17
*
[Rework] Drop url tags
Vsevolod Stakhov
2019-08-21
1
-3
/
+0
*
[Minor] Rework utf8 lowercasing
Vsevolod Stakhov
2019-08-13
1
-1
/
+1
*
[Minor] Slight types improvement
Vsevolod Stakhov
2019-06-28
1
-2
/
+2
*
[Fix] Html: Fix processing of fjlig entity
Vsevolod Stakhov
2019-06-14
1
-5
/
+7
*
[Minor] HTML: Allow to extract base url from the tag
Vsevolod Stakhov
2019-05-13
1
-4
/
+8
*
[Fix] HTML: Fix `size` attribute processing
Vsevolod Stakhov
2019-04-27
1
-5
/
+4
*
[Fix] Fix processing of embedded urls
Vsevolod Stakhov
2019-04-09
1
-12
/
+14
*
[Rework] Rework HTML content urls extraction
Vsevolod Stakhov
2019-04-02
1
-2
/
+4
*
[Feature] Treat all tags with HREF as a potential hyperlinks
Vsevolod Stakhov
2019-03-20
1
-10
/
+7
*
[Minor] Improve style from the previous merge
Vsevolod Stakhov
2019-03-09
1
-22
/
+4
*
Merge pull request #2771 from miecio45/feat_url_visible_part
Vsevolod Stakhov
2019-03-09
1
-0
/
+26
|
\
|
*
[Fix] Fix memor leaks and whitespace processing
Miecio Za
2019-03-07
1
-5
/
+22
|
*
[Feature] Add flag to url object when visible part is url_like
Miecio Za
2019-02-27
1
-0
/
+3
|
*
[Feature] Export visible part of url to lua
Miecio Za
2019-02-27
1
-0
/
+6
*
|
[Minor] Ignore completely damaged urls
Vsevolod Stakhov
2019-03-04
1
-1
/
+3
*
|
[Minor] Fix crash when tld is absent
Vsevolod Stakhov
2019-03-04
1
-1
/
+2
*
|
[Rework] Rework telephone urls parsing logic
Vsevolod Stakhov
2019-03-01
1
-1
/
+2
|
/
*
[Minor] Fix logic of finding slashless urls
Vsevolod Stakhov
2019-02-25
1
-5
/
+8
*
[Fix] HTML: Another HTML comments exception fix
Vsevolod Stakhov
2019-02-25
1
-3
/
+26
*
[Minor] More heuristics in HTML urls detection
Vsevolod Stakhov
2019-02-21
1
-33
/
+39
*
[Minor] Oops, more crap filtering
Vsevolod Stakhov
2019-02-21
1
-0
/
+4
*
[Fix] Add filter for absurdic URLs
Vsevolod Stakhov
2019-02-21
1
-3
/
+19
*
[Fix] HTML: Fix some more SGML tags issues
Vsevolod Stakhov
2019-02-04
1
-2
/
+3
*
[Fix] HTML: Fix HTML comments with many dashes
Vsevolod Stakhov
2019-01-30
1
-1
/
+1
*
[Minor] Core: Improve url findings in queries
Vsevolod Stakhov
2019-01-29
1
-1
/
+1
*
[Minor] HTML: More corner cases in entities decoding
Vsevolod Stakhov
2019-01-24
1
-20
/
+49
*
[Fix] HTML: Another entities decoding logic fix
Vsevolod Stakhov
2019-01-24
1
-7
/
+12
*
[Fix] HTML: Fix entities in HTML attributes
Vsevolod Stakhov
2019-01-24
1
-9
/
+36
*
[CritFix] Html: Entities are not valid within tag params values
Vsevolod Stakhov
2019-01-23
1
-20
/
+10
[next]