index
:
rspamd.git
external-maps
libev-migration
log_json
master
mime-rework
rdns-tcp-rework
rework-symcache
rspamd-0.5
rspamd-0.6
rspamd-0.7
rspamd-0.8
rspamd-0.9
rspamd-1.0
rspamd-1.1
rspamd-1.2
rspamd-1.3
rspamd-1.4
rspamd-1.5
rspamd-1.6
rspamd-1.9
rspamd-3.10
rspamd-3.7
rspamd-3.8
rspamd-3.9
torch-removal
vstakhov-anonymize-mime
vstakhov-another-grow-factor-fix
vstakhov-ci-try
vstakhov-conf-reorg
vstakhov-cpu-detection
vstakhov-cumulative-tcp-timeout
vstakhov-fasttext-langdet
vstakhov-fix-2047-encode
vstakhov-fix-dcc
vstakhov-fuzzy-cxx
vstakhov-fuzzy-limits-display
vstakhov-fuzzy-tcp
vstakhov-gpt-ollama
vstakhov-keypair-encoding
vstakhov-known-senders
vstakhov-llm-anonymize
vstakhov-llm-embeddings
vstakhov-lua-text-api
vstakhov-new-hiredis
vstakhov-openssl-provider-message
vstakhov-remove-control-block
vstakhov-some-build-fixes
vstakhov-ssl-fixes
vstakhov-stringzilla
vstakhov-strip-attachments
vstakhov-surbl-conf-fix
vstakhov-universal-hashing-lua
vstakhov-utf8-mime
vstakhov-zstd-headers
Rapid spam filtering system: https://github.com/rspamd/rspamd
www-data
about
summary
refs
log
tree
commit
diff
stats
log msg
author
committer
range
path:
root
/
src
/
html.c
Commit message (
Expand
)
Author
Age
Files
Lines
*
* First commit to implement multi-statfile filter system with new learning me...
Vsevolod Stakhov
2011-07-12
1
-25
/
+2
*
* Make fuzzy hashes utf8 compatible.
Vsevolod Stakhov
2011-07-12
1
-1
/
+1
*
Fix phishing detection with img flag.
Vsevolod Stakhov
2011-07-11
1
-16
/
+32
*
* Fixes to fuzzy hashing logic, skip urls while estimating fuzzy hash
Vsevolod Stakhov
2011-06-23
1
-7
/
+5
*
* Fix phishing detector to find phished urls with tags inside
Vsevolod Stakhov
2011-04-19
1
-4
/
+27
*
* Add ability to extract urls from subject field
0.3.10
Vsevolod Stakhov
2011-03-23
1
-1
/
+7
*
Fix error with parsing phishing urls.
Vsevolod Stakhov
2011-03-17
1
-1
/
+1
*
Fix phishing check for special cases like http://host.com and http://www.host...
Vsevolod Stakhov
2011-03-14
1
-5
/
+37
*
Small fix.
Vsevolod Stakhov
2011-03-14
1
-1
/
+1
*
Make phishing checks working.
Vsevolod Stakhov
2011-03-05
1
-6
/
+7
*
Try to fix memory issues.
Vsevolod Stakhov
2011-03-02
1
-4
/
+12
*
Fix stupid bug in url parser.
Vsevolod Stakhov
2011-02-25
1
-1
/
+1
*
* Rewrite URL storage system
Vsevolod Stakhov
2011-02-24
1
-1
/
+3
*
Fix error with tags like <? xml ?>
Vsevolod Stakhov
2011-01-25
1
-1
/
+3
*
* Many fixes to fuzzy hashes logic and tokenization.
Vsevolod Stakhov
2011-01-24
1
-1
/
+7
*
Detect mailto: inside <a> and <img> tags.
Vsevolod Stakhov
2010-12-01
1
-1
/
+2
*
Make own strlcpy that does not calculate remaining string length (faster and ...
Vsevolod Stakhov
2010-11-16
1
-1
/
+1
*
* Add ability to obtain phished url from lua
Vsevolod Stakhov
2010-11-12
1
-0
/
+1
*
Urgent fixes.
Vsevolod Stakhov
2010-11-03
1
-1
/
+1
*
* Add phishing detector (now just compares <a href> with tag's data).
Vsevolod Stakhov
2010-11-02
1
-19
/
+55
*
Fixes types (use glib ones) no functional change.
Vsevolod Stakhov
2010-10-06
1
-22
/
+22
*
* Make improvements to HTML entites decoder: now it replaces entities with co...
Vsevolod Stakhov
2010-07-16
1
-263
/
+283
*
* Fix compatibility issues
Vsevolod Stakhov
2010-06-23
1
-2
/
+25
*
* Introduce new logging system:
Vsevolod Stakhov
2009-12-22
1
-3
/
+3
*
* Retab, no functional changes
Vsevolod Stakhov
2009-10-02
1
-512
/
+508
*
* Small fixes in task construction
Vsevolod Stakhov
2009-09-23
1
-1
/
+0
*
* Add decoding entities as it is specified in w3c recommendations
Vsevolod Stakhov
2009-09-16
1
-21
/
+336
*
* Decode all html entitles in html parts
Vsevolod Stakhov
2009-09-16
1
-5
/
+16
*
* Fix html decoding when '/' are encoded too
Vsevolod Stakhov
2009-08-28
1
-2
/
+7
*
* Strip urls from space characters
Vsevolod Stakhov
2009-08-05
1
-1
/
+0
*
* Strip url line from spaces
Vsevolod Stakhov
2009-07-30
1
-0
/
+1
*
* Fix entitles decoding for hex and oct characters
Vsevolod Stakhov
2009-07-28
1
-4
/
+21
*
* Use g_ascii_isalnum for more strict decoding
Vsevolod Stakhov
2009-07-20
1
-3
/
+4
*
* Decode html entitles in urls while extracting urls values from html tags
Vsevolod Stakhov
2009-07-20
1
-0
/
+46
*
* Handle <?xml> tags correctly
Vsevolod Stakhov
2009-07-06
1
-1
/
+1
*
* Check return value from evdns_resolve
Vsevolod Stakhov
2009-07-03
1
-1
/
+1
*
* Add hack to disallow misformed urls
Vsevolod Stakhov
2009-07-03
1
-1
/
+1
*
* If tag attribute value empty do not assume it as url
Vsevolod Stakhov
2009-07-03
1
-0
/
+4
*
* Fix html urls processing
Vsevolod Stakhov
2009-07-03
1
-7
/
+34
*
* Add autolearn config options
Vsevolod Stakhov
2009-07-03
1
-1
/
+1
*
* Extract url encoded urls from html texts
Vsevolod Stakhov
2009-07-03
1
-2
/
+59
*
* Fix issue with <?xml> tag
Vsevolod Stakhov
2009-05-21
1
-1
/
+1
*
* Remove unused debug
Vsevolod Stakhov
2009-05-19
1
-6
/
+0
*
* Add simple html parser and tag balancing detector
Vsevolod Stakhov
2009-05-15
1
-0
/
+319