aboutsummaryrefslogtreecommitdiffstats
path: root/src/libserver/url.c
Commit message (Collapse)AuthorAgeFilesLines
...
* [Minor] Do not try to check numeric url in utf contentVsevolod Stakhov2020-09-111-0/+4
|
* [Minor] Allow attaching of urls to the mime partsVsevolod Stakhov2020-05-051-2/+12
|
* [Fix] One more fix to skip images that are not urlsVsevolod Stakhov2020-05-011-1/+8
|
* [Rework] Rework url flags handling APIVsevolod Stakhov2020-04-211-0/+79
|
* [Minor] Remove unused-but-set variablesChristian Göttsche2020-04-111-3/+2
| | | | | | | | | | | | ../src/libserver/url.c: In function ‘rspamd_url_host_set_add’: ../src/libserver/url.c:3808:11: warning: variable ‘k’ set but not used [-Wunused-but-set-variable] 3808 | khiter_t k; | ^ ../src/lua/lua_task.c: In function ‘lua_task_has_urls’: ../src/lua/lua_task.c:2406:11: warning: variable ‘need_emails’ set but not used [-Wunused-but-set-variable] 2406 | gboolean need_emails = FALSE, ret = FALSE; | ^~~~~~~~~~~
* [Rework] URL: Another update for urls extraction logicVsevolod Stakhov2020-03-231-7/+7
| | | | | | | | | URL extraction from HTML parts should look like this: 1. Extract href links 2. Convert HTML to plain text and extract: a) (http|https|ftp)://foo.bar and www.foo b) email like strings \bfoo@bar.baz\b . For all extracted strings check if we have host with a domain from the public suffix.
* [Rework] Urls: process query urls in HTML urls correctlyVsevolod Stakhov2020-03-221-4/+0
|
* [Rework] Urls: Improve query urls handlingVsevolod Stakhov2020-03-221-30/+37
|
* [Minor] Replace g_abort with abortVsevolod Stakhov2020-03-201-1/+1
|
* [Rework] Fix various url extraction issuesVsevolod Stakhov2020-03-191-6/+8
|
* [Rework] Use multiple search tries for different url extraction typesVsevolod Stakhov2020-03-191-45/+139
|
* [Fix] Fix some corner cases in urls parsing, add limitsVsevolod Stakhov2020-03-191-5/+42
|
* [Minor] Firther relax href parsingVsevolod Stakhov2020-03-161-3/+3
|
* [Rework] Urls: update lua librariesVsevolod Stakhov2020-03-091-1/+1
|
* [Rework] Urls: adopt html related stuffVsevolod Stakhov2020-03-091-13/+45
|
* [Rework] Urls: more rework of the urls setsVsevolod Stakhov2020-03-091-107/+84
|
* [Rework] Urls: rework urls hash structureVsevolod Stakhov2020-03-091-6/+47
|
* [Rework] Rework URL structure: adjust tld partVsevolod Stakhov2020-03-091-5/+5
|
* [Rework] Rework URL structure: more structure optimisationsVsevolod Stakhov2020-03-091-45/+69
|
* [Rework] Rework URL structure: host fieldVsevolod Stakhov2020-03-091-34/+44
|
* [Rework] Rework URL structure: user fieldVsevolod Stakhov2020-03-091-11/+12
|
* [Minor] Show long compilation warningVsevolod Stakhov2020-02-271-0/+5
|
* [Minor] Fix some more tld issues and tld testsVsevolod Stakhov2020-02-251-25/+26
|
* [Fix] Fix tld compositon rulesVsevolod Stakhov2020-02-251-4/+3
| | | | Issue: #3248, #3268
* [Minor] Fix boundary detection in emails checksVsevolod Stakhov2020-02-181-6/+7
|
* [Minor] Distinguish max_lua_urls and max_urlsVsevolod Stakhov2020-01-271-2/+2
|
* [Minor] Another try to fix shiftingVsevolod Stakhov2020-01-171-3/+3
|
* [Minor] Another try to fix urls shifting when decodingVsevolod Stakhov2020-01-171-4/+13
|
* [Fix] Fix urls shifting when doing decode to include separatorsVsevolod Stakhov2020-01-151-3/+3
|
* [Fix] Fix urls encode functionVsevolod Stakhov2020-01-131-2/+2
|
* [Minor] Add trailing slash when reconstructing numeric urlVsevolod Stakhov2020-01-111-0/+5
|
* [Minor] Fixes in urls fragments/queries parsingVsevolod Stakhov2020-01-101-0/+16
|
* [Fix] Fix processing of numeric url'sVsevolod Stakhov2020-01-101-28/+61
|
* [Minor] Fix format stringVsevolod Stakhov2019-12-231-2/+2
|
* [Minor] Stop extracting urls on limit reachedVsevolod Stakhov2019-12-231-0/+11
|
* [Minor] Fix urls adding limitVsevolod Stakhov2019-12-231-5/+8
|
* [Minor] Normalise backslashes to slashes in urlsVsevolod Stakhov2019-12-201-0/+15
|
* [Fix] Another fix for numeric urls parsingVsevolod Stakhov2019-10-221-18/+35
|
* [Fix] Add another safe-guard in urls processingVsevolod Stakhov2019-10-091-5/+23
|
* [Fix] Another fix to deal with bad URLsVsevolod Stakhov2019-10-091-12/+30
|
* [CritFix] Fix procesing of urlsVsevolod Stakhov2019-10-091-2/+15
| | | | | | | | Rspamd tried to process urls by end of match in hyperscan disregarding the fact that a particular URL has been already processed. It might cause high memory consumption or crash due to OOM when some particular URLs being processed.
* [Minor] Fix telephones parsingVsevolod Stakhov2019-10-021-4/+7
|
* [Minor] Fix OOB readVsevolod Stakhov2019-09-261-1/+1
|
* [Minor] Fix protocol encodingVsevolod Stakhov2019-09-251-5/+12
|
* [Rework] Drop url tagsVsevolod Stakhov2019-08-211-28/+0
|
* [Minor] Rework utf8 lowercasingVsevolod Stakhov2019-08-131-1/+2
|
* [Minor] Fix more alignment and ubsan issuesVsevolod Stakhov2019-08-121-1/+1
|
* [Minor] Fix some ubsan related issuesVsevolod Stakhov2019-08-121-1/+4
|
* [Project] Libserver migration processVsevolod Stakhov2019-07-121-11/+13
|
* [Minor] Slight types improvementVsevolod Stakhov2019-06-281-1/+1
|