aboutsummaryrefslogtreecommitdiffstats
path: root/src/libserver/html.c
Commit message (Expand)AuthorAgeFilesLines
* [Minor] Move html code to a separate subdir (no functional changes)Vsevolod Stakhov2021-05-201-3423/+0
* [Rework] Use C++ version for unicode normalisationVsevolod Stakhov2021-05-171-1/+1
* [Rework] Use C++ utf8 library with unit tests to trim whitespacesVsevolod Stakhov2021-05-141-37/+3
* [Minor] Strip visible parts of urls using utf rulesVsevolod Stakhov2021-05-141-2/+37
* [Minor] Do not treat unnormalised urls as obscuredVsevolod Stakhov2021-05-141-4/+0
* [Minor] One more fix in the ZW spaces urls processing logicVsevolod Stakhov2021-05-131-9/+2
* [Fix] Fix normalisation flags propagationVsevolod Stakhov2021-05-111-15/+4
* [Rework] Rename phished url to a linked urlVsevolod Stakhov2021-04-191-1/+1
* [Minor] Avoid FP when a protocol prefix is implicitly addedVsevolod Stakhov2021-04-151-2/+2
* [Minor] Propagate images flagVsevolod Stakhov2021-04-141-3/+16
* [Project] Css: Implement styles mergingVsevolod Stakhov2021-03-291-1/+2
* [Project] Css: Enable conditional css parsing support from the HTML parserVsevolod Stakhov2021-03-261-3/+42
* [Minor] Fix urls count tracking logicVsevolod Stakhov2021-03-241-0/+2
* [Fix] Urls: Fix processing of html urls when it comes to the flagsVsevolod Stakhov2021-03-061-3/+8
* [Minor] Try to find some obfuscation attemtpsVsevolod Stakhov2021-03-051-2/+22
* [Minor] Various fixes for display link detectionVsevolod Stakhov2021-03-051-6/+18
* [Minor] HTML: Extract urls from `action` attributeVsevolod Stakhov2021-02-021-1/+6
* [Minor] Temporary workaround (should be fixed properly at some point)Vsevolod Stakhov2021-01-201-1/+2
* [Fix] Html: Attach inline tags to the structureVsevolod Stakhov2021-01-191-3/+11
* [Fix] Html: Do not treat empty tags as block tagsVsevolod Stakhov2021-01-121-1/+1
* [Fix] Do not process links in ignored html tagsVsevolod Stakhov2021-01-061-2/+2
* [Feature] Extract text from img alt attributesVsevolod Stakhov2021-01-061-3/+20
* [Fix] Html: Add entities collisions prevention logic (e.g. for mathml entities)Vsevolod Stakhov2020-10-131-1/+58
* [Minor] Oops, fix crashVsevolod Stakhov2020-07-161-0/+2
* [Fix] Exclude damaged urls from html parserVsevolod Stakhov2020-07-161-1/+1
* [Minor] Add link tag basic processingVsevolod Stakhov2020-07-161-0/+32
* [Minor] Ignore data urlsVsevolod Stakhov2020-07-161-3/+7
* [Minor] Fix data images processing in html linksVsevolod Stakhov2020-07-161-2/+6
* [Minor] Add one more boundary checkVsevolod Stakhov2020-06-081-1/+1
* [Minor] Fix corner case in html escapingVsevolod Stakhov2020-06-031-6/+12
* [Minor] Allow attaching of urls to the mime partsVsevolod Stakhov2020-05-051-9/+27
* [Fix] One more fix to skip images that are not urlsVsevolod Stakhov2020-05-011-7/+10
* Revert "[Minor] Do not append unbalanced closing tags"Vsevolod Stakhov2020-04-301-14/+15
* [Minor] Use more strict checks for image urlsVsevolod Stakhov2020-04-301-4/+10
* [Minor] Do not append unbalanced closing tagsVsevolod Stakhov2020-04-271-15/+14
* [Minor] Oops, forgot to fill struct fieldVsevolod Stakhov2020-03-231-0/+1
* [Rework] Urls: process query urls in HTML urls correctlyVsevolod Stakhov2020-03-221-40/+39
* [Minor] Oops, fix html urls processingVsevolod Stakhov2020-03-121-1/+1
* [Minor] Fix bitset sizeVsevolod Stakhov2020-03-111-1/+1
* [Rework] Urls: adopt html related stuffVsevolod Stakhov2020-03-091-97/+45
* [Rework] Rework URL structure: adjust tld partVsevolod Stakhov2020-03-091-6/+6
* [Rework] Rework URL structure: more structure optimisationsVsevolod Stakhov2020-03-091-2/+2
* [Rework] Rework URL structure: host fieldVsevolod Stakhov2020-03-091-7/+7
* [Fix] Another brain damage html standard adoptionsVsevolod Stakhov2020-03-021-3/+29
* [Fix] Fix parsing of the html tags with no spaces after attributesVsevolod Stakhov2020-03-021-0/+5
* [Minor] Fix stupid email clients entities 'guessing'Vsevolod Stakhov2020-02-181-4/+36
* [CritFix] Fix html entities decodingVsevolod Stakhov2020-02-031-2/+2
* [Fix] Fix white on white rule and add is_leaf flagVsevolod Stakhov2020-01-231-6/+12
* [Fix] More fixes in html tag content calculationsVsevolod Stakhov2020-01-091-7/+40
* [Rework] Rework HTML tags content attachmentVsevolod Stakhov2020-01-061-25/+45