aboutsummaryrefslogtreecommitdiffstats
path: root/src/html.c
Commit message (Collapse)AuthorAgeFilesLines
* * Introduce new logging system:Vsevolod Stakhov2009-12-221-3/+3
| | | | | | | | | - independent and customizeable buffering - line buffering - errors handling support - custom (ip based) debug - append function name automaticaly (based on __FUNCTION__) - add some logic to logs system
* * Retab, no functional changesVsevolod Stakhov2009-10-021-512/+508
|
* * Small fixes in task constructionVsevolod Stakhov2009-09-231-1/+0
|
* * Add decoding entities as it is specified in w3c recommendationsVsevolod Stakhov2009-09-161-21/+336
|
* * Decode all html entitles in html partsVsevolod Stakhov2009-09-161-5/+16
|
* * Fix html decoding when '/' are encoded tooVsevolod Stakhov2009-08-281-2/+7
|
* * Strip urls from space charactersVsevolod Stakhov2009-08-051-1/+0
|
* * Strip url line from spacesVsevolod Stakhov2009-07-301-0/+1
|
* * Fix entitles decoding for hex and oct charactersVsevolod Stakhov2009-07-281-4/+21
|
* * Use g_ascii_isalnum for more strict decodingVsevolod Stakhov2009-07-201-3/+4
| | | | | * Keep undecoded entitles undecoded * Fix log message
* * Decode html entitles in urls while extracting urls values from html tagsVsevolod Stakhov2009-07-201-0/+46
| | | | NOTE: works only for ascii symbols
* * Handle <?xml> tags correctlyVsevolod Stakhov2009-07-061-1/+1
|
* * Check return value from evdns_resolveVsevolod Stakhov2009-07-031-1/+1
| | | | * Do not parse html parts double time while extracting urls, just parse tags attributes
* * Add hack to disallow misformed urlsVsevolod Stakhov2009-07-031-1/+1
|
* * If tag attribute value empty do not assume it as urlVsevolod Stakhov2009-07-031-0/+4
|
* * Fix html urls processingVsevolod Stakhov2009-07-031-7/+34
|
* * Add autolearn config optionsVsevolod Stakhov2009-07-031-1/+1
| | | | | * Fix parsing of invalid urls in html parser * Add ability to specify symbols in view parameter as comma-separated list
* * Extract url encoded urls from html textsVsevolod Stakhov2009-07-031-2/+59
|
* * Fix issue with <?xml> tagVsevolod Stakhov2009-05-211-1/+1
|
* * Remove unused debugVsevolod Stakhov2009-05-191-6/+0
|
* * Add simple html parser and tag balancing detectorVsevolod Stakhov2009-05-151-0/+319
* Add function for searching html tag