Commit message (Collapse) | Author | Age | Files | Lines | |
---|---|---|---|---|---|
* | * Introduce new logging system: | Vsevolod Stakhov | 2009-12-22 | 1 | -3/+3 |
| | | | | | | | | | - independent and customizeable buffering - line buffering - errors handling support - custom (ip based) debug - append function name automaticaly (based on __FUNCTION__) - add some logic to logs system | ||||
* | * Retab, no functional changes | Vsevolod Stakhov | 2009-10-02 | 1 | -512/+508 |
| | |||||
* | * Small fixes in task construction | Vsevolod Stakhov | 2009-09-23 | 1 | -1/+0 |
| | |||||
* | * Add decoding entities as it is specified in w3c recommendations | Vsevolod Stakhov | 2009-09-16 | 1 | -21/+336 |
| | |||||
* | * Decode all html entitles in html parts | Vsevolod Stakhov | 2009-09-16 | 1 | -5/+16 |
| | |||||
* | * Fix html decoding when '/' are encoded too | Vsevolod Stakhov | 2009-08-28 | 1 | -2/+7 |
| | |||||
* | * Strip urls from space characters | Vsevolod Stakhov | 2009-08-05 | 1 | -1/+0 |
| | |||||
* | * Strip url line from spaces | Vsevolod Stakhov | 2009-07-30 | 1 | -0/+1 |
| | |||||
* | * Fix entitles decoding for hex and oct characters | Vsevolod Stakhov | 2009-07-28 | 1 | -4/+21 |
| | |||||
* | * Use g_ascii_isalnum for more strict decoding | Vsevolod Stakhov | 2009-07-20 | 1 | -3/+4 |
| | | | | | * Keep undecoded entitles undecoded * Fix log message | ||||
* | * Decode html entitles in urls while extracting urls values from html tags | Vsevolod Stakhov | 2009-07-20 | 1 | -0/+46 |
| | | | | NOTE: works only for ascii symbols | ||||
* | * Handle <?xml> tags correctly | Vsevolod Stakhov | 2009-07-06 | 1 | -1/+1 |
| | |||||
* | * Check return value from evdns_resolve | Vsevolod Stakhov | 2009-07-03 | 1 | -1/+1 |
| | | | | * Do not parse html parts double time while extracting urls, just parse tags attributes | ||||
* | * Add hack to disallow misformed urls | Vsevolod Stakhov | 2009-07-03 | 1 | -1/+1 |
| | |||||
* | * If tag attribute value empty do not assume it as url | Vsevolod Stakhov | 2009-07-03 | 1 | -0/+4 |
| | |||||
* | * Fix html urls processing | Vsevolod Stakhov | 2009-07-03 | 1 | -7/+34 |
| | |||||
* | * Add autolearn config options | Vsevolod Stakhov | 2009-07-03 | 1 | -1/+1 |
| | | | | | * Fix parsing of invalid urls in html parser * Add ability to specify symbols in view parameter as comma-separated list | ||||
* | * Extract url encoded urls from html texts | Vsevolod Stakhov | 2009-07-03 | 1 | -2/+59 |
| | |||||
* | * Fix issue with <?xml> tag | Vsevolod Stakhov | 2009-05-21 | 1 | -1/+1 |
| | |||||
* | * Remove unused debug | Vsevolod Stakhov | 2009-05-19 | 1 | -6/+0 |
| | |||||
* | * Add simple html parser and tag balancing detector | Vsevolod Stakhov | 2009-05-15 | 1 | -0/+319 |
* Add function for searching html tag |