aboutsummaryrefslogtreecommitdiffstats
path: root/src/url.c
Commit message (Collapse)AuthorAgeFilesLines
* Use DB_HASH access method for bdb backend.Vsevolod Stakhov2012-03-011-1/+1
| | | | Fix signed and unsigned comparasion while I'm here.
* Make compiler happy in several cases, remove warnings.Vsevolod Stakhov2011-12-201-1/+2
|
* Ignore something that looks like email but has no username.Vsevolod Stakhov2011-12-011-2/+4
|
* * Support for IDN urlsVsevolod Stakhov2011-08-231-4/+4
| | | | * DNS resolver now can handle international domains correctly
* Fix signness in arithmetic operations.Vsevolod Stakhov2011-08-041-4/+4
|
* Add workaround for clang under linux.Vsevolod Stakhov2011-07-211-2/+4
| | | | Fix problems found by static analyzing.
* Fix coredumps on some specific messages with specific urls.Vsevolod Stakhov2011-07-191-7/+13
| | | | | Fix coredumps while closing log file. Fix parsing of chunked HTTP replies.
* * Welcome 0.4.0Vsevolod Stakhov2011-06-241-5/+7
| | | | | | | | | | | | | | | | | | Uncompatible changes: - Statistics is uncompatible in utf8 mode Major changes: - Improved utf8 mode - Convert all characters to lowercase in statistics - Skip URL's in statistics - Improve speed of bayes classifier by using integer arithmetics - Fixed statfiles synchronization that was broken for a long time - Synchronization is now configurable Minor changes: - Bugfixes - Removed some of legacy code - Types polishing
* * Fixes to fuzzy hashing logic, skip urls while estimating fuzzy hashVsevolod Stakhov2011-06-231-15/+26
| | | | | Fix tags stripping. Fix phishing checks (ignore img tags).
* Parse urls with '_' sign in domain name0.3.13Vsevolod Stakhov2011-06-031-1/+1
|
* Fix phishing check for special cases like http://host.com and ↵Vsevolod Stakhov2011-03-141-9/+9
| | | | http://www.host.com
* Try to fix memory issues.Vsevolod Stakhov2011-03-021-9/+9
|
* Do not write warning about normal urls.0.3.7Vsevolod Stakhov2011-02-251-1/+1
|
* * Rewrite URL storage systemVsevolod Stakhov2011-02-241-3/+8
|
* Fix '_' in emails.Vsevolod Stakhov2011-02-171-1/+1
|
* Fix parsing emails with '_' symbol.Vsevolod Stakhov2011-02-171-1/+1
| | | | Fix adding metric symbol.
* Detect mail addresses at the begining of message.Vsevolod Stakhov2011-02-111-0/+4
|
* * Write Emails: header in outputVsevolod Stakhov2011-02-111-3/+3
|
* * New module for checking emails inside messages (rules based, like multimap)Vsevolod Stakhov2011-02-031-16/+35
| | | | | | | * Emails now are separated from urls and urls checks * Add ability to check text attachements if option is presented in a configuration Version is 0.3.6 now
* Fix decoding of escaped URL'sVsevolod Stakhov2010-11-251-4/+4
|
* Performance critical fix0.3.3Vsevolod Stakhov2010-11-031-1/+1
|
* Urgent fixes.Vsevolod Stakhov2010-11-031-1/+5
|
* * Add phishing detector (now just compares <a href> with tag's data).Vsevolod Stakhov2010-11-021-30/+53
|
* Fixes bugs found with clang-static analyser.Vsevolod Stakhov2010-10-111-2/+2
| | | | | Strictly follow c99 standart. Turn on pedantic c99 checks.
* Fixes types (use glib ones) no functional change.Vsevolod Stakhov2010-10-061-87/+87
| | | | | Now all comments in commit logs beginning with '*' would be included in changelog, so important changes would be separated from small ones.
* * Extract domains from emails to check via surblVsevolod Stakhov2010-09-291-4/+31
| | | | | * Fixes to url extracter * Check urls in redirector even if they are listed in surbl whitelist
* * New trie based url scanner (based on libcamel)Vsevolod Stakhov2010-09-211-141/+363
| | | | | * Small fixes to rspamd perl client * Write fuzzy hashes info to log
* * Reopen log file by USR1 signalVsevolod Stakhov2010-07-191-1/+6
| | | | * Add reopenlog method to FreeBSD rc script
* * Fix parsing of some broken urlsVsevolod Stakhov2010-07-051-1/+1
|
* * Fix detection of numeric urls (reported by citrin)Vsevolod Stakhov2010-07-051-2/+13
| | | | * Write real time of message's scan to log (not only virtual)
* * Introduce new logging system:Vsevolod Stakhov2009-12-221-11/+11
| | | | | | | | | - independent and customizeable buffering - line buffering - errors handling support - custom (ip based) debug - append function name automaticaly (based on __FUNCTION__) - add some logic to logs system
* * Retab, no functional changesVsevolod Stakhov2009-10-021-243/+227
|
* * Handle base10 encoded ip addresses correctlyVsevolod Stakhov2009-08-311-1/+2
|
* * Strip urls from space charactersVsevolod Stakhov2009-08-051-0/+18
|
* * Strip url line from spacesVsevolod Stakhov2009-07-301-0/+1
|
* * Fix 'cc' urls parsingVsevolod Stakhov2009-07-161-1/+1
| | | | * Add function has_fake_html for determine fake html parts (without any tags)
* * Fix url unescapingVsevolod Stakhov2009-07-061-3/+26
|
* * Extract url encoded urls from html textsVsevolod Stakhov2009-07-031-4/+7
|
* * Forgot to mergeVsevolod Stakhov2009-06-021-2/+2
|\
| * * Fix url grabbing regexp to match urls like <a href=http://blah.com>Vsevolod Stakhov2009-06-011-2/+2
| |
* | * Rework url parsing algorithmsVsevolod Stakhov2009-06-021-12/+19
|/ | | | | | | * Adopt all parts of rspamd for new url parser * Improve url-extracter utility by avoiding cut&paste of mime parsing * Small fixes to rspamc client * Bump version to 0.1.3
* * Fix urls extracting, avoid code repeatingVsevolod Stakhov2009-04-171-78/+30
|
* * Fix many compile time warnings and polish codeVsevolod Stakhov2009-03-241-7/+9
|
* * Avoid extra requests to surbls saving them into requests cacheVsevolod Stakhov2009-03-101-2/+2
| | | | * Lower debug level for some errors (INFO to DEBUG)
* * Fix 2 memory issues:Vsevolod Stakhov2009-03-101-0/+10
| | | | | | - NULL string when trying to check url regexp - double free of message byte array as mime_stream frees memory in array if it thinks that stream is owner of array's memory
* * Add some broken urls to check list, in fact skip only urls without host ↵Vsevolod Stakhov2009-03-061-2/+2
| | | | | | part and empty urls (thought empty urls are totally impossible in that code)
* * Fix bad memory leaks and memory corruption in url detecting moduleVsevolod Stakhov2009-03-061-10/+51
|
* * Fix bug in url parser with freeing memory allocated by memory_pool allocatorVsevolod Stakhov2009-03-041-4/+4
| | | | * Do not write to log if we don't do any read operation in dispatcher library
* * Prepare to migrate to cmake (still need to write install target and ↵Vsevolod Stakhov2009-02-161-12/+2
| | | | | | | | working with XS implicitly) * Move all system includes to one file where we detect availability of all that includes * Fix license misprint * Fix some issues with perl initializing
* * Add BSD license textVsevolod Stakhov2009-02-161-0/+24
|