aboutsummaryrefslogtreecommitdiffstats
path: root/src/url.c
Commit message (Collapse)AuthorAgeFilesLines
* Add workaround for clang under linux.Vsevolod Stakhov2011-07-211-2/+4
| | | | Fix problems found by static analyzing.
* Fix coredumps on some specific messages with specific urls.Vsevolod Stakhov2011-07-191-7/+13
| | | | | Fix coredumps while closing log file. Fix parsing of chunked HTTP replies.
* * Welcome 0.4.0Vsevolod Stakhov2011-06-241-5/+7
| | | | | | | | | | | | | | | | | | Uncompatible changes: - Statistics is uncompatible in utf8 mode Major changes: - Improved utf8 mode - Convert all characters to lowercase in statistics - Skip URL's in statistics - Improve speed of bayes classifier by using integer arithmetics - Fixed statfiles synchronization that was broken for a long time - Synchronization is now configurable Minor changes: - Bugfixes - Removed some of legacy code - Types polishing
* * Fixes to fuzzy hashing logic, skip urls while estimating fuzzy hashVsevolod Stakhov2011-06-231-15/+26
| | | | | Fix tags stripping. Fix phishing checks (ignore img tags).
* Parse urls with '_' sign in domain name0.3.13Vsevolod Stakhov2011-06-031-1/+1
|
* Fix phishing check for special cases like http://host.com and ↵Vsevolod Stakhov2011-03-141-9/+9
| | | | http://www.host.com
* Try to fix memory issues.Vsevolod Stakhov2011-03-021-9/+9
|
* Do not write warning about normal urls.0.3.7Vsevolod Stakhov2011-02-251-1/+1
|
* * Rewrite URL storage systemVsevolod Stakhov2011-02-241-3/+8
|
* Fix '_' in emails.Vsevolod Stakhov2011-02-171-1/+1
|
* Fix parsing emails with '_' symbol.Vsevolod Stakhov2011-02-171-1/+1
| | | | Fix adding metric symbol.
* Detect mail addresses at the begining of message.Vsevolod Stakhov2011-02-111-0/+4
|
* * Write Emails: header in outputVsevolod Stakhov2011-02-111-3/+3
|
* * New module for checking emails inside messages (rules based, like multimap)Vsevolod Stakhov2011-02-031-16/+35
| | | | | | | * Emails now are separated from urls and urls checks * Add ability to check text attachements if option is presented in a configuration Version is 0.3.6 now
* Fix decoding of escaped URL'sVsevolod Stakhov2010-11-251-4/+4
|
* Performance critical fix0.3.3Vsevolod Stakhov2010-11-031-1/+1
|
* Urgent fixes.Vsevolod Stakhov2010-11-031-1/+5
|
* * Add phishing detector (now just compares <a href> with tag's data).Vsevolod Stakhov2010-11-021-30/+53
|
* Fixes bugs found with clang-static analyser.Vsevolod Stakhov2010-10-111-2/+2
| | | | | Strictly follow c99 standart. Turn on pedantic c99 checks.
* Fixes types (use glib ones) no functional change.Vsevolod Stakhov2010-10-061-87/+87
| | | | | Now all comments in commit logs beginning with '*' would be included in changelog, so important changes would be separated from small ones.
* * Extract domains from emails to check via surblVsevolod Stakhov2010-09-291-4/+31
| | | | | * Fixes to url extracter * Check urls in redirector even if they are listed in surbl whitelist
* * New trie based url scanner (based on libcamel)Vsevolod Stakhov2010-09-211-141/+363
| | | | | * Small fixes to rspamd perl client * Write fuzzy hashes info to log
* * Reopen log file by USR1 signalVsevolod Stakhov2010-07-191-1/+6
| | | | * Add reopenlog method to FreeBSD rc script
* * Fix parsing of some broken urlsVsevolod Stakhov2010-07-051-1/+1
|
* * Fix detection of numeric urls (reported by citrin)Vsevolod Stakhov2010-07-051-2/+13
| | | | * Write real time of message's scan to log (not only virtual)
* * Introduce new logging system:Vsevolod Stakhov2009-12-221-11/+11
| | | | | | | | | - independent and customizeable buffering - line buffering - errors handling support - custom (ip based) debug - append function name automaticaly (based on __FUNCTION__) - add some logic to logs system
* * Retab, no functional changesVsevolod Stakhov2009-10-021-243/+227
|
* * Handle base10 encoded ip addresses correctlyVsevolod Stakhov2009-08-311-1/+2
|
* * Strip urls from space charactersVsevolod Stakhov2009-08-051-0/+18
|
* * Strip url line from spacesVsevolod Stakhov2009-07-301-0/+1
|
* * Fix 'cc' urls parsingVsevolod Stakhov2009-07-161-1/+1
| | | | * Add function has_fake_html for determine fake html parts (without any tags)
* * Fix url unescapingVsevolod Stakhov2009-07-061-3/+26
|
* * Extract url encoded urls from html textsVsevolod Stakhov2009-07-031-4/+7
|
* * Forgot to mergeVsevolod Stakhov2009-06-021-2/+2
|\
| * * Fix url grabbing regexp to match urls like <a href=http://blah.com>Vsevolod Stakhov2009-06-011-2/+2
| |
* | * Rework url parsing algorithmsVsevolod Stakhov2009-06-021-12/+19
|/ | | | | | | * Adopt all parts of rspamd for new url parser * Improve url-extracter utility by avoiding cut&paste of mime parsing * Small fixes to rspamc client * Bump version to 0.1.3
* * Fix urls extracting, avoid code repeatingVsevolod Stakhov2009-04-171-78/+30
|
* * Fix many compile time warnings and polish codeVsevolod Stakhov2009-03-241-7/+9
|
* * Avoid extra requests to surbls saving them into requests cacheVsevolod Stakhov2009-03-101-2/+2
| | | | * Lower debug level for some errors (INFO to DEBUG)
* * Fix 2 memory issues:Vsevolod Stakhov2009-03-101-0/+10
| | | | | | - NULL string when trying to check url regexp - double free of message byte array as mime_stream frees memory in array if it thinks that stream is owner of array's memory
* * Add some broken urls to check list, in fact skip only urls without host ↵Vsevolod Stakhov2009-03-061-2/+2
| | | | | | part and empty urls (thought empty urls are totally impossible in that code)
* * Fix bad memory leaks and memory corruption in url detecting moduleVsevolod Stakhov2009-03-061-10/+51
|
* * Fix bug in url parser with freeing memory allocated by memory_pool allocatorVsevolod Stakhov2009-03-041-4/+4
| | | | * Do not write to log if we don't do any read operation in dispatcher library
* * Prepare to migrate to cmake (still need to write install target and ↵Vsevolod Stakhov2009-02-161-12/+2
| | | | | | | | working with XS implicitly) * Move all system includes to one file where we detect availability of all that includes * Fix license misprint * Fix some issues with perl initializing
* * Add BSD license textVsevolod Stakhov2009-02-161-0/+24
|
* * Reorganize structure of source filesVsevolod Stakhov2008-11-011-0/+886
* Adopt build system for new structure --HG-- rename : cfg_file.h => src/cfg_file.h rename : cfg_file.l => src/cfg_file.l rename : cfg_file.y => src/cfg_file.y rename : cfg_utils.c => src/cfg_utils.c rename : controller.c => src/controller.c rename : filter.c => src/filter.c rename : filter.h => src/filter.h rename : fstring.c => src/fstring.c rename : fstring.h => src/fstring.h rename : main.c => src/main.c rename : main.h => src/main.h rename : mem_pool.c => src/mem_pool.c rename : mem_pool.h => src/mem_pool.h rename : memcached-test.c => src/memcached-test.c rename : memcached.c => src/memcached.c rename : memcached.h => src/memcached.h rename : perl.c => src/perl.c rename : perl.h => src/perl.h rename : plugins/regexp.c => src/plugins/regexp.c rename : plugins/surbl.c => src/plugins/surbl.c rename : protocol.c => src/protocol.c rename : protocol.h => src/protocol.h rename : upstream.c => src/upstream.c rename : upstream.h => src/upstream.h rename : url.c => src/url.c rename : url.h => src/url.h rename : util.c => src/util.c rename : util.h => src/util.h rename : worker.c => src/worker.c