aboutsummaryrefslogtreecommitdiffstats
path: root/src/url.c
Commit message (Collapse)AuthorAgeFilesLines
* Refuse empty emails in urls detection as well.Vsevolod Stakhov2013-11-261-1/+1
|
* Reduce false positive rate in urls detection.Vsevolod Stakhov2013-11-261-4/+11
|
* Do not search plaintext email objects in html parts.Vsevolod Stakhov2013-06-251-1/+1
|
* Remove ambiguity from url parser.Vsevolod Stakhov2013-06-241-3/+0
|
* Fix urls detection for strings with multiply dots.Vsevolod Stakhov2013-06-241-0/+10
|
* Another fix for tld urls.Vsevolod Stakhov2013-06-171-2/+2
|
* Fix url detection at the beginning of a message.Vsevolod Stakhov2013-06-171-0/+4
|
* Improve urls parsing.Vsevolod Stakhov2013-06-171-346/+318
| | | | | | | | For tld matches use more strict constraint to avoid non-url objects detection by a suffix trie. For example, aaa.rum@mail.com could be previously treated as m@mail.com email. This patch tries to resolve this issue.
* Add webui handler for rspamd (skeleton).Vsevolod Stakhov2013-01-051-2/+2
| | | | | | Fix url detector. Add group option for modules options. Some fixes in controller and rrd code.
* Get rid of some false-positive urls detections.Vsevolod Stakhov2012-10-121-1/+1
|
* Another debian license fix.Vsevolod Stakhov2012-09-101-1/+1
| | | | | Add apache license for regexp that were delivered from SpamAssassin project. Fix debian/copyright for src/dns.c.
* Update copyright (required by debian).Vsevolod Stakhov2012-09-041-3/+3
|
* Fix url detector.Vsevolod Stakhov2012-08-221-1/+10
|
* Do not try to detect tld urls inside HTML texts as it generates too much ↵Vsevolod Stakhov2012-08-211-289/+295
| | | | | | false positive matches. Add some prototypes for lua.
* Add detection of simple urls with tld detection algorithm.Vsevolod Stakhov2012-08-061-13/+333
|
* Use DB_HASH access method for bdb backend.Vsevolod Stakhov2012-03-011-1/+1
| | | | Fix signed and unsigned comparasion while I'm here.
* Make compiler happy in several cases, remove warnings.Vsevolod Stakhov2011-12-201-1/+2
|
* Ignore something that looks like email but has no username.Vsevolod Stakhov2011-12-011-2/+4
|
* * Support for IDN urlsVsevolod Stakhov2011-08-231-4/+4
| | | | * DNS resolver now can handle international domains correctly
* Fix signness in arithmetic operations.Vsevolod Stakhov2011-08-041-4/+4
|
* Add workaround for clang under linux.Vsevolod Stakhov2011-07-211-2/+4
| | | | Fix problems found by static analyzing.
* Fix coredumps on some specific messages with specific urls.Vsevolod Stakhov2011-07-191-7/+13
| | | | | Fix coredumps while closing log file. Fix parsing of chunked HTTP replies.
* * Welcome 0.4.0Vsevolod Stakhov2011-06-241-5/+7
| | | | | | | | | | | | | | | | | | Uncompatible changes: - Statistics is uncompatible in utf8 mode Major changes: - Improved utf8 mode - Convert all characters to lowercase in statistics - Skip URL's in statistics - Improve speed of bayes classifier by using integer arithmetics - Fixed statfiles synchronization that was broken for a long time - Synchronization is now configurable Minor changes: - Bugfixes - Removed some of legacy code - Types polishing
* * Fixes to fuzzy hashing logic, skip urls while estimating fuzzy hashVsevolod Stakhov2011-06-231-15/+26
| | | | | Fix tags stripping. Fix phishing checks (ignore img tags).
* Parse urls with '_' sign in domain name0.3.13Vsevolod Stakhov2011-06-031-1/+1
|
* Fix phishing check for special cases like http://host.com and ↵Vsevolod Stakhov2011-03-141-9/+9
| | | | http://www.host.com
* Try to fix memory issues.Vsevolod Stakhov2011-03-021-9/+9
|
* Do not write warning about normal urls.0.3.7Vsevolod Stakhov2011-02-251-1/+1
|
* * Rewrite URL storage systemVsevolod Stakhov2011-02-241-3/+8
|
* Fix '_' in emails.Vsevolod Stakhov2011-02-171-1/+1
|
* Fix parsing emails with '_' symbol.Vsevolod Stakhov2011-02-171-1/+1
| | | | Fix adding metric symbol.
* Detect mail addresses at the begining of message.Vsevolod Stakhov2011-02-111-0/+4
|
* * Write Emails: header in outputVsevolod Stakhov2011-02-111-3/+3
|
* * New module for checking emails inside messages (rules based, like multimap)Vsevolod Stakhov2011-02-031-16/+35
| | | | | | | * Emails now are separated from urls and urls checks * Add ability to check text attachements if option is presented in a configuration Version is 0.3.6 now
* Fix decoding of escaped URL'sVsevolod Stakhov2010-11-251-4/+4
|
* Performance critical fix0.3.3Vsevolod Stakhov2010-11-031-1/+1
|
* Urgent fixes.Vsevolod Stakhov2010-11-031-1/+5
|
* * Add phishing detector (now just compares <a href> with tag's data).Vsevolod Stakhov2010-11-021-30/+53
|
* Fixes bugs found with clang-static analyser.Vsevolod Stakhov2010-10-111-2/+2
| | | | | Strictly follow c99 standart. Turn on pedantic c99 checks.
* Fixes types (use glib ones) no functional change.Vsevolod Stakhov2010-10-061-87/+87
| | | | | Now all comments in commit logs beginning with '*' would be included in changelog, so important changes would be separated from small ones.
* * Extract domains from emails to check via surblVsevolod Stakhov2010-09-291-4/+31
| | | | | * Fixes to url extracter * Check urls in redirector even if they are listed in surbl whitelist
* * New trie based url scanner (based on libcamel)Vsevolod Stakhov2010-09-211-141/+363
| | | | | * Small fixes to rspamd perl client * Write fuzzy hashes info to log
* * Reopen log file by USR1 signalVsevolod Stakhov2010-07-191-1/+6
| | | | * Add reopenlog method to FreeBSD rc script
* * Fix parsing of some broken urlsVsevolod Stakhov2010-07-051-1/+1
|
* * Fix detection of numeric urls (reported by citrin)Vsevolod Stakhov2010-07-051-2/+13
| | | | * Write real time of message's scan to log (not only virtual)
* * Introduce new logging system:Vsevolod Stakhov2009-12-221-11/+11
| | | | | | | | | - independent and customizeable buffering - line buffering - errors handling support - custom (ip based) debug - append function name automaticaly (based on __FUNCTION__) - add some logic to logs system
* * Retab, no functional changesVsevolod Stakhov2009-10-021-243/+227
|
* * Handle base10 encoded ip addresses correctlyVsevolod Stakhov2009-08-311-1/+2
|
* * Strip urls from space charactersVsevolod Stakhov2009-08-051-0/+18
|
* * Strip url line from spacesVsevolod Stakhov2009-07-301-0/+1
|