Commit Graph

82 Commits

Author SHA1 Message Date
Vsevolod Stakhov
137ab1cb78 [Feature] Lua_magic: Add a sane CSV heuristic 2021-12-03 14:37:52 +00:00
Vsevolod Stakhov
73280f3dab [Minor] Lua_magic: Add another spammy JS pattern 2021-08-24 13:24:53 +01:00
Vsevolod Stakhov
229cf1c94b [Fix] Add a special logic for text part with no text extraction 2021-08-16 11:24:20 +01:00
Vsevolod Stakhov
ef16ff4ddd [Minor] Lua_magic: Fix detection for the parts with file names 2021-08-16 11:24:20 +01:00
Vsevolod Stakhov
333b82b2bf [Minor] Lua_magic: Slight re tune 2021-08-13 14:01:53 +01:00
Vsevolod Stakhov
872247289f [Minor] Lua_magic: Fix xml detection 2021-08-13 14:00:35 +01:00
Vsevolod Stakhov
e0c16f4cf4 [Minor] Lua_magic: Do not treat xml as html
Suggested by: @citrin
2021-08-13 13:47:08 +01:00
Vsevolod Stakhov
b41417742e [Minor] Escape regexp quantifiers 2021-07-17 09:53:10 +01:00
Vsevolod Stakhov
a9fe8f9048 [Minor] Do not treat ASiC files as pure archives
Issue: #3790
Closes: #3790
2021-06-17 14:01:43 +01:00
Vsevolod Stakhov
8062da9e66 [Minor] Add ASICE format support
Issue: #3790
2021-06-17 13:51:18 +01:00
Vsevolod Stakhov
268bc590b6 [Minor] Another try to fix the logic around 8bit charsets handling 2021-05-17 21:20:22 +01:00
Vsevolod Stakhov
f8e81ccc2c [Minor] Fix non-utf inputs checking 2021-05-17 21:13:53 +01:00
Vsevolod Stakhov
c0e15f8039 [Minor] Try to fix another boundary issue 2021-05-07 12:08:18 +01:00
Vsevolod Stakhov
9d5a725bd6 [Minor] Sigh, another condition fix 2021-05-06 22:42:18 +01:00
Vsevolod Stakhov
24054b0a09 [Minor] Fix check condition 2021-05-06 22:24:00 +01:00
Vsevolod Stakhov
f4d5f9c4e4 [Minor] Lua_magic: Return utf8 check as it is useful for many cases 2021-04-22 14:22:52 +01:00
Vsevolod Stakhov
12e0300930 [Minor] Lua_magic: Really add an exclusion... 2021-04-22 14:13:28 +01:00
Vsevolod Stakhov
adefe59d6f [Minor] Lua_magic: Make an exclusion for message parts 2021-04-22 14:08:47 +01:00
Vsevolod Stakhov
401baa2b86 [Rework] Lua_magic: Try to detect text parts with 8bit characters for non-utf8 encodings 2021-04-22 14:08:11 +01:00
Vsevolod Stakhov
8cc2fca85d [Minor] Lua_magic: Add vcf magic pattern 2021-01-12 15:21:40 +00:00
Vsevolod Stakhov
d5bf9ae28d [Minor] Lua_magic: Minor changes 2021-01-04 16:10:02 +00:00
Andrew Lewis
7679fd1222 [Minor] Magic: try avoid false positive exe detection 2021-01-04 13:41:04 +02:00
Vsevolod Stakhov
2aaf02a82c [Minor] Lua_magic: Use larger span to look for text patterns 2020-12-03 11:51:03 +00:00
Vsevolod Stakhov
f5bebd97db [Minor] Lua_magic: Add ics pattern 2020-11-20 11:24:17 +00:00
Vsevolod Stakhov
f22fd35fc8 [Minor] Lua_magic: Do not treat by default non-text part as text 2020-11-20 11:24:17 +00:00
Andrew Lewis
68badebdac [Minor] rspamd_text:byte() metamethod 2020-11-16 20:13:03 +02:00
Vsevolod Stakhov
7998548c40 [Minor] Lua_magic: RTF should not be treated as text without conversion 2020-11-04 11:35:47 +00:00
Vsevolod Stakhov
63869b7d7b [Minor] Lua_magic: Improve html detection heuristic 2020-10-21 14:26:56 +01:00
Vsevolod Stakhov
f08ffd73e8 [Minor] Lua_magic: Improve html patterns 2020-09-25 10:58:45 +01:00
Vsevolod Stakhov
9ce8ae9803 [Minor] Add alzip magic numbers 2020-09-20 22:55:36 +01:00
Vsevolod Stakhov
56ed3ffdd6 [Minor] Improve exe files detection
Issue: #3481
2020-09-06 23:09:57 +01:00
Anton Yuzhaninov
d8c76f9fcb [Minor] Improve lua_magic HTML content type check
Types which are used for html (xhtml) attachments:
text/html
application/html
application/xhtml+xml

Type which cannot be found in the wild:
text/htm
2020-08-20 10:05:51 +01:00
Vsevolod Stakhov
f722832e3a [Minor] Do not autodetect html parts as text parts 2020-07-16 16:14:22 +01:00
Vsevolod Stakhov
95c95db6d4 [Minor] Add sanity check 2020-06-16 13:52:32 +01:00
Vsevolod Stakhov
afaa9500bb [Minor] Further improve patterns 2020-05-11 17:03:00 +01:00
Vsevolod Stakhov
2c07393bed [Minor] Remove redundant pattern 2020-05-11 17:00:01 +01:00
Vsevolod Stakhov
b56b5fc224 [Minor] Improve pdf magic detection 2020-05-11 16:31:30 +01:00
Vsevolod Stakhov
f5299d51ba [Minor] Add limit for number of files processed in archives 2020-04-28 15:48:57 +01:00
Vsevolod Stakhov
a89cf52149 [Fix] Disable text detection heuristics for encrypted parts
Issue: #3349
2020-04-28 15:38:47 +01:00
Vsevolod Stakhov
d6f2dc6707 [Minor] Lua_magic: Add ical and vcard support 2019-11-21 13:59:50 +00:00
Vsevolod Stakhov
052ccc761e [Feature] Support uuencoding 2019-10-23 17:03:19 +01:00
Vsevolod Stakhov
784746733a [Feature] Lua_scanners: Use lua magic for inclusion/exclusion logic 2019-10-14 16:46:31 +01:00
Vsevolod Stakhov
13a09de8ed [Feature] Multimap: Do not check files in office archives 2019-10-14 14:55:36 +01:00
Vsevolod Stakhov
2568de9762 [Minor] Lua_magic: Fix csv detection 2019-10-07 14:54:34 +01:00
Vsevolod Stakhov
12fdcf7bad [Minor] Fix text parts heuristic 2019-10-07 14:33:50 +01:00
Vsevolod Stakhov
d6ea2092cd [Minor] Be more strict when checking for text parts 2019-10-07 11:45:43 +01:00
Vsevolod Stakhov
82106bb106 [Minor] Lua_magic: Extend dmg pattern 2019-10-05 15:17:08 +01:00
Vsevolod Stakhov
e4dbb877a3 [Minor] Lua_magic: Relax jpeg pattern 2019-09-24 18:42:52 +01:00
Vsevolod Stakhov
415bcce9f4 [Minor] Lua_magic: Add bat files detection 2019-09-24 15:59:58 +01:00
Vsevolod Stakhov
99cc1fcb69 [Project] Lua_magic: Add images patterns 2019-09-24 15:43:04 +01:00