3659 Commits (054f1bc28974979d2de2a6c57c321e042614aea1)
 

Author SHA1 Message Date
  Nick Burch 054f1bc289 Implement the ExtRst part of a UnicodeString 14 years ago
  Nick Burch d7470746e1 If there's no more data but there ought to be for unicode strings, in a partly corrupt file, give a warning and substitute in empty strings 14 years ago
  Nick Burch 9780245c08 Add more documentation, and make the constructor public for testing use 14 years ago
  Nick Burch 398b457631 Improve unhandled continue record message by saying what it follows 14 years ago
  Nick Burch 90c599b4bb Fix a couple of warnings 14 years ago
  Nick Burch 017ac3523a New debugging class, useful for when figuring out how to split on continue records, where continue records lie etc 14 years ago
  Nick Burch 796f95c0b4 Tidy up some of the IntMapper/UnicodeString bits with generics 14 years ago
  Nick Burch 62dc99b25f records.UnicodeString isn't actually a Record, just a common part that exists in many other records. As such, move it to records.common, which is where all the other record components live 14 years ago
  Maxim Valyanskiy a998b7764f bug#48068: RecordFormatException: The content of an excel record cannot exceed 8224 bytes 14 years ago
  Yegor Kozlov 5bb2570155 avoid creating unnecessary temp files, tighten up usage of temp files in unit tests 14 years ago
  Maxim Valyanskiy 2956525db2 revert previous commit 14 years ago
  Maxim Valyanskiy 4e3c970131 ExtractorFactory: save OOXML stream into temporary file before text extraction - this reduces memory usage and allows temporary file cleanup 14 years ago
  Yegor Kozlov 691795a115 Avoid creating temporary files when opening OPC packages from input stream 14 years ago
  Nick Burch a57d32eeb2 Add a command line wrapper around ExtractorFactory, mostly aimed at when debugging rather than production 14 years ago
  Nick Burch 878e9880a5 Remove some generics warnings from the HPSF custom properties and sections parts 14 years ago
  Nick Burch c8f8a1d8a0 Tweak the generics definition on workbook so that method signatures on hssf and xssf won't need to change 14 years ago
  Nick Burch b9621dfb14 Tweak the iterator section to avoid casts as we have generics, and then ditch the hssf duplicate bit (everyone should be pointed at the ss.usermodel version) 14 years ago
  Nick Burch 0779f79cc7 Fix a few generics warnings 14 years ago
  Nick Burch 2a4805f938 Improved how HSMF handles multiple recipients 14 years ago
  Maxim Valyanskiy ababd504b5 add more powerpoint xml mime types 14 years ago
  Nick Burch 63387c5c31 Add PublisherTextExtractor support to ExtractorFactory 14 years ago
  Nick Burch cd5573765c Add status message for recent change from maxcom 14 years ago
  Maxim Valyanskiy b1c8c26708 XSLF: text extraction from tables 14 years ago
  Nick Burch 14b87c4232 Make it possible to return null on missing chunks, rather than the exception 14 years ago
  Nick Burch 9c6aa08e4f Do a manual sync, as per the common skins README, to avoid a build warning 14 years ago
  Nick Burch f37c8f303a Add embeded (attachment) support to the outlook text extractor 14 years ago
  Nick Burch 07551a0925 Rename the outlook extractor to be more consistent with other extractors 14 years ago
  Nick Burch f7ccc5d5f5 Wire up the new HSMFTextExtactor to the ExtractorFactory 14 years ago
  Nick Burch c01272208e Add a text extractor to HSMF for simpler extraction of text from .msg files 14 years ago
  Nick Burch d1b18c3a4d Fix generics warnings 14 years ago
  Nick Burch ed6bc2b09d More work on the recipient related chunks, including a helper method to do best-effort finding of the recipients email address 14 years ago
  Nick Burch 4aedf8cb5e Quick bit of refactoring to save parsing the type and id twice 14 years ago
  Nick Burch 363f50e17a Support fetching the message date from the submission id 14 years ago
  Nick Burch 3a85e0969e Docs tweaks 14 years ago
  Nick Burch 3f56b83377 Tweak a few tests, and add in a few more chunk types 14 years ago
  Nick Burch 4c002588af Complete chunk parser tests, and make more chunk groups available 14 years ago
  Nick Burch 4903ecfca1 Fix some chunk types, fix the directory descent, fix the Msg2txt example, and start on fixing core tests 14 years ago
  Nick Burch 8b89688ddb Shuffle where some of the HSMF tests live to better match package names, and stub out a few more tests 14 years ago
  Nick Burch 41994b053f Start on major HSMF refactoring. Should compile, but not quite all tests pass as a little bit of work is left 14 years ago
  Nick Burch 643a43720b Add a couple more HSMF chunk types, and use Generics in a few places 14 years ago
  Nick Burch 8f24725930 Skip warnings we can't avoid 14 years ago
  Nick Burch 25bd56f569 Initialise the link type of HSSFHyperLink, so that getType() on it works 14 years ago
  Nick Burch dc1c8ec482 More FeatRecord support 14 years ago
  Nick Burch 698c9b1279 Add in a few bits of Generics to avoid warnings 14 years ago
  Nick Burch 9e213d5d78 Add a note on the different shared feature types 14 years ago
  Yegor Kozlov 58fd770335 improved usermodel to prevent data corruption when setting array formulas, mimic Excel behaviour and block changing cells included in multi-cell arrays 14 years ago
  Yegor Kozlov 5c08e67cd0 removed svn:eol-style=native from binary files, uncommented related tests; see Bugzilla 48427 14 years ago
  Nick Burch e4a4574a1e Start testing the contents of FeatRecord 14 years ago
  Nick Burch 9d432275a9 Add Generics types to avoid warnings 14 years ago
  Nick Burch 302bce265a Make InternalSheet capture the feature related records 14 years ago