Andreas Beeker
47a71f0396
Reverted DocumentSummary hack in codepage guessing code
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1793601 13f79535-47bb-0310-9956-ffa450edef68
7 years ago
Tim Allison
77a0bb753f
bug 50955 - add logging per Javen's recommendation. Thank you!
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1791021 13f79535-47bb-0310-9956-ffa450edef68
7 years ago
Tim Allison
4657756a12
bug 50955 - incorporate info from the DocumentSummaryInformation for
guessing the encoding. Back off to the old method if DocSummInfo is
not available. Thanks to Andreas Beeker for recommending this direction.
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1791002 13f79535-47bb-0310-9956-ffa450edef68
7 years ago
Tim Allison
97c5c54eb7
bug 50955 - try originally guessed codepoint, backoff to 1252 if that fails
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1790904 13f79535-47bb-0310-9956-ffa450edef68
7 years ago
Nick Burch
fe53bc134e
Javadoc fixes
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1790239 13f79535-47bb-0310-9956-ffa450edef68
7 years ago
Tim Allison
55ed734108
bug 50955 and bug 60953 improve Big5 reader; ensure one character
per byte pair
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1790172 13f79535-47bb-0310-9956-ffa450edef68
7 years ago
Tim Allison
6fe3b75bfd
bug 50955 -- word 6.0 charset fix
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1790061 13f79535-47bb-0310-9956-ffa450edef68
7 years ago
Nick Burch
efb790ef95
#57919 Add in-place and new-File write methods to POIDocument
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1753619 13f79535-47bb-0310-9956-ffa450edef68
8 years ago
Javen O'Neal
0796bce0fe
bug 59170: remove deprecated HSLF and HWPF methods
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1748783 13f79535-47bb-0310-9956-ffa450edef68
8 years ago
Sergey Vladimirov
0ccd78dc30
different workarounds for old Word format
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1195133 13f79535-47bb-0310-9956-ffa450edef68
12 years ago
Sergey Vladimirov
b29c0af97c
extract FibBase from FIB
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1178019 13f79535-47bb-0310-9956-ffa450edef68
12 years ago
Sergey Vladimirov
49697de696
Add Word-to-Text converter and use it as replacement for WordExtractor
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1155336 13f79535-47bb-0310-9956-ffa450edef68
13 years ago
Sergey Vladimirov
d31694057d
replace StringBuffer with StringBuilder in TextPiece
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1155208 13f79535-47bb-0310-9956-ffa450edef68
13 years ago
Sergey Vladimirov
4c724bf71c
replace ComplexFileTable with single-element-one right after load; replace text piece table as well
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1150675 13f79535-47bb-0310-9956-ffa450edef68
13 years ago
Sergey Vladimirov
9684b7f1b7
add getOverallRange() method definition to HWPFDocumentCore
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1143709 13f79535-47bb-0310-9956-ffa450edef68
13 years ago
Nick Burch
f22e831a0f
Since a DirectoryNode has a reference to the underlying POIFSFileSystem, tidy up the POIDocument constructor to not need both passing in
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@1053521 13f79535-47bb-0310-9956-ffa450edef68
13 years ago
Maxim Valyanskiy
36d2b0d346
HWPF: Improve reading of auto-saved ("complex") document
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@960587 13f79535-47bb-0310-9956-ffa450edef68
14 years ago
Nick Burch
2d9df14178
More Word 6 / Word 95 Support
HWPFOldDocument now processes a few more table sections, and so we can fake up some
basic Ranges. This allows us to do paragraph level text extraction
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@960102 13f79535-47bb-0310-9956-ffa450edef68
14 years ago
Nick Burch
999aecbaa1
Basic text extraction support for old Word 6 and Word 95 documents via some HWPF extensions
git-svn-id: https://svn.apache.org/repos/asf/poi/trunk@959346 13f79535-47bb-0310-9956-ffa450edef68
14 years ago