30 Commits (d8421364646f195743625f24848205f999cdca53)

Author SHA1 Message Date
  Andreas Beeker d842136464 Sonar fixes - "static" base class members should not be accessed via derived types 4 years ago
  Andreas Beeker 2a982f62ec Replace StringBuffer with StringBuilder 4 years ago
  Dominik Stadler f6eab84243 Simplify some string operations 5 years ago
  Andreas Beeker cdb3de9dc0 #62355 - unsplit packages - 3 - only imports 6 years ago
  Dominik Stadler 813fa7edd3 Don't wrap RuntimeException 7 years ago
  Dominik Stadler 2c8f25c864 IntelliJ warnings, Javadoc, Formatting 7 years ago
  Javen O'Neal 2a884279c1 bug 59170: remove deprecated WordExtractor constructor; add version when deprecated to HWPF HeaderStories 8 years ago
  Dominik Stadler b983911836 Eclipse warnings, Generics, unused members in some exapmles and scratchpad classes. 9 years ago
  Sergey Vladimirov ead63d1e9c simplify API to Word file's part processing, like includint page headers / footers into plain text and HTML 12 years ago
  Sergey Vladimirov 49697de696 Add Word-to-Text converter and use it as replacement for WordExtractor 12 years ago
  Sergey Vladimirov 4c724bf71c replace ComplexFileTable with single-element-one right after load; replace text piece table as well 12 years ago
  Sergey Vladimirov 34ed972af9 add content of main text box to document text 13 years ago
  Nick Burch 8cd8659010 Fix support for sections in old word 6 / word 95 files 13 years ago
  Nick Burch 2d9df14178 More Word 6 / Word 95 Support 14 years ago
  Nick Burch 999aecbaa1 Basic text extraction support for old Word 6 and Word 95 documents via some HWPF extensions 14 years ago
  Josh Micich 27b61f396a Fixed compiler warnings - unnecessary throws declaration 14 years ago
  Yegor Kozlov b6dc852992 fixed WordExtractor to avoid ArrayIndexOutOfBoundsException when encountering empty footnote block 15 years ago
  Yegor Kozlov 8e1165ea7e Support fo text extraction of footnotes, endnotes and comments in HWPF, see Bugzilla 47400 15 years ago
  Josh Micich 852e34bfea Changed CRLF to LF in scratchpad. Minor fixes for compiler warnings and formatting 15 years ago
  Nick Burch e4ff06ec79 Initial ExtractorFactory support for building TextExtractors for embeded documents 15 years ago
  Nick Burch 3638f76a8a Merged revisions 638786-638802,638805-638811,638813-638814,638816-639230,639233-639241,639243-639253,639255-639486,639488-639601,639603-639835,639837-639917,639919-640056,640058-640710,640712-641156,641158-641184,641186-641795,641797-641798,641800-641933,641935-641963,641965-641966,641968-641995,641997-642230,642232-642562,642564-642565,642568-642570,642572-642573,642576-642736,642739-642877,642879,642881-642890,642892-642903,642905-642945,642947-643624,643626-643653,643655-643669,643671,643673-643830,643832-643833,643835-644342,644344-644472,644474-644508,644510-645347,645349-645351,645353-645559,645561-645565,645568-645951,645953-646193,646195-646311,646313-646404,646406-646665,646667-646853,646855-646869,646871-647151,647153-647185,647187-647277,647279-647566,647568-647573,647575,647578-647711,647714-647737,647739-647823,647825-648155,648157-648202,648204-648273,648275,648277-648302,648304-648333,648335-648588,648590-648622,648625-648673,648675-649141,649144,649146-649556,649558-649795,649799,649801-649910,649912-649913,649915-650128,650131-650132,650134-650137,650140-650914,650916-651991,651993-652284,652286-652287,652289,652291,652293-652297,652299-652328,652330-652425,652427-652445,652447-652560,652562-652933,652935,652937-652993,652995-653116,653118-653124,653126-653483,653487-653519,653522-653550,653552-653607,653609-653667,653669-653674,653676-653814,653817-653830,653832-653891,653893-653944,653946-654055,654057-654355,654357-654365,654367-654648,654651-655215,655217-655277,655279-655281,655283-655911,655913-656212,656214,656216-656251,656253-656698,656700-656756,656758-656892,656894-657135,657137-657165,657168-657179,657181-657354,657356-657357,657359-657701,657703-657874,657876-658032,658034-658284,658286,658288-658301,658303-658307,658309-658321,658323-658335,658337-658348,658351,658353-658832,658834-658983,658985,658987-659066,659068-659402,659404-659428,659430-659451,659453-659454,659456-659461,659463-659477,659479-659524,659526-659571,659574,659576-660255,660257-660262,660264-660279,660281-660343,660345-660473,660475-660827,660829-660833,660835-660888,660890-663321,663323-663435,663437-663764,663766-663854,663856-664219,664221-664489,664494-664514,664516-668013,668015-668142,668144-668152,668154,668156-668256,668258,668260-669139,669141-669455,669457-669657,669659-669808,669810-670189,670191-671321,671323-672229,672231-672549,672551-672552,672554-672561,672563-672566,672568,672571-673049,673051-673852,673854-673862,673864-673986,673988-673996,673998-674347,674349-674890,674892-674910,674912-674936,674938-674952,674954-675078,675080-675085,675087-675217,675219-675660,675662-675670,675672-675716,675718-675726,675728-675733,675735-675775,675777-675782,675784,675786-675791,675794-675852,675854-676200,676202,676204,676206-676220,676222-676309,676311-676456,676458-676994,676996-677027,677030-677040,677042-677056,677058-677375,677377-677968,677970-677971,677973,677975-677994,677996-678286,678288-678538,678540-680393,680395-680469,680471-680529,680531-680852,680854-681529,681531-681571,681573-682224,682226,682228,682231-682281,682283-682335,682337-682507,682509,682512-682517,682519-682532,682534-682619,682622-682777,682779-682998,683000-683019,683021-683022,683024-683080,683082-683092,683094-683095,683097-683127,683129-683131,683133-683166,683168-683698,683700-683705,683707-683757,683759-683787,683789-683870,683872-683879,683881-683900,683902-684066,684068-684074,684076-684222,684224-684254,684257-684281,684283-684286,684288-684292,684294-684298,684300-684301,684303-684308,684310-684317,684320,684323-684335,684337-684348,684350-684354,684356-684361,684363-684369,684371-684453,684455-684883,684885-684937,684940-684958,684960-684970,684972-684985,684987-685053,685055-685063,685065-685284 via svnmerge from 15 years ago
  Nick Burch 97e8e39eb6 Add HWPF support for stripping out fields (eg macros), and make this optionally happen always for headers and footers 15 years ago
  Nick Burch 77002f6491 Merged revisions 638786-638802,638805-638811,638813-638814,638816-639230,639233-639241,639243-639253,639255-639486,639488-639601,639603-639835,639837-639917,639919-640056,640058-640710,640712-641156,641158-641184,641186-641795,641797-641798,641800-641933,641935-641963,641965-641966,641968-641995,641997-642230,642232-642562,642564-642565,642568-642570,642572-642573,642576-642736,642739-642877,642879,642881-642890,642892-642903,642905-642945,642947-643624,643626-643653,643655-643669,643671,643673-643830,643832-643833,643835-644342,644344-644472,644474-644508,644510-645347,645349-645351,645353-645559,645561-645565,645568-645951,645953-646193,646195-646311,646313-646404,646406-646665,646667-646853,646855-646869,646871-647151,647153-647185,647187-647277,647279-647566,647568-647573,647575,647578-647711,647714-647737,647739-647823,647825-648155,648157-648202,648204-648273,648275,648277-648302,648304-648333,648335-648588,648590-648622,648625-648673,648675-649141,649144,649146-649556,649558-649795,649799,649801-649910,649912-649913,649915-650128,650131-650132,650134-650137,650140-650914,650916-651991,651993-652284,652286-652287,652289,652291,652293-652297,652299-652328,652330-652425,652427-652445,652447-652560,652562-652933,652935,652937-652993,652995-653116,653118-653124,653126-653483,653487-653519,653522-653550,653552-653607,653609-653667,653669-653674,653676-653814,653817-653830,653832-653891,653893-653944,653946-654055,654057-654355,654357-654365,654367-654648,654651-655215,655217-655277,655279-655281,655283-655911,655913-656212,656214,656216-656251,656253-656698,656700-656756,656758-656892,656894-657135,657137-657165,657168-657179,657181-657354,657356-657357,657359-657701,657703-657874,657876-658032,658034-658284,658286,658288-658301,658303-658307,658309-658321,658323-658335,658337-658348,658351,658353-658832,658834-658983,658985,658987-659066,659068-659402,659404-659428,659430-659451,659453-659454,659456-659461,659463-659477,659479-659524,659526-659571,659574,659576-660255,660257-660262,660264-660279,660281-660343,660345-660473,660475-660827,660829-660833,660835-660888,660890-663321,663323-663435,663437-663764,663766-663854,663856-664219,664221-664489,664494-664514,664516-668013,668015-668142,668144-668152,668154,668156-668256,668258,668260-669139,669141-669455,669457-669657,669659-669808,669810-670189,670191-671321,671323-672229,672231-672549,672551-672552,672554-672561,672563-672566,672568,672571-673049,673051-673852,673854-673862,673864-673986,673988-673996,673998-674347,674349-674890,674892-674910,674912-674936,674938-674952,674954-675078,675080-675085,675087-675217,675219-675660,675662-675670,675672-675716,675718-675726,675728-675733,675735-675775,675777-675782,675784,675786-675791,675794-675852,675854-676200,676202,676204,676206-676220,676222-676309,676311-676456,676458-676994,676996-677027,677030-677040,677042-677056,677058-677375,677377-677968,677970-677971,677973,677975-677994,677996-678286,678288-678538,678540-680393,680395-680469,680471-680529,680531-680852,680854-681529,681531-681571,681573-682224,682226,682228,682231-682281,682283-682335,682337-682507,682509,682512-682517,682519-682532,682534-682619,682622-682777,682779-682998,683000-683019,683021-683022,683024-683080,683082-683092,683094-683095,683097-683127,683129-683131,683133-683166,683168-683698,683700-683705,683707-683757,683759-683787,683789-683870,683872-683879,683881-683900,683902-684066,684068-684074,684076-684222,684224-684254,684257-684370 via svnmerge from 15 years ago
  Nick Burch a1f745fee3 Add header/footer support to HWPF WordExtractor 15 years ago
  Nick Burch 7411759977 Get most of the hwpf tests passing again 15 years ago
  Nick Burch d8a63301aa Merged revisions 638786-638802,638805-638811,638813-638814,638816-639230,639233-639241,639243-639253,639255-639486,639488-639601,639603-639835,639837-639917,639919-640056,640058-640710,640712-641156,641158-641184,641186-641795,641797-641798,641800-641933,641935-641963,641965-641966,641968-641995,641997-642230,642232-642562,642564-642565,642568-642570,642572-642573,642576-642736,642739-642877,642879,642881-642890,642892-642903,642905-642945,642947-643624,643626-643653,643655-643669,643671,643673-643830,643832-643833,643835-644342,644344-644472,644474-644508,644510-645347,645349-645351,645353-645559,645561-645565,645568-645951,645953-646193,646195-646313 via svnmerge from 16 years ago
  Nick Burch 40b5fb8af8 Provide a common ole2 implementation of POITextExtractor, which gives access to the document metadata 16 years ago
  Nick Burch 6a72c5656a Implement an Excel text extractor, and put all the existing text extractors under a common superclass, so they're easier to find and use 16 years ago
  Mark Thomas c60210ede4 Add standard licence header 17 years ago
  Nick Burch 036a97a268 If we're created with an InputStream, check to see if it's really RTF before proceeding 18 years ago
  Nick Burch f26c5c8468 Add in a main() method, so people can stop moaning 18 years ago
  Nick Burch 5f0cd6fc76 Add constructors taking the main objects, rather than just input streams 18 years ago
  Nick Burch 8b4718ba90 Do the javadoc comment correctly 18 years ago
  Nick Burch 5bacacee85 Friendly wrapper on HWPF for extracting text from Word Documents 18 years ago