From: Glen Stampoultzis Date: Thu, 1 May 2003 23:35:19 +0000 (+0000) Subject: These should have been removed during the merge. X-Git-Tag: REL_2_0_PRE1~25 X-Git-Url: https://source.dussan.org/?a=commitdiff_plain;h=c2da5fec8e10f6a4b104c4cf4d1b3f028c368e90;p=poi.git These should have been removed during the merge. git-svn-id: https://svn.apache.org/repos/asf/jakarta/poi/trunk@353075 13f79535-47bb-0310-9956-ffa450edef68 --- diff --git a/src/documentation/xdocs/3rdparty.xml b/src/documentation/xdocs/3rdparty.xml deleted file mode 100644 index dac1516edd..0000000000 --- a/src/documentation/xdocs/3rdparty.xml +++ /dev/null @@ -1,67 +0,0 @@ - - - -
- Third Party Contributions - - - -
- - - -
-

- See How to contribute to Poi. -

- -
- -
-

- These are not necessarily deemed to be high enough quality to be included in the - core distribution, but they have been tested under - several key environments, they are provided under the same license - as Poi, and they are included in the POI distribution under the - contrib/ directory. -

- -

- None as yet! - although you can expect that some of the links - listed below will eventually migrate to the "contributed components" level, and - then maybe even into the main distribution. -

-
- -
-

Submissions of modifications - to Poi which are awaiting review. Anyone can - comment on them on the dev mailing list - code reviewers are needed! - Use these at your own risk - although Poi has no guarantee - either, these patches have not been reviewed, let alone accepted. -

-
- -
-

The other extensions listed here are not endorsed by the Poi - project either - they are provided as a convenience only. They may or may not work, - they may or may not be open source, etc. -

- -

To have a link added to this table, see How to contribute - to POI.

- - - - - - - - - - -
Name and LinkTypeDescriptionStatusLicensingContact
- -
- -
diff --git a/src/documentation/xdocs/book.xml b/src/documentation/xdocs/book.xml deleted file mode 100644 index d027788edf..0000000000 --- a/src/documentation/xdocs/book.xml +++ /dev/null @@ -1,62 +0,0 @@ - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - diff --git a/src/documentation/xdocs/casestudies.xml b/src/documentation/xdocs/casestudies.xml deleted file mode 100644 index a1c228dd2e..0000000000 --- a/src/documentation/xdocs/casestudies.xml +++ /dev/null @@ -1,99 +0,0 @@ - - - - -
- Jakarta POI - Case Studies - - - - -
- - -
-

- A number of people are using POI for a variety of purposes. As with - any new API or technology, the first question people generally ask - is not "how can I" but rather "Who else is doing what I'm about to - do?" This is understandable with the abysmal success rate in the - software business. These case statements are meant to help create - confidence and understanding. -

-
-
-

- We are actively seeking case studies for this page (after all it - just started). Andy Oliver (acoliver at apache dot org) has - agreed to have a few T-Shirts printed with the POI logo (once its - chosen) for the first - few best submissions. To submit a case study, either - - submit a patch for this page (preferred) or email it to the - mailing list - . -

-
-
-
-

- The Bank of Lithuania - reports financial statistical data to Excel format using the - Jakarta POI - project's - HSSF API. The system is based on Oracle JServer and - utilizes a Java stored procedure that outputs to XLS format - using the HSSF API. - Arian Lashkov (alaskov at lbank.lt) -

-
-
-

-Bit Tracker (http://www.bittracker.com) is the world's first and only web-based drill bit tracking system to manage your company's critical bit information and use that data to its full potential. It manages all bit related data, including their usage, locations, how they were used, and results such as rate of penetration and dull grade after use. This data needs to be available in Excel format for backwards compatibility and other uses in the industry. After using CSV and HTML formats, we needed something better for creating the spreadsheets and POI is the answer. It works great and was easy to implement. Kudos to the POI team. -

-

- Travis Reeder (travis at thinkvirtual dot com) -

-
-
-

- Edwards and Kelcey Technology (http://www.ekcorp.com/) developed a - Facility - Managament and Maintenance System for the Telecommunications industry - based - on Turbine and Velocity. Originally the invoicing was done with a simple - CVS - sheet which was then marked up by accounts and customized for each client. - As growth has been consistent with the application, the requirement for - invoices that need not be touched by hand increased. POI provided the - solution to this issue, integrating easily and transparently into the - system. POI HSSF was used to create the invoices directly from the server - in - Excel 97 format and now services over 150 unique invoices per month. -

-

- Cameron Riley (crileyNO@ SPAMekmail.com) -

-
-
-

- ClickFind Inc. used the POI - projects HSSF API to provide their medical - research clients with an Excel export from their electronic data - collection web service Data Collector 3.0. The POI team's assistance - allowed ClickFind to give their clients a data format that requires less - technical expertise than the XML format used by the Data Collector - application. This was important to ClickFind as many of their current - and potential clients are already using Excel in their day-to-day - operations and in established procedures for handling their generated - clinical data. - Jared Walker (jared.walker at clickfind.com) -

-
-
- -
- - Copyright (c) @year@ The Apache Software Foundation All rights reserved. - $Revision$ $Date$ - -
-
diff --git a/src/documentation/xdocs/changes.xml b/src/documentation/xdocs/changes.xml deleted file mode 100644 index 10fc771a3d..0000000000 --- a/src/documentation/xdocs/changes.xml +++ /dev/null @@ -1,107 +0,0 @@ - - - - - - - - - - - - - - - Support for zoom level - Freeze and split pane support - Row and column headers on printouts - - - Custom Data Format Support - Enhanced Unicode Support for Russian and Japanese - Enhanced formula support including read-only for - "optimized if" statements. - Support for cloning objects - Fixes for header/footer - Spanish Documentation translations - Support for preserving VBA macros - - - Removed runtime dependency on commons logging. - Formula support - - - Removed depedency on commons logging. Now define poi.logging system property to enable logging to standard out. - Fixed SST string handling so that spreadsheets with rich text or extended text will be read correctly. - - - New project build. - New project documentation system based on Cocoon. - Package rename - Various bug fixes - Early stages of HSF development (not ready for development) - Initial low level record support for charting (not complete) - - - Created new event model - Optimizations made to HSSF including aggregate records for - values, rows, etc. - predictive sizing, offset based writing (instead of lots of - array copies) - minor re-factoring and bug fixes. - - - Minor documentation updates. - - - Added DataFormat helper class and exposed set and get format - on HSSFCellStyle - Fixed column width apis (unit wise) and various javadoc on - the subject - Fix for Dimensions record (again)... (one of these days I'll - write a unit test for this ;-p). - Some optimization on sheet creation. - - - - - - Added MulBlank, Blank, ColInfo - Added log4j facility and removed all sys.out type logging - Added support for adding font's, styles and corresponding - high level api for styling cells - added support for changing row height, cell width and default - row height/cell width. - Added fixes for internationalization (UTF-16 should work now - from HSSFCell.setStringValue, etc when the encoding is set) - added support for adding/removing and naming sheets. - - - Bugfix release. We were throwing an exception when reading - RKRecord objects. - - - Got continuation records to work (read/write) - Added various pre-support for formulas - Massive API reorganization, repackaging. - BiffViewer class added for validating HSSF & POI and/or - HSSF Output. - Better API support for modification. - - - Added encoding flag to high and low level api to use utf-16 - when needed (HSSFCell.setEncoding()) - added read only support for Label records (which are - reinterpreted as LabelSST when written) - Broken continuation record implementation (oops) - BiffViewer class added for validating HSSF & POI and/or - HSSF Output. - - - Support for read/write and modify. - Read only support for MulRK records (converted to Number when - writing) - - - - diff --git a/src/documentation/xdocs/dtd/ISOdia.pen b/src/documentation/xdocs/dtd/ISOdia.pen deleted file mode 100644 index df5c05d2ed..0000000000 --- a/src/documentation/xdocs/dtd/ISOdia.pen +++ /dev/null @@ -1,34 +0,0 @@ - - - - - - - - - - - - - - - - - - diff --git a/src/documentation/xdocs/dtd/ISOgrk1.pen b/src/documentation/xdocs/dtd/ISOgrk1.pen deleted file mode 100644 index cdab380fc3..0000000000 --- a/src/documentation/xdocs/dtd/ISOgrk1.pen +++ /dev/null @@ -1,74 +0,0 @@ - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - diff --git a/src/documentation/xdocs/dtd/ISOlat1.pen b/src/documentation/xdocs/dtd/ISOlat1.pen deleted file mode 100644 index aff4e217d4..0000000000 --- a/src/documentation/xdocs/dtd/ISOlat1.pen +++ /dev/null @@ -1,79 +0,0 @@ - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - diff --git a/src/documentation/xdocs/dtd/ISOnum.pen b/src/documentation/xdocs/dtd/ISOnum.pen deleted file mode 100644 index 02605bfb58..0000000000 --- a/src/documentation/xdocs/dtd/ISOnum.pen +++ /dev/null @@ -1,109 +0,0 @@ - - - - - - - - - - - - - - - - - - - - - - - - - -" > - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - diff --git a/src/documentation/xdocs/dtd/ISOpub.pen b/src/documentation/xdocs/dtd/ISOpub.pen deleted file mode 100644 index 04a5732b45..0000000000 --- a/src/documentation/xdocs/dtd/ISOpub.pen +++ /dev/null @@ -1,110 +0,0 @@ - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - diff --git a/src/documentation/xdocs/dtd/ISOtech.pen b/src/documentation/xdocs/dtd/ISOtech.pen deleted file mode 100644 index b3b39f9f1b..0000000000 --- a/src/documentation/xdocs/dtd/ISOtech.pen +++ /dev/null @@ -1,85 +0,0 @@ - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - diff --git a/src/documentation/xdocs/faq.xml b/src/documentation/xdocs/faq.xml deleted file mode 100644 index a702b247be..0000000000 --- a/src/documentation/xdocs/faq.xml +++ /dev/null @@ -1,286 +0,0 @@ - - - - - - - Why is reading a simple sheet taking so long? - - - You've probably enabled logging. Logging is intended only for - autopsie style debugging. Having it enabled will reduce performance - by a factor of at least 100. Logging is helpful for understanding - why POI can't read some file or developing POI itself. Important - errors are thrown as exceptions, which means you probably don't need - logging. - - - - - What is the HSSF "eventmodel"? - - - The HSSF eventmodel package is a new API for reading XLS files more efficiently. It does - require more knowledge on the part of the user, but reduces memory consumption by more than - tenfold. It is based on the AWT event model in combination with SAX. If you need read-only - access to a given XLS file, this is the best way to do it. - - - - - - Why can't read the document I created using Star Office 5.1? - - - Star Office 5.1 writes some records using the older BIFF standard. This causes some problems - with POI which supports only BIFF8. - - - - - Why am I getting an exception each time I attempt to read my spreadsheet? - - - It's possible your spreadsheet contains a feature that is not currently supported by HSSF. - For example - spreadsheets containing cells with rich text are not currently supported. - - - - - Does HSSF support protected spreadsheets? - - - Protecting a spreadsheet encrypts it. We wont touch encryption because we're not legally educated - and don't understand the full implications of trying to implement this. If you wish to have a go - at this feel free to add it as a plugin module. We wont be hosting it here however. - - - - - How do you tell if a field contains a date with HSSF? - - - Excel stores dates as numbers therefore the only way to determine if a cell is - actually stored as a date is to look at the formatting. There is a helper method - in HSSFDateUtil (since the 1.7.0-dev release) that checks for this. - Thanks to Jason Hoffman for providing the solution. - - - case HSSFCell.CELL_TYPE_NUMERIC: - double d = cell.getNumericCellValue(); - // test if a date! - if (HSSFDateUtil.isCellDateFormatted(cell)) { - // format in form of M/D/YY - cal.setTime(HSSFDateUtil.getJavaDate(d)); - cellText = - (String.valueOf(cal.get(Calendar.YEAR))).substring(2); - cellText = cal.get(Calendar.MONTH)+1 + "/" + - cal.get(Calendar.DAY_OF_MONTH) + "/" + - cellText; - } - - - - - I'm trying to stream an XLS file from a servlet and I'm having some trouble. What's the problem? - - -

- The problem usually manifests itself as the junk characters being shown on - screen. The problem persists even though you have set the correct mime type. -

-

- The short answer is, don't depend on IE to display a binary file type properly if you stream it via a - servlet. Every minor version of IE has different bugs on this issue. -

-

- The problem in most versions of IE is that it does not use the mime type on - the HTTP response to determine the file type; rather it uses the file extension - on the request. Thus you might want to add a - .xls to your request - string. For example - http://yourserver.com/myServelet.xls?param1=xx. This is - easily accomplished through URL mapping in any servlet container. Sometimes - a request like - http://yourserver.com/myServelet?param1=xx&dummy=file.xls is also - known to work. - -

-

- To guarantee opening the file properly in Excel from IE, write out your file to a - temporary file under your web root from your servelet. Then send an http response - to the browser to do a client side redirection to your temp file. (Note that using a - server side redirect using RequestDispatcher will not be effective in this case) -

-

- Note also that when you request a document that is opened with an - external handler, IE sometimes makes two requests to the webserver. So if your - generating process is heavy, it makes sense to write out to a temporary file, so that multiple - requests happen for a static file. -

-

- None of this is particular to Excel. The same problem arises when you try to - generate any binary file dynamically to an IE client. For example, if you generate - pdf files using - FOP, you will come across many of the same issues. - -

- -
-
- - - I want to set a cell format (Data format of a cell) of a excel sheet as ###,###,###.#### or ###,###,###.0000. Is it possible using POI ? - - -

- Yes. You first need to get a HSSFDataFormat object from the workbook and call getFormat with the desired format. Some examples are here. -

-
-
- - - I want to set a cell format (Data format of a cell) of a excel sheet as text. Is it possible using POI ? - - -

- Yes. This is a built-in format for excel that you can get from HSSFDataFormat object using the format string "@". Also, the string "text" will alias this format. -

-
-
- - - How do I add a border around a merged cell? - - - Add blank cells around where the cells normally would have been and set the borders individually for each cell. - We will probably enhance HSSF in the future to make this process easier. - - - - - I tried to set cell values and Excel sheet name on my native language, - but I failed to do it. :( - - - By default HSSF uses cell values and sheet names as compressed unicode, - so to support localization you should use Unicode. - To do it you should set it manually: - - // for sheet name - HSSFWorkbook wb = new HSSFWorkbook(); - HSSFSheet s = wb.createSheet(); - wb.setSheetName( 0, "SomeUnicodeName", HSSFWorkbook.ENCODING_UTF_16 ); - - // for cell value - HSSFRow r = s.createRow( 0 ); - HSSFCell c = r.createCell( (short)0 ); - c.setCellType( HSSFCell.CELL_TYPE_STRING ); - c.setEncoding( HSSFCell.ENCODING_UTF_16 ); - c.setCellValue( "\u0422\u0435\u0441\u0442\u043E\u0432\u0430\u044F" ); - Make sure you make the call to setEncoding() before calling setCellValue(), otherwise what you pass in won't be interpreted properly. - - - - - I'm having trouble creating a spreadsheet using POI using - websphere 3.5. - - - Make sure you have fix pack 4 installed. - - - - I am using styles when creating a workbook in POI, but Excel refuses to open the file, complaining about "Too Many Styles". - - -

You just create the styles OUTSIDE of the loop in which you create cells.

-

GOOD:

- - HSSFWorkbook wb = new HSSFWorkbook(); - HSSFSheet sheet = wb.createSheet("new sheet"); - HSSFRow row = null; - - // Aqua background - HSSFCellStyle style = wb.createCellStyle(); - style.setFillBackgroundColor(HSSFColor.AQUA.index); - style.setFillPattern(HSSFCellStyle.BIG_SPOTS); - HSSFCell cell = row.createCell((short) 1); - cell.setCellValue("X"); - cell.setCellStyle(style); - - // Orange "foreground", foreground being the fill foreground not the font color. - style = wb.createCellStyle(); - style.setFillForegroundColor(HSSFColor.ORANGE.index); - style.setFillPattern(HSSFCellStyle.SOLID_FOREGROUND); - - for (int x = 0; x < 1000; x++) { - - // Create a row and put some cells in it. Rows are 0 based. - row = sheet.createRow((short) k); - - for (int y = 0; y < 100; y++) { - cell = row.createCell((short) k); - cell.setCellValue("X"); - cell.setCellStyle(style); - } - } - - // Write the output to a file - FileOutputStream fileOut = new FileOutputStream("workbook.xls"); - wb.write(fileOut); - fileOut.close(); - -

BAD:

- - HSSFWorkbook wb = new HSSFWorkbook(); - HSSFSheet sheet = wb.createSheet("new sheet"); - HSSFRow row = null; - - for (int x = 0; x < 1000; x++) { - // Aqua background - HSSFCellStyle style = wb.createCellStyle(); - style.setFillBackgroundColor(HSSFColor.AQUA.index); - style.setFillPattern(HSSFCellStyle.BIG_SPOTS); - HSSFCell cell = row.createCell((short) 1); - cell.setCellValue("X"); - cell.setCellStyle(style); - - // Orange "foreground", foreground being the fill foreground not the font color. - style = wb.createCellStyle(); - style.setFillForegroundColor(HSSFColor.ORANGE.index); - style.setFillPattern(HSSFCellStyle.SOLID_FOREGROUND); - - // Create a row and put some cells in it. Rows are 0 based. - row = sheet.createRow((short) k); - - for (int y = 0; y < 100; y++) { - cell = row.createCell((short) k); - cell.setCellValue("X"); - cell.setCellStyle(style); - } - } - - // Write the output to a file - FileOutputStream fileOut = new FileOutputStream("workbook.xls"); - wb.write(fileOut); - fileOut.close(); -
- - - - Will Poi read any spreadsheet and rewrite it with modifications. - - - Poi is not guanteed to read the contents of any spreadsheet. - Certain features may cause spreadsheets to fail to read. More - problematic is rewriting spreadsheets. Poi tried hard to - preserve the records of the original spreadsheet but some - features may cause problems. We advise that you limit the - formatting of spreadsheets you process so as to not be - unpleasantly suprised at a later stage. - - - -
-
diff --git a/src/documentation/xdocs/getinvolved/book.xml b/src/documentation/xdocs/getinvolved/book.xml deleted file mode 100644 index 97cc62343b..0000000000 --- a/src/documentation/xdocs/getinvolved/book.xml +++ /dev/null @@ -1,15 +0,0 @@ - - - - - - - - - - - - diff --git a/src/documentation/xdocs/getinvolved/branching.xml b/src/documentation/xdocs/getinvolved/branching.xml deleted file mode 100644 index 4246d83443..0000000000 --- a/src/documentation/xdocs/getinvolved/branching.xml +++ /dev/null @@ -1,97 +0,0 @@ - - - - - -
- Branching - - - -
- - -
-

- Branches are tagged in the following way: -

-
    -
  • REL_1_5_BRANCH
  • -
  • REL_2_0_BRANCH
  • -
-

- Merge points should be tagged as follows: -

-
    -
  • REL_1_5_BRANCH_MERGE1
  • -
  • REL_1_5_BRANCH_MERGE2
  • -
  • etc...
  • -
-

- Releases should be tagged as: -

-
    -
  • REL_1_5
  • -
  • REL_1_5_1
  • -
  • REL_1_5_2
  • -
  • etc...
  • -
- -
-
-

- Don't forget which branch you are currently on. This is critically - important. Committing stuff to the wrong branch causes all sorts of - headaches. Best to name your checkout after the branch you are on. -

-
-
-

- All branching is currently managed by Glen Stampoultzis. If you wish - to create your own branch please let him know. Merging is also - handled by Glen. Just pop him a mail if you feel it's necessary to - create a branch or perform a merge. -

-

- The reason to go through a single point for branching is that it can be - an easy thing to get wrong. Having a single person managing branches - means there is less chance of getting getting our wires crossed with this - difficult area of CVS. -

-
-
-

- The following branches are currently active: -

- - - - - - - - - - - - - -
- Branch - - Description -
- HEAD - - This is the trunk and is always active. Currently it is being used to continue development - of the 2.0 release. -
- REL_1_5_BRANCH - - All bug fixes not specifically relevant to the 2.0 work should be placed in this branch. - From here they will merged back to the trunk and the merge point marked. -
-
- - -
\ No newline at end of file diff --git a/src/documentation/xdocs/getinvolved/index.xml b/src/documentation/xdocs/getinvolved/index.xml deleted file mode 100644 index c33d85059c..0000000000 --- a/src/documentation/xdocs/getinvolved/index.xml +++ /dev/null @@ -1,111 +0,0 @@ - - - - -
- Contribution to POI - - - - - - -
- - - -
-
-

- Any information in here that might be perceived as legal information is - informational only. We're not lawyers, so consult a legal professional - if needed. -

-
-
-

- The POI project is OpenSource - and developed/distributed under the - Apache Software License. Unlike other licenses this license allows - free open source development; however, it does not require you to release - your source or use any particular license for your source. If you wish - to contribute to POI (which you're very welcome and encouraged to do so) - then you must agree to release the rights of your source to us under this - license. -

-
-
-

- In short, stay away, stay far far away. Implementing these file formats - in POI is done strictly by using public information. Public information - includes sources from other open source projects, books that state the - purpose intended is for allowing implementation of the file format and - do not require any non-disclosure agreement and just hard work. - We are intent on keeping it - legal, by contributing patches you agree to do the same. -

-

- If you've ever received information regarding the OLE 2 Compound Document - Format under any type of exclusionary agreement from Microsoft, or - (probably illegally) received such information from a person bound by - such an agreement, you cannot participate in this project. (Sorry) -

-

- Those submitting patches that show insight into the file format may be - asked to state explicitly that they are eligible or possibly sign an - agreement. -

-
-
-
-
    -
  • Read the rest of the website, understand what POI is and what it does, - the project vision, etc.
  • -
  • Use POI a bit, look for gaps in the documentation and examples.
  • -
  • Join the mail lists and share your knowledge with others.
  • -
  • Get CVS and check out the POI source tree
  • -
  • Documentation is always the best place to start contributing, maybe you found that if the documentation just told you how to do X then it would make more sense, modify the documentation.
  • -
  • Get used to building POI, you'll be doing it a lot, be one with the build, know its targets, etc.
  • -
  • Write Unit Tests. Great way to understand POI. Look for classes that aren't tested, or aren't tested on a public/protected method level, start there.
  • -
  • (HSSF)Get the Excel 97 Developer's Kit - its out of print but its dirt cheap (seen copies for under $15 (US)) used on amazon. It explains the Excel file format.
  • -
  • Submit patches (see below) of your contributions, modifications.
  • -
  • Fill out new features, see Bug database for suggestions.
  • -
-
-
-

- Create patches by getting the latest sources from CVS (the HEAD). - Alter or add files as appropriate. Then, from the jakarta-poi directiory, - type cvs diff -u > mypatch.patch. This will capture all of your changes - in a patch file of the appropriate format. Next, if you've added any - files, create an archive (tar.bz2 preferred as its the smallest) in a - path-preserving archive format, relative to your jakarta-poi directory. - (Note: If you use WinCVS, move to - [Admin] -> [Command Line] Menu and type "cvs diff -u" at - [Enter a cvs line command] input field ([Commandline Settings] Tab), - while selecting the target directories or files, in order to create - unified diffs. - In other words, [Alt+A]+[Alt+C]+[Alt+C] and type "cvs diff -u".) - You'll attach both files in the next step. -

-

- Patches are submitted via the Bug Database. - Create a new bug, set the subject to [PATCH] followed by a brief description. Explain you patch and any special instructions and submit/save it. - Next, go back to the bug, and create attachements for the patch files you - created. Be sure to describe not only the files purpose, but its format. - (Is that ZIP or a tgz or a bz2 or what?). -

-

- Make sure your patches include the @author tag on any files you've altered - or created. Make sure you've documented your changes and altered the - examples/etc to reflect them. Any new additions should have unit tests. - Lastly, ensure that you've provided approriate javadoc. (see - Coding - Standards). Patches that are of low quality may be rejected or - the contributer may be asked to bring them up to spec. -

-
- - -
diff --git a/src/documentation/xdocs/hdf/book.xml b/src/documentation/xdocs/hdf/book.xml deleted file mode 100644 index 6dfc05464d..0000000000 --- a/src/documentation/xdocs/hdf/book.xml +++ /dev/null @@ -1,15 +0,0 @@ - - - - - - - - - - - - - diff --git a/src/documentation/xdocs/hdf/docoverview.xml b/src/documentation/xdocs/hdf/docoverview.xml deleted file mode 100644 index 977a9a904c..0000000000 --- a/src/documentation/xdocs/hdf/docoverview.xml +++ /dev/null @@ -1,94 +0,0 @@ - - - - -
- HDF - Word file format - - - -
- - -
- -

The purpose of this document is to give a brief high level overview of the - HDF document format. This document does not go into in-depth technical - detail and is only meant as a supplement to the Microsoft Word 97 Binary - File Format freely available at Wotsit.org.

-

The OLE file format is not discussed in this document. It is assumed that - the reader has a working knowledge of the POIFS API.

- -
-

A Word file is made up of the document text and data structures - containing formatting information about the text. Of course, this is a - very simplified illustration. There are fields and macros and other - things that have not been considered. At this stage, HDF is mainly - concerned with formatted text.

-
-
-

The entry point for HDF's reading of a Word file is the File Information - Block (FIB). This structure is the entry point for the locations and size - of a document's text and data structures. The FIB is located at the - beginning of the main stream.

-
-

The document's text is also located in the main stream. Its starting - location is given as FIB.fcMin and its length is given in bytes by - FIB.ccpText. These two values are not very useful in getting the text - because of unicode. There may be unicode text intermingled with ASCII - text. That brings us to the piece table.

-

The piece table is used to divide the text into non-unicode and unicode - pieces. The size and offset are given in FIB.fcClx and FIB.lcbClx - respectively. The piece table may contain Property Modifiers (prm). - These are for complex(fast-saved) files and are skipped. Each text piece - contains offsets in the main stream that contain text for that piece. - If the piece uses unicode, the file offset is masked with a certain bit. - Then you have to unmask the bit and divide by 2 to get the real file - offset.

-
-
-
-

All text formatting is based on styles contained in the StyleSheet. - The StyleSheet is a data structure containing among other things, style - descriptions. Each style description can contain a paragraph style and - a character style or simply a character style. Each style description - is stored in a compressed version on file. Basically these are deltas - from another style.

-

Eventually, you have to chain back to the nil style which is an - imaginary style with certain implied values.

-
-
-

Paragraph and Character formatting properties for a document's text are - stored on file as deltas from some base style in the Stylesheet. The - deltas are used to create a complete uncompressed style in memory.

-

Uncompressed paragraph styles are represented by the Pargraph - Properties(PAP) data structure. Uncompressed character styles are - represented by the Character Properties(CHP) data structure. The styles - for the document text are stored in compressed format in the - corresponding Formatted Disk Pages (FKP). A compressed PAP is referred - to as a PAPX and a compressed CHP is a CHPX. The FKP locations are - stored in the bin table. There are seperate bin tables for CHPXs and - PAPXs. The bin tables' locations and sizes are stored in the FIB.

-

A FKP is a 512 byte OLE page. It contains the offsets of the beginning - and end of each paragraph/character run in the main stream and the - compressed properties for that interval. The compessed PAPX is based on - its base style in the StyleSheet. The compressed CHPX is based on the - enclosing paragraph's base style in the Stylesheet.

-
-
-

All compressed properties(CHPX, PAPX, SEPX) contain a grpprl. A grpprl - is an array of sprms. A sprm defines a delta from some base property. - There is a table of possible sprms in the Word 97 spec. Each sprm is a - two byte operand followed by a parameter. The parameter size depends on - the sprm. Each sprm describes an operation that should be performed on - the base style. After every sprm in the grpprl is performed on the base - style you will have the style for the paragraph, character run, - section, etc.

-
-
-
-
- -
- diff --git a/src/documentation/xdocs/hdf/index.xml b/src/documentation/xdocs/hdf/index.xml deleted file mode 100644 index afdd822b00..0000000000 --- a/src/documentation/xdocs/hdf/index.xml +++ /dev/null @@ -1,34 +0,0 @@ - - - - -
- HDF - Overview - - - - - -
- - -
- -

HDF is the name of OUR port of the Microsoft Word 97(-2002) file format to - pure Java.

-

HDF is still in early development. It is in the - scratchpad section of the - CVS. Source code in the org.apache.poi.hdf.extractor tree is - legacy code. Source in the org.apache.poi.hdf.model - tree is the old legacy code refactored into an object model. Check the How-To - page for detailed examples on using HDF. -

-

- We are looking for developers!!! If you are interested in helping with HDF - familiarize yourself with the source code and just start coding. Make sure - you read the guidelines for - getting involved

-
- -
diff --git a/src/documentation/xdocs/historyandfuture.xml b/src/documentation/xdocs/historyandfuture.xml deleted file mode 100755 index 741f6aad5c..0000000000 --- a/src/documentation/xdocs/historyandfuture.xml +++ /dev/null @@ -1,134 +0,0 @@ - - - - -
- Project History - - - -
- - - - -
- -

The POI project was dreamed up back around April 2001, when - Andy Oliver landed a short term contract to do Java-based - reporting to Excel. He'd done this project a few times before - and knew right where to look for the tools he needed. - Ironically, the API he used to use had skyrocketed from around - $300 ($US) to around $10K ($US). He figured it would take two - people around six months to write an Excel port so he - recommended the client fork out the $10K. -

- -

Around June 2001, Andy started thinking how great it would - be to have an open source Java tool to do this and, while he - had some spare time, he started on the project and learned - about OLE 2 Compound Document Format. After hitting some real - stumpers he realized he'd need help. He posted a message to - his local Java User's Group (JUG) and asked if anyone else - would be interested. He lucked out and the most talented Java - programmer he'd ever met, Marc Johnson, joined the project. He - ran rings around Andy at porting OLE 2 CDF and rewrote his - skeletal code into a more sophisticated library. It took Marc - a few iterations to get something they were happy with. -

- -

While Marc worked on that, Andy ported XLS to Java, based - on Marc's library. Several users wrote in asking to read XLS - (not just write as had originally been planned) and one user - had special requests for a different use for POIFS. Before - long, the project scope had tripled. POI 1.0 was released a - month later than planned, but with far more features. Marc - quickly wrote the serializer framework and HSSF Serializer in - record time and Andy banged out more documentation and worked - on making people aware of the project -

- -

Shortly before the release, POI was fortunate to come into - contact with Nicola -Ken- Barrozzi who gave them samples for - the HSSF Serializer and help uncover its unfortunate bugs - (which were promptly fixed). More recently, Ken ported most - of the POI project documentation to XML from Andy's crappy - HTML docs he wrote with Star Office. -

- -

Around the same time as the release, Glen Stampoultzis -joined the project. Glen was ticked off at Andy's flippant attitude -towards adding graphing to HSSF. Glen got so ticked off he decided to -grab a hammer and do it himself. Glen has already become an integral -part of the POI development community; his contributions to HSSF have -already started making waves. -

- -

Somewhere in there we decided to finally submit the project - to The Apache - Cocoon Project, only to discover the project had - outgrown fitting nicely into just Cocoon long ago. - Furthermore, Andy started eyeing other projects he'd like to - see POI functionality added to. So it was decided to donate - the Serializers and Generators to Cocoon, other POI - integration components to other projects, and the POI APIs - would become part of Jakarta. It was a bumpy road but it - looks like everything turned out since you're reading this! -

- -
- -
-

First we'll tackle this from a project standpoint: Well, we - made an offer to Microsoft and Actuate (tongue in cheek - ... well mostly) that we'd quit the project and retire if - they'd simply write us each a really large check. I've yet to - get a phone call or email so I'm assuming they're not going to - pay us to go away. -

-

Next, we've got some work to do here at Jakarta to finish - integrating POI into the community. Furthermore, we're - still transitioning the Serializer to Cocoon. -

-

HSSF, during the 2.0 cycle, will undergo a few - optimizations. We'll also be adding new features like a full - implementation of Formulas and custom text formats. We're - hoping to be able to generate smaller files by adding - write-support for RK, MulRK and MulBlank records. I'm also - going to work on a Cocoon 2 Generator. Currently, reading is - not very efficient in HSSF. This is mainly because in order to - write or modify, one needs to be able to update upstream - pointers to downstream data. To do this you have to have - everything between in memory. A Generator would allow SAX - events to be processed instead. (This will be based on the low - level structures). One of the great things about this is that, - you'll not only have a more efficient way to read the file, - you'll have a great way to use spreadsheets as XML data - sources. -

-

The HSSF Serializer, will further separate into a general - framework for creating serializers for other formats and the - HSSF Serializer specific implementation. (This is largely - already true). We'll also be adding support for features - already supported by HSSF (styles, fonts, text formats). We're - hoping to add support for formulas during this cycle. -

-

We're beginning to expand our scope yet again. If we could - do all of this for XLS files, what about Doc files or PPT - files? We're thinking that our next component (HDF - Horrible - Document Format) should follow the same pattern. We're hoping - that new blood will join the team and allow us to tackle this - even faster (in part because POIFS is already finished). But - maybe what we need most is you!

-
- - -
- - Copyright (c) @year@ The Apache Software Foundation All rights reserved. - $Revision$ $Date$ - -
- - -
diff --git a/src/documentation/xdocs/hpsf/book.xml b/src/documentation/xdocs/hpsf/book.xml deleted file mode 100644 index 529baed75a..0000000000 --- a/src/documentation/xdocs/hpsf/book.xml +++ /dev/null @@ -1,21 +0,0 @@ - - - - - - - - - - - - - - - - - - - diff --git a/src/documentation/xdocs/hpsf/how-to.xml b/src/documentation/xdocs/hpsf/how-to.xml deleted file mode 100644 index aeea5e1dc9..0000000000 --- a/src/documentation/xdocs/hpsf/how-to.xml +++ /dev/null @@ -1,868 +0,0 @@ - - - - - -
- HPSF HOW-TO - - - -
- -
- -

This HOW-TO is organized in three sections. You should read them - sequentially because the later sections build upon the earlier ones.

- -
    -
  1. -

    The first section explains how to read - the most important standard properties of a Microsoft Office - document. Standard properties are things like title, author, creation - date etc. It is quite likely that you will find here what you need and - don't have to read the other sections.

    -
  2. - -
  3. -

    The second section goes a small step - further and focusses on reading additional standard properties. It also - talks about exceptions that may be thrown when dealing with HPSF and - shows how you can read properties of embedded objects.

    -
  4. - -
  5. -

    The third section tells how to read - non-standard properties. Non-standard properties are application-specific - triples consisting of an ID, a type, and a value.

    -
  6. -
- - - - -
- - This section explains how to read - the most important standard properties of a Microsoft Office - document. Standard properties are things like title, author, creation - date etc. Chances are that you will find here what you need and - don't have to read the other sections. - -

The first thing you should understand is that properties are stored in - separate documents inside the POI filesystem. (If you don't know what a - POI filesystem is, read the POIFS - documentation.) A document in a POI filesystem is also called a - stream.

- -

The following example shows how to read a POI filesystem's - "title" property. Reading other properties is similar. Consider the API - documentation of org.apache.poi.hpsf.SummaryInformation to - learn which methods are available!

- -

The standard properties this section focusses on can be found in a - document called \005SummaryInformation located in the root of the - POI filesystem. The notation \005 in the document's name means - the character with the decimal value of 5. In order to read the title, an - application has to perform the following steps:

- -
    -
  1. -

    Open the document \005SummaryInformation located in the root - of the POI filesystem.

    -
  2. -
  3. -

    Create an instance of the class SummaryInformation from - that document.

    -
  4. -
  5. -

    Call the SummaryInformation instance's - getTitle() method.

    -
  6. -
- -

Sounds easy, doesn't it? Here are the steps in detail.

- - -
- -

An application that wants to open a document in a POI filesystem - (POIFS) proceeds as shown by the following code fragment. (The full - source code of the sample application is available in the - examples section of the POI source tree as - ReadTitle.java.

- - -import java.io.*; -import org.apache.poi.hpsf.*; -import org.apache.poi.poifs.eventfilesystem.*; - -// ... - -public static void main(String[] args) - throws IOException -{ - final String filename = args[0]; - POIFSReader r = new POIFSReader(); - r.registerListener(new MyPOIFSReaderListener(), - "\005SummaryInformation"); - r.read(new FileInputStream(filename)); -} - -

The first interesting statement is

- - POIFSReader r = new POIFSReader(); - -

It creates a - org.apache.poi.poifs.eventfilesystem.POIFSReader instance - which we shall need to read the POI filesystem. Before the application - actually opens the POI filesystem we have to tell the - POIFSReader which documents we are interested in. In this - case the application should do something with the document - \005SummaryInformation.

- - -r.registerListener(new MyPOIFSReaderListener(), - "\005SummaryInformation"); - -

This method call registers a - org.apache.poi.poifs.eventfilesystem.POIFSReaderListener - with the POIFSReader. The POIFSReaderListener - interface specifies the method processPOIFSReaderEvent - which processes a document. The class - MyPOIFSReaderListener implements the - POIFSReaderListener and thus the - processPOIFSReaderEvent method. The eventing POI filesystem - calls this method when it finds the \005SummaryInformation - document. In the sample application MyPOIFSReaderListener is - a static class in the ReadTitle.java source file.

- -

Now everything is prepared and reading the POI filesystem can - start:

- - r.read(new FileInputStream(filename)); - -

The following source code fragment shows the - MyPOIFSReaderListener class and how it retrieves the - title.

- - -static class MyPOIFSReaderListener implements POIFSReaderListener -{ - public void processPOIFSReaderEvent(POIFSReaderEvent event) - { - SummaryInformation si = null; - try - { - si = (SummaryInformation) - PropertySetFactory.create(event.getStream()); - } - catch (Exception ex) - { - throw new RuntimeException - ("Property set stream \"" + - event.getPath() + event.getName() + "\": " + ex); - } - final String title = si.getTitle(); - if (title != null) - System.out.println("Title: \"" + title + "\""); - else - System.out.println("Document has no title."); - } -} - - -

The line

- - SummaryInformation si = null; - -

declares a SummaryInformation variable and initializes it - with null. We need an instance of this class to access the - title. The instance is created in a try block:

- - si = (SummaryInformation) - PropertySetFactory.create(event.getStream()); - -

The expression event.getStream() returns the input stream - containing the bytes of the property set stream named - \005SummaryInformation. This stream is passed into the - create method of the factory class - org.apache.poi.hpsf.PropertySetFactory which returns - a org.apache.poi.hpsf.PropertySet instance. It is more or - less safe to cast this result to SummaryInformation, a - convenience class with methods like getTitle(), - getAuthor() etc.

- -

The PropertySetFactory.create method may throw all sorts - of exceptions. We'll deal with them in the next sections. For now we just - catch all exceptions and throw a RuntimeException - containing the message text of the origin exception.

- -

If all goes well, the sample application retrieves the title and prints - it to the standard output. As you can see you must be prepared for the - case that the POI filesystem does not have a title.

- - final String title = si.getTitle(); -if (title != null) - System.out.println("Title: \"" + title + "\""); -else - System.out.println("Document has no title."); - -

Please note that a Microsoft Office document does not necessarily - contain the \005SummaryInformation stream. The documents created - by the Microsoft Office suite have one, as far as I know. However, an - Excel spreadsheet exported from StarOffice 5.2 won't have a - \005SummaryInformation stream. In this case the applications - won't throw an exception but simply does not call the - processPOIFSReaderEvent method. You have been warned!

-
-
- - -
- - This section focusses on reading additional standard properties. It - also talks about exceptions that may be thrown when dealing with HPSF and - shows how you can read properties of embedded objects. - -

A couple of additional standard properties are not - contained in the \005SummaryInformation stream explained above, - for example a document's category or the number of multimedia clips in a - PowerPoint presentation. Microsoft has invented an additional stream named - \005DocumentSummaryInformation to hold these properties. With two - minor exceptions you can proceed exactly as described above to read the - properties stored in \005DocumentSummaryInformation:

- -
    -
  • Instead of \005SummaryInformation use - \005DocumentSummaryInformation as the stream's name.

  • -
  • Replace all occurrences of the class - SummaryInformation by - DocumentSummaryInformation.

  • -
- -

And of course you cannot call getTitle() because - DocumentSummaryInformation has different query methods. See - the Javadoc API documentation for the details!

- -

In the previous section the application simply caught all - exceptions and was in no way interested in any - details. However, a real application will likely want to know what went - wrong and act appropriately. Besides any IO exceptions there are three - HPSF resp. POI specific exceptions you should know about:

- -
-
NoPropertySetStreamException:
-
-

This exception is thrown if the application tries to create a - PropertySet instance from a stream that is not a - property set stream. (SummaryInformation and - DocumentSummaryInformation are subclasses of - PropertySet.) A faulty property set stream counts as not - being a property set stream at all. An application should be prepared to - deal with this case even if it opens streams named - \005SummaryInformation or - \005DocumentSummaryInformation only. These are just names. A - stream's name by itself does not ensure that the stream contains the - expected contents and that this contents is correct.

-
- -
UnexpectedPropertySetTypeException
-

This exception is thrown if a certain type of property set is - expected somewhere (e.g. a SummaryInformation or - DocumentSummaryInformation) but the provided property - set is not of that type.

- -
MarkUnsupportedException
-

This exception is thrown if an input stream that is to be parsed - into a property set does not support the - InputStream.mark(int) operation. The POI filesystem uses - the DocumentInputStream class which does support this - operation, so you are safe here. However, if you read a property set - stream from another kind of input stream things may be - different.

-
- -

Many Microsoft Office documents contain embedded - objects, for example an Excel sheet on a page in a Word - document. Embedded objects may have property sets of their own. An - application can open these property set streams as described above. The - only difference is that they are not located in the POI filesystem's root - but in a nested directory instead. Just register a - POIFSReaderListener for the property set streams you are - interested in. For example, the POIBrowser application in the - contrib section tries to open each and every document in a POI filesystem - as a property set stream. If this operation was successful it displays the - properties.

-
- - -
- - This section tells how to read non-standard properties. Non-standard - properties are application-specific ID/type/value triples. - -
-

Now comes the real hardcode stuff. As mentioned above, - SummaryInformation and - DocumentSummaryInformation are just special cases of the - general concept of a property set. This concept says that a - property set consists of properties and that each - property is an entity with an ID, a - type, and a value.

- -

Okay, that was still rather easy. However, to make things more - complicated, Microsoft in its infinite wisdom decided that a property set - shalt be broken into one or more sections. Each section - holds a bunch of properties. But since that's still not complicated - enough, a section may have an optional dictionary that - maps property IDs to property names - we'll explain - later what that means.

- -

The procedure to get to the properties is the following:

- -
    -
  1. Use the PropertySetFactory class to - create a PropertySet object from a property set stream. If - you don't know whether an input stream is a property set stream, just - try to call PropertySetFactory.create(java.io.InputStream): - You'll either get a PropertySet instance returned or an - exception is thrown.
  2. - -
  3. Call the PropertySet's method getSections() - to get the sections contained in the property set. Each section is - an instance of the Section class.
  4. - -
  5. Each section has a format ID. The format ID of the first section in a - property set determines the property set's type. For example, the first - (and only) section of the SummaryInformation property set has a format - ID of F29F85E0-4FF9-1068-AB-91-08-00-2B-27-B3-D9. You can - get the format ID with Section.getFormatID().
  6. - -
  7. The properties contained in a Section can be retrieved - with Section.getProperties(). The result is an array of - Property instances.
  8. - -
  9. A property has a name, a type, and a value. The Property - class has methods to retrieve them.
  10. -
-
- -
-

Let's have a look at a sample Java application that dumps all property - set streams contained in a POI file system. The full source code of this - program can be found as ReadCustomPropertySets.java in the - examples area of the POI source code tree. Here are the key - sections:

- - import java.io.*; -import java.util.*; -import org.apache.poi.hpsf.*; -import org.apache.poi.poifs.eventfilesystem.*; -import org.apache.poi.util.HexDump; - -

The most important package the application needs is - org.apache.poi.hpsf.*. This package contains the HPSF - classes. Most classes named below are from the HPSF package. Of course we - also need the POIFS event file system's classes and java.io.* - since we are dealing with POI I/O. From the java.util package - we use the List and Iterator class. The class - org.apache.poi.util.HexDump provides a methods to dump byte - arrays as nicely formatted strings.

- - public static void main(String[] args) - throws IOException -{ - final String filename = args[0]; - POIFSReader r = new POIFSReader(); - - /* Register a listener for *all* documents. */ - r.registerListener(new MyPOIFSReaderListener()); - r.read(new FileInputStream(filename)); -} - -

The POIFSReader is set up in a way that the listener - MyPOIFSReaderListener is called on every file in the POI file - system.

-
- -
-

The listener class tries to create a PropertySet from each - stream using the PropertySetFactory.create() method:

- - static class MyPOIFSReaderListener implements POIFSReaderListener -{ - public void processPOIFSReaderEvent(POIFSReaderEvent event) - { - PropertySet ps = null; - try - { - ps = PropertySetFactory.create(event.getStream()); - } - catch (NoPropertySetStreamException ex) - { - out("No property set stream: \"" + event.getPath() + - event.getName() + "\""); - return; - } - catch (Exception ex) - { - throw new RuntimeException - ("Property set stream \"" + - event.getPath() + event.getName() + "\": " + ex); - } - - /* Print the name of the property set stream: */ - out("Property set stream \"" + event.getPath() + - event.getName() + "\":"); - -

Creating the PropertySet is done in a try - block, because not each stream in the POI file system contains a property - set. If it is some other file, the - PropertySetFactory.create() throws a - NoPropertySetStreamException, which is caught and - logged. Then the program continues with the next stream. However, all - other types of exceptions cause the program to terminate by throwing a - runtime exception. If all went well, we can print the name of the property - set stream.

-
- -
-

The next step is to print the number of sections followed by the - sections themselves:

- - /* Print the number of sections: */ -final long sectionCount = ps.getSectionCount(); -out(" No. of sections: " + sectionCount); - -/* Print the list of sections: */ -List sections = ps.getSections(); -int nr = 0; -for (Iterator i = sections.iterator(); i.hasNext();) -{ - /* Print a single section: */ - Section sec = (Section) i.next(); - - // See below for the complete loop body. -} - -

The PropertySet's method getSectionCount() - returns the number of sections.

- -

To retrieve the sections, use the getSections() - method. This method returns a java.util.List containing - instances of the Section class in their proper order.

- -

The sample code shows a loop that retrieves the Section - objects one by one and prints some information about each one. Here is - the complete body of the loop:

- - /* Print a single section: */ -Section sec = (Section) i.next(); -out(" Section " + nr++ + ":"); -String s = hex(sec.getFormatID().getBytes()); -s = s.substring(0, s.length() - 1); -out(" Format ID: " + s); - -/* Print the number of properties in this section. */ -int propertyCount = sec.getPropertyCount(); -out(" No. of properties: " + propertyCount); - -/* Print the properties: */ -Property[] properties = sec.getProperties(); -for (int i2 = 0; i2 < properties.length; i2++) -{ - /* Print a single property: */ - Property p = properties[i2]; - int id = p.getID(); - long type = p.getType(); - Object value = p.getValue(); - out(" Property ID: " + id + ", type: " + type + - ", value: " + value); -} -
- -
-

The first method called on the Section instance is - getFormatID(). As explained above, the format ID of the - first section in a property set determines the type of the property - set. Its type is ClassID which is essentially a sequence of - 16 bytes. A real application using its own type of a custom property set - should have defined a unique format ID and, when reading a property set - stream, should check the format ID is equal to that unique format ID. The - sample program just prints the format ID it finds in a section:

- - String s = hex(sec.getFormatID().getBytes()); -s = s.substring(0, s.length() - 1); -out(" Format ID: " + s); - -

As you can see, the getFormatID() method returns a - ClassID object. An array containing the bytes can be - retrieved with ClassID.getBytes(). In order to get a nicely - formatted printout, the sample program uses the hex() helper - method which in turn uses the POI utility class HexDump in - the org.apache.poi.util package. Another helper method is - out() which just saves typing - System.out.println().

-
- -
-

Before getting the properties, it is possible to find out how many - properties are available in the section via the - Section.getPropertyCount(). The sample application uses this - method to print the number of properties to the standard output:

- - int propertyCount = sec.getPropertyCount(); -out(" No. of properties: " + propertyCount); - -

Now its time to get to the properties themselves. You can retrieve a - section's properties with the method - Section.getProperties():

- - Property[] properties = sec.getProperties(); - -

As you can see the result is an array of Property - objects. This class has three methods to retrieve a property's ID, its - type, and its value. The following code snippet shows how to call - them:

- - for (int i2 = 0; i2 < properties.length; i2++) -{ - /* Print a single property: */ - Property p = properties[i2]; - int id = p.getID(); - long type = p.getType(); - Object value = p.getValue(); - out(" Property ID: " + id + ", type: " + type + - ", value: " + value); -} -
- -
-

The output of the sample program might look like the following. It - shows the summary information and the document summary information - property sets of a Microsoft Word document. However, unlike the first and - second section of this HOW-TO the application does not have any code - which is specific to the SummaryInformation and - DocumentSummaryInformation classes.

- - Property set stream "/SummaryInformation": - No. of sections: 1 - Section 0: - Format ID: 00000000 F2 9F 85 E0 4F F9 10 68 AB 91 08 00 2B 27 B3 D9 ....O..h....+'.. - No. of properties: 17 - Property ID: 1, type: 2, value: 1252 - Property ID: 2, type: 30, value: Titel - Property ID: 3, type: 30, value: Thema - Property ID: 4, type: 30, value: Rainer Klute (Autor) - Property ID: 5, type: 30, value: Test (Stichwörter) - Property ID: 6, type: 30, value: This is a document for testing HPSF - Property ID: 7, type: 30, value: Normal.dot - Property ID: 8, type: 30, value: Unknown User - Property ID: 9, type: 30, value: 3 - Property ID: 18, type: 30, value: Microsoft Word 9.0 - Property ID: 12, type: 64, value: Mon Jan 01 00:59:25 CET 1601 - Property ID: 13, type: 64, value: Thu Jul 18 16:22:00 CEST 2002 - Property ID: 14, type: 3, value: 1 - Property ID: 15, type: 3, value: 20 - Property ID: 16, type: 3, value: 93 - Property ID: 19, type: 3, value: 0 - Property ID: 17, type: 71, value: [B@13582d -Property set stream "/DocumentSummaryInformation": - No. of sections: 2 - Section 0: - Format ID: 00000000 D5 CD D5 02 2E 9C 10 1B 93 97 08 00 2B 2C F9 AE ............+,.. - No. of properties: 14 - Property ID: 1, type: 2, value: 1252 - Property ID: 2, type: 30, value: Test - Property ID: 14, type: 30, value: Rainer Klute (Manager) - Property ID: 15, type: 30, value: Rainer Klute IT-Consulting GmbH - Property ID: 5, type: 3, value: 3 - Property ID: 6, type: 3, value: 2 - Property ID: 17, type: 3, value: 111 - Property ID: 23, type: 3, value: 592636 - Property ID: 11, type: 11, value: false - Property ID: 16, type: 11, value: false - Property ID: 19, type: 11, value: false - Property ID: 22, type: 11, value: false - Property ID: 13, type: 4126, value: [B@56a499 - Property ID: 12, type: 4108, value: [B@506411 - Section 1: - Format ID: 00000000 D5 CD D5 05 2E 9C 10 1B 93 97 08 00 2B 2C F9 AE ............+,.. - No. of properties: 7 - Property ID: 0, type: 0, value: {6=Test-JaNein, 5=Test-Zahl, 4=Test-Datum, 3=Test-Text, 2=_PID_LINKBASE} - Property ID: 1, type: 2, value: 1252 - Property ID: 2, type: 65, value: [B@c9ba38 - Property ID: 3, type: 30, value: This is some text. - Property ID: 4, type: 64, value: Wed Jul 17 00:00:00 CEST 2002 - Property ID: 5, type: 3, value: 27 - Property ID: 6, type: 11, value: true -No property set stream: "/WordDocument" -No property set stream: "/CompObj" -No property set stream: "/1Table" - -

There are some interesting items to note:

- -
    -
  • The first property set (summary information) consists of a single - section, the second property set (document summary information) consists - of two sections.
  • - -
  • Each section type (identified by its format ID) has its own domain of - property ID. For example, in the second property set the properties with - ID 2 have different meanings in the two section. By the way, the format - IDs of these sections are not equal, but you have to - look hard to find the difference.
  • - -
  • The properties are not in any particular order in the section, - although they slightly tend to be sorted by their IDs.
  • -
-
- -
-

Properties in the same section are distinguished by their IDs. This is - similar to variables in a programming language like Java, which are - distinguished by their names. But unlike variable names, property IDs are - simple integral numbers. There is another similarity, however. Just like - a Java variable has a certain scope (e.g. a member variables in a class), - a property ID also has its scope of validity: the section.

- -

Two property IDs in sections with different section format IDs - don't have the same meaning even though their IDs might be equal. For - example, ID 4 in the first (and only) section of a summary - information property set denotes the document's author, while ID 4 in the - first section of the document summary information property set means the - document's byte count. The sample output above does not show a property - with an ID of 4 in the first section of the document summary information - property set. That means that the document does not have a byte - count. However, there is a property with an ID of 4 in the - second section: This is a user-defined property ID - we'll get - to that topic in a minute.

- -

So, how can you find out what the meaning of a certain property ID in - the summary information and the document summary information property set - is? The standard property sets as such don't have any hints about the - meanings of their property IDs. For example, the summary - information property set does not tell you that the property ID 4 stands - for the document's author. This is external knowledge. Microsoft defined - standard meanings for some of the property IDs in the summary information - and the document summary information property sets. As a help to the Java - and POI programmer, the class PropertyIDMap in the - org.apache.poi.hpsf.wellknown package defines constants - for the "well-known" property IDs. For example, there is the - definition

- - public final static int PID_AUTHOR = 4; - -

These definitions allow you to use symbolic names instead of - numbers.

- -

In order to provide support for the other way, too, - i.e. to map - property IDs to property names - the class PropertyIDMap - defines two static methods: - getSummaryInformationProperties() and - getDocumentSummaryInformationProperties(). Both return - java.util.Map objects which map property IDs to - strings. Such a string gives a hint about the property's meaning. For - example, - PropertyIDMap.getSummaryInformationProperties().get(4) - returns the string "PID_AUTHOR". An application could use this string as - a key to a localized string which is displayed to the user, e.g. "Author" - in English or "Verfasser" in German. HPSF might provide such - language-dependend ("localized") mappings in a later release.

- -

Usually you won't have to deal with those two maps. Instead you should - call the Section.getPIDString(int) method. It returns the - string associated with the specified property ID in the context of the - Section object.

- -

Above you learned that property IDs have a meaning in the scope of a - section only. However, there are two exceptions to the rule: The property - IDs 0 and 1 have a fixed meaning in all sections:

- - - - - - - - - - - - - - - - -
Property IDMeaning
0The property's value is a dictionary, i.e. a - mapping from property IDs to strings.
1The property's value is the number of a codepage, - i.e. a mapping from character codes to characters. All strings in the - section containing this property must be interpreted using this - codepage. Typical property values are 1252 (8-bit "western" characters) - or 1200 (16-bit Unicode characters).
-
- -
-

A property is nothing without its value. It is stored in a property set - stream as a sequence of bytes. You must know the property's - type in order to properly interpret those bytes and - reasonably handle the value. A property's type is one of the so-called - Microsoft-defined "variant types". When you call - Property.getType() you'll get a long value - which denoting the property's variant type. The class - Variant in the org.apache.poi.hpsf package - holds most of those long values as named constants. For - example, the constant VT_I4 = 3 means a signed integer value - of four bytes. Examples of other types are VT_LPSTR = 30 - meaning a null-terminated string of 8-bit characters, VT_LPWSTR = - 31 which means a null-terminated Unicode string, or VT_BOOL - = 11 denoting a boolean value.

- -

In most cases you won't need a property's type because HPSF does all - the work for you.

-
- -
-

When an application wants to retrieve a property's value and calls - Property.getValue(), HPSF has to interpret the bytes making - out the value according to the property's type. The type determines how - many bytes the value consists of and what - to do with them. For example, if the type is VT_I4, HPSF - knows that the value is four bytes long and that these bytes - comprise a signed integer value in the little-endian format. This is - quite different from e.g. a type of VT_LPWSTR. In this case - HPSF has to scan the value bytes for a Unicode null character and collect - everything from the beginning to that null character as a Unicode - string.

- -

The good new is that HPSF does another job for you, too: It maps the - variant type to an adequate Java type.

- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -
Variant type:Java type:
VT_I2java.lang.Integer
VT_I4java.lang.Long
VT_FILETIMEjava.util.Date
VT_LPSTRjava.lang.String
VT_LPWSTRjava.lang.String
VT_CFbyte[]
VT_BOOLjava.lang.Boolean
- -

The bad news is that there are still a couple of variant types HPSF - does not yet support. If it encounters one of these types it - returns the property's value as a byte array and leaves it to be - interpreted by the application.

- -

An application retrieves a property's value by calling the - Property.getValue() method. This method's return type is the - abstract Object class. The getValue() method - looks up the property's variant type, reads the property's value bytes, - creates an instance of an adequate Java type, assigns it the property's - value and returns it. Primitive types like int or - long will be returned as the corresponding class, - e.g. Integer or Long.

-
- - -
-

The property with ID 0 has a very special meaning: It is a - dictionary mapping property IDs to property names. We - have seen already that the meanings of standard properties in the - summary information and the document summary information property sets - have been defined by Microsoft. The advantage is that the labels of - properties like "Author" or "Title" don't have to be stored in the - property set. However, a user can define custom fields in, say, Microsoft - Word. For each field the user has to specify a name, a type, and a - value.

- -

The names of the custom-defined fields (i.e. the property names) are - stored in the document summary information second section's - dictionary. The dictionary is a map which associates - property IDs with property names.

- -

The method Section.getPIDString(int) not only returns with - the well-known property names of the summary information and document - summary information property sets, but with self-defined properties, - too. It should also work with self-defined properties in self-defined - sections.

-
- -
- Improve codepage support! - -

The property with ID 1 holds the number of the codepage which was used - to encode the strings in this section. The present HPSF codepage support - is still very limited: When reading property value strings, HPSF - distinguishes between 16-bit characters and 8-bit characters. 16-bit - characters should be Unicode characters and thus be okay. 8-bit - characters are interpreted according to the platform's default character - set. This is fine as long as the document being read has been written on - a platform with the same default character set. However, if you receive a - document from another region of the world and want to process it with - HPSF you are in trouble - unless the creator used Unicode, of course.

-
- -
-

There are still some aspects of HSPF left which are not covered by this - HOW-TO. You should dig into the Javadoc API documentation to learn - further details. Since you've struggled through this document up to this - point, you are well prepared.

-
-
-
- -
- - diff --git a/src/documentation/xdocs/hpsf/index.xml b/src/documentation/xdocs/hpsf/index.xml deleted file mode 100644 index 0324452ca0..0000000000 --- a/src/documentation/xdocs/hpsf/index.xml +++ /dev/null @@ -1,54 +0,0 @@ - - - - - -
- HPSF (Horrible Property Set Format) - Overview - - - -
- -
-

Microsoft applications like "Word", "Excel" or "Powerpoint" let the user - describe his document by properties like "title", "category" and so on. The - application itself adds further information: last author, creation date - etc. These document properties are stored in so-called property set - streams. A property set stream is a separate document within a - POI filesystem. We'll call property - set streams mostly just "property sets". HPSF is POI's pure-Java - implementation to read (and in future to write) property sets.

- -

The HPSF HOWTO describes what a Java - application should do to read a property set using HPSF and to retrieve the - information it needs.

- -

HPSF supports OLE2 property set streams in general, and is not limited to - the special case of document properties in the Microsoft Office files - mentioned above. The HPSF description - describes the internal structure of property set streams. A separate - document explains the internal of thumbnail - images.

-
- -
- - diff --git a/src/documentation/xdocs/hpsf/internals.xml b/src/documentation/xdocs/hpsf/internals.xml deleted file mode 100644 index 867167074d..0000000000 --- a/src/documentation/xdocs/hpsf/internals.xml +++ /dev/null @@ -1,1186 +0,0 @@ - - - - - -
- HPSF Internals: The Horrible Property Set Format - - - -
- -
- -
- -

A Microsoft Office document is internally organized like a filesystem - with directory and files. Microsoft calls these files - streams. A document can have properties attached to it, - like author, title, number of words etc. These metadata are not stored in - the main stream of, say, a Word document, but instead in a dedicated - stream with a special format. Usually this stream's name is - \005SummaryInformation, where \005 represents - the character with a decimal value of 5.

- -

A single piece of information in the stream is called a - property, for example the document title. Each property - has an integral ID (e.g. 2 for title), a - type (telling that the title is a string of bytes) and a - value (what this is should be obvious). A stream - containing properties is called a - property set stream.

- -

This document describes the internal structure of a property set stream, - i.e. the Horrible Property Set Format (HDF). It does not - describe how a Microsoft Office document is organized internally and how - to retrieve a stream from it. See the POIFS documentation for that kind of - stuff.

- -

The Horrible Property Set Format is not only used in the Summary - Information stream in the top-level document of a Microsoft Office - document. Often there is also a property set stream named - \005DocumentSummaryInformation with additional properties. - Embedded documents may have their own property set streams. You cannot - tell by a stream's name whether it is a property set stream or not. - Instead you have to open the stream and look at its bytes.

-
- - - -
- -

Before delving into the details of the property set stream format we - have to have a short look at data types. Integral values are stored in the - so-called little endian format. In this format the bytes - that make out an integral value are stored in the "wrong" order. For - example, the decimal value 4660 is 0x1234 in the hexadecimal notation. If - you think this should be represented by a byte 0x12 followed by another - byte 0x34, you are right. This is called the big endian - format. In the little endian format, however, this order is reversed and - the low-value byte comes first: 0x3412. -

- -

The following table gives an overview about some important data - types:

- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -
-

Name

-
-

Length

-
-

Example (Little Endian)

-
-

Example (Big Endian)

-

Bytes

-

1 byte

-

0x12

0x12

Word

-

2 bytes

-

0x1234

0x3412

DWord

-

4 bytes

-

0x12345678

0x78563412

-

ClassID

-

A sequence of one DWord, two Words and eight Bytes

-
-

16 bytes

-
-

0xE0859FF2F94F6810AB9108002B27B3D9 resp. - E0859FF2-F94F-6810-AB-91-08-00-2B-27-B3-D9

-
-

0xF29F85E04FF91068AB9108002B27B3D9 resp. - F29F85E0-4FF9-1068-AB-91-08-00-2B-27-B3-D9

-
-

The ClassID examples are given here in two different notations. The - second notation without the "0x" at the beginning and with dashes - inside shows the internal grouping into one DWord, two Words and eight - Bytes.

-
-

Watch out: Microsoft documentation and tools show class IDs - a little bit differently like - F29F85E0-4FF9-1068-AB91-08002B27B3D9. - However, that representation is (intentionally?) misleading with - respect to endianess.

-
-
- - - -
- -

A property set stream consists of three main parts:

- -
    -
  1. -

    The header and

    -
  2. -
  3. -

    the section(s) containing the properties.

    -
  4. -
-
- - - -
- -

The first bytes in a property set stream is the header. - It has a fixed length and looks like this:

- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -
-

Offset

-
-

Type

-
-

Contents

-
-

Remarks

-
-

0

-
-

Word

-

0xFFFE

-

If the first four bytes of a stream do not contain these values, the - stream is not a property set stream.

-
-

2

-
-

Word

-

0x0000

-

4

-
-

DWord

-
-

Denotes the operating system and the OS version under which this - stream was created. The operating system ID is in the DWord's higher - word (after little endian decoding): 0x0000 for Win16, - 0x0001 for Macintosh and 0x0002 for Win32 - that's - all. The reader is most likely aware of the fact that there are some - more operating systems. However, Microsoft does not seem to know.

-
-

8

-
-

ClassID

-

0x00000000000000000000000000000000

-

Most property set streams have this value but this is not - required.

-
-

24

-
-

DWord

-
-

0x01000000 or greater

-
-

Section count. This field's value should be equal to 1 or greater. - Microsoft claims that this is a "reserved" field, but it seems to tell - how many sections (see below) are following in the stream. This would - really make sense because otherwise you could not know where and how - far you should read section data.

-
-
- - - -
- -

Following the header is the section list. This is an array of pairs each - consisting of a section format ID and an offset. This array has as many - pairs of ClassID and and DWord fields as the section count field in the - header says. The Summary Information stream contains a single section, the - Document Summary Information stream contains two.

- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -
-

Type

-
-

Contents

-
-

Remarks

-
-

ClassID

-
-

Section format ID

-
-

0xF29F85E04FF91068AB9108002B27B3D9 for the single section - in the Summary Information stream.

- -

0xD5CDD5022E9C101B939708002B2CF9AE for the first - section in the Document Summary Information stream.

-
-

DWord

-
-

Offset

-
-

The number of bytes between the beginning of the stream and the - beginning of the section within the stream.

-
-

ClassID

-
-

Section format ID

-
-

...

-
-

DWord

-
-

Offset

-
-

...

-
-

...

-
-

...

-
-

...

-
-
- - - -
- -

A section is divided into three parts: the section header (with the - section length and the number of properties in the section), the - properties list (with type and offset of each property), and the - properties themselves. Here are the details:

- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -
-

 

-
-

Type

-
-

Contents

-
-

Remarks

-
-

Section header

-
-

DWord

-
-

Length

-
-

The length of the section in bytes.

-
-

DWord

-
-

Property count

-
-

The number of properties in the section.

-
-

Properties list

-
-

DWord

-
-

Property ID

-
-

The property ID tells what the property means. For example, an ID of - 0x0002 in the Summary Information stands for the document's - title. See the Property IDs - chapter below for more details.

-
-

DWord

-
-

Offset

-
-

The number of bytes between the beginning of the section and the - property.

-
-

...

-
-

...

-
-

...

-
-

Properties

-
-

DWord

-
-

Property type ("variant")

-
-

This is the property's data type, e.g. an integer value, a byte - string or a Unicode string. See the - Property Types chapter for - details!

-

Field length depends on the property type - ("variant")

-

Property value

-
-

This field's length depends on the property's type. These are the - bytes that make out the DWord, the byte string or some other data of - fixed or variable length.

- -

The property value's length is always stored in an area which is a - multiple of 4 in length. If the property is shorter, e.g. a byte - string of 13 bytes, the remaining bytes are padded with 0x00 - bytes.

-
-

...

-
-

...

-
-

...

-
-
- - - -
- - -

As seen above, a section holds a property list: an array with property - IDs and offsets. The property ID gives each property a meaning. For - example, in the Summary Information stream the property ID 2 says that - this property is the document's title.

- -

If you want to know a property ID's meaning, it is not sufficient to - know the ID itself. You must also know the - section format ID. For example, in the Document Summary - Information stream the property ID 2 means not the document's title but - its category. Due to Microsoft's infinite wisdom the section format ID is - not part of the section. Thus if you have only a section without the - stream it is in, you cannot make any sense of the properties because you - do not know what they mean.

- -

So each section format ID has its own name space of property IDs. - Microsoft defined some "well-known" property IDs for the Summary - Information and the Document Summary Information streams. You can extend - them by your own additional IDs. This will be described below.

- -
- -

The Summary Information stream has a single section with a section - format ID of 0xF29F85E04FF91068AB9108002B27B3D9. The following - table defines the meaning of its property IDs. Each row associates a - property ID with a name and an ID string. (The property - type is just for informational purposes given here. As we have - seen above, the type is always given along with the value.)

- -

The property name is a readable string which could be - displayed to the user. However, this string is useful only for users who - understand English. The property name does not help with other - languages.

- -

The property ID string is about the same but looks more - technically and is nothing a user should bother with. You could the ID - string and map it to an appropriate display string in a particular - language. Of course you could do that with the property ID as well and - with less overhead, but people (including software developers) tend to be - better in remembering symbolic constants than remembering numbers.

- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -

Property ID

Property Name

Property ID String

Property Type

2

Title

PID_TITLE

VT_LPSTR

3

Subject

PID_SUBJECT

VT_LPSTR

4

Author

PID_AUTHOR

VT_LPSTR

5

Keywords

PID_KEYWORDS

VT_LPSTR

6

Comments

PID_COMMENTS

VT_LPSTR

7

Template

PID_TEMPLATE

VT_LPSTR

8

Last Saved By

PID_LASTAUTHOR

VT_LPSTR

9

Revision Number

PID_REVNUMBER

VT_LPSTR

10

Total Editing Time

PID_EDITTIME

VT_FILETIME

11

Last Printed

PID_LASTPRINTED

VT_FILETIME

12

Create Time/Date

PID_CREATE_DTM

VT_FILETIME

13

Last Saved Time/Date

PID_LASTSAVE_DTM

VT_FILETIME

14

Number of Pages

PID_PAGECOUNT

VT_I4

15

Number of Words

PID_WORDCOUNT

VT_I4

16

Number of Characters

PID_CHARCOUNT

VT_I4

17

Thumbnail

PID_THUMBNAIL

VT_CF

18

Name of Creating Application

PID_APPNAME

VT_LPSTR

19

Security

PID_SECURITY

VT_I4

-
- - - -
- -

The Document Summary Information stream has two sections with a section - format ID of 0xD5CDD5022E9C101B939708002B2CF9AE for the first - one. The following table defines the meaning of the property IDs in the - first section. See the preceeding section for interpreting the table.

- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -

Property ID

Property name

Property ID string

VT type

0

Dictionary

PID_DICTIONARY

[Special format]

1

Code page

PID_CODEPAGE

VT_I2

2

Category

PID_CATEGORY

VT_LPSTR

3

PresentationTarget

PID_PRESFORMAT

VT_LPSTR

4

Bytes

PID_BYTECOUNT

VT_I4

5

Lines

PID_LINECOUNT

VT_I4

6

Paragraphs

PID_PARCOUNT

VT_I4

7

Slides

PID_SLIDECOUNT

VT_I4

8

Notes

PID_NOTECOUNT

VT_I4

9

HiddenSlides

PID_HIDDENCOUNT

VT_I4

10

MMClips

PID_MMCLIPCOUNT

VT_I4

11

ScaleCrop

PID_SCALE

VT_BOOL

12

HeadingPairs

PID_HEADINGPAIR

VT_VARIANT | VT_VECTOR

13

TitlesofParts

PID_DOCPARTS

VT_LPSTR | VT_VECTOR

14

Manager

PID_MANAGER

VT_LPSTR

15

Company

PID_COMPANY

VT_LPSTR

16

LinksUpTo Date

PID_LINKSDIRTY

VT_BOOL

-
-
- - - -
- - -

A property consists of a DWord type field followed by the - property value. The property type is an integer value and tells how the - data byte following it are to be interpreted. In the Microsoft world it is - also known as the variant.

- -

The Usage column says where a variant type may occur. Not all - of them are allowed in a property set but just those marked with a [P]. - [V] - may appear in a VARIANT, [T] - may - appear in a TYPEDESC, [P] - may appear in an OLE property - set, [S] - may appear in a Safe Array.

- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -

Variant ID

Variant Type

Usage

Description

0

VT_EMPTY

[V] [P]

nothing

1

VT_NULL

[V] [P]

SQL style Null

2

VT_I2

[V] [T] [P] [S]

2 byte signed int

3

VT_I4

[V] [T] [P] [S]

4 byte signed int

4

VT_R4

[V] [T] [P] [S]

4 byte real

5

VT_R8

[V] [T] [P] [S]

8 byte real

6

VT_CY

[V] [T] [P] [S]

currency

7

VT_DATE

[V] [T] [P] [S]

date

8

VT_BSTR

[V] [T] [P] [S]

OLE Automation string

9

VT_DISPATCH

[V] [T] [P] [S]

IDispatch *

10

VT_ERROR

[V] [T] [S]

SCODE

11

VT_BOOL

[V] [T] [P] [S]

True=-1, False=0

12

VT_VARIANT

[V] [T] [P] [S]

VARIANT *

13

VT_UNKNOWN

[V] [T] [S]

IUnknown *

14

VT_DECIMAL

[V] [T] [S]

16 byte fixed point

16

VT_I1

[T]

signed char

17

VT_UI1

[V] [T] [P] [S]

unsigned char

18

VT_UI2

[T] [P]

unsigned short

19

VT_UI4

[T] [P]

unsigned short

20

VT_I8

[T] [P]

signed 64-bit int

21

VT_UI8

[T] [P]

unsigned 64-bit int

22

VT_INT

[T]

signed machine int

23

VT_UINT

[T]

unsigned machine int

24

VT_VOID

[T]

C style void

25

VT_HRESULT

[T]

Standard return type

26

VT_PTR

[T]

pointer type

27

VT_SAFEARRAY

[T]

(use VT_ARRAY in VARIANT)

28

VT_CARRAY

[T]

C style array

29

VT_USERDEFINED

[T]

user defined type

30

VT_LPSTR

[T] [P]

null terminated string

31

VT_LPWSTR

[T] [P]

wide null terminated string

64

VT_FILETIME

[P]

FILETIME

65

VT_BLOB

[P]

Length prefixed bytes

66

VT_STREAM

[P]

Name of the stream follows

67

VT_STORAGE

[P]

Name of the storage follows

68

VT_STREAMED_OBJECT

[P]

Stream contains an object

69

VT_STORED_OBJECT

[P]

Storage contains an object

70

VT_BLOB_OBJECT

[P]

Blob contains an object

71

VT_CF

[P]

Clipboard format

72

VT_CLSID

[P]

A Class ID

0x1000

VT_VECTOR

[P]

simple counted array

0x2000

VT_ARRAY

[V]

SAFEARRAY*

0x4000

VT_BYREF

[V]

void* for local use

0x8000

VT_RESERVED



0xFFFF

VT_ILLEGAL



0xFFF

VT_ILLEGALMASKED



0xFFF

VT_TYPEMASK



-
- - - -
- -

In order to assemble the HPSF description I used information publically - available on the Internet only. The references given below have been very - helpful. If you have any amendments or corrections, please let us know! - Thank you!

- -
    - -
  1. -

    In - Understanding OLE - documents, Ken Kyler gives an introduction to OLE2 - documents - and especially to property sets. He names the property names, types, and - IDs of the Summary Information and Document Summary Information - stream.

    -
  2. - -
  3. -

    The - ActiveX Programmer's - Reference at - http://www.dwam.net/docs/oleref/ - seems a little outdated, but that's what I have found.

    -
  4. - -
  5. -

    An overview of the VT_ types is in - Variant - Type Definitions.

    -
  6. - -
  7. -

    What is a FILETIME? The answer can be found - under , http://www.vbapi.com/ref/f/filetime.html or - http://www.cs.rpi.edu/courses/fall01/os/FILETIME.html. - In short: The FILETIME structure holds a date and time associated - with a file. The structure identifies a 64-bit integer specifying the - number of 100-nanosecond intervals which have passed since January 1, - 1601. This 64-bit value is split into the two dwords stored in the - structure.

    -
  8. - -
  9. -

    Information about the code page property in the - DocumentSummaryInformation stream is available at http://msdn.microsoft.com/library/default.asp?url=/library/en-us/stg/stg/property_id_1.asp.

    -
  10. - -
  11. -

    This documentation origins from the HPSF description available at http://www.rainer-klute.de/~klute/Software/poibrowser/doc/HPSF-Description.html.

    -
  12. -
-
-
- -
- - diff --git a/src/documentation/xdocs/hpsf/thumbnails.xml b/src/documentation/xdocs/hpsf/thumbnails.xml deleted file mode 100644 index 032736727b..0000000000 --- a/src/documentation/xdocs/hpsf/thumbnails.xml +++ /dev/null @@ -1,182 +0,0 @@ - - - - - -
- HPSF THUMBNAIL HOW-TO - - - -
- -
- -

Thumbnail information is stored as a VT_CF, or Thumbnail Variant. The - Thumbnail Variant is used to store various types of information in a - clipboard. The VT_CF can store information in formats for the Macintosh or - Windows clipboard.

- -

There are many types of data that can be copied to the clipboard, but the - only types of information needed for thumbnail manipulation are the image - formats.

- -

The VT_CF structure looks like this:

- - - - - - - - - - - - - - -
Element:Clipboard SizeClipboard Format TagClipboard Data
Size:32 bit unsigned integer (DWord)32 bit signed integer (DWord)variable length (byte array)
- -

The Clipboard Size refers to the size (in bytes) of Clipboard Data - (variable size) plus the Clipboard Format (four bytes).

- -

Clipboard Format Tag has four possible values:

- - - - - - - - - - - - - - - - - - - - - - - - - - - -
ValueIdentifierDescription
-1LCFTAG_WINDOWSa built-in Windows© clipboard format value
-2LCFTAG_MACINTOSHa Macintosh clipboard format value
-3LCFTAG_FMTIDa format identifier (FMTID) This is rarely used.
0LCFTAG_NODATANo data This is rarely used.
-
- - - -
- -

Windows clipboard data has four image formats for thumbnails:

- - - - - - - - - - - - - - - - - - - - - - - - - - - -
ValueIdentifierDescription
3CF_METAFILEPICTWindows metafile format - recommended
8CF_DIBDevice Independent Bitmap
14CF_ENHMETAFILEEnhanced Windows metafile format
2CF_BITMAPBitmap - Obsolete - Use CF_DIB instead
-
- -
- -

The most common format for thumbnails on the Windows platform is the - Windows metafile format. The Clipboard places and extra header in front of - a the standard Windows Metafile Format data.

- -

The Clipboard Data byte array looks like this when an image is stored in - Windows' Clipboard WMF format.

- - - - - - - - - - - - - - - - - - - - - - - - - - - - - -
IdentifierCF_METAFILEPICTmmwidthheighthandleWMF data
Size32 bit unsigned int16 bit unsigned(?) int16 bit unsigned(?) int16 bit unsigned(?) int16 bit unsigned(?) intbyte array - variable length
DescriptionClipboard WMFMapping ModeImage WidthImage Heighthandle to the WMF data array in memory, or 0standard WMF byte stream
-
- - -
-

FIXME: Describe the Device Independent Bitmap - format!

-
- - - -
-

FIXME: Describe the Macintosh clipboard formats!

-
- - -
- - diff --git a/src/documentation/xdocs/hpsf/todo.xml b/src/documentation/xdocs/hpsf/todo.xml deleted file mode 100644 index f62d9d373d..0000000000 --- a/src/documentation/xdocs/hpsf/todo.xml +++ /dev/null @@ -1,65 +0,0 @@ - - - - - -
- To Do - - - -
- -
- -

The following functionalities should be added to HPFS:

- -
    -
  1. -

    Add writing capability for property sets. Presently property sets can - be read only.

    -
  2. -
  3. -

    Add codepage support: Presently the bytes making out the string in a - property's value are interpreted using the platform's default character - set.

    -
  4. -
  5. -

    Add resource bundles to - org.apache.poi.hpsf.wellknown to ease - localizations. This would be useful for mapping standard property IDs to - localized strings. Example: The property ID 4 could be mapped to "Author" - in English or "Verfasser" in German.

    -
  6. -
  7. -

    Implement reading functionality for those property types that are not - yet supported. HPSF should return proper Java types instead of just byte - arrays.

    -
  8. -
  9. -

    Add WMF to java.awt.Image example code in Thumbnail - HOW TO.

    -
  10. -
-
- -
- - diff --git a/src/documentation/xdocs/hssf/alternatives.xml b/src/documentation/xdocs/hssf/alternatives.xml deleted file mode 100644 index d66f33288d..0000000000 --- a/src/documentation/xdocs/hssf/alternatives.xml +++ /dev/null @@ -1,114 +0,0 @@ - - - - -
- HSSF - Alternatives to HSSF - - - - -
- - -
-

- Maybe it's unwise to advertise your competitors but we believe - competition is good and we have the best support reading and - write Excel workbooks currently available. -

- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -
ProductURLDescription
JSC - jsc.tomschuetz.com - Reading, calculating standard and VBA functions with - Java at runtime. -
Formula One - ReportingEngines (division of Actuate Corporation) - An alternative to this project is to - buy the $5000 per server installation.
Visual Basic - www.microsoft.com - Give up XML and write Visual Basic code on a Microsoft Windows based - Environment or output in Microsoft's beta and primarily undocumented - XML for office format.
JExcelhttp://stareyes.homeip.net:8888Frequently unavailable. Little currently known about it's capabilities.
JWorkbookhttp://www.object-refinery.com/jworkbook/index.htmlThis effort supports Gnumeric and Excel, however the Excel part is done using POI anyway.
xlReaderhttp://www.sourceforge.net/projects/xlrdProvides decent support for reading Excel.
Excel ODBC Driverhttp://www.nwlink.com/~leewal/content/exceljavasample.htmODBC offers a somewhat wierd method for using Excel.
ExtenXLShttp://www.extentech.com/products/ExtenXLS/docs/intro3.jspCommercial library for reading, modifying and writing Excel spreadsheets. Not cheap but - certainly a lot more affordable than Formula 1. No idea as to it's quality.
J-Integra Java-Excel Bridgehttp://www.intrinsyc.com/products/bridging/jintegra.aspUses DCOM to an Excel instance on a windows machine.
Perl & C-There are a number of perl and C libraries, however none of them are consistent.
VistaJDBChttp://www.vistaportal.com/products/vistajdbc.htmVistaJDBC driver works with both StarOffice and Excel spreadsheets and - can access data using standard SQL statements without any API programming. - VistaJDBC also implemented ability to choose by not just rows and columns but by - specific cells, ranges of cells, etc. -
Coldtags Excel Tag Libraryhttp://www.servletsuite.com/servlets/exceltag.htm - This library outputs a simple CSV file, in which cells can - contain numbers or text. You could output a CSV file without its - help, but it gives a little more readability/structure to the code, and - could be extended to handle more complexity. When - you invoke one of these JSP pages from your browser, you open up an Excel - spreadsheet. There's no formatting, worksheets, or anything fancy like that. - So it's not strictly a competitor but it does the job. -
-
- -
diff --git a/src/documentation/xdocs/hssf/book.xml b/src/documentation/xdocs/hssf/book.xml deleted file mode 100644 index a36db153f6..0000000000 --- a/src/documentation/xdocs/hssf/book.xml +++ /dev/null @@ -1,29 +0,0 @@ - - - - - - - - - - - - - - - - - - - - - - - - - - - diff --git a/src/documentation/xdocs/hssf/chart.xml b/src/documentation/xdocs/hssf/chart.xml deleted file mode 100644 index 8b02b74601..0000000000 --- a/src/documentation/xdocs/hssf/chart.xml +++ /dev/null @@ -1,1506 +0,0 @@ - - - - -
- Chart record information - - - -
- -
-

- This document is intended as a work in progress for describing - our current understanding of how the chart records are are - written to produce a valid chart. -

-
-
-

- The following records detail the records written for a - 'simple' bar chart. -

- - - ============================================ - rectype = 0xec, recsize = 0xc8 - -BEGIN DUMP--------------------------------- - 00000000 0F 00 02 F0 C0 00 00 00 10 00 08 F0 08 00 00 00 ................ - 00000010 02 00 00 00 02 04 00 00 0F 00 03 F0 A8 00 00 00 ................ - 00000020 0F 00 04 F0 28 00 00 00 01 00 09 F0 10 00 00 00 ....(........... - 00000030 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 ................ - 00000040 02 00 0A F0 08 00 00 00 00 04 00 00 05 00 00 00 ................ - 00000050 0F 00 04 F0 70 00 00 00 92 0C 0A F0 08 00 00 00 ....p........... - 00000060 02 04 00 00 00 0A 00 00 93 00 0B F0 36 00 00 00 ............6... - 00000070 7F 00 04 01 04 01 BF 00 08 00 08 00 81 01 4E 00 ..............N. - 00000080 00 08 83 01 4D 00 00 08 BF 01 10 00 11 00 C0 01 ....M........... - 00000090 4D 00 00 08 FF 01 08 00 08 00 3F 02 00 00 02 00 M.........?..... - 000000A0 BF 03 00 00 08 00 00 00 10 F0 12 00 00 00 00 00 ................ - 000000B0 04 00 C0 02 0A 00 F4 00 0E 00 66 01 20 00 E9 00 ..........f. ... - 000000C0 00 00 11 F0 00 00 00 00 ........ - -END DUMP----------------------------------- - recordid = 0xec, size =200 - [UNKNOWN RECORD:ec] - .id = ec - [/UNKNOWN RECORD] - - ============================================ - rectype = 0x5d, recsize = 0x1a - -BEGIN DUMP--------------------------------- - 00000000 15 00 12 00 05 00 02 00 11 60 00 00 00 00 B8 03 .........`...... - 00000010 87 03 00 00 00 00 00 00 00 00 .......... - -END DUMP----------------------------------- - recordid = 0x5d, size =26 - [UNKNOWN RECORD:5d] - .id = 5d - [/UNKNOWN RECORD] - - ============================================ - rectype = 0x809, recsize = 0x10 - -BEGIN DUMP--------------------------------- - 00000000 00 06 20 00 FE 1C CD 07 C9 40 00 00 06 01 00 00 .. ......@...... - -END DUMP----------------------------------- - recordid = 0x809, size =16 - [BOF RECORD] - .version = 600 - .type = 20 - .build = 1cfe - .buildyear = 1997 - .history = 40c9 - .requiredversion = 106 - [/BOF RECORD] - - ============================================ - rectype = 0x14, recsize = 0x0 - -BEGIN DUMP--------------------------------- - **NO RECORD DATA** - -END DUMP----------------------------------- - recordid = 0x14, size =0 - [HEADER] - .length = 0 - .header = null - [/HEADER] - - ============================================ - rectype = 0x15, recsize = 0x0 - -BEGIN DUMP--------------------------------- - **NO RECORD DATA** - -END DUMP----------------------------------- - recordid = 0x15, size =0 - [FOOTER] - .footerlen = 0 - .footer = null - [/FOOTER] - - ============================================ - rectype = 0x83, recsize = 0x2 - -BEGIN DUMP--------------------------------- - 00000000 00 00 .. - -END DUMP----------------------------------- - recordid = 0x83, size =2 - [HCENTER] - .hcenter = false - [/HCENTER] - - ============================================ - rectype = 0x84, recsize = 0x2 - -BEGIN DUMP--------------------------------- - 00000000 00 00 .. - -END DUMP----------------------------------- - recordid = 0x84, size =2 - [VCENTER] - .vcenter = false - [/VCENTER] - - ============================================ - rectype = 0xa1, recsize = 0x22 - -BEGIN DUMP--------------------------------- - 00000000 00 00 12 00 01 00 01 00 01 00 04 00 00 00 B8 03 ................ - 00000010 00 00 00 00 00 00 E0 3F 00 00 00 00 00 00 E0 3F .......?.......? - 00000020 0F 00 .. - -END DUMP----------------------------------- - recordid = 0xa1, size =34 - [PRINTSETUP] - .papersize = 0 - .scale = 18 - .pagestart = 1 - .fitwidth = 1 - .fitheight = 1 - .options = 4 - .ltor = false - .landscape = false - .valid = true - .mono = false - .draft = false - .notes = false - .noOrientat = false - .usepage = false - .hresolution = 0 - .vresolution = 952 - .headermargin = 0.5 - .footermargin = 0.5 - .copies = 15 - [/PRINTSETUP] - - ============================================ - rectype = 0x33, recsize = 0x2 - -BEGIN DUMP--------------------------------- - 00000000 03 00 .. - -END DUMP----------------------------------- - recordid = 0x33, size =2 - [UNKNOWN RECORD:33] - .id = 33 - [/UNKNOWN RECORD] - - ============================================ - rectype = 0x1060, recsize = 0xa - -BEGIN DUMP--------------------------------- - 00000000 A0 23 08 16 C8 00 00 00 05 00 .#........ - -END DUMP----------------------------------- - recordid = 0x1060, size =10 - [FBI] - .xBasis = 0x23A0 (9120 ) - .yBasis = 0x1608 (5640 ) - .heightBasis = 0x00C8 (200 ) - .scale = 0x0000 (0 ) - .indexToFontTable = 0x0005 (5 ) - [/FBI] - - ============================================ - rectype = 0x1060, recsize = 0xa - -BEGIN DUMP--------------------------------- - 00000000 A0 23 08 16 C8 00 01 00 06 00 .#........ - -END DUMP----------------------------------- - recordid = 0x1060, size =10 - [FBI] - .xBasis = 0x23A0 (9120 ) - .yBasis = 0x1608 (5640 ) - .heightBasis = 0x00C8 (200 ) - .scale = 0x0001 (1 ) - .indexToFontTable = 0x0006 (6 ) - [/FBI] - - ============================================ - rectype = 0x12, recsize = 0x2 - -BEGIN DUMP--------------------------------- - 00000000 00 00 .. - -END DUMP----------------------------------- - recordid = 0x12, size =2 - [PROTECT] - .rowheight = 0 - [/PROTECT] - - ============================================ - Offset 0xf22 (3874) - rectype = 0x1001, recsize = 0x2 - -BEGIN DUMP--------------------------------- - 00000000 00 00 .. - -END DUMP----------------------------------- - recordid = 0x1001, size =2 - [UNITS] - .units = 0x0000 (0 ) - [/UNITS] - - ============================================ - Offset 0xf28 (3880) - rectype = 0x1002, recsize = 0x10 - -BEGIN DUMP--------------------------------- - 00000000 00 00 00 00 00 00 00 00 58 66 D0 01 40 66 22 01 ........Xf..@f". - -END DUMP----------------------------------- - recordid = 0x1002, size =16 - [CHART] - .x = 0x00000000 (0 ) - .y = 0x00000000 (0 ) - .width = 0x01D06658 (30434904 ) - .height = 0x01226640 (19031616 ) - [/CHART] - - ============================================ - Offset 0xf3c (3900) - rectype = 0x1033, recsize = 0x0 - -BEGIN DUMP--------------------------------- - **NO RECORD DATA** - -END DUMP----------------------------------- - recordid = 0x1033, size =0 - [BEGIN] - [/BEGIN] - - ============================================ - Offset 0xf40 (3904) - rectype = 0xa0, recsize = 0x4 - -BEGIN DUMP--------------------------------- - 00000000 01 00 01 00 .... - -END DUMP----------------------------------- - recordid = 0xa0, size =4 - [SCL] - .numerator = 0x0001 (1 ) - .denominator = 0x0001 (1 ) - [/SCL] - - ============================================ - Offset 0xf48 (3912) - rectype = 0x1064, recsize = 0x8 - -BEGIN DUMP--------------------------------- - 00000000 00 00 01 00 00 00 01 00 ........ - -END DUMP----------------------------------- - recordid = 0x1064, size =8 - [PLOTGROWTH] - .horizontalScale = 0x00010000 (65536 ) - .verticalScale = 0x00010000 (65536 ) - [/PLOTGROWTH] - - ============================================ - Offset 0xf54 (3924) - rectype = 0x1032, recsize = 0x4 - -BEGIN DUMP--------------------------------- - 00000000 00 00 02 00 .... - -END DUMP----------------------------------- - recordid = 0x1032, size =4 - [FRAME] - .borderType = 0x0000 (0 ) - .options = 0x0002 (2 ) - .autoSize = false - .autoPosition = true - [/FRAME] - - ============================================ - Offset 0xf5c (3932) - rectype = 0x1033, recsize = 0x0 - -BEGIN DUMP--------------------------------- - **NO RECORD DATA** - -END DUMP----------------------------------- - recordid = 0x1033, size =0 - [BEGIN] - [/BEGIN] - - ============================================ - Offset 0xf60 (3936) - rectype = 0x1007, recsize = 0xc - -BEGIN DUMP--------------------------------- - 00000000 00 00 00 00 00 00 FF FF 09 00 4D 00 ..........M. - -END DUMP----------------------------------- - recordid = 0x1007, size =12 - [LINEFORMAT] - .lineColor = 0x00000000 (0 ) - .linePattern = 0x0000 (0 ) - .weight = 0xFFFF (-1 ) - .format = 0x0009 (9 ) - .auto = true - .drawTicks = false - .unknown = false - .colourPaletteIndex = 0x004D (77 ) - [/LINEFORMAT] - - ============================================ - Offset 0xf70 (3952) - rectype = 0x100a, recsize = 0x10 - -BEGIN DUMP--------------------------------- - 00000000 FF FF FF 00 00 00 00 00 01 00 01 00 4E 00 4D 00 ............N.M. - -END DUMP----------------------------------- - recordid = 0x100a, size =16 - [AREAFORMAT] - .foregroundColor = 0x00FFFFFF (16777215 ) - .backgroundColor = 0x00000000 (0 ) - .pattern = 0x0001 (1 ) - .formatFlags = 0x0001 (1 ) - .automatic = true - .invert = false - .forecolorIndex = 0x004E (78 ) - .backcolorIndex = 0x004D (77 ) - [/AREAFORMAT] - - ============================================ - Offset 0xf84 (3972) - rectype = 0x1034, recsize = 0x0 - -BEGIN DUMP--------------------------------- - **NO RECORD DATA** - -END DUMP----------------------------------- - recordid = 0x1034, size =0 - [END] - [/END] - - ============================================ - Offset 0xf88 (3976) - rectype = 0x1003, recsize = 0xc - -BEGIN DUMP--------------------------------- - 00000000 01 00 01 00 20 00 1F 00 01 00 00 00 .... ....... - -END DUMP----------------------------------- - recordid = 0x1003, size =12 - [SERIES] - .categoryDataType = 0x0001 (1 ) - .valuesDataType = 0x0001 (1 ) - .numCategories = 0x0020 (32 ) - .numValues = 0x001F (31 ) - .bubbleSeriesType = 0x0001 (1 ) - .numBubbleValues = 0x0000 (0 ) - [/SERIES] - - ============================================ - Offset 0xf98 (3992) - rectype = 0x1033, recsize = 0x0 - -BEGIN DUMP--------------------------------- - **NO RECORD DATA** - -END DUMP----------------------------------- - recordid = 0x1033, size =0 - [BEGIN] - [/BEGIN] - - ============================================ - Offset 0xf9c (3996) - rectype = 0x1051, recsize = 0x8 - -BEGIN DUMP--------------------------------- - 00000000 00 01 00 00 00 00 00 00 ........ - -END DUMP----------------------------------- - recordid = 0x1051, size =8 - [AI] - .linkType = 0x00 (0 ) - .referenceType = 0x01 (1 ) - .options = 0x0000 (0 ) - .customNumberFormat = false - .indexNumberFmtRecord = 0x0000 (0 ) - .formulaOfLink = (org.apache.poi.hssf.record.LinkedDataFormulaField@1ee3914 ) - [/AI] - - ============================================ - Offset 0xfa8 (4008) - rectype = 0x1051, recsize = 0x13 - -BEGIN DUMP--------------------------------- - 00000000 01 02 00 00 00 00 0B 00 3B 00 00 00 00 1E 00 01 ........;....... - 00000010 00 01 00 ... - -END DUMP----------------------------------- - recordid = 0x1051, size =19 - [AI] - .linkType = 0x01 (1 ) - .referenceType = 0x02 (2 ) - .options = 0x0000 (0 ) - .customNumberFormat = false - .indexNumberFmtRecord = 0x0000 (0 ) - .formulaOfLink = (org.apache.poi.hssf.record.LinkedDataFormulaField@e5855a ) - [/AI] - - ============================================ - Offset 0xfbf (4031) - rectype = 0x1051, recsize = 0x13 - -BEGIN DUMP--------------------------------- - 00000000 02 02 00 00 69 01 0B 00 3B 00 00 00 00 1F 00 00 ....i...;....... - 00000010 00 00 00 ... - -END DUMP----------------------------------- - recordid = 0x1051, size =19 - [AI] - .linkType = 0x02 (2 ) - .referenceType = 0x02 (2 ) - .options = 0x0000 (0 ) - .customNumberFormat = false - .indexNumberFmtRecord = 0x0169 (361 ) - .formulaOfLink = (org.apache.poi.hssf.record.LinkedDataFormulaField@95fd19 ) - [/AI] - - ============================================ - Offset 0xfd6 (4054) - rectype = 0x1051, recsize = 0x8 - -BEGIN DUMP--------------------------------- - 00000000 03 01 00 00 00 00 00 00 ........ - -END DUMP----------------------------------- - recordid = 0x1051, size =8 - [AI] - .linkType = 0x03 (3 ) - .referenceType = 0x01 (1 ) - .options = 0x0000 (0 ) - .customNumberFormat = false - .indexNumberFmtRecord = 0x0000 (0 ) - .formulaOfLink = (org.apache.poi.hssf.record.LinkedDataFormulaField@11b9fb1 ) - [/AI] - - ============================================ - Offset 0xfe2 (4066) - rectype = 0x1006, recsize = 0x8 - -BEGIN DUMP--------------------------------- - 00000000 FF FF 00 00 00 00 00 00 ........ - -END DUMP----------------------------------- - recordid = 0x1006, size =8 - [DATAFORMAT] - .pointNumber = 0xFFFF (-1 ) - .seriesIndex = 0x0000 (0 ) - .seriesNumber = 0x0000 (0 ) - .formatFlags = 0x0000 (0 ) - .useExcel4Colors = false - [/DATAFORMAT] - - ============================================ - Offset 0xfee (4078) - rectype = 0x1033, recsize = 0x0 - -BEGIN DUMP--------------------------------- - **NO RECORD DATA** - -END DUMP----------------------------------- - recordid = 0x1033, size =0 - [BEGIN] - [/BEGIN] - - ============================================ - Offset 0xff2 (4082) - rectype = 0x105f, recsize = 0x2 - -BEGIN DUMP--------------------------------- - 00000000 00 00 .. - -END DUMP----------------------------------- - recordid = 0x105f, size =2 - [UNKNOWN RECORD] - .id = 105f - [/UNKNOWN RECORD] - - ============================================ - Offset 0xff8 (4088) - rectype = 0x1034, recsize = 0x0 - -BEGIN DUMP--------------------------------- - **NO RECORD DATA** - -END DUMP----------------------------------- - recordid = 0x1034, size =0 - [END] - [/END] - - ============================================ - Offset 0xffc (4092) - rectype = 0x1045, recsize = 0x2 - -BEGIN DUMP--------------------------------- - 00000000 00 00 .. - -END DUMP----------------------------------- - recordid = 0x1045, size =2 - [SeriesToChartGroup] - .chartGroupIndex = 0x0000 (0 ) - [/SeriesToChartGroup] - - ============================================ - Offset 0x1002 (4098) - rectype = 0x1034, recsize = 0x0 - -BEGIN DUMP--------------------------------- - **NO RECORD DATA** - -END DUMP----------------------------------- - recordid = 0x1034, size =0 - [END] - [/END] - - ============================================ - Offset 0x1006 (4102) - rectype = 0x1044, recsize = 0x4 - -BEGIN DUMP--------------------------------- - 00000000 0A 00 00 00 .... - -END DUMP----------------------------------- - recordid = 0x1044, size =4 - [SHTPROPS] - .flags = 0x000A (10 ) - .chartTypeManuallyFormatted = false - .plotVisibleOnly = true - .doNotSizeWithWindow = false - .defaultPlotDimensions = true - .autoPlotArea = false - .empty = 0x00 (0 ) - [/SHTPROPS] - - ============================================ - Offset 0x100e (4110) - rectype = 0x1024, recsize = 0x2 - -BEGIN DUMP--------------------------------- - 00000000 02 00 .. - -END DUMP----------------------------------- - recordid = 0x1024, size =2 - [DEFAULTTEXT] - .categoryDataType = 0x0002 (2 ) - [/DEFAULTTEXT] - - ============================================ - Offset 0x1014 (4116) - rectype = 0x1025, recsize = 0x20 - -BEGIN DUMP--------------------------------- - 00000000 02 02 01 00 00 00 00 00 DB FF FF FF C4 FF FF FF ................ - 00000010 00 00 00 00 00 00 00 00 B1 00 4D 00 50 2B 00 00 ..........M.P+.. - -END DUMP----------------------------------- - recordid = 0x1025, size =32 - [TEXT] - .horizontalAlignment = 0x02 (2 ) - .verticalAlignment = 0x02 (2 ) - .displayMode = 0x0001 (1 ) - .rgbColor = 0x00000000 (0 ) - .x = 0xFFFFFFDB (-37 ) - .y = 0xFFFFFFC4 (-60 ) - .width = 0x00000000 (0 ) - .height = 0x00000000 (0 ) - .options1 = 0x00B1 (177 ) - .autoColor = true - .showKey = false - .showValue = false - .vertical = false - .autoGeneratedText = true - .generated = true - .autoLabelDeleted = false - .autoBackground = true - .rotation = 0 - .showCategoryLabelAsPercentage = false - .showValueAsPercentage = false - .showBubbleSizes = false - .showLabel = false - .indexOfColorValue = 0x004D (77 ) - .options2 = 0x2B50 (11088 ) - .dataLabelPlacement = 0 - .textRotation = 0x0000 (0 ) - [/TEXT] - - ============================================ - Offset 0x1038 (4152) - rectype = 0x1033, recsize = 0x0 - -BEGIN DUMP--------------------------------- - **NO RECORD DATA** - -END DUMP----------------------------------- - recordid = 0x1033, size =0 - [BEGIN] - [/BEGIN] - - ============================================ - Offset 0x103c (4156) - rectype = 0x104f, recsize = 0x14 - -BEGIN DUMP--------------------------------- - 00000000 02 00 02 00 00 00 00 00 00 00 00 00 00 00 00 00 ................ - 00000010 00 00 00 00 .... - -END DUMP----------------------------------- - recordid = 0x104f, size =20 - [UNKNOWN RECORD] - .id = 104f - [/UNKNOWN RECORD] - - ============================================ - Offset 0x1054 (4180) - rectype = 0x1026, recsize = 0x2 - -BEGIN DUMP--------------------------------- - 00000000 05 00 .. - -END DUMP----------------------------------- - recordid = 0x1026, size =2 - [FONTX] - .fontIndex = 0x0005 (5 ) - [/FONTX] - - ============================================ - Offset 0x105a (4186) - rectype = 0x1051, recsize = 0x8 - -BEGIN DUMP--------------------------------- - 00000000 00 01 00 00 00 00 00 00 ........ - -END DUMP----------------------------------- - recordid = 0x1051, size =8 - [AI] - .linkType = 0x00 (0 ) - .referenceType = 0x01 (1 ) - .options = 0x0000 (0 ) - .customNumberFormat = false - .indexNumberFmtRecord = 0x0000 (0 ) - .formulaOfLink = (org.apache.poi.hssf.record.LinkedDataFormulaField@913fe2 ) - [/AI] - - ============================================ - Offset 0x1066 (4198) - rectype = 0x1034, recsize = 0x0 - -BEGIN DUMP--------------------------------- - **NO RECORD DATA** - -END DUMP----------------------------------- - recordid = 0x1034, size =0 - [END] - [/END] - - ============================================ - Offset 0x106a (4202) - rectype = 0x1024, recsize = 0x2 - -BEGIN DUMP--------------------------------- - 00000000 03 00 .. - -END DUMP----------------------------------- - recordid = 0x1024, size =2 - [DEFAULTTEXT] - .categoryDataType = 0x0003 (3 ) - [/DEFAULTTEXT] - - ============================================ - Offset 0x1070 (4208) - rectype = 0x1025, recsize = 0x20 - -BEGIN DUMP--------------------------------- - 00000000 02 02 01 00 00 00 00 00 DB FF FF FF C4 FF FF FF ................ - 00000010 00 00 00 00 00 00 00 00 B1 00 4D 00 50 2B 00 00 ..........M.P+.. - -END DUMP----------------------------------- - recordid = 0x1025, size =32 - [TEXT] - .horizontalAlignment = 0x02 (2 ) - .verticalAlignment = 0x02 (2 ) - .displayMode = 0x0001 (1 ) - .rgbColor = 0x00000000 (0 ) - .x = 0xFFFFFFDB (-37 ) - .y = 0xFFFFFFC4 (-60 ) - .width = 0x00000000 (0 ) - .height = 0x00000000 (0 ) - .options1 = 0x00B1 (177 ) - .autoColor = true - .showKey = false - .showValue = false - .vertical = false - .autoGeneratedText = true - .generated = true - .autoLabelDeleted = false - .autoBackground = true - .rotation = 0 - .showCategoryLabelAsPercentage = false - .showValueAsPercentage = false - .showBubbleSizes = false - .showLabel = false - .indexOfColorValue = 0x004D (77 ) - .options2 = 0x2B50 (11088 ) - .dataLabelPlacement = 0 - .textRotation = 0x0000 (0 ) - [/TEXT] - - ============================================ - Offset 0x1094 (4244) - rectype = 0x1033, recsize = 0x0 - -BEGIN DUMP--------------------------------- - **NO RECORD DATA** - -END DUMP----------------------------------- - recordid = 0x1033, size =0 - [BEGIN] - [/BEGIN] - - ============================================ - Offset 0x1098 (4248) - rectype = 0x104f, recsize = 0x14 - -BEGIN DUMP--------------------------------- - 00000000 02 00 02 00 00 00 00 00 00 00 00 00 00 00 00 00 ................ - 00000010 00 00 00 00 .... - -END DUMP----------------------------------- - recordid = 0x104f, size =20 - [UNKNOWN RECORD] - .id = 104f - [/UNKNOWN RECORD] - - ============================================ - Offset 0x10b0 (4272) - rectype = 0x1026, recsize = 0x2 - -BEGIN DUMP--------------------------------- - 00000000 06 00 .. - -END DUMP----------------------------------- - recordid = 0x1026, size =2 - [FONTX] - .fontIndex = 0x0006 (6 ) - [/FONTX] - - ============================================ - Offset 0x10b6 (4278) - rectype = 0x1051, recsize = 0x8 - -BEGIN DUMP--------------------------------- - 00000000 00 01 00 00 00 00 00 00 ........ - -END DUMP----------------------------------- - recordid = 0x1051, size =8 - [AI] - .linkType = 0x00 (0 ) - .referenceType = 0x01 (1 ) - .options = 0x0000 (0 ) - .customNumberFormat = false - .indexNumberFmtRecord = 0x0000 (0 ) - .formulaOfLink = (org.apache.poi.hssf.record.LinkedDataFormulaField@1f934ad ) - [/AI] - - ============================================ - Offset 0x10c2 (4290) - rectype = 0x1034, recsize = 0x0 - -BEGIN DUMP--------------------------------- - **NO RECORD DATA** - -END DUMP----------------------------------- - recordid = 0x1034, size =0 - [END] - [/END] - - ============================================ - Offset 0x10c6 (4294) - rectype = 0x1046, recsize = 0x2 - -BEGIN DUMP--------------------------------- - 00000000 01 00 .. - -END DUMP----------------------------------- - recordid = 0x1046, size =2 - [AXISUSED] - .numAxis = 0x0001 (1 ) - [/AXISUSED] - - ============================================ - Offset 0x10cc (4300) - rectype = 0x1041, recsize = 0x12 - -BEGIN DUMP--------------------------------- - 00000000 00 00 DF 01 00 00 DD 00 00 00 B3 0B 00 00 56 0B ..............V. - 00000010 00 00 .. - -END DUMP----------------------------------- - recordid = 0x1041, size =18 - [AXISPARENT] - .axisType = 0x0000 (0 ) - .x = 0x000001DF (479 ) - .y = 0x000000DD (221 ) - .width = 0x00000BB3 (2995 ) - .height = 0x00000B56 (2902 ) - [/AXISPARENT] - - ============================================ - Offset 0x10e2 (4322) - rectype = 0x1033, recsize = 0x0 - -BEGIN DUMP--------------------------------- - **NO RECORD DATA** - -END DUMP----------------------------------- - recordid = 0x1033, size =0 - [BEGIN] - [/BEGIN] - - ============================================ - Offset 0x10e6 (4326) - rectype = 0x104f, recsize = 0x14 - -BEGIN DUMP--------------------------------- - 00000000 02 00 02 00 3A 00 00 00 5E 00 00 00 58 0D 00 00 ....:...^...X... - 00000010 E5 0E 00 00 .... - -END DUMP----------------------------------- - recordid = 0x104f, size =20 - [UNKNOWN RECORD] - .id = 104f - [/UNKNOWN RECORD] - - ============================================ - Offset 0x10fe (4350) - rectype = 0x101d, recsize = 0x12 - -BEGIN DUMP--------------------------------- - 00000000 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 ................ - 00000010 00 00 .. - -END DUMP----------------------------------- - recordid = 0x101d, size =18 - [AXIS] - .axisType = 0x0000 (0 ) - .reserved1 = 0x00000000 (0 ) - .reserved2 = 0x00000000 (0 ) - .reserved3 = 0x00000000 (0 ) - .reserved4 = 0x00000000 (0 ) - [/AXIS] - - ============================================ - Offset 0x1114 (4372) - rectype = 0x1033, recsize = 0x0 - -BEGIN DUMP--------------------------------- - **NO RECORD DATA** - -END DUMP----------------------------------- - recordid = 0x1033, size =0 - [BEGIN] - [/BEGIN] - - ============================================ - Offset 0x1118 (4376) - rectype = 0x1020, recsize = 0x8 - -BEGIN DUMP--------------------------------- - 00000000 01 00 01 00 01 00 01 00 ........ - -END DUMP----------------------------------- - recordid = 0x1020, size =8 - [CATSERRANGE] - .crossingPoint = 0x0001 (1 ) - .labelFrequency = 0x0001 (1 ) - .tickMarkFrequency = 0x0001 (1 ) - .options = 0x0001 (1 ) - .valueAxisCrossing = true - .crossesFarRight = false - .reversed = false - [/CATSERRANGE] - - ============================================ - Offset 0x1124 (4388) - rectype = 0x1062, recsize = 0x12 - -BEGIN DUMP--------------------------------- - 00000000 1C 90 39 90 02 00 00 00 01 00 00 00 00 00 1C 90 ..9............. - 00000010 FF 00 .. - -END DUMP----------------------------------- - recordid = 0x1062, size =18 - [AXCEXT] - .minimumCategory = 0x901C (-28644 ) - .maximumCategory = 0x9039 (-28615 ) - .majorUnitValue = 0x0002 (2 ) - .majorUnit = 0x0000 (0 ) - .minorUnitValue = 0x0001 (1 ) - .minorUnit = 0x0000 (0 ) - .baseUnit = 0x0000 (0 ) - .crossingPoint = 0x901C (-28644 ) - .options = 0x00FF (255 ) - .defaultMinimum = true - .defaultMaximum = true - .defaultMajor = true - .defaultMinorUnit = true - .isDate = true - .defaultBase = true - .defaultCross = true - .defaultDateSettings = true - [/AXCEXT] - - ============================================ - Offset 0x113a (4410) - rectype = 0x101e, recsize = 0x1e - -BEGIN DUMP--------------------------------- - 00000000 02 00 03 01 00 00 00 00 00 00 00 00 00 00 00 00 ................ - 00000010 00 00 00 00 00 00 00 00 23 00 4D 00 2D 00 ........#.M.-. - -END DUMP----------------------------------- - recordid = 0x101e, size =30 - [TICK] - .majorTickType = 0x02 (2 ) - .minorTickType = 0x00 (0 ) - .labelPosition = 0x03 (3 ) - .background = 0x01 (1 ) - .labelColorRgb = 0x00000000 (0 ) - .zero1 = 0x0000 (0 ) - .zero2 = 0x0000 (0 ) - .options = 0x0023 (35 ) - .autoTextColor = true - .autoTextBackground = true - .rotation = 0 - .autorotate = true - .tickColor = 0x004D (77 ) - .zero3 = 0x002D (45 ) - [/TICK] - - ============================================ - Offset 0x115c (4444) - rectype = 0x1034, recsize = 0x0 - -BEGIN DUMP--------------------------------- - **NO RECORD DATA** - -END DUMP----------------------------------- - recordid = 0x1034, size =0 - [END] - [/END] - - ============================================ - Offset 0x1160 (4448) - rectype = 0x101d, recsize = 0x12 - -BEGIN DUMP--------------------------------- - 00000000 01 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 ................ - 00000010 00 00 .. - -END DUMP----------------------------------- - recordid = 0x101d, size =18 - [AXIS] - .axisType = 0x0001 (1 ) - .reserved1 = 0x00000000 (0 ) - .reserved2 = 0x00000000 (0 ) - .reserved3 = 0x00000000 (0 ) - .reserved4 = 0x00000000 (0 ) - [/AXIS] - - ============================================ - Offset 0x1176 (4470) - rectype = 0x1033, recsize = 0x0 - -BEGIN DUMP--------------------------------- - **NO RECORD DATA** - -END DUMP----------------------------------- - recordid = 0x1033, size =0 - [BEGIN] - [/BEGIN] - - ============================================ - Offset 0x117a (4474) - rectype = 0x101f, recsize = 0x2a - -BEGIN DUMP--------------------------------- - 00000000 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 ................ - 00000010 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 ................ - 00000020 00 00 00 00 00 00 00 00 1F 01 .......... - -END DUMP----------------------------------- - recordid = 0x101f, size =42 - [VALUERANGE] - .minimumAxisValue = (0.0 ) - .maximumAxisValue = (0.0 ) - .majorIncrement = (0.0 ) - .minorIncrement = (0.0 ) - .categoryAxisCross = (0.0 ) - .options = 0x011F (287 ) - .automaticMinimum = true - .automaticMaximum = true - .automaticMajor = true - .automaticMinor = true - .automaticCategoryCrossing = true - .logarithmicScale = false - .valuesInReverse = false - .crossCategoryAxisAtMaximum = false - .reserved = true - [/VALUERANGE] - - ============================================ - Offset 0x11a8 (4520) - rectype = 0x101e, recsize = 0x1e - -BEGIN DUMP--------------------------------- - 00000000 02 00 03 01 00 00 00 00 00 00 00 00 00 00 00 00 ................ - 00000010 00 00 00 00 00 00 00 00 23 00 4D 00 00 00 ........#.M... - -END DUMP----------------------------------- - recordid = 0x101e, size =30 - [TICK] - .majorTickType = 0x02 (2 ) - .minorTickType = 0x00 (0 ) - .labelPosition = 0x03 (3 ) - .background = 0x01 (1 ) - .labelColorRgb = 0x00000000 (0 ) - .zero1 = 0x0000 (0 ) - .zero2 = 0x0000 (0 ) - .options = 0x0023 (35 ) - .autoTextColor = true - .autoTextBackground = true - .rotation = 0 - .autorotate = true - .tickColor = 0x004D (77 ) - .zero3 = 0x0000 (0 ) - [/TICK] - - ============================================ - Offset 0x11ca (4554) - rectype = 0x1021, recsize = 0x2 - -BEGIN DUMP--------------------------------- - 00000000 01 00 .. - -END DUMP----------------------------------- - recordid = 0x1021, size =2 - [AXISLINEFORMAT] - .axisType = 0x0001 (1 ) - [/AXISLINEFORMAT] - - ============================================ - Offset 0x11d0 (4560) - rectype = 0x1007, recsize = 0xc - -BEGIN DUMP--------------------------------- - 00000000 00 00 00 00 00 00 FF FF 09 00 4D 00 ..........M. - -END DUMP----------------------------------- - recordid = 0x1007, size =12 - [LINEFORMAT] - .lineColor = 0x00000000 (0 ) - .linePattern = 0x0000 (0 ) - .weight = 0xFFFF (-1 ) - .format = 0x0009 (9 ) - .auto = true - .drawTicks = false - .unknown = false - .colourPaletteIndex = 0x004D (77 ) - [/LINEFORMAT] - - ============================================ - Offset 0x11e0 (4576) - rectype = 0x1034, recsize = 0x0 - -BEGIN DUMP--------------------------------- - **NO RECORD DATA** - -END DUMP----------------------------------- - recordid = 0x1034, size =0 - [END] - [/END] - - ============================================ - Offset 0x11e4 (4580) - rectype = 0x1035, recsize = 0x0 - -BEGIN DUMP--------------------------------- - **NO RECORD DATA** - -END DUMP----------------------------------- - recordid = 0x1035, size =0 - [PLOTAREA] - [/PLOTAREA] - - ============================================ - Offset 0x11e8 (4584) - rectype = 0x1032, recsize = 0x4 - -BEGIN DUMP--------------------------------- - 00000000 00 00 03 00 .... - -END DUMP----------------------------------- - recordid = 0x1032, size =4 - [FRAME] - .borderType = 0x0000 (0 ) - .options = 0x0003 (3 ) - .autoSize = true - .autoPosition = true - [/FRAME] - - ============================================ - Offset 0x11f0 (4592) - rectype = 0x1033, recsize = 0x0 - -BEGIN DUMP--------------------------------- - **NO RECORD DATA** - -END DUMP----------------------------------- - recordid = 0x1033, size =0 - [BEGIN] - [/BEGIN] - - ============================================ - Offset 0x11f4 (4596) - rectype = 0x1007, recsize = 0xc - -BEGIN DUMP--------------------------------- - 00000000 80 80 80 00 00 00 00 00 00 00 17 00 ............ - -END DUMP----------------------------------- - recordid = 0x1007, size =12 - [LINEFORMAT] - .lineColor = 0x00808080 (8421504 ) - .linePattern = 0x0000 (0 ) - .weight = 0x0000 (0 ) - .format = 0x0000 (0 ) - .auto = false - .drawTicks = false - .unknown = false - .colourPaletteIndex = 0x0017 (23 ) - [/LINEFORMAT] - - ============================================ - Offset 0x1204 (4612) - rectype = 0x100a, recsize = 0x10 - -BEGIN DUMP--------------------------------- - 00000000 C0 C0 C0 00 00 00 00 00 01 00 00 00 16 00 4F 00 ..............O. - -END DUMP----------------------------------- - recordid = 0x100a, size =16 - [AREAFORMAT] - .foregroundColor = 0x00C0C0C0 (12632256 ) - .backgroundColor = 0x00000000 (0 ) - .pattern = 0x0001 (1 ) - .formatFlags = 0x0000 (0 ) - .automatic = false - .invert = false - .forecolorIndex = 0x0016 (22 ) - .backcolorIndex = 0x004F (79 ) - [/AREAFORMAT] - - ============================================ - Offset 0x1218 (4632) - rectype = 0x1034, recsize = 0x0 - -BEGIN DUMP--------------------------------- - **NO RECORD DATA** - -END DUMP----------------------------------- - recordid = 0x1034, size =0 - [END] - [/END] - - ============================================ - Offset 0x121c (4636) - rectype = 0x1014, recsize = 0x14 - -BEGIN DUMP--------------------------------- - 00000000 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 ................ - 00000010 00 00 00 00 .... - -END DUMP----------------------------------- - recordid = 0x1014, size =20 - [CHARTFORMAT] - .xPosition = 0 - .yPosition = 0 - .width = 0 - .height = 0 - .grBit = 0 - [/CHARTFORMAT] - - ============================================ - Offset 0x1234 (4660) - rectype = 0x1033, recsize = 0x0 - -BEGIN DUMP--------------------------------- - **NO RECORD DATA** - -END DUMP----------------------------------- - recordid = 0x1033, size =0 - [BEGIN] - [/BEGIN] - - ============================================ - Offset 0x1238 (4664) - rectype = 0x1017, recsize = 0x6 - -BEGIN DUMP--------------------------------- - 00000000 00 00 96 00 00 00 ...... - -END DUMP----------------------------------- - recordid = 0x1017, size =6 - [BAR] - .barSpace = 0x0000 (0 ) - .categorySpace = 0x0096 (150 ) - .formatFlags = 0x0000 (0 ) - .horizontal = false - .stacked = false - .displayAsPercentage = false - .shadow = false - [/BAR] - - ============================================ - Offset 0x1242 (4674) - rectype = 0x1022, recsize = 0xa - -BEGIN DUMP--------------------------------- - 00000000 00 00 00 00 00 00 00 00 0F 00 .......... - -END DUMP----------------------------------- - recordid = 0x1022, size =10 - [UNKNOWN RECORD] - .id = 1022 - [/UNKNOWN RECORD] - - ============================================ - Offset 0x1250 (4688) - rectype = 0x1015, recsize = 0x14 - -BEGIN DUMP--------------------------------- - 00000000 D6 0D 00 00 1E 06 00 00 B5 01 00 00 D5 00 00 00 ................ - 00000010 03 01 1F 00 .... - -END DUMP----------------------------------- - recordid = 0x1015, size =20 - [LEGEND] - .xAxisUpperLeft = 0x00000DD6 (3542 ) - .yAxisUpperLeft = 0x0000061E (1566 ) - .xSize = 0x000001B5 (437 ) - .ySize = 0x000000D5 (213 ) - .type = 0x03 (3 ) - .spacing = 0x01 (1 ) - .options = 0x001F (31 ) - .autoPosition = true - .autoSeries = true - .autoXPositioning = true - .autoYPositioning = true - .vertical = true - .dataTable = false - [/LEGEND] - - ============================================ - Offset 0x1268 (4712) - rectype = 0x1033, recsize = 0x0 - -BEGIN DUMP--------------------------------- - **NO RECORD DATA** - -END DUMP----------------------------------- - recordid = 0x1033, size =0 - [BEGIN] - [/BEGIN] - - ============================================ - Offset 0x126c (4716) - rectype = 0x104f, recsize = 0x14 - -BEGIN DUMP--------------------------------- - 00000000 05 00 02 00 D6 0D 00 00 1E 06 00 00 00 00 00 00 ................ - 00000010 00 00 00 00 .... - -END DUMP----------------------------------- - recordid = 0x104f, size =20 - [UNKNOWN RECORD] - .id = 104f - [/UNKNOWN RECORD] - - ============================================ - Offset 0x1284 (4740) - rectype = 0x1025, recsize = 0x20 - -BEGIN DUMP--------------------------------- - 00000000 02 02 01 00 00 00 00 00 DB FF FF FF C4 FF FF FF ................ - 00000010 00 00 00 00 00 00 00 00 B1 00 4D 00 70 37 00 00 ..........M.p7.. - -END DUMP----------------------------------- - recordid = 0x1025, size =32 - [TEXT] - .horizontalAlignment = 0x02 (2 ) - .verticalAlignment = 0x02 (2 ) - .displayMode = 0x0001 (1 ) - .rgbColor = 0x00000000 (0 ) - .x = 0xFFFFFFDB (-37 ) - .y = 0xFFFFFFC4 (-60 ) - .width = 0x00000000 (0 ) - .height = 0x00000000 (0 ) - .options1 = 0x00B1 (177 ) - .autoColor = true - .showKey = false - .showValue = false - .vertical = false - .autoGeneratedText = true - .generated = true - .autoLabelDeleted = false - .autoBackground = true - .rotation = 0 - .showCategoryLabelAsPercentage = false - .showValueAsPercentage = false - .showBubbleSizes = false - .showLabel = false - .indexOfColorValue = 0x004D (77 ) - .options2 = 0x3770 (14192 ) - .dataLabelPlacement = 0 - .textRotation = 0x0000 (0 ) - [/TEXT] - - ============================================ - Offset 0x12a8 (4776) - rectype = 0x1033, recsize = 0x0 - -BEGIN DUMP--------------------------------- - **NO RECORD DATA** - -END DUMP----------------------------------- - recordid = 0x1033, size =0 - [BEGIN] - [/BEGIN] - - ============================================ - Offset 0x12ac (4780) - rectype = 0x104f, recsize = 0x14 - -BEGIN DUMP--------------------------------- - 00000000 02 00 02 00 00 00 00 00 00 00 00 00 00 00 00 00 ................ - 00000010 00 00 00 00 .... - -END DUMP----------------------------------- - recordid = 0x104f, size =20 - [UNKNOWN RECORD] - .id = 104f - [/UNKNOWN RECORD] - - ============================================ - Offset 0x12c4 (4804) - rectype = 0x1051, recsize = 0x8 - -BEGIN DUMP--------------------------------- - 00000000 00 01 00 00 00 00 00 00 ........ - -END DUMP----------------------------------- - recordid = 0x1051, size =8 - [AI] - .linkType = 0x00 (0 ) - .referenceType = 0x01 (1 ) - .options = 0x0000 (0 ) - .customNumberFormat = false - .indexNumberFmtRecord = 0x0000 (0 ) - .formulaOfLink = (org.apache.poi.hssf.record.LinkedDataFormulaField@1d05c81 ) - [/AI] - - ============================================ - Offset 0x12d0 (4816) - rectype = 0x1034, recsize = 0x0 - -BEGIN DUMP--------------------------------- - **NO RECORD DATA** - -END DUMP----------------------------------- - recordid = 0x1034, size =0 - [END] - [/END] - - ============================================ - Offset 0x12d4 (4820) - rectype = 0x1034, recsize = 0x0 - -BEGIN DUMP--------------------------------- - **NO RECORD DATA** - -END DUMP----------------------------------- - recordid = 0x1034, size =0 - [END] - [/END] - - ============================================ - Offset 0x12d8 (4824) - rectype = 0x1034, recsize = 0x0 - -BEGIN DUMP--------------------------------- - **NO RECORD DATA** - -END DUMP----------------------------------- - recordid = 0x1034, size =0 - [END] - [/END] - - ============================================ - Offset 0x12dc (4828) - rectype = 0x1034, recsize = 0x0 - -BEGIN DUMP--------------------------------- - **NO RECORD DATA** - -END DUMP----------------------------------- - recordid = 0x1034, size =0 - [END] - [/END] - - ============================================ - Offset 0x12e0 (4832) - rectype = 0x1034, recsize = 0x0 - -BEGIN DUMP--------------------------------- - **NO RECORD DATA** - -END DUMP----------------------------------- - recordid = 0x1034, size =0 - [END] - [/END] - - ============================================ - rectype = 0x200, recsize = 0xe - -BEGIN DUMP--------------------------------- - 00000000 00 00 00 00 1F 00 00 00 00 00 01 00 00 00 .............. - -END DUMP----------------------------------- - recordid = 0x200, size =14 - [DIMENSIONS] - .firstrow = 0 - .lastrow = 1f - .firstcol = 0 - .lastcol = 1 - .zero = 0 - [/DIMENSIONS] - - ============================================ - rectype = 0x1065, recsize = 0x2 - -BEGIN DUMP--------------------------------- - 00000000 02 00 .. - -END DUMP----------------------------------- - recordid = 0x1065, size =2 - [SINDEX] - .index = 0x0002 (2 ) - [/SINDEX] - - ============================================ - rectype = 0x1065, recsize = 0x2 - -BEGIN DUMP--------------------------------- - 00000000 01 00 .. - -END DUMP----------------------------------- - recordid = 0x1065, size =2 - [SINDEX] - .index = 0x0001 (1 ) - [/SINDEX] - - ============================================ - rectype = 0x1065, recsize = 0x2 - -BEGIN DUMP--------------------------------- - 00000000 03 00 .. - -END DUMP----------------------------------- - recordid = 0x1065, size =2 - [SINDEX] - .index = 0x0003 (3 ) - [/SINDEX] - - ============================================ - rectype = 0xa, recsize = 0x0 - -BEGIN DUMP--------------------------------- - **NO RECORD DATA** - -END DUMP----------------------------------- - recordid = 0xa, size =0 - [EOF] - [/EOF] - - - -

- The next section breaks those records down into an easier - to read format: -

- -[UNKNOWN RECORD:ec] -[UNKNOWN RECORD:5d] -[BOF RECORD] - [HEADER] - [FOOTER] - [HCENTER] - [VCENTER] - [PRINTSETUP] - [UNKNOWN RECORD:33] - [FBI] - [FBI] - [PROTECT] - [UNITS] - [CHART] - [BEGIN] - [SCL] // zoom magnification - [PLOTGROWTH] // font scaling - [FRAME] // border around text - [BEGIN] // default line and area format - [LINEFORMAT] - [AREAFORMAT] - [END] - [SERIES] // start of series - [BEGIN] - [AI] // LINK_TYPE_TITLE_OR_TEXT - [AI] // LINK_TYPE_VALUES - [AI] // LINK_TYPE_CATEGORIES - [AI] // ?? - [DATAFORMAT] // Formatting applies to series? - [BEGIN] // ?? - [UNKNOWN RECORD] - [END] - [SeriesToChartGroup] // Used to support > 1 chart? - [END] - [SHTPROPS] // Some defaults for how chart is displayed. - [DEFAULTTEXT] // Describes the characteristics of the next - // record - [TEXT] // Details of the text that follows in the - // next section - [BEGIN] - [UNKNOWN RECORD] // POS record... looks like I missed this one. - // docs seem to indicate it's better to use - // defaults... - [FONTX] // index to font record. - [AI] // link to text? seems to be linking to nothing - [END] - [DEFAULTTEXT] // contains a category type of 3 which is not - // documented (sigh). - [TEXT] // defines position, color etc for text on chart. - [BEGIN] - [UNKNOWN RECORD] // Another pos record - [FONTX] // font - [AI] // reference type is DIRECT (not sure what this - // means) - [END] - [AXISUSED] // number of axis on the chart. - [AXISPARENT] // axis size and location - [BEGIN] // beginning of axis details - [UNKNOWN RECORD] // Another pos record. - [AXIS] // Category axis - [BEGIN] - [CATSERRANGE] // defines tick marks and other stuff - [AXCEXT] // unit information - [TICK] // tick formating characteristics - [END] - [AXIS] // Value axis - [BEGIN] - [VALUERANGE] // defines tick marks and other stuff - [TICK] // tick formating characteristics - [AXISLINEFORMAT] // major grid line axis format - [LINEFORMAT] // what do the lines look like? - [END] - [PLOTAREA] // marks that the frame following belongs - // to the frame. - [FRAME] // border - [BEGIN] - [LINEFORMAT] // border line - [AREAFORMAT] // border area - [END] - [CHARTFORMAT] // marks a chart group - [BEGIN] - [BAR] // indicates a bar chart - [UNKNOWN RECORD] // apparently this record is ignoreable - [LEGEND] // positioning for the legend - [BEGIN] - [UNKNOWN RECORD] // another position record. - [TEXT] // details of the text that follows - // in the next section - [BEGIN] - [UNKNOWN RECORD] // yet another pos record - [AI] // another link (of type direct) - [END] - [END] - [END] - [END] - [END] - [DIMENSIONS] - [SINDEX] - [SINDEX] - [SINDEX] -[EOF] - -

- Just a quick note on some of the unknown records: -

-
    -
  • EC: MSODRAWING - A Microsoft drawing record. (Need to - track down where this is documented).
  • -
  • 5D: OBJ: Description of a drawing object. (This is going to - be a PITA to implement).
  • -
  • 33: Not documented. :-(
  • -
  • 105f: Not documented. :-(
  • -
  • 104f: POS: Position record (should be able to safely leave this out).
  • -
  • 1022: CHARTFORMATLINK: Can be left out.
  • -
-

- It is currently suspected that many of those records could be - left out when generating a bar chart from scratch. The way - we will be proceeding with this is to write code that generates - most of these records and then start removing them to see - how this effects the chart in excel. -

-
-
-
    -
  • - Unknown record (sid=00eb) is inserted before the SST - record. -
  • - - ============================================ - rectype = 0xeb, recsize = 0x5a - -BEGIN DUMP--------------------------------- - 00000000 0F 00 00 F0 52 00 00 00 00 00 06 F0 18 00 00 00 ....R........... - 00000010 01 08 00 00 02 00 00 00 02 00 00 00 01 00 00 00 ................ - 00000020 01 00 00 00 03 00 00 00 33 00 0B F0 12 00 00 00 ........3....... - 00000030 BF 00 08 00 08 00 81 01 09 00 00 08 C0 01 40 00 ..............@. - 00000040 00 08 40 00 1E F1 10 00 00 00 0D 00 00 08 0C 00 ..@............. - 00000050 00 08 17 00 00 08 F7 00 00 10 .......... - -END DUMP----------------------------------- - recordid = 0xeb, size =90 - [UNKNOWN RECORD:eb] - .id = eb - [/UNKNOWN RECORD] - - ============================================ - -
  • - Any extra font records are inserted as needed -
  • -
  • - Chart records inserted after DBCell records. -
  • -
-
- -
\ No newline at end of file diff --git a/src/documentation/xdocs/hssf/diagram1.xml b/src/documentation/xdocs/hssf/diagram1.xml deleted file mode 100644 index 9d1cd2a7f9..0000000000 --- a/src/documentation/xdocs/hssf/diagram1.xml +++ /dev/null @@ -1,19 +0,0 @@ - - - - -
- HSSF - Overview - - - - -
- - -
- -
- -
diff --git a/src/documentation/xdocs/hssf/diagrams.xml b/src/documentation/xdocs/hssf/diagrams.xml deleted file mode 100644 index 1a1c1bbd19..0000000000 --- a/src/documentation/xdocs/hssf/diagrams.xml +++ /dev/null @@ -1,38 +0,0 @@ - - - - -
- HSSF - Overview - - - - -
- - -
-

- This section is intended for diagrams (UML/etc) that help - explain HSSF. -

-
    -
  • - HSSF usermodel class diagram - - by Matthew Young (myoung at westernasset dot com) -
  • -
-

- Have more? Add a new "bug" to the bug database with [DOCUMENTATION] - prefacing the description and a link to the file on an http server - somewhere. If you don't have your own webserver, then you can email it - to (acoliver at apache dot org) provided its < 5MB. Diagrams should be - in some format that can be read at least on Linux and Windows. Diagrams - that can be edited are preferrable, but lets face it, there aren't too - many good affordable UML tools yet! And no they don't HAVE to be UML... - just useful. -

-
- -
diff --git a/src/documentation/xdocs/hssf/formula.xml b/src/documentation/xdocs/hssf/formula.xml deleted file mode 100644 index d3003d8b94..0000000000 --- a/src/documentation/xdocs/hssf/formula.xml +++ /dev/null @@ -1,90 +0,0 @@ - - - - -
- Formula Support - - - -
- -
-

- This document describes the current state of formula support in POI. - The information in this document applies to the 2.0-dev version of POI (i.e. CVS HEAD). - Since this area is a work in progress, this document will be updated with new features as and - when they are added. -

- -
-
-

- In org.apache.poi.hssf.usermodel.HSSFCell - setCellFormula("formulaString") is used to add a formula to sheet and - getCellFormula() is used to retrieve the string representation of a formula. -

-

- We aim to support the complete excel grammer for formulas. Thus, the string that you pass in - to the setCellFormula call should be what you expect to type into excel. Also, note - that you should NOT add a "=" to the front of the string. -

-
-
-
    -
  • Cell References
  • -
  • String, integer and floating point literals
  • -
  • Area references
  • -
  • Relative or absolute references
  • -
  • Arithmetic Operators
  • -
  • Sheet Functions
  • -
-
-
-
    -
  • - The formula parser now has the ability to parse formulas containing strings. However - formulas that return a string value are not yet supported. -
  • -
  • Formula tokens in Excel are stored in one of three possible classes : - Reference, Value and Array. Based on the location of a token, its class can change - in complicated and undocumented ways. While we have support for most cases, we - are not sure if we have covered all bases (since there is no documentation for this area.) - We would therefore like you to report any - occurence of #VALUE! in a cell upon opening a POI generated workbook in excel. (Check that - typing the formula into Excel directly gives a valid result.) -
  • - -
-
-
-
    -
  • Array formulas
  • -
  • Formulas with logical operations (IF)
  • -
  • Sheet References in formulas
  • -
  • Everything else :)
  • -
-
- -
-

- Formulas in Excel are stored as sequences of tokens in Reverse Polish Notation order. The - open office XLS spec is the best - documentation you will find for the format. -

- -

- The tokens used by excel are modelled as individual *Ptg classes in the - org.apache.poi.hssf.record.formula package. -

-

- The task of parsing a formula string into an array of RPN ordered tokens is done by the - org.apache.poi.hssf.record.formula.FormulaParser class. This class implements a hand - written recursive descent parser. -

-

Check out the javadocs for details. -

-
- - -
\ No newline at end of file diff --git a/src/documentation/xdocs/hssf/hacking-hssf.xml b/src/documentation/xdocs/hssf/hacking-hssf.xml deleted file mode 100644 index 1cdf65af04..0000000000 --- a/src/documentation/xdocs/hssf/hacking-hssf.xml +++ /dev/null @@ -1,72 +0,0 @@ - - - - -
- Hacking HSSF - - - - -
- -
-

- You might find the - 'Excel 97 Developer's Kit' (out of print, Microsoft Press, no - restrictive covenants, available on Amazon.com) helpful for - understanding the file format. -

-

- Also useful is the open office XLS spec. We - are collaborating with the maintainer of the spec so if you think you can add something to their - document just send through your changes. -

-
-
-
    -
  1. - Look at OpenOffice.org or Gnumeric sources if its implemented there. -
  2. -
  3. - Use org.apache.poi.hssf.dev.BiffViewer to view the structure of the - file. Experiment by adding one criteria entry at a time. See what it - does to the structure, infer behavior and structure from it. Using the - unix diff command (or get cygwin from www.cygwin.com for windows) you - can figure out a lot very quickly. Unimplemented records show up as - 'UNKNOWN' and prints a hex dump. -
  4. -
-
-
-

- Low level records can be time consuming to created. We created a record - generator to help generate some of the simpler tasks. -

-

- We use XML - descriptors to generate the Java code (which sure beats the heck out of - the PERL scripts originally used ;-) for low level records. The - generator is kinda alpha-ish right now and could use some enhancement, - so you may find that to be about 1/2 of the work. Notice this is in - org.apache.poi.hssf.record.definitions. -

-
-
-

One thing to note: If you are making a large code contribution we need to ensure - any participants in this process have never - signed a "Non Disclosure Agreement" with Microsoft, and have not - received any information covered by such an agreement. If they have - they'll not be able to participate in the POI project. For large contributions we - may ask you to sign an agreement.

-
-
-

Check our todo list or simply look for missing functionality. Start small - and work your way up.

-
-
-

Make sure you read the contributing section - as it contains more generation information about contributing to Poi in general.

-
- -
\ No newline at end of file diff --git a/src/documentation/xdocs/hssf/how-to.xml b/src/documentation/xdocs/hssf/how-to.xml deleted file mode 100644 index e9db5b9798..0000000000 --- a/src/documentation/xdocs/hssf/how-to.xml +++ /dev/null @@ -1,502 +0,0 @@ - - - - -
- The New Halloween Document - - - - - -
- -
- -
-

This release of the how-to outlines functionality for the CVS HEAD. - Those looking for information on previous releases should - look in the documentation distributed with that release.

-

- This release allows numeric and string cell values to be written to - or read from an XLS file as well as reading and writing dates. Also - in this release is row and column sizing, cell styling (bold, - italics, borders,etc), and support for both built-in and user - defined data formats. New - to this release is an event-based API for reading XLS files. - It differs greatly from the read/write API - and is intended for intermediate developers who need a smaller - memory footprint. It will also serve as the basis for the HSSF - Generator.

-
-
-
-
- -

The high level API (package: org.apache.poi.hssf.usermodel) - is what most people should use. Usage is very simple. -

-

Workbooks are created by creating an instance of - org.apache.poi.hssf.usermodel.HSSFWorkbook. -

-

Sheets are created by calling createSheet() from an existing - instance of HSSFWorkbook, the created sheet is automatically added in - sequence to the workbook. Sheets do not in themselves have a sheet - name (the tab at the bottom); you set - the name associated with a sheet by calling - HSSFWorkbook.setSheetName(sheetindex,"SheetName",encoding). - The name may be in 8bit format (HSSFWorkbook.ENCODING_COMPRESSED_UNICODE) - or Unicode (HSSFWorkbook.ENCODING_UTF_16). Default encoding is 8bit per char. -

-

Rows are created by calling createRow(rowNumber) from an existing - instance of HSSFSheet. Only rows that have cell values should be - added to the sheet. To set the row's height, you just call - setRowHeight(height) on the row object. The height must be given in - twips, or 1/20th of a point. If you prefer, there is also a - setRowHeightInPoints method. -

-

Cells are created by calling createCell(column, type) from an - existing HSSFRow. Only cells that have values should be added to the - row. Cells should have their cell type set to either - HSSFCell.CELL_TYPE_NUMERIC or HSSFCell.CELL_TYPE_STRING depending on - whether they contain a numeric or textual value. Cells must also have - a value set. Set the value by calling setCellValue with either a - String or double as a parameter. Individual cells do not have a - width; you must call setColumnWidth(colindex, width) (use units of - 1/256th of a character) on the HSSFSheet object. (You can't do it on - an individual basis in the GUI either).

-

Cells are styled with HSSFCellStyle objects which in turn contain - a reference to an HSSFFont object. These are created via the - HSSFWorkbook object by calling createCellStyle() and createFont(). - Once you create the object you must set its parameters (colors, - borders, etc). To set a font for an HSSFCellStyle call - setFont(fontobj). -

-

Once you have generated your workbook, you can write it out by - calling write(outputStream) from your instance of Workbook, passing - it an OutputStream (for instance, a FileOutputStream or - ServletOutputStream). You must close the OutputStream yourself. HSSF - does not close it for you. -

-

Here is some example code (excerpted and adapted from - org.apache.poi.hssf.dev.HSSF test class):

- -
-
- -

Reading in a file is equally simple. To read in a file, create a -new instance of org.apache.poi.poifs.Filesystem, passing in an open InputStream, such as a FileInputStream -for your XLS, to the constructor. Construct a new instance of -org.apache.poi.hssf.usermodel.HSSFWorkbook passing the -Filesystem instance to the constructor. From there you have access to -all of the high level model objects through their assessor methods -(workbook.getSheet(sheetNum), sheet.getRow(rownum), etc). -

-

Modifying the file you have read in is simple. You retrieve the -object via an assessor method, remove it via a parent object's remove -method (sheet.removeRow(hssfrow)) and create objects just as you -would if creating a new xls. When you are done modifying cells just -call workbook.write(outputstream) just as you did above.

-

An example of this can be seen in -org.apache.poi.hssf.dev.HSSF.

-
-
-
- -

The event API is brand new. It is intended for intermediate - developers who are willing to learn a little bit of the low level API - structures. Its relatively simple to use, but requires a basic - understanding of the parts of an Excel file (or willingness to - learn). The advantage provided is that you can read an XLS with a - relatively small memory footprint. -

-

To use this API you construct an instance of - org.apache.poi.hssf.eventmodel.HSSFRequest. Register a class you - create that supports the - org.apache.poi.hssf.eventmodel.HSSFListener interface using the - HSSFRequest.addListener(yourlistener, recordsid). The recordsid - should be a static reference number (such as BOFRecord.sid) contained - in the classes in org.apache.poi.hssf.record. The trick is you - have to know what these records are. Alternatively you can call - HSSFRequest.addListenerForAllRecords(mylistener). In order to learn - about these records you can either read all of the javadoc in the - org.apache.poi.hssf.record package or you can just hack up a - copy of org.apache.poi.hssf.dev.EFHSSF and adapt it to your - needs. TODO: better documentation on records.

-

Once you've registered your listeners in the HSSFRequest object - you can construct an instance of - org.apache.poi.poifs.filesystem.FileSystem (see POIFS howto) and - pass it your XLS file inputstream. You can either pass this, along - with the request you constructed, to an instance of HSSFEventFactory - via the HSSFEventFactory.processWorkbookEvents(request, Filesystem) - method, or you can get an instance of DocumentInputStream from - Filesystem.createDocumentInputStream("Workbook") and pass - it to HSSFEventFactory.processEvents(request, inputStream). Once you - make this call, the listeners that you constructed receive calls to - their processRecord(Record) methods with each Record they are - registered to listen for until the file has been completely read. -

-

A code excerpt from org.apache.poi.hssf.dev.EFHSSF (which is - in CVS or the source distribution) is reprinted below with excessive - comments:

- -
-
- -

The low level API is not much to look at. It consists of lots of -"Records" in the org.apache.poi.hssf.record.* package, -and set of helper classes in org.apache.poi.hssf.model.*. The -record classes are consistent with the low level binary structures -inside a BIFF8 file (which is embedded in a POIFS file system). You -probably need the book: "Microsoft Excel 97 Developer's Kit" -from Microsoft Press in order to understand how these fit together -(out of print but easily obtainable from Amazon's used books). In -order to gain a good understanding of how to use the low level APIs -should view the source in org.apache.poi.hssf.usermodel.* and -the classes in org.apache.poi.hssf.model.*. You should read the -documentation for the POIFS libraries as well.

-
-
- -

The HSSF application is nothing more than a test for the high -level API (and indirectly the low level support). The main body of -its code is repeated above. To run it: -

-
    -
  • download the poi-alpha build and untar it (tar xvzf - tarball.tar.gz) -
  • -
  • set up your classpath as follows: - export HSSFDIR={wherever you put HSSF's jar files} -export LOG4JDIR={wherever you put LOG4J's jar files} -export CLASSPATH=$CLASSPATH:$HSSFDIR/hssf.jar:$HSSFDIR/poi-poifs.jar:$HSSFDIR/poi-util.jar:$LOG4JDIR/jog4j.jar -
  • type: - java org.apache.poi.hssf.dev.HSSF ~/myxls.xls write
  • -
-

-

This should generate a test sheet in your home directory called "myxls.xls".

-
    -
  • Type: - java org.apache.poi.hssf.dev.HSSF ~/input.xls output.xls -
    -
    -This is the read/write/modify test. It reads in the spreadsheet, modifies a cell, and writes it back out. -Failing this test is not necessarily a bad thing. If HSSF tries to modify a non-existant sheet then this will -most likely fail. No big deal.
  • -
-
-
-

Poi can dynamically select it's logging implementation. Poi trys to - create a logger using the System property named "org.apache.poi.util.POILogger". - Out of the box this can be set to one of three values: -

-
    -
  • org.apache.poi.util.CommonsLogger
  • -
  • org.apache.poi.util.NullLogger
  • -
  • org.apache.poi.util.SystemOutLogger
  • -
-

- If the property is not defined or points to an invalid classthen the NullLogger is used. -

-

- Refer to the commons logging package level javadoc for more information concerning how to - configure commons logging. -

-
-
- -

HSSF has a number of tools useful for developers to debug/develop -stuff using HSSF (and more generally XLS files). We've already -discussed the app for testing HSSF read/write/modify capabilities; -now we'll talk a bit about BiffViewer. Early on in the development of -HSSF, it was decided that knowing what was in a record, what was -wrong with it, etc. was virtually impossible with the available -tools. So we developed BiffViewer. You can find it at -org.apache.poi.hssf.dev.BiffViewer. It performs two basic -functions and a derivative. -

-

The first is "biffview". To do this you run it (assumes -you have everything setup in your classpath and that you know what -you're doing enough to be thinking about this) with an xls file as a -parameter. It will give you a listing of all understood records with -their data and a list of not-yet-understood records with no data -(because it doesn't know how to interpret them). This listing is -useful for several things. First, you can look at the values and SEE -what is wrong in quasi-English. Second, you can send the output to a -file and compare it. -

-

The second function is "big freakin dump", just pass a -file and a second argument matching "bfd" exactly. This -will just make a big hexdump of the file. -

-

Lastly, there is "mixed" mode which does the same as -regular biffview, only it includes hex dumps of certain records -intertwined. To use that just pass a file with a second argument -matching "on" exactly.

-

In the next release cycle we'll also have something called a -FormulaViewer. The class is already there, but its not very useful -yet. When it does something, we'll document it.

- -
-
- -

This release contains code that supports "internationalization" -or more accurately non-US/UK languages; however, it has not been -tested with the new API changes (please help us with this). We've -shifted focus a bit for this release in recognition of the -international support we've gotten. We're going to focus on western -European languages for our first beta. We're more than happy to -accept help in supporting non-Western European languages if someone -who knows what they're doing in this area is willing to pitch in! -(There is next to no documentation on what is necessary to support -such a move and its really hard to support a language when you don't even -know the alphabet).

-

This release of HSSF does not yet support Formulas. I've been -focusing on the requests I've gotten in. That being said, if we get -more user feedback on what is most useful first we'll aim for that. -As a general principal, HSSF's goal is to support HSSF-Serializer -(meaning an emphasis on write). We would like to hear from you! How -are you using HSSF/POIFS? How would you like to use it? What features -are most important first? -

-
- -
- -
- -
diff --git a/src/documentation/xdocs/hssf/index.xml b/src/documentation/xdocs/hssf/index.xml deleted file mode 100644 index 9b96f3c390..0000000000 --- a/src/documentation/xdocs/hssf/index.xml +++ /dev/null @@ -1,47 +0,0 @@ - - - - -
- HSSF - Overview - - - - -
- - -
- -

HSSF is the POI Project's pure Java implementation of the Excel '97(-2002) file format.

-

HSSF provides a way to read spreadsheets create, modify, read and write XLS spreadsheets - It provides: -

-
    -
  • low level structures for those with special needs
  • -
  • an eventmodel api for efficient read-only access
  • -
  • a full usermodel api for creating, reading and modifying XLS files
  • -
-

- Truth be told there is probably a better way to generate your spreadsheet - generation (yet you'll still be using HSSF indirectly). At the time of - this writing we're in the process of moving the HSSF Serializer over to - the Apache Cocoon - Project. With Cocoon you can serialize any XML datasource (of - which might be a ESQL page outputting in SQL for instance) by simply - applying the stylesheet and designating the serializer. -

-

- If you're merely reading spreadsheet data, then use the eventmodel api - in the org.apache.poi.hssf.eventmodel package. -

-

- If you're modifying spreadsheet data then use the usermodel api. You - can also generate spreadsheets this way, but using Cocoon (which will do - it this way indirectly) is the best way...we promise. -

- -
- -
diff --git a/src/documentation/xdocs/hssf/limitations.xml b/src/documentation/xdocs/hssf/limitations.xml deleted file mode 100644 index 18b379e1a9..0000000000 --- a/src/documentation/xdocs/hssf/limitations.xml +++ /dev/null @@ -1,52 +0,0 @@ - -
- Limitations - - - -
- -
-

- The intent of this document is to outline some of the known limitations of the - POI HSSF API's. It is not intended to be complete list of every bug or missing - feature of HSSF, rather it's purpose is to provide a broad feel for some of the - functionality that is missing or broken. -

-
    -
  • - Charts

    - You can not currently create charts. This is planned for the 2.0 release. You can - however create a chart in Excel, modify the chart data values using HSSF and write - a new spreadsheet out. This is possible because POI attempts to keep existing records - intact as far as possible.

    -
  • -
  • - Rich Text

    - HSSF does not support rich text cells. Rich text cells are - cells that have multiple fonts and styles in the once cell. Any attempt to read - a spreadsheet that has rich text cells will throw an exception. This feature may - be supported in the future but it is not currently planned. Patches are welcome.

    -
  • -
  • - Outlines

    - It is not yet possible to create outlines. Reading a spreadsheet with outlines - may work correctly but has not been tested. Write support for outlines may - be added in the future but it is not currently planned. Patches are welcome.

    -
  • -
  • - Macros

    - Macros can not be created. The are currently no plans to support macros. Reading - workbooks containing macros is supported but attempting to write those workbooks - will fail. This is because macros are stored as extra file sytems within the - compound document, and these are not currently kept when the file is rewritten.

    -
  • -
  • - Pivot Tables

    - Generating pivot tables is not supported. Reading spreadsheets containing pivot tables - has not been tested. -
  • -
-
- -
diff --git a/src/documentation/xdocs/hssf/quick-guide.xml b/src/documentation/xdocs/hssf/quick-guide.xml deleted file mode 100644 index 070a2f5468..0000000000 --- a/src/documentation/xdocs/hssf/quick-guide.xml +++ /dev/null @@ -1,698 +0,0 @@ - - - - -
- Busy Developers' Guide to HSSF Features - - - -
- -
-

- Want to use HSSF read and write spreadsheets in a hurry? This guide is for you. If you're after - more in-depth coverage of the HSSF user-API please consult the HOWTO - guide as it contains actual descriptions of how to use this stuff. -

-
-
    -
  • How to create a new workbook
  • -
  • How to create a sheet
  • -
  • How to create cells
  • -
  • How to create date cells
  • -
  • Working with different types of cells
  • -
  • Aligning cells
  • -
  • Working with borders
  • -
  • Fills and color
  • -
  • Merging cells
  • -
  • Working with fonts
  • -
  • Custom colors
  • -
  • Reading and writing
  • -
  • Use newlines in cells.
  • -
  • Create user defined data formats.
  • -
  • Fit sheet to one page
  • -
  • Set print area for a sheet.
  • -
  • Set page numbers on the footer of a sheet.
  • -
  • Shift rows.
  • -
  • Set a sheet as selected.
  • -
  • Set the zoom magnification for a sheet.
  • -
  • Create split and freeze panes.
  • -
  • Repeating rows and columns.
  • -
  • Headers and Footers.
  • -
-
-
- -
- - HSSFWorkbook wb = new HSSFWorkbook(); - FileOutputStream fileOut = new FileOutputStream("workbook.xls"); - wb.write(fileOut); - fileOut.close(); - -
- -
- - HSSFWorkbook wb = new HSSFWorkbook(); - HSSFSheet sheet1 = wb.createSheet("new sheet"); - HSSFSheet sheet2 = wb.createSheet("second sheet"); - FileOutputStream fileOut = new FileOutputStream("workbook.xls"); - wb.write(fileOut); - fileOut.close(); - -
- -
- - HSSFWorkbook wb = new HSSFWorkbook(); - HSSFSheet sheet = wb.createSheet("new sheet"); - - // Create a row and put some cells in it. Rows are 0 based. - HSSFRow row = sheet.createRow((short)0); - // Create a cell and put a value in it. - HSSFCell cell = row.createCell((short)0); - cell.setCellValue(1); - - // Or do it on one line. - row.createCell((short)1).setCellValue(1.2); - row.createCell((short)2).setCellValue("This is a string"); - row.createCell((short)3).setCellValue(true); - - // Write the output to a file - FileOutputStream fileOut = new FileOutputStream("workbook.xls"); - wb.write(fileOut); - fileOut.close(); - -
- -
- - HSSFWorkbook wb = new HSSFWorkbook(); - HSSFSheet sheet = wb.createSheet("new sheet"); - - // Create a row and put some cells in it. Rows are 0 based. - HSSFRow row = sheet.createRow((short)0); - - // Create a cell and put a date value in it. The first cell is not styled - // as a date. - HSSFCell cell = row.createCell((short)0); - cell.setCellValue(new Date()); - - // we style the second cell as a date (and time). It is important to - // create a new cell style from the workbook otherwise you can end up - // modifying the built in style and effecting not only this cell but other cells. - HSSFCellStyle cellStyle = wb.createCellStyle(); - cellStyle.setDataFormat(HSSFDataFormat.getFormat("m/d/yy h:mm")); - cell = row.createCell((short)1); - cell.setCellValue(new Date()); - cell.setCellStyle(cellStyle); - - // Write the output to a file - FileOutputStream fileOut = new FileOutputStream("workbook.xls"); - wb.write(fileOut); - fileOut.close(); - -
- -
- - HSSFWorkbook wb = new HSSFWorkbook(); - HSSFSheet sheet = wb.createSheet("new sheet"); - HSSFRow row = sheet.createRow((short)2); - row.createCell((short) 0).setCellValue(1.1); - row.createCell((short) 1).setCellValue(new Date()); - row.createCell((short) 2).setCellValue("a string"); - row.createCell((short) 3).setCellValue(true); - row.createCell((short) 4).setCellType(HSSFCell.CELL_TYPE_ERROR); - - // Write the output to a file - FileOutputStream fileOut = new FileOutputStream("workbook.xls"); - wb.write(fileOut); - fileOut.close(); - -
- -
- - public static void main(String[] args) - throws IOException - { - HSSFWorkbook wb = new HSSFWorkbook(); - HSSFSheet sheet = wb.createSheet("new sheet"); - HSSFRow row = sheet.createRow((short) 2); - createCell(wb, row, (short) 0, HSSFCellStyle.ALIGN_CENTER); - createCell(wb, row, (short) 1, HSSFCellStyle.ALIGN_CENTER_SELECTION); - createCell(wb, row, (short) 2, HSSFCellStyle.ALIGN_FILL); - createCell(wb, row, (short) 3, HSSFCellStyle.ALIGN_GENERAL); - createCell(wb, row, (short) 4, HSSFCellStyle.ALIGN_JUSTIFY); - createCell(wb, row, (short) 5, HSSFCellStyle.ALIGN_LEFT); - createCell(wb, row, (short) 6, HSSFCellStyle.ALIGN_RIGHT); - - // Write the output to a file - FileOutputStream fileOut = new FileOutputStream("workbook.xls"); - wb.write(fileOut); - fileOut.close(); - - } - - /** - * Creates a cell and aligns it a certain way. - * - * @param wb the workbook - * @param row the row to create the cell in - * @param column the column number to create the cell in - * @param align the alignment for the cell. - */ - private static void createCell(HSSFWorkbook wb, HSSFRow row, short column, short align) - { - HSSFCell cell = row.createCell(column); - cell.setCellValue("Align It"); - HSSFCellStyle cellStyle = wb.createCellStyle(); - cellStyle.setAlignment(align); - cell.setCellStyle(cellStyle); - } - -
- -
- - HSSFWorkbook wb = new HSSFWorkbook(); - HSSFSheet sheet = wb.createSheet("new sheet"); - - // Create a row and put some cells in it. Rows are 0 based. - HSSFRow row = sheet.createRow((short) 1); - - // Create a cell and put a value in it. - HSSFCell cell = row.createCell((short) 1); - cell.setCellValue(4); - - // Style the cell with borders all around. - HSSFCellStyle style = wb.createCellStyle(); - style.setBorderBottom(HSSFCellStyle.BORDER_THIN); - style.setBottomBorderColor(HSSFColor.BLACK.index); - style.setBorderLeft(HSSFCellStyle.BORDER_THIN); - style.setLeftBorderColor(HSSFColor.GREEN.index); - style.setBorderRight(HSSFCellStyle.BORDER_THIN); - style.setRightBorderColor(HSSFColor.BLUE.index); - style.setBorderTop(HSSFCellStyle.BORDER_MEDIUM_DASHED); - style.setTopBorderColor(HSSFColor.BLACK.index); - cell.setCellStyle(style); - - // Write the output to a file - FileOutputStream fileOut = new FileOutputStream("workbook.xls"); - wb.write(fileOut); - fileOut.close(); - -
- -
- - HSSFWorkbook wb = new HSSFWorkbook(); - HSSFSheet sheet = wb.createSheet("new sheet"); - - // Create a row and put some cells in it. Rows are 0 based. - HSSFRow row = sheet.createRow((short) 1); - - // Aqua background - HSSFCellStyle style = wb.createCellStyle(); - style.setFillBackgroundColor(HSSFColor.AQUA.index); - style.setFillPattern(HSSFCellStyle.BIG_SPOTS); - HSSFCell cell = row.createCell((short) 1); - cell.setCellValue("X"); - cell.setCellStyle(style); - - // Orange "foreground", foreground being the fill foreground not the font color. - style = wb.createCellStyle(); - style.setFillForegroundColor(HSSFColor.ORANGE.index); - style.setFillPattern(HSSFCellStyle.SOLID_FOREGROUND); - cell = row.createCell((short) 2); - cell.setCellValue("X"); - cell.setCellStyle(style); - - // Write the output to a file - FileOutputStream fileOut = new FileOutputStream("workbook.xls"); - wb.write(fileOut); - fileOut.close(); - -
- -
- - HSSFWorkbook wb = new HSSFWorkbook(); - HSSFSheet sheet = wb.createSheet("new sheet"); - - HSSFRow row = sheet.createRow((short) 1); - HSSFCell cell = row.createCell((short) 1); - cell.setCellValue("This is a test of merging"); - - sheet.addMergedRegion(new Region(1,(short)1,1,(short)2)); - - // Write the output to a file - FileOutputStream fileOut = new FileOutputStream("workbook.xls"); - wb.write(fileOut); - fileOut.close(); - -
- -
- - HSSFWorkbook wb = new HSSFWorkbook(); - HSSFSheet sheet = wb.createSheet("new sheet"); - - // Create a row and put some cells in it. Rows are 0 based. - HSSFRow row = sheet.createRow((short) 1); - - // Create a new font and alter it. - HSSFFont font = wb.createFont(); - font.setFontHeightInPoints((short)24); - font.setFontName("Courier New"); - font.setItalic(true); - font.setStrikeout(true); - - // Fonts are set into a style so create a new one to use. - HSSFCellStyle style = wb.createCellStyle(); - style.setFont(font); - - // Create a cell and put a value in it. - HSSFCell cell = row.createCell((short) 1); - cell.setCellValue("This is a test of fonts"); - cell.setCellStyle(style); - - // Write the output to a file - FileOutputStream fileOut = new FileOutputStream("workbook.xls"); - wb.write(fileOut); - fileOut.close(); - -
- -
- - HSSFWorkbook wb = new HSSFWorkbook(); - HSSFSheet sheet = wb.createSheet(); - HSSFRow row = sheet.createRow((short) 0); - HSSFCell cell = row.createCell((short) 0); - cell.setCellValue("Default Palette"); - - //apply some colors from the standard palette, - // as in the previous examples. - //we'll use red text on a lime background - - HSSFCellStyle style = wb.createCellStyle(); - style.setFillForegroundColor(HSSFColor.LIME.index); - style.setFillPattern(HSSFCellStyle.SOLID_FOREGROUND); - - HSSFFont font = wb.createFont(); - font.setColor(HSSFColor.RED.index); - style.setFont(font); - - cell.setCellStyle(style); - - //save with the default palette - FileOutputStream out = new FileOutputStream("default_palette.xls"); - wb.write(out); - out.close(); - - //now, let's replace RED and LIME in the palette - // with a more attractive combination - // (lovingly borrowed from freebsd.org) - - cell.setCellValue("Modified Palette"); - - //creating a custom palette for the workbook - HSSFPalette palette = wb.getCustomPalette(); - - //replacing the standard red with freebsd.org red - palette.setColorAtIndex(HSSFColor.RED.index, - (byte) 153, //RGB red (0-255) - (byte) 0, //RGB green - (byte) 0 //RGB blue - ); - //replacing lime with freebsd.org gold - palette.setColorAtIndex(HSSFColor.LIME.index, (byte) 255, (byte) 204, (byte) 102); - - //save with the modified palette - // note that wherever we have previously used RED or LIME, the - // new colors magically appear - out = new FileOutputStream("modified_palette.xls"); - wb.write(out); - out.close(); - -
- -
- - POIFSFileSystem fs = - new POIFSFileSystem(new FileInputStream("workbook.xls")); - HSSFWorkbook wb = new HSSFWorkbook(fs); - HSSFSheet sheet = wb.getSheetAt(0); - HSSFRow row = sheet.getRow(2); - HSSFCell cell = row.getCell((short)3); - if (cell == null) - cell = row.createCell((short)3); - cell.setCellType(HSSFCell.CELL_TYPE_STRING); - cell.setCellValue("a test"); - - // Write the output to a file - FileOutputStream fileOut = new FileOutputStream("workbook.xls"); - wb.write(fileOut); - fileOut.close(); - -
- -
- - HSSFWorkbook wb = new HSSFWorkbook(); - HSSFSheet s = wb.createSheet(); - HSSFRow r = null; - HSSFCell c = null; - HSSFCellStyle cs = wb.createCellStyle(); - HSSFFont f = wb.createFont(); - HSSFFont f2 = wb.createFont(); - - cs = wb.createCellStyle(); - - cs.setFont( f2 ); - //Word Wrap MUST be turned on - cs.setWrapText( true ); - - r = s.createRow( (short) 2 ); - r.setHeight( (short) 0x349 ); - c = r.createCell( (short) 2 ); - c.setCellType( HSSFCell.CELL_TYPE_STRING ); - c.setCellValue( "Use \n with word wrap on to create a new line" ); - c.setCellStyle( cs ); - s.setColumnWidth( (short) 2, (short) ( ( 50 * 8 ) / ( (double) 1 / 20 ) ) ); - - FileOutputStream fileOut = new FileOutputStream( "workbook.xls" ); - wb.write( fileOut ); - fileOut.close(); -
- -
- - HSSFWorkbook wb = new HSSFWorkbook(); - HSSFSheet sheet = wb.createSheet("format sheet"); - HSSFCellStyle style; - HSSFDataFormat format = wb.createDataFormat(); - HSSFRow row; - HSSFCell cell; - short rowNum = 0; - short colNum = 0; - - row = sheet.createRow(rowNum++); - cell = row.createCell(colNum); - cell.setCellValue(11111.25); - style = wb.createCellStyle(); - style.setDataFormat(format.getFormat("0.0")); - cell.setCellStyle(style); - - row = sheet.createRow(rowNum++); - cell = row.createCell(colNum); - cell.setCellValue(11111.25); - style = wb.createCellStyle(); - style.setDataFormat(format.getFormat("#,##0.0000")); - cell.setCellStyle(style); - - FileOutputStream fileOut = new FileOutputStream("workbook.xls"); - wb.write(fileOut); - fileOut.close(); - -
- -
- - HSSFWorkbook wb = new HSSFWorkbook(); - HSSFSheet sheet = wb.createSheet("format sheet"); - HSSFPrintSetup ps = sheet.getPrintSetup() - - sheet.setAutobreaks(true) - - ps.setFitHeight((short)1); - ps.setFitWidth((short)1); - - - // Create various cells and rows for spreadsheet. - - FileOutputStream fileOut = new FileOutputStream("workbook.xls"); - wb.write(fileOut); - fileOut.close(); - -
- -
- - HSSFWorkbook wb = new HSSFWorkbook(); - HSSFSheet sheet = wb.createSheet("Sheet1"); - wb.setPrintArea(0, "$A$1:$C$2"); - //sets the print area for the first sheet - //Alternatively: - //wb.setPrintArea(0, 0, 1, 0, 0) is equivalent to using the name reference (See the JavaDocs for more details) - - // Create various cells and rows for spreadsheet. - - FileOutputStream fileOut = new FileOutputStream("workbook.xls"); - wb.write(fileOut); - fileOut.close(); - - - -
- - -
- - HSSFWorkbook wb = new HSSFWorkbook(); - HSSFSheet sheet = wb.createSheet("format sheet"); - HSSFFooter footer = sheet.getFooter() - - footer.setRight( "Page " + HSSFFooter.page() + " of " + HSSFFooter.numPages() ); - - - - // Create various cells and rows for spreadsheet. - - FileOutputStream fileOut = new FileOutputStream("workbook.xls"); - wb.write(fileOut); - fileOut.close(); - -
- - -
-

- The convenience functions live in contrib and provide - utility features such as setting borders around merged - regions and changing style attributes without explicitly - creating new styles. -

- - HSSFWorkbook wb = new HSSFWorkbook(); - HSSFSheet sheet1 = wb.createSheet( "new sheet" ); - - // Create a merged region - HSSFRow row = sheet1.createRow( (short) 1 ); - HSSFRow row2 = sheet1.createRow( (short) 2 ); - HSSFCell cell = row.createCell( (short) 1 ); - cell.setCellValue( "This is a test of merging" ); - Region region = new Region( 1, (short) 1, 4, (short) 4 ); - sheet1.addMergedRegion( region ); - - // Set the border and border colors. - final short borderMediumDashed = HSSFCellStyle.BORDER_MEDIUM_DASHED; - HSSFRegionUtil.setBorderBottom( borderMediumDashed, - region, sheet1, wb ); - HSSFRegionUtil.setBorderTop( borderMediumDashed, - region, sheet1, wb ); - HSSFRegionUtil.setBorderLeft( borderMediumDashed, - region, sheet1, wb ); - HSSFRegionUtil.setBorderRight( borderMediumDashed, - region, sheet1, wb ); - HSSFRegionUtil.setBottomBorderColor(HSSFColor.AQUA.index, region, sheet1, wb); - HSSFRegionUtil.setTopBorderColor(HSSFColor.AQUA.index, region, sheet1, wb); - HSSFRegionUtil.setLeftBorderColor(HSSFColor.AQUA.index, region, sheet1, wb); - HSSFRegionUtil.setRightBorderColor(HSSFColor.AQUA.index, region, sheet1, wb); - - // Shows some usages of HSSFCellUtil - HSSFCellStyle style = wb.createCellStyle(); - style.setIndention((short)4); - HSSFCellUtil.createCell(row, 8, "This is the value of the cell", style); - HSSFCell cell2 = HSSFCellUtil.createCell( row2, 8, "This is the value of the cell"); - HSSFCellUtil.setAlignment(cell2, wb, HSSFCellStyle.ALIGN_CENTER); - - // Write out the workbook - FileOutputStream fileOut = new FileOutputStream( "workbook.xls" ); - wb.write( fileOut ); - fileOut.close(); - -
- - -
- - HSSFWorkbook wb = new HSSFWorkbook(); - HSSFSheet sheet = wb.createSheet("row sheet"); - - // Create various cells and rows for spreadsheet. - - // Shift rows 6 - 11 on the spreadsheet to the top (rows 0 - 5) - sheet.shiftRows(5, 10, -5); - - FileOutputStream fileOut = new FileOutputStream("workbook.xls"); - wb.write(fileOut); - fileOut.close(); - -
- - -
- - HSSFWorkbook wb = new HSSFWorkbook(); - HSSFSheet sheet = wb.createSheet("row sheet"); - sheet.setSelected(true); - - // Create various cells and rows for spreadsheet. - - FileOutputStream fileOut = new FileOutputStream("workbook.xls"); - wb.write(fileOut); - fileOut.close(); - -
- - -
-

- The zoom is expressed as a fraction. For example to - express a zoom of 75% use 3 for the numerator and - 4 for the denominator. -

- - HSSFWorkbook wb = new HSSFWorkbook(); - HSSFSheet sheet1 = wb.createSheet("new sheet"); - sheet1.setZoom(3,4); // 75 percent magnification - FileOutputStream fileOut = new FileOutputStream("workbook.xls"); - wb.write(fileOut); - fileOut.close(); - -
- - -
-

- There are two types of panes you can create; freeze panes and split panes. -

-

- A freeze pane is split by columns and rows. You create - a freeze pane using the following mechanism: -

-

- sheet1.createFreezePane( 3, 2, 3, 2 ); -

-

- The first two parameters are the columns and rows you - wish to split by. The second two parameters indicate - the cells that are visible in the bottom right quadrant. -

-

- - Split pains appear differently. The split area is - divided into four separate work area's. The split - occurs at the pixel level and the user is able to - adjust the split by dragging it to a new position. -

-

- - Split panes are created with the following call: -

-

- sheet2.createSplitPane( 2000, 2000, 0, 0, HSSFSheet.PANE_LOWER_LEFT ); -

-

- - The first parameter is the x position of the split. - This is in 1/20th of a point. A point in this case - seems to equate to a pixel. The second parameter is - the y position of the split. Again in 1/20th of a point. -

-

- The last parameter indicates which pane currently has - the focus. This will be one of HSSFSheet.PANE_LOWER_LEFT, - PANE_LOWER_RIGHT, PANE_UPPER_RIGHT or PANE_UPPER_LEFT. -

- - HSSFWorkbook wb = new HSSFWorkbook(); - HSSFSheet sheet1 = wb.createSheet("new sheet"); - HSSFSheet sheet2 = wb.createSheet("second sheet"); - HSSFSheet sheet3 = wb.createSheet("third sheet"); - HSSFSheet sheet4 = wb.createSheet("fourth sheet"); - - // Freeze just one row - sheet1.createFreezePane( 0, 1, 0, 1 ); - // Freeze just one column - sheet2.createFreezePane( 1, 0, 1, 0 ); - // Freeze the columns and rows (forget about scrolling position of the lower right quadrant). - sheet3.createFreezePane( 2, 2 ); - // Create a split with the lower left side being the active quadrant - sheet4.createSplitPane( 2000, 2000, 0, 0, HSSFSheet.PANE_LOWER_LEFT ); - - FileOutputStream fileOut = new FileOutputStream("workbook.xls"); - wb.write(fileOut); - fileOut.close(); - -
- - -
-

- It's possible to set up repeating rows and columns in - your printouts by using the setRepeatingRowsAndColumns() - function in the HSSFWorkbook class. -

-

- This function Contains 5 parameters. - The first parameter is the index to the sheet (0 = first sheet). - The second and third parameters specify the range for the columns to repreat. - To stop the columns from repeating pass in -1 as the start and end column. - The fourth and fifth parameters specify the range for the rows to repeat. - To stop the columns from repeating pass in -1 as the start and end rows. -

- - HSSFWorkbook wb = new HSSFWorkbook(); - HSSFSheet sheet1 = wb.createSheet("new sheet"); - HSSFSheet sheet2 = wb.createSheet("second sheet"); - - // Set the columns to repeat from column 0 to 2 on the first sheet - wb.setRepeatingRowsAndColumns(0,0,2,-1,-1); - // Set the the repeating rows and columns on the second sheet. - wb.setRepeatingRowsAndColumns(1,4,5,1,2); - - FileOutputStream fileOut = new FileOutputStream("workbook.xls"); - wb.write(fileOut); - fileOut.close(); - -
- -
-

- Example is for headers but applies directly to footers. -

- - HSSFWorkbook wb = new HSSFWorkbook(); - HSSFSheet sheet = wb.createSheet("new sheet"); - - HSSFHeader header = sheet.getHeader(); - header.setCenter("Center Header"); - header.setLeft("Left Header"); - header.setRight(HSSFHeader.font("Stencil-Normal", "Italic") + - HSSFHeader.fontSize((short) 16) + "Right w/ Stencil-Normal Italic font and size 16"); - - FileOutputStream fileOut = new FileOutputStream("workbook.xls"); - wb.write(fileOut); - fileOut.close(); - -
-
-
- -
diff --git a/src/documentation/xdocs/hssf/record-generator.xml b/src/documentation/xdocs/hssf/record-generator.xml deleted file mode 100644 index df5b95f6b6..0000000000 --- a/src/documentation/xdocs/hssf/record-generator.xml +++ /dev/null @@ -1,194 +0,0 @@ - - - - -
- Record Generator HOWTO - - - - -
- -
- -
-

- The record generator was born from frustration with translating - the Excel records to Java classes. Doing this manually is a time - consuming process. It's also very easy to make mistakes. -

-

- A utility was needed to take the defintition of what a - record looked like and do all the boring and repetitive work. -

-
- -
-

- The record generator takes XML as input and produces the following - output: -

    -
  • A Java file capabile of decoding and encoding the record.
  • -
  • A test class that provides a fill-in-the-blanks implementation - of a test case for ensuring the record operates as - designed.
  • -
-

-
-
-

- The record generator is invoked as an Ant target - (generate-records). It goes through looking for all files in - src/records/defintitions ending with _record.xml. - It then creates two files; the Java record definition and the - Java test case template. -

-

- The records themselves have the following general layout: -

- - The frame record indicates whether there is a border - around the displayed text of a chart. - Glen Stampoultzis (glens at apache.org) - - - - - - - - - - - - ]]> -

- The following table details the allowable types and sizes for - the fields. -

- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -
TypeSizeJava Type
int1byte
int2short
int4int
int8long
intvarwordarray of shorts
bits1A byte comprising of a bits (defined by the bit element) -
bits2An short comprising of a bits
bits4A int comprising of a bits
float8double
hbstringjava expressionString
-

- The Java records are regenerated each time the record generator is - run, however the test stubs are only created if the test stub does - not already exist. What this means is that you may change test - stubs but not the generated records. -

-
-
-

- Occationally the builtin types are not enough. More control - over the encoding and decoding of the streams is required. This - can be achieved using a custom type. -

-

- A custom type lets you escape to java to define the way in which - the field encodes and decodes. To code a custom type you - declare your field like this: -

- - ]]> -

- Where the class name specified after custom: is a - class implementing the interface CustomField. -

-

- You can then implement the encoding yourself. -

-
-
-

- The record generation works by taking an XML file and styling it - using XLST. Given that XSLT is a little limited in some ways it was - necessary to add a little Java code to the mix. -

-

- See record.xsl, record_test.xsl, FieldIterator.java, - RecordUtil.java, RecordGenerator.java -

-

- There is a corresponding "type" generator for HDF. - See the HDF documentation for details. -

-
-
-

- The record generator does not handle all possible record types and - goes not intend to perform this function. When dealing with a - non-standard record sometimes the cost-benifit of coding the - record by hand will be greater than attempting modify the - generator. The main point of the record generator is to save - time, so keep that in mind. -

-

- Currently the the XSL file that generates the record calls out to - Java objects. The Java code for the record generation is - currently quite messy with minimal comments. -

-
-
- -
diff --git a/src/documentation/xdocs/hssf/use-case.xml b/src/documentation/xdocs/hssf/use-case.xml deleted file mode 100644 index ee37cf59d8..0000000000 --- a/src/documentation/xdocs/hssf/use-case.xml +++ /dev/null @@ -1,182 +0,0 @@ - - - - -
- HSSF Use Cases - - - -
- -
-
- -

Primary Actor: HSSF client

-

Scope: HSSF

-

Level: Summary

-

Stakeholders and Interests:

-
    -
  • HSSF client- wants to read content - of HSSF file
  • -
  • HSSF - understands HSSF file
  • -
  • POIFS - understands underlying POI - file system
  • -
-

Precondition: None

-

Minimal Guarantee: None

-

Main Success Guarantee:

-
    -
  1. HSSF client requests HSSF to read - a HSSF file, providing an InputStream - containing HSSF file in question.
  2. -
  3. HSSF requests POIFS to read the HSSF - file, passing the InputStream - object to POIFS (POIFS use case 1, read existing file system)
  4. -
  5. HSSF reads the "Workbook" - file (use case 4, read workbook entry)
  6. -
-

Extensions:

-

2a. Exceptions -thrown by POIFS will be passed on to the HSSF client.

-
-
- -

Primary Actor: HSSF client

-

Scope: HSSF

-

Level: Summary

-

Stakeholders and Interests:

-
    -
  • HSSF client- wants to write file - out.
  • -
  • HSSF - knows how to write file - out.
  • -
  • POIFS - knows how to write file - system out.
  • -
-

Precondition:

-
    -
  • File has been - read (use case 1, read existing HSSF file) and subsequently modified - or file has been created (use case 3, create HSSF file)
  • -
-

Minimal Guarantee: None

-

Main Success Guarantee:

-
    -
  1. HSSF client - provides an OutputStream to - write the file to.
  2. -
  3. HSSF writes - the "Workbook" to its associated POIFS file system (use case - 5, write workbook entry)
  4. -
  5. HSSF - requests POIFS to write its file system out, using the OutputStream - obtained from the HSSF client (POIFS use case 2, write file system).
  6. -
-

Extensions:

-

3a. Exceptions -from POIFS are passed to the HSSF client.

- -
-
- -

Primary Actor: HSSF client

-

Scope: HSSF

-

-Level: Summary

-

Stakeholders and Interests:

-
    -
  • HSSF client- wants to create a new - file.
  • -
  • HSSF - knows how to create a new - file.
  • -
  • POIFS - knows how to creat a new - file system.
  • -
-

Precondition:

-

Minimal Guarantee: None

-

Main Success Guarantee:

-
    -
  1. HSSF requests - POIFS to create a new file system (POIFS use case 3, create new file - system)
  2. -
-

Extensions: -None

- -
-
-

Primary Actor: HSSF

-

Scope: HSSF

-

-Level: Summary

-

Stakeholders and Interests:

-
    -
  • HSSF - knows how to read the - workbook entry
  • -
  • POIFS - knows how to manage the file - system.
  • -
-

Precondition:

-
    -
  • The file - system has been read (use case 1, read existing HSSF file) or has - been created and written to (use case 3, create HSSF file system; - use case 5, write workbook entry).
  • -
-

Minimal -Guarantee: None

-

Main Success Guarantee:

-
    -
  1. - HSSF requests POIFS for the "Workbook" file
  2. -
  3. POIFS returns - an InputStream for the file.
  4. -
  5. HSSF reads - from the InputStream provided by POIFS
  6. -
  7. HSSF closes - the InputStream provided by POIFS
  8. -
-

Extensions:

-

3a. Exceptions -thrown by POIFS will be passed on

-
-
- - -

Primary Actor: HSSF

-

Scope: HSSF

-

-Level: Summary

-

Stakeholders and Interests:

-
    -
  • HSSF - knows how to manage the - write the workbook entry.
  • -
  • POIFS - knows how to manage the file - system.
  • -
-

Precondition: -

-
    -
  • Either an existing HSSF file has - been read (use case 1, read existing HSSF file) or an HSSF file has - been created (use case 3, create HSSF file).
  • -
-

Minimal Guarantee: None

-

Main Success Guarantee:

-
    -
  1. HSSF - checks the POIFS file system directory for the "Workbook" - file (POIFS use case 8, read file system directory)
  2. -
  3. If "Workbook" is in the directory, HSSF requests POIFS to - replace it with the new workbook entry (POIFS use case 4, replace file - in file system). Otherwise, HSSF requests POIFS to write the new - workbook file, with the name "Workbook" (POIFS use case 6, - write new file to file system)
  4. -
-

Extensions:None

-
- -
- -
diff --git a/src/documentation/xdocs/index.xml b/src/documentation/xdocs/index.xml deleted file mode 100644 index 1649f8e2fc..0000000000 --- a/src/documentation/xdocs/index.xml +++ /dev/null @@ -1,174 +0,0 @@ - - - - -
- Welcome to POI - - - - -
- - -
-
-

- You can now read your POI news in a handy dandy - RSS feed or read it here (links to - other syndication formats there as well). Thanks to the good guys of the - Blojsom project for writing such - a great piece of software! Subscribe using your favorite aggregator today and - stay current on important POI news and events such as releases, new development - features, etc. -

-
-
-

- The latest development release of POI is version 1.10. - Binary distributions are available here - and source distributions are here. -

- - - -
-
-

- The POI documentation translation project has begun. The first ones - to start are the Spanish - (espaniol) and Japanese, and - German - translations. Others are welcome. Please feel - free to participate! -

-
-
-

- Voting for the POI logo contest is now complete. Thank you for your votes. -

- - - -
-
-
-

- The POI project consists of APIs for manipulating various file formats - based upon Microsoft's OLE 2 Compound Document format using pure Java. -

-

- OLE 2 Compound Document Format based files include most Microsoft Office - files such as XLS and DOC. -

-

- As a general policy we try to collaborate as much as possible with other projects to - provide this functionality. Examples include: Cocoon for - which you'll soon find generators and serializers for our projects; - Open Office.org with whom we collaborate in documenting the - XLS format; and Lucene for which we'll soon have file - format interpretors. When practical, we donate components directly to those projects for POI-enabling them. -

-
-

- We'll tackle this on a component level. POI refers to the whole project. -

-

- So why should you use POIFS or HSSF? -

-

- You'd use POIFS if you had a document written in OLE 2 Compound Document Format, probably written using - MFC, that you needed to read in Java. Alternatively, you'd use POI to write OLE 2 Compound Document Format - if you needed to inter-operate with software running on the Windows platform. We are not just bragging when - we say that POIFS is the most complete and correct port of this file format to date! -

-

- You'd use HSSF if you needed to read or write an XLS (Excel) file using Java. You can also read and modify - spreadsheets using this API, although right now writing is more mature. -

-
- -
-

- POI stands for Poor Obfuscation Implementation. Why would we name our project such a derogatory name? Well, - Microsoft's OLE 2 Compound Document Format is a poorly conceived thing. It is essentially an archive structured - much like the old DOS FAT filesystem. Redmond chose, instead of using tar, gzip, zip or arc, to invent their own - archive format that does not provide any standard encryption or compression, is not very appendable and is prone - to fragmentation. -

-

- Poi is also a Hawaiian delicacy that Merriam Webster's dictionary defines as: - "A Hawaiian food of taro root cooked, pounded, and kneaded to a paste and often allowed to ferment." This seemed - strangely descriptive of the file format. -

-

- So if you like acronyms, then POI is an acronym. If you hate them, then we just used the name of the food for our - project. If you wish to signify your love or hate for acronyms, use POI or Poi to refer to the project respectively. -

-
- -
- - -
-
-

A common misconception is that POI writes Excel files. POI is the name of the project. POI contains several - components, one of which, HSSF, writes Excel files. The following are components of the entire POI project - and a brief summary of their purpose.

-
-
-

POIFS is the oldest and most stable part of the project. It is our port of the OLE 2 Compound Document Format to - pure Java. It supports both read and write functionality. All of our components ultimately rely on it by - definition. Please see the POIFS project page for more information.

-
-
-

HSSF is our port of the Microsoft Excel 97(-2002) file format (BIFF8) to pure Java. It supports read and write - capability. Please see the HSSF project page for more information.

-
-
-

HDF is our port of the Microsoft Word 97 file format to pure Java. It supports read and write capability. - Please see the HDF project page for more information. This component is - in the early stages of design. Jump in!

-
-
-

HPSF is our port of the OLE 2 property set format to pure - Java. Property sets are mostly use to store a document's properties - (title, author, date of last modification etc.), but they can be used - for application-specific purposes as well. Currently HPSF supports - read functionality only. Please see the HPSF project page for more - information.

-
- -
- -
-

The HSSF Serializer, which was part of our 1.0 release and last builds on - Sourceforge, has been donated to the - Cocoon project, and is available starting from version - 2.0.2.

-
- -
-

- So you'd like to contribute to the project? Great! We need enthusiastic, hard-working, talented folks to help - us on the project in several areas. The first is bug reports and feature requests! The second is documentation - - we'll be at your every beck and call if you've got a critique or you'd like to contribute or otherwise improve - the documentation. We could especially use some help documenting the HSSF file format! Last, but not least, we - could use some binary crunching Java coders to chew through the convolution that characterizes Microsoft's file - formats and help us port new ones to a superior Java platform! -

-

So if you're motivated, ready, and have the time, join the mail lists and we'll be happy to help you get started on the - project! -

- - -
- -
- - Copyright (c) @year@ The Apache Software Foundation All rights reserved. - $Revision$ $Date$ - -
-
diff --git a/src/documentation/xdocs/mirrors.xml b/src/documentation/xdocs/mirrors.xml deleted file mode 100644 index 0d92ddb0f3..0000000000 --- a/src/documentation/xdocs/mirrors.xml +++ /dev/null @@ -1,37 +0,0 @@ - - - - -
- - - - -
- - -
-

- These are mirrors of the - POI website. - If you know of others...report them! :-) -

-
-
-
    -
  • Austrian Mirror of Jakarta POI
  • -
-
-
-
    -
  • Jakarta site partially translated into Korean and Mirrored
  • -
-
- -
- - Copyright (c) @year@ The Apache Software Foundation All rights reserved. - $Revision$ $Date$ - -
-
diff --git a/src/documentation/xdocs/news.xml b/src/documentation/xdocs/news.xml deleted file mode 100644 index 304ffd44e7..0000000000 --- a/src/documentation/xdocs/news.xml +++ /dev/null @@ -1,212 +0,0 @@ - - - - -
- - - - -
- - -
-

- These are articles/etc. posted about POI around the web. If you - see POI in the news or mentioned at least somewhat prominently - on a site (not your homepage that you put the work POI on in - order to get us to link you and by the why here is a picture of - your wife in kids) then send a patch to the list. In general - equal time will be given so please feel free to send inflamatory - defamation as well as favorable, technical and factual. Really - stupid things won't be mentioned (sorry). -

-
-
-
    -
  • - Discussion about using POI on AS/400s -
  • -
  • - Discussion from back when we almost had POI as the filter for KOffice if politics and licenses hadn't killed iit -
  • -
  • - Java discussion on O'Reilly Network including discussion about POI - O'Reilly.net -
  • -
  • - Poor Obfuscation Implementation. - Blog of David M. Johnson -
  • -
  • - - POI 1.5-dev-rc2 released - JSurfer -
  • - -
  • - Google says we're the most important in our category -
  • -
  • - It's POI-fect - Tony Sintes, Javaworld -
  • -
  • - - Nicola announces POI serialization code - - Matthew Langham's Radio Weblog -
  • -
  • - - Jakarta POI 1.4583 Released - JavaLobby -
  • -
  • - - POI project moves to Jakarta (OLE 2 CDF/Excel/Word in - pure java) - JavaLobby -
  • -
  • - - List of Java libraries to read and write image and document files - Marco Schmidt's homepage (normally we wouldn't - feature someone's homepage but its an extensive list of - information including "alternatives to POI" (for those - of you who are very wealthy). But heck I think I'll - bookmark his page for myself since he's like got every - piece of info known to man linked or featured on it! -
  • -
  • - - The Experiences of an Operator (Måns af Klercker) - - radio.weblogs.com -
  • -
  • - - DATACONV - Data Conversion Tools: Office - DATACONV -
  • -
  • - - Chicago Developer Page - -
  • -
  • - - POI/POI Serialization Project - - Man you know you've hit the bigtime when - O'Reilly Likes you.. ;-) -
  • -
  • - - News Around the Net - - Java World -
  • - -
-
-
-
    -
  • - - Een Excel-werkboek maken vanuit Java - Lieven Smits - -
  • -
-
-
-
    -
  • Apache POI verffentlicht - entwicker.com -
  • -
  • - - Apache Jakarta-Projekt bringt Word und Excel in die Java-Welt - jsp-develop.de (for the misguided who use JSP ;-) ) -
  • -
  • - - Neues Apache-Projekt bringt Word- und Excel nach Java - - entwickler.com -
  • -
-
-
-
    -
  • - - OLE2 desde Java nativo - - javaHispano -
  • -
  • - Spanish discussion about Excel and Java including POI from Wrox forums -
  • -
-
-
-
    -
  • - - Excel/OLE accessibles - - Da Linux French Page -
  • -
  • - Discussion on POI in French -
  • -
-
-
-
    -
  • - 100% PureJava... - Dr. Panda Portal -
  • -
  • - - What's new with java? - - gimlay.org -
  • -
  • Java?Excel????? - appears to show how to use Japanese with POI
  • -
  • Various discussion in Japanese including on POI
  • -
  • Japanese resources on Jakarta projects including POI
  • -
  • Kishida's site various information, includes a snip about POI and Japanese.
  • - -
-
-
-
    -
  • - - Probably a translation of the Javalobby announcement of 1.5-final - Java-??????? -
  • -
-
-
-
    -
  • - Various discussion in Korean about Excel output/APIs including POI -
  • -
-
-
-

- If you can read one of these languages, send mail to the list - telling us what language it is and we'll categorize it! -

-
    -
  • - - If I had to guess, I'd say this is Thai, but - maybe you actually know - javacentrix.com -
  • -
-
- -
- - Copyright (c) @year@ The Apache Software Foundation All rights reserved. - $Revision$ $Date$ - -
-
diff --git a/src/documentation/xdocs/news/book.xml b/src/documentation/xdocs/news/book.xml deleted file mode 100644 index d934b6a724..0000000000 --- a/src/documentation/xdocs/news/book.xml +++ /dev/null @@ -1,19 +0,0 @@ - - - - - - - - - - - - - - - - - diff --git a/src/documentation/xdocs/news/logocontest.xml b/src/documentation/xdocs/news/logocontest.xml deleted file mode 100644 index 239bf29d2d..0000000000 --- a/src/documentation/xdocs/news/logocontest.xml +++ /dev/null @@ -1,174 +0,0 @@ - - - - -
- - - - - -
- - -
-

- Here are the current logo submissions. Thanks to the artists! -

-
-

- -

-
-
-

-     - -

-
-
-

- -

-
-
-

-     - -

-
-
-

-     -     - -

-
-
-

-     -     - -

-

-     - -

-
-
-

- -

-
-
-

-     - -

-
-
-

-     - -

-
-
-

-     - -

-

-     - -

-

-     - -

-

-     - -

-

-     - -

-

-     - -

-
-
-

-     - -

-
-
-

- Contact Person: Fancy at: fancy at my-feiqi.com -

-

-     - -

-

- -

-

- -

-

- -

-

- -

-

- -

-

- -

-

- -

-

- -

-
-
-

- -

-

- -

-

- -

-

- -

-

- -

-

- -

-

- -

-

- -

-
-
- -
- - Copyright (c) @year@ The Apache Software Foundation All rights reserved. - $Revision$ $Date$ - -
-
diff --git a/src/documentation/xdocs/overview.xml b/src/documentation/xdocs/overview.xml deleted file mode 100644 index be98f36397..0000000000 --- a/src/documentation/xdocs/overview.xml +++ /dev/null @@ -1,70 +0,0 @@ - - - - -
- Overview - - - -
- - -
-

The POI project is the master project for developing pure - Java ports of file formats based on Microsoft's OLE 2 Compound - Document Format. OLE 2 Compound Document Format is used by - Microsoft Office Documents, as well as by programs using MFC - property sets to serialize their document objects. -

-
-
-

- There following are ports, packages or components contained in the POI project. -

-
-

- POIFS is the set of APIs - for reading and writing OLE 2 Compound Document Formats using (only) Java. -

-
- -
-

- HSSF is the set of APIs - for reading and writing Microsoft Excel 97(-XP) spreadsheet using (only) Java. -

-
- -
-

- HDF is the set of APIs - for reading and writing Microsoft Word 97(-XP) spreadsheet using (only) Java. -

-
- -
-

- HPSF is the set of APIs - for reading property sets using (only) Java. -

-
- -
-

- POI-Utils are general purpose artifacts - from POI development that have not yet been implemented elsewhere. We're - always looking to donate these and maintain them as part of a general library - used in another project. These are things we need to complete our mission but - are generally outside of it. -

-
-
- -
- - Copyright (c) @year@ The Poi Project All rights reserved. - $Revision$ $Date$ - -
-
diff --git a/src/documentation/xdocs/patches.xml b/src/documentation/xdocs/patches.xml deleted file mode 100644 index 2887174d94..0000000000 --- a/src/documentation/xdocs/patches.xml +++ /dev/null @@ -1,23 +0,0 @@ - - - -
Patch Queue

- This is an informal list - in chronological order - - of some of the noteworthy patches that have been posted - to the developers mailing list. - These patches are not (yet) part of the Poi project, but need reviewing for possible - inclusion. This system was instituted because, due to the large volume of mail and - the lack of time of the committers, some patches tended to get forgotten about. This - queue does not guarantee that any patch will be reviewed within a reasonable time frame, - but it does at least make them easier to find! -

Reviewers wanted! - If you have time to review and/or test these patches, - we would be grateful for your time. Please post comments to the dev mailing lists. -

- Before submitting a patch, please read the page on Third-Party - Contributions. The preferred submission method for patches is: -

  • Post to Poi developers list
  • Describe the patch, the reason for it and (if necessary) why this is important.
  • Generate the patch in diff -u format from CVS
  • Also generate a documentation patch or new file, if this is something that should be documented. -
  • Post as an attachment rather than inline (unless it is trivially small).

Following the above guidelines will facilitate your patch being reviewed - and applied efficiently.

[Under Construction] Archive links will be added later. - Please do not bother the patch submitters/authors without first reading the - relevant post(s) in the mailing list archives.

Vapourware will not be listed.

idSummaryReviewerResolutionStatus

See also additional list of patches to be added in To Do. -

diff --git a/src/documentation/xdocs/plan/POI10Vision.xml b/src/documentation/xdocs/plan/POI10Vision.xml deleted file mode 100644 index b2b8c6c3e2..0000000000 --- a/src/documentation/xdocs/plan/POI10Vision.xml +++ /dev/null @@ -1,509 +0,0 @@ - - - - -
- POI 1.0 Vision Document - - - - -
- - - -
-

- (21-Jan-02) While this document is just full of useful project - introductory information and I do suggest those interested in getting - involved in the project read it, it is woefully out of date. -

-

- We deliberately allowed this document to run out of date because it - is a good reflection of what the original vision was for POI 1.0. - You'll note that some of the terminology is not used in quite the same - way any longer. I've made some minor corrections where reading this - confused me. An example: in some places this document may refer to - POI API instead of POIFS API. When this vision was written we had - an incomplete understanding of the project. -

-

- Lastly, the scope of the project expanded dramatically near the end - of the 1.0 cycle. Our vision at the time was to focus merely on the - Excel port (having no idea how the project would grow or be received) - and provide the OLE 2 Compound Document port for others to port later - formats. We now plan to spearhead these ports under the umbrella of - the POI project. So, you've been warned. Read on, but just realize - that we had a fuzzy view of things to come, and hindsight is 20-20. -

-

- If I recall major holes were: a complete understanding of the format - of OLE 2 Compound Document format, Excel file format, and exactly how - Cocoon 2 Serializers worked. (that just about covers the whole range - huh?) -

-
- -
-
-

- The purpose of this document is to - collect, analyze and define high-level requirements, user needs and - features of the HSSF Serializer for Cocoon 2 and related libraries. - The HSSF Serializer is a java class supporting the Serializer - interface from the Cocoon 2 project and outputting in a compatible - format of that used by the spreadsheet program Microsoft Excel '97. - The HSSF Serializer will be responsible for converting XML - spreadsheet-like documents into Excel-compatible XLS spreadsheets. -

-
- - -
-

- Many web apps today hit a brick wall - when it comes to the user request that they be able to easily - manipulate their reports and data extracts in the popular Microsoft - Excel spreadsheet format. This often causes inferior technologies to be - chosen for the project simply because they easily support this - format. This project seeks to extend existing XML, Java and Apache - Cocoon 2 project technologies by: -

- -
    -
  • - providing an extensible library - (POIFS) which reads/writes in a compatable format to OLE 2 Compound - Document Format (aka Structured Storage Format) for easy - implementation of other document types; -
  • -
  • - providing a library (HSSF) for - manipulating spreadsheet data and outputting it in a compatible - format to Microsoft Excel XLS format; -
  • -
  • - and providing a Cocoon 2 - Serializer (HSSFSerializer) for serializing XML documents as - Excel-compatible spreadsheets. -
  • -
- -
-
-
-
-

- There are a number of enthusiastic - users of XML, UNIX and Java technology. Secondly, the Microsoft - solution for outputting Office Document formats often involves - actually manipulating the software as an OLE Server. This method - provides extremely low performance, extremely high overhead and is - only capable of handling one document at a time. -

-
    -
  1. - Our intended audience for the HSSF - Serializer portion of this project are developers writing reports or - data extracts in XML format. -
  2. -
  3. - Our intended audience for the HSSF - library portion of this project is ourselves as we are developing - the Serializer and anyone who needs to write to Excel spreadsheets - in a non-XML Java environment or who has specific needs not - addressed by the Serializer. -
  4. -
  5. - Our intended audience for the - "POIFS" OLE 2 Compound Document format reader/writer is - ourselves as we are writing the HSSF library and secondly, anyone - wishing to provide other libraries for reading/writing OLE 2 - Compound Document Format in Java. -
  6. -
-
-
-

- The users of this software shall be - developers in a Java environment on any Operating System or power - users who are capable of XML document generation/deployment. -

-
-
-

- The OLE 2 Compound Document format is - undocumented for all practical purposes and cryptic for all - impractical purposes. Developer needs in this area include - documentation and an easy to use library for reading and writing in - this format without requiring the developer to have intimate - knowledge of the format. -

-

- There is currently no good way to write - to Microsoft Excel documents from Java or from a non-Microsoft - Windows based platform for that matter. Developers need an easy to - use library that supports a reasonable feature set and allows - seperation of data from formatting/stylistic concerns. -

-

- There is currently no good way to - transform XML data to Microsoft Excel. Apache's Cocoon 2 project - supplies a complete framework for XML, but nothing for outputting in - Excel's XLS format. Developers and power users alike need a simple - method to output XML documents to Excel through server-side - processing. -

- - -
-
-

- Originally there weren't any decent alternatives for reading or writing - to Excel. This has changed somewhat. -

-
-
-
-
-

- The produced code shall be licensed by - the Apache License as used by the Cocoon 2 project and maintained on - a project page until such time as the Cocoon 2 developers accept it - as a donation (at which time the copyright will be turned over to - them). -

-
-
-

- For developers on a Java and/or XML - environment this project will provide all the tools necessary for - outputting XML data in the Microsoft Excel format. This project seeks - to make the use of Microsoft Windows based servers unnecessary for - file format considerations and to fully document the OLE 2 Compound - Document format. The project aims not only to provide the tools for - serializing XML to Excel's file format and the tools for writing to - that file format from Java, but also to provide the tools for later - projects to convert other OLE 2 Compound Document formats to pure - Java APIs. -

-
-
-

- HSSF Serializer for Apache Cocoon 2 -

- - - - - - - - - - - - - - - - - - - - - -
- Benefit - - Supporting Features -
- Standard XML tag language for sheet data - - Serializer will transform documents utilizing a defined tag - language -
- Utilize XML to output in Excel - - Serializer will output in Excel -
- Java API to output in Excel on any platform - - The project will develop an API that outputs in Excel using - pure Java. -
- Make it easy for developers to port other OLE 2 Compound - Document-based formats to Java. - - The POIFS library will contain both a high-level abstraction - along with low-level constructs. The project will fully document - the OLE 2 Compound Document Format. -
-
-
-
    -
  • - The HSSF Serializer will run on - any Java 2 supporting platform with Apache Cocoon 2 installed along - with the HSSF and POIFS APIs. -
  • -
  • - The HSSF API requires a Java 2 - implementation and the POI API. -
  • -
  • - The POIFS API requires a Java 2 - implementation. -
  • -
-
-
-
-

- The POIFS API will include: -

-
    -
  • - Low level structures representing - the structures in a POI filesystems. -
  • -
  • - A low-level API for - creating/manipulating POI filesystems. -
  • -
  • - A set of high level interfaces - abstracting the user from the POI filesystem constructs and - representing it as a standard filesystem (Files, directories, etc) -
  • -
-

- The HSSF API will include: -

-
    -
  • - Low level structures representing - the structures in an Excel file. -
  • -
  • - A low-level API for creating and - manipulating Excel files and writing them into POI filesystems. -
  • -
  • - A high level model and style - interface for manipulating spreadsheet data without knowing anything - about the Excel format itself. -
  • -
-
-

- The POI Filesystem API includes: -

-
    -
  • An implementation of Big Blocks
  • -
  • An implementation of Small Blocks
  • -
  • An implementation of Header Blocks
  • -
  • An implementation of Block Allocation Tables
  • -
  • An implementation of Property Sets
  • -
  • An implementation of the POI - filesystem including functions to get and set the above constructs; - compound functions for reading/writing files/directories. -
  • -
  • An abstraction of the POI - filesystem providing interfaces representing Files, Directories, - FileSystems in normal terminology and encapulating the above - constructs. -
  • -
  • Full documentation of the POI file - format. -
  • -
  • Full documentation of the APIs and - interfaces provided through Javadoc, user documentation (aimed at - developers using the APIs) -
  • -
  • Examples aimed at teaching the - user to write code using POI. (titled: recipes for POI) -
  • -
  • Performance specifications. - (Example POI filesystems rated by some measure of complexity along - with system specifications and execution times for given operations) -
  • -
-
-
-

- The HSSF API includes: -

-
    -
  • An implementation of Record - (binary 2 byte type followed by 2 byte size (n) followed by n bytes)
  • -
  • Implementations of many standard - record types mapping the data bytes to fields along with methods to - reserialize those fields
  • -
  • An implementation of the HSSF File - including functions to get/set the above constructs, create a blank - file with the minimum required record types and mappings between - getting/setting data and style in a workbook to the creation of - record types, and read HSSF files.
  • -
  • An abstraction of the HSSF file - format providing interfaces representing the HSSF File, HSSF - Workbook, HSSF Sheet, HSSF Column, HSSF Formulas in a manner - seperating the data from the styling and encapsulating the above - constructs.
  • -
  • Full documentation of the HSSF - file format (which will be a subset of the Excel '97 File format). - This must be done with care for legal reasons.
  • -
  • Full documentation of the APIs and - interfaces provided through Javadoc, user documentation (aimed at - developers using the apis).
  • -
  • Examples aimed at teaching - developers to use the APIs. -
  • -
  • Performance specifications. - (Example files rated by some measure of complexity along with system - specifications and execution times for given operations - possibly - the same files used for POI's tests)
  • -
-
-
-

- The HSSF Serializer subproject: -

-
    -
  • A class supporting the Cocoon 2 - Serializer Interface.
  • -
  • An interface between the SAX - events and the HSSF APIs.
  • -
  • A specified tag language for using - with the Serializer.
  • -
  • Documentation on the tag language - for the HSSF Serializer
  • -
  • Normal javadocs.
  • -
  • Example XML files
  • -
  • Performance specifications. - (Example XML docs and stylesheets rated by some measure of - complexity along with system specifications and execution times)
  • -
-
-
-
-
-

- All Java code will be 100% pure Java. -

-
-
-

- The minimum system requirements for POIFS are: -

-
    -
  • 64 Mbytes memory
  • -
  • Java 2 environment
  • -
  • Pentium or better processor (or equivalent on other platforms)
  • -
-

- The minimum system requirements for HSSF are: -

-
    -
  • 64 Mbytes memory
  • -
  • Java 2 environment
  • -
  • Pentium or better processor (or equivalent on other platforms)
  • -
  • POIFS API
  • -
-

- The minimum system requirements for the HSSF Serializer are: -

-
    -
  • 64 Mbytes memory
  • -
  • Java 2 environment
  • -
  • Pentium or better processor (or equivalent on other platforms)
  • -
  • Cocoon 2
  • -
  • HSSF API
  • -
  • POI API
  • -
-
-
-

- All components must perform well enough - to be practical for use in a webserver environment (especially - Cocoon2/Tomcat/Apache combo) -

-
-
-

- The software will run primarily in - developer environments. We should make some allowances for - not-highly-technical users to write XML documents for the HSSF - Serializer. All other components will assume intermediate Java 2 - knowledge. No XML knowledge will be required except for using the - HSSF Serializer. As much documentation as is practical shall be - required for all components as XML is relatively new, and the - concepts introduced for writing spreadsheets and to POI filesystems - will be brand new to Java and many Java developers. -

-
-
-
-
-

- The filesystem as read and written by - POI shall be fully documented and explained so that the average Java - developer can understand it. -

-
-
-

- The POI API will be fully documented - through Javadoc. A walkthrough of using the high level POI API shall - be provided. No documentation outside of the Javadoc shall be - provided for the low-level POI APIs. -

-
-
-

- The HSSF File Format as implemented by - the HSSF API will be fully documented. No documentation will be - provided for features that are not supported by HSSF API that are - supported by the Excel 97 File Format. Care will be taken not to - infringe on any "legal stuff". -

-
-
-

- The HSSF API will be documented by - javadoc. A walkthrough of using the high level HSSF API shall be - provided. No documentation outside of the Javadoc shall be provided - for the low level HSSF APIs. -

-
- -
-

- The HSSF Serializer will be documented - by javadoc. -

-
- -
-

- The XML tag language along with - function and usage shall be fully documented. Examples will be - provided as well. -

-
-
-
-
-

- filesystem shall refer only to the POI formatted archive. -

-
-
-

- file shall refer to the embedded data stream within a - POI filesystem. This will be the actual embedded document. -

-
-
- -
diff --git a/src/documentation/xdocs/plan/POI20Vision.xml b/src/documentation/xdocs/plan/POI20Vision.xml deleted file mode 100644 index 000fe8baa8..0000000000 --- a/src/documentation/xdocs/plan/POI20Vision.xml +++ /dev/null @@ -1,582 +0,0 @@ - - - - -
- POI 2.0 Vision Document - - - - - - -
- - - -
-

- This is the POI 2.0 cycle vision document. Although the vision - has not changed and this document is certainly not out of date and - the vision has not changed, the structure of the project has - changed a bit. We're not going to change the vision document to - reflect this (however proper that may be) because it would only - involve deletion. There is no purpose in providing less - information provided we give clarification. -

-

- This document was created before the POI components for - Apache Cocoon - were accepted into the Cocoon project itself. It was also - written before POI was accepted into Jakarta. So while the - vision hasn't changed some of the components are actually now - part of other projects. We'll still be working on them on the - same timeline roughly (minus the overhead of coordination with - other groups), but they are no longer technically part of the - POI project itself. -

-
- -
-
-

- The purpose of this document is to - collect, analyze and define high-level requirements, user needs, - and features of the second release of the POI project software. - The POI project currently consists of the following components: - the HSSF Serializer, the HSSF library and the POIFS library. -

-
    -
  • - The HSSF Serializer is a set of Java classes whose main - class supports the Serializer interface from the Cocoon - 2 project and outputs the serialized data in a format - compatible with the spreadsheet program Microsoft Excel - '97. -
  • -
  • - The HSSF library is a set of classes for reading and - writing Microsoft Excel 97 file format using pure Java. -
  • -
  • - The POIFS library is a set of classes for reading and - writing Microsoft's OLE 2 Compound Document format using - pure Java. -
  • -
-

By the completion of this release cycle the POI project will also - include the HSSF Generator and the HDF library. -

-
    -
  • The HSSF Generator will be responsible for using HSSF to read - in the XLS (Excel 97) file format and create SAX events. The HSSF - Generator will support the applicable interfaces specified by the - Apache Cocoon 2 project. -
  • -
  • The HDF library will provide a set of high level interfaces - for reading and writing Microsoft Word 97 file format using pure - Java.
  • -
- -
- - -
-

- The first release of the POI project - was an astounding success. This release seeks to build on that - success by: -

-
    -
  • - Refactoring POIFS into imput and - output classes as well as an event-driven API for reading. -
  • -
  • - Refactor HSSF for greater - performance as well as an event-driven API for reading -
  • -
  • - Extend HSSF by adding the ability to read and write formulas. -
  • -
  • - Extend HSSF by adding the ability to read and write - user-defined styles. -
  • -
  • - Create a Cocoon 2 Generator for HSSF using the same tags - as the HSSF Serializer. -
  • -
  • - Create a new library (HDF) for reading and writing - Microsoft Word DOC format. -
  • -
  • - Refactor the HSSFSerializer into a separate extensible - POIFSSerializer and HSSFSerializer -
  • -
  • - Providing the create excel charts. (write only) -
  • -
-
-
-
-
-

- There are a number of enthusiastic - users of XML, UNIX and Java technology. Furthermore, the Microsoft - solution for outputting Office Document formats often involves - actually manipulating the software as an OLE Server. This method - provides extremely low performance, extremely high overhead and is - only capable of handing one document at a time. -

-
    -
  1. - Our intended audience for the HSSF - Serializer portion of this project are developers writing reports or - data extracts in XML format. -
  2. -
  3. - Our intended audience for the HSSF - library portion of this project is ourselves as we are developing - the HSSF serializer and anyone who needs to read and write Excel - spreadsheets in a non-XML Java environment, or who has specific - needs not addressed by the Serializer -
  4. -
  5. - Our intended audience for the - POIFS library is ourselves as we are developing the HSSF and HDF - libraries and anyone wishing to provide other libraries for - reading/writing other file formats utilizing the OLE 2 Compound - Document Format in Java. -
  6. -
  7. - Our intended audience for the HSSF - generator are developers who need to export Excel spreadsheets to - XML in a non-proprietary environment. -
  8. -
  9. - Our intended audience for the HDF - library is ourselves, as we will be developing a HDF Serializer in a - later release, and anyone wishing to add .DOC file processing and - creation to their projects. -
  10. -
-
-
-

- The users of this software shall be - developers in a Java environment on any operating system, or power - users who are capable of XML document generation/deployment. -

-
-
-

- The HSSF library currently requires a - full object representation to be created before reading values. This - results in very high memory utilization. We need to reduce this - substantially for reading. It would be preferable to do this for - writing, but it may not be possible due to the constraints imposed by - the file format itself. Memory utilization during read is our top - user complaint. -

-

- The POIFS library currently requires a - full object representation to be created before reading values. This - results in very high memory utilization. We need to reduce this - substantially for reading. -

-

- The HSSF library currently ignores - formula cells and identifies them as "UnknownRecord" at the - lower level of the API. We must provide a way to read and write - formulas. This is now the top requested feature. -

-

- The HSSF library currently does not support - charts. This is a key requirement of some users who wish to use HSSF - in a reporting engine. -

-

- The HSSF Serializer currently does not - provide serialization for cell styling. User's will want stylish - spreadsheets to result from their XML. -

-

- There is currently no way to generate - the XML from an XLS that is consistent with the format used by the - HSSF Serializer. -

-

- There should be a way to read and write - the DOC file format using pure Java. -

- -
-
-

- Originally there weren't any decent alternatives for reading or writing - to Excel. This has changed somewhat. -

-
-
-
-
-

- The produced code shall be licensed by - the Apache License as used by the Cocoon 2 project (APL 1.1) and - maintained on at http://poi.sourceforge.net - and http://sourcefoge.net/projects/poi. - It is our hope to at some point integrate with the various Apache - projects (xml.apache.org and jakarta.apache.org), at which point we'd - turn the copyright over to them. -

-
-
-

- For developers on a Java and/or XML - environment this project will provide all the tools necessary for - outputting XML data in the Microsoft Excel format. This project seeks - to make the use of Microsoft Windows based servers unnecessary for - file format considerations and to fully document the OLE 2 Compound - Document format. The project aims not only to provide the tools for - serializing XML to Excel and Word file formats and the tools for - writing to those file formats from Java, but also to provide the - tools for later projects to convert other OLE 2 Compound Document - formats to pure Java APIs. -

-
-
-

- HSSF Serializer for Apache Cocoon 2 -

- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -
- Benefit - - Supporting Features -
- Ability to serialize styles from XML spreadsheets. - - HSSFSerialzier will support styles. -
- Ability to read and write formulas in XLS files. - - HSSF will support reading/writing formulas. -
- Ability to output in MS Word on any platform using Java. - - The project will develop an API that outputs in Word format - using pure Java. -
- Enhance performance for reading and writing XLS files. - - HSSF will undergo a number of performance enhancements. HSSF - will include a new event-based API for reading XLS files. POIFS - will support a new event-based API for reading OLE2 CDF files. -
- Ability to generate XML from XLS files - - The project will develop an HSSF Generator. -
- The ability to generate charts - - HSSF will provide low level support for chart records as well - as high level API support for generating charts. The ability - to read chart information will not initially be provided. -
-
-
-
    -
  • - The HSSF Serializer and Generator - will support the Gnumeric 1.0 XML tag language. -
  • -
  • - The HSSF Generator and HSSF - Serializer will be mutually validating. It should be possible to - have an XLS file created by the Serializer run through the Generator - and the output back through the Serializer (via the Cocoon pipeline) - and get the same file or a reasonable facimille (no one cares if it - differs by the order of the binary records in some minor but - non-visually recognizable manner). -
  • -
  • - The HSSF Generator will run on any - Java 2 supporting platform with Apache Cocoon 2 installed along with - the HSSF and POIFS APIs. -
  • -
  • - The HSSF Serializer will run on - any Java 2 supporting platform with Apache Cocoon 2 installed along - with the HSSF and POIFS APIs. -
  • -
  • - The HDF API requires a Java 2 - implementation and the POIFS API. -
  • -
  • - The HSSF API requires a Java 2 - implementation and the POIFS API. -
  • -
  • - The POIFS API requires a Java 2 - implementation. -
  • - -
-
-
-
-

- Enhancements to the POIFS API will - include: -

-
    -
  • - An event driven API for reading - POIFS Filesystems. -
  • -
  • - A low-level API for - creating/manipulating POI filesystems. -
  • -
  • - Code improvements supporting - greater separation between read and write structures. -
  • -
-

- Enhancements to the HSSF API will - include: -

-
    -
  • - An event driven API for reading - XLS files. -
  • -
  • - Performance improvements. -
  • -
  • - Formula support (read/write) -
  • -
  • - Support for user-defined data - formats -
  • -
  • - Better documentation of the file - format and structure. -
  • -
  • - An API for creation of charts. -
  • -
-

- The HSSF Generator will include: -

-
    -
  • - A set of classes supporting the - Cocoon 2 Generator interfaces providing a method for reading XLS - files and outputting SAX events. -
  • -
  • - The same tag format used by the - HSSFSerializer in any given release. -
  • -
-

- The HDF API will include: -

-
    -
  • - An event driven API for reading - DOC files. -
  • -
  • - A set of high and low level APIs - for reading and writing DOC files. -
  • -
  • - Documentation of the DOC file - format or enhancements to existing documentation. -
  • -
-
-
-
-

- All Java code will be 100% pure Java. -

-
-
-

- The minimum system requirements for the POIFS API are: -

-
    -
  • 64 Mbytes memory
  • -
  • Java 2 environment
  • -
  • Pentium or better processor (or equivalent on other platforms)
  • -
-

- The minimum system requirements for the the HSSF API are: -

-
    -
  • 64 Mbytes memory
  • -
  • Java 2 environment
  • -
  • Pentium or better processor (or equivalent on other platforms)
  • -
  • POIFS API
  • -
-

- The minimum system requirements for the the HDF API are: -

-
    -
  • 64 Mbytes memory
  • -
  • Java 2 environment
  • -
  • Pentium or better processor (or equivalent on other platforms)
  • -
  • POIFS API
  • -
- -

- The minimum system requirements for the HSSF Serializer are: -

-
    -
  • 64 Mbytes memory
  • -
  • Java 2 environment
  • -
  • Pentium or better processor (or equivalent on other platforms)
  • -
  • Cocoon 2
  • -
  • HSSF API
  • -
  • POI API
  • -
-
-
-

- All components must perform well enough - to be practical for use in a webserver environment (especially - the "killer trio": Cocoon2/Tomcat/Apache combo) -

-
-
-

- The software will run primarily in - developer environments. We should make some allowances for - not-highly-technical users to write XML documents for the HSSF - Serializer. All other components will assume intermediate Java 2 - knowledge. No XML knowledge will be required except for using the - HSSF Serializer. As much documentation as is practical shall be - required for all components as XML is relatively new, and the - concepts introduced for writing spreadsheets and to POI filesystems - will be brand new to Java and many Java developers. -

-
-
-
-
-

- The filesystem as read and written by - POI shall be fully documented and explained so that the average Java - developer can understand it. -

-
-
-

- The POI API will be fully documented - through Javadoc. A walkthrough of using the high level POI API shall - be provided. No documentation outside of the Javadoc shall be - provided for the low-level POI APIs. -

-
-
-

- The HSSF File Format as implemented by - the HSSF API will be fully documented. No documentation will be - provided for features that are not supported by HSSF API that are - supported by the Excel 97 File Format. Care will be taken not to - infringe on any "legal stuff". Additionally, we are - collaborating with the fine folks at OpenOffice.org on - *free* documentation of the format. -

-
-
-

- The HSSF API will be documented by - javadoc. A walkthrough of using the high level HSSF API shall be - provided. No documentation outside of the Javadoc shall be provided - for the low level HSSF APIs. -

-
-
-

- The HDF API will be documented by - javadoc. A walkthrough of using the high level HDF API shall be - provided. No documentation outside of the Javadoc shall be provided - for the low level HDF APIs. -

-
-
-

- The HSSF Serializer will be documented - by javadoc. -

-
-
-

- The HSSF Generator will be documented - by javadoc. -

-
-
-

- The XML tag language along with - function and usage shall be fully documented. Examples will be - provided as well. -

-
-
-
-
-

- filesystem shall refer only to the POI formatted archive. -

-
-
-

- file shall refer to the embedded data stream within a - POI filesystem. This will be the actual embedded document. -

-
-
- -
diff --git a/src/documentation/xdocs/plan/book.xml b/src/documentation/xdocs/plan/book.xml deleted file mode 100644 index 12857f774f..0000000000 --- a/src/documentation/xdocs/plan/book.xml +++ /dev/null @@ -1,20 +0,0 @@ - - - - - - - - - - - - - - - - - - diff --git a/src/documentation/xdocs/plan/index.xml b/src/documentation/xdocs/plan/index.xml deleted file mode 100644 index 74e44e3ada..0000000000 --- a/src/documentation/xdocs/plan/index.xml +++ /dev/null @@ -1,58 +0,0 @@ - - - - -
- Planning Documentation - Overview - - - - -
- - -
- -

This is a collection of notes to assist with long-term planning and - development. -

- -

There is much discussion of issues and research topics (RT) threads on - the dev mailing list (and elsewhere). However, details - get lost in the sheer volume. This is the place to document the summary of - discussions on some key topics. Some new and complex capabilities will take - lots of design and specification before they can be implemented. -

- -

Another use for this collection of notes is as a place to quickly store - a snippet from an email discussion or even a link to a discussion thread. - The concepts can then be fleshed-out over time. -

- -

Anyone can participate in this process. Please get involved in discussion - on dev and contribute patches for these summary planning - documents via the normal contribution - process. -

- -

These planning documents are intended to be concise notes only. They are - also ever-evolving, because as issues are addressed these notes will be - revised. -

-
- -
- -
    -
  • Release Plan - - major things to do before the 2.0 release
  • -
  • Documentation - - revisions and additions are required
  • -
  • See the general To Do list - and the dev email archives for other issues
  • -
-
- - -
diff --git a/src/documentation/xdocs/plan/release.xml b/src/documentation/xdocs/plan/release.xml deleted file mode 100644 index 0c660e0859..0000000000 --- a/src/documentation/xdocs/plan/release.xml +++ /dev/null @@ -1,63 +0,0 @@ - - - - -
- Release Plan 2.0 - Planning Documentation - - - - -
- - -
-

Todo

- -
- - -
diff --git a/src/documentation/xdocs/poifs/book.xml b/src/documentation/xdocs/poifs/book.xml deleted file mode 100644 index 9ee98e5fd0..0000000000 --- a/src/documentation/xdocs/poifs/book.xml +++ /dev/null @@ -1,17 +0,0 @@ - - - - - - - - - - - - - - - diff --git a/src/documentation/xdocs/poifs/fileformat.xml b/src/documentation/xdocs/poifs/fileformat.xml deleted file mode 100644 index 9d9277c623..0000000000 --- a/src/documentation/xdocs/poifs/fileformat.xml +++ /dev/null @@ -1,667 +0,0 @@ - - - -
- POIFS File System Internals - - - -
- -
-
-

POIFS file systems are essentially normal files stored on a - Java-compatible platform's native file system. They are - typically identified by names ending in a four character - extension noting what type of data they contain. For - example, a file ending in ".xls" would likely - contain spreadsheet data, and a file ending in - ".doc" would probably contain a word processing - document. POIFS file systems are called "file - system", because they contain multiple embedded files - in a manner similar to traditional file systems. Along - functional lines, it would be more accurate to call these - POIFS archives. For the remainder of this document it is - referred to as a file system in order to avoid confusion - with the "files" it contains.

-

POIFS file systems are compatible with those document - formats used by a well-known software company's popular - office productivity suite and programs outputting - compatible data. Because the POIFS file system does not - provide compression, encryption or any other worthwhile - feature, its not a good choice unless you require - interoperability with these programs.

-

The POIFS file system does not encode the documents - themselves. For example, if you had a word processor file - with the extension ".doc", you would actually - have a POIFS file system with a document file archived - inside of that file system.

-
-
-

This document utilizes the numeric types as described by - the Java Language Specification, which can be found at - http://java.sun.com. In - short:

-
    -
  • A byte is an 8 bit signed integer ranging from - -128 to 127.
  • -
  • A short is a 16 bit signed integer ranging from - -32768 to 32767
  • -
  • An int is a 32 bit signed integer ranging from - -2147483648 to 2147483647
  • -
  • A long is a 64 bit signed integer ranging from - -9.22E18 to 9.22E18.
  • -
-

The Java Language Specification spells out a number of - other types that are not referred to by this document.

-

Where this document makes references to "endian - conversion" it is referring to the byte order of - stored numbers. Numbers in "little-endian order" - are stored with the least significant byte first. In - order to properly read a short, for example, you'd read two - bytes and then shift the second byte 8 bits to the left - before performing an or operation to it - against the first byte. The following code illustrates this - method:

- -public int getShort (byte[] rec) -{ - return ((rec[1] << 8) | (rec[0] & 0x00ff)); -} -
-
-

This is a walkthrough of a POIFS file system and how it is - put together. It is not intended to give a concise - description but to give a "big picture" of the - general structure and how it's interpreted.

-

A POIFS file system begins with a header. This header - identifies locations in the file by function and provides a - sanity check identifying a file as a POIFS file system.

-

The first 64 bits of the header compose a magic number - identifier. This identifier tells the client software - that this is indeed a POIFS file system and that it should - be treated as such. This is a "sanity check" to - make sure this is a POIFS file system and not some other - format. The header also contains an array of block - numbers. These block numbers refer to blocks in the - file. When these blocks are read together they form the - Block Allocation Table. The header also contains a - pointer to the first element in the property table, - also known as the root element, and a pointer to the - small Block Allocation Table (SBAT).

-

The block allocation table or BAT, along with - the property table, specify which blocks in the file - system belong to which files. After the header block, the - file system is divided into identically sized blocks of - data, numbered from 0 to however many blocks there are in - the file system. For each file in the file system, its - entry in the property table includes the index of the first - block in the array of blocks. Each block's index into the - array of blocks is also its index into the BAT, and the - integer value stored at that index in the BAT gives the - index of the next block in the array (and thus the index of - the next BAT value). A special value is stored in the BAT - to indicate "end of file".

-

The property table is essentially the directory - storage for the file system. It consists of the name of the - file or directory, its start block in both the file - system and BAT, and its actual size. The first - property in the property table is the root - element. It has two purposes: to be a directory entry - (the root of the directory tree, to be specific), and to - hold the start block for the small block data.

-

Small block data is a special file that contains the data - for small files (less than 4K bytes). It subdivides its - blocks into smaller blocks and there is a special small - block allocation table that, like the main BAT for larger - files, is used to map a small file to its small blocks.

-
-
-

The POIFS file system begins with a header - block. The first 64 bits of the header form a long - file type id or magic number identifier of - 0xE11AB1A1E011CFD0L. This is basically a - sanity check. If this isn't the first thing in the header - (and consequently the file system) then this is not a - POIFS file system and should be read with some other - library.

-

It's important to know the most important parts of the - header. These are discussed in the rest of this - section.

-
-

At offset 0x2C is an int specifying the number - of elements in the BAT array. The array at - 0x4C an array of ints. This array contains the - indices of every block in the Block Allocation - Table.

-
-
-

Very large POIFS archives may have more blocks than can - be addressed by the BAT blocks enumerated in the header - block. How large? Well, the BAT array in the header can - contain up to 109 BAT block indices; each BAT block - references up to 128 blocks, and each block is 512 - bytes, so we're talking about 109 * 128 * 512 = - 6.8MB. That's a pretty respectable document! But, you - could have much more data than that, and in today's - world of cheap gigabyte drives, why not? So, the BAT - may be extended in that event. The integer value at - offset 0x44 of the header is the index of the - first extended BAT (XBAT) block. At offset - 0x48 of the header, there is an int value that - specifies how many XBAT blocks there are. The XBAT - blocks begin at the specified index into the array of - blocks making up the POIFS file system, and continue in - sequence for the specified count of XBAT blocks.

-

Each XBAT block contains the indices of up to 128 BAT - blocks, so the document size can be expanded by another - 8MB for each XBAT block. The BAT blocks indexed by an - XBAT block are appended to the end of the list of BAT - blocks enumerated in the header block. Thus the BAT - blocks enumerated in the header block are BAT blocks 0 - through 108, the BAT blocks enumerated in the first - XBAT block are BAT blocks 109 through 236, the BAT - blocks enumerated in the second XBAT block are BAT - blocks 237 through 364, and so on.

-

Through the use of XBAT blocks, the limit on the - overall document size is that imposed by the 4-byte - block indices; if the indices are unsigned ints, the - maximum file size is 2 terabytes, 1 terabyte if the - indices are treated as signed ints. Either way, I have - yet to see a disk drive large enough to accommodate - such a file on the shelves at the local office supply - stores.

-
-
-

If a file contained in a POIFS archive is smaller than - 4096 bytes, it is stored in small blocks. Small blocks - are 64 bytes in length and are contained within big - blocks, up to 8 to a big block. As the main BAT is used - to navigate the array of big blocks, so the small - block allocation table is used to navigate the - array of small blocks. The SBAT's start block index is - found at offset 0x3C of the header block, and - remaining blocks constituting the SBAT are found by - walking the main BAT as if it were an ordinary file in - the POIFS file system (this process is described - below).

-
-
-

An integer at address 0x30 specifies the start - index of the property table. This integer is specified - as a "block index". The Property Table - is stored, as is almost everything in a POIFS file - system, in big blocks and walked via the BAT. The - Property Table is described below.

-
-
-
-

The property table is essentially nothing more than the - directory system. Properties are 128 byte records - contained within the 512 byte blocks. The first property - is always the Root Entry. The following applies to - individual properties within a property table:

-
    -
  • At offset 0x00 in the property is the - "name". This is stored as an - uncompressed 16 bit unicode string. In short every - other byte corresponds to an "ASCII" - character. The size of this string is stored at offset - 0x40 (string size) as a short.
  • -
  • At offset 0x42 is the property type - (byte). The type is 1 for directory, 2 for file or 5 - for the Root Entry.
  • -
  • At offset 0x43 is the node color - (byte). The color is either 1, (black), or 0, - (red). Properties are apparently meant to be arranged - in a red-black binary tree, subject to the following - rules: -
      -
    1. The root of the tree is always black
    2. -
    3. Two consecutive nodes cannot both be red
    4. -
    5. A property is less than another property if its - name length is less than the other property's name - length
    6. -
    7. If two properties have the same name length, the - sort order is determined by the sort order of the - properties' names.
    8. -
  • -
  • At offset 0x44 is the index (int) of the - previous property.
  • -
  • At offset 0x48 is the index (int) of the - next property.
  • -
  • At offset 0x4C is the index (int) of the - first directory entry. This is used by - directory entries.
  • -
  • At offset 0x74 is an integer giving the - start block for the file described by this - property. This index corresponds to an index in the - array of indices that is the Block Allocation Table - (or the Small Block Allocation Table) as well as the - index of the first block in the file. This is used by - files and the root entry.
  • -
  • At offset 0x78 is an integer giving the total - actual size of the file pointed at by this - property. If the file size is less than 4096, the file - is stored in small blocks and the SBAT is used to walk - the small blocks making up the file. If the file size - is 4096 or larger, the file is stored in big blocks - and the main BAT is used to walk the big blocks making - up the file. The exception to this rule is the Root - Entry, which, regardless of its size, is - always stored in big blocks and the main BAT is - used to walk the big blocks making up this special - file.
  • -
-
-
-

The Root Entry in the Property Table - contains the information necessary to read and write - small files, which are files less than 4096 bytes - long. The start block field of the Root Entry is the - start index of the Small Block Array, which is - read like any other file in the POIFS file system. Since - the SBAT cannot be used without the Small Block Array, - the Root Entry MUST be read or written using the Block - Allocation Table. The blocks making up the Small - Block Array are divided into 64-byte small blocks, up to - the size indicated in the Root Entry (which should always - be a multiple of 64).

-
-
-

The individual properties form a directory tree, with the - Root Entry as the directory tree's root, as shown - in the accompanying drawing. Note the numbers in - parentheses in each node; they represent the node's index - in the array of properties. The NEXT_PROP, - PREVIOUS_PROP, and CHILD_PROP fields hold - these indices, and are used to navigate the tree.

- -

Each directory entry (i.e., a property whose type is - directory or root entry) uses its - CHILD_PROP field to point to one of its - subordinate (child) properties. It doesn't seem to matter - which of its children it points to. Thus in the previous - drawing, the Root Entry's CHILD_PROP field may contain 1, - 4, or the index of one of its other children. Similarly, - the directory node (index 1) may have, in its CHILD_PROP - field, 2, 3, or the index of one of its other - children.

-

The children of a given directory property point to each - other in a similar fashion by using their - NEXT_PROP and PREVIOUS_PROP fields.

-

Unused NEXT_PROP, PREVIOUS_PROP, and - CHILD_PROP fields contain the marker value of - -1. All file properties have a value of -1 for their - CHILD_PROP fields for example.

-
-
-

The BAT blocks are pointed at by the bat array - contained in the header and supplemented, if necessary, - by the XBAT blocks. These blocks form a large - table of integers. These integers are block numbers. The - Block Allocation Table holds chains of integers. - These chains are terminated with -2. The elements in - these chains refer to blocks in the files. The starting - block of a file is NOT specified in the BAT. It is - specified by the property for a given file. The - elements in this BAT are both the block number (within - the file minus the header) and the number of the - next BAT element in the chain. This can be thought of as - a linked list of blocks. The BAT array contains the links - from one block to the next, including the end of chain - marker.

-

Here's an example: Let's assume that the BAT begins as - follows:

-

BAT[ 0 ] = 2

-

BAT[ 1 ] = 5

-

BAT[ 2 ] = 3

-

BAT[ 3 ] = 4

-

BAT[ 4 ] = 6

-

BAT[ 5 ] = -2

-

BAT[ 6 ] = 7

-

BAT[ 7 ] = -2

-

...

-

Now, if we have a file whose Property Table entry says it - begins with index 0, we walk the BAT array and see that - the file consists of blocks 0 (because the start block is - 0), 2 (because BAT[ 0 ] is 2), 3 (BAT[ 2 ] is 3), 4 (BAT[ - 3 ] is 4), 6 (BAT[ 4 ] is 6), and 7 (BAT[ 6 ] is 7). It - ends at block 7 because BAT[ 7 ] is -2, which is the end - of chain marker.

-

Similarly, a file beginning at index 1 consists of - blocks 1 and 5.

-

Other special numbers in a BAT array are:

-
    -
  • -1, which indicates an unused block
  • -
  • -3, which indicates a "special" block, such - as a block used to make up the Small Block Array, the - Property Table, the main BAT, or the SBAT
  • -
-
-
-

The following outlines the basic file system structures.

-
- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -
FieldDescriptionOffsetLengthDefault value or const
FILETYPEMagic number identifying this as a POIFS file - system.0x0000Long0xE11AB1A1E011CFD0
UK1Unknown constant0x0008Integer0
UK2Unknown Constant0x000CInteger0
UK3Unknown Constant0x0014Integer0
UK4Unknown Constant (revision?)0x0018Short0x003B
UK5Unknown Constant (version?)0x001AShort0x0003
UK6Unknown Constant0x001CShort-2
LOG_2_BIG_BLOCK_SIZELog, base 2, of the big block size0x001EShort9 (2 ^ 9 = 512 bytes)
LOG_2_SMALL_BLOCK_SIZELog, base 2, of the small block size0x0020Integer6 (2 ^ 6 = 64 bytes)
UK7Unknown Constant0x0024Integer0
UK8Unknown Constant0x0028Integer0
BAT_COUNTNumber of elements in the BAT array0x002CIntegerrequired
PROPERTIES_STARTBlock index of the first block of the property - table0x0030Integerrequired
UK9Unknown Constant0x0034Integer0
UK10Unknown Constant0x0038Integer0x00001000
SBAT_STARTBlock index of first big block containing the small - block allocation table (SBAT)0x003CInteger-2
SBAT_Block_CountNumber of big blocks holding the SBAT0x0040Integer1
XBAT_STARTBlock index of the first block in the Extended Block - Allocation Table (XBAT)0x0044Integer-2
XBAT_COUNTNumber of elements in the Extended Block Allocation - Table (to be added to the BAT)0x0048Integer0
BAT_ARRAYArray of block indices constituting the Block - Allocation Table (BAT)0x004C, 0x0050, 0x0054 ... 0x01FCInteger[]-1 for unused elements, at least first element must - be filled.
N/AHeader block data not otherwise described in this - tableN/AN/A-1
-
-
- - - - - - - - - - - - - - - -
FieldDescriptionOffsetLengthDefault value or const
BAT_ELEMENTAny given element in the BAT block0x0000, 0x0004, 0x0008, ... 0x01FCInteger -
    -
  • -1 = unused
  • -
  • -2 = end of chain
  • -
  • -3 = special (e.g., BAT block)
  • -
-

All other values point to the next element in the - chain and the next index of a block composing the - file.

-
-
-
- - - - - - - - - - - - - - - -
FieldDescriptionOffsetLengthDefault value or const
Properties[]This block contains the properties.0x0000, 0x0080, 0x0100, 0x0180128 bytesAll unused space is set to -1.
-
-
- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -
FieldDescriptionOffsetLengthDefault value or const
NAMEA unicode null-terminated uncompressed 16bit string - (lose the high bytes) containing the name of the - property.0x00, 0x02, 0x04, ... 0x3EShort[]0x0000 for unused elements, field required, 32 - (0x40) element max
NAME_SIZENumber of characters in the NAME field0x40ShortRequired
PROPERTY_TYPEProperty type (directory, file, or root)0x42Byte1 (directory), 2 (file), or 5 (root entry)
NODE_COLORNode color0x43Byte0 (red) or 1 (black)
PREVIOUS_PROPPrevious property index0x44Integer-1
NEXT_PROPNext property index0x48Integer-1
CHILD_PROPFirst child property index0x4cInteger-1
SECONDS_1Seconds component of the created timestamp?0x64Integer0
DAYS_1Days component of the created timestamp?0x68Integer0
SECONDS_2Seconds component of the modified timestamp?0x6CInteger0
DAYS_2Days component of the modified timestamp?0x70Integer0
START_BLOCKStarting block of the file, used as the first block - in the file and the pointer to the next block from - the BAT0x74IntegerRequired
SIZEActual size of the file this property points - to. (used to truncate the blocks to the real - size).0x78Integer0
-
-
-
- -
diff --git a/src/documentation/xdocs/poifs/how-to.xml b/src/documentation/xdocs/poifs/how-to.xml deleted file mode 100644 index a9336a9114..0000000000 --- a/src/documentation/xdocs/poifs/how-to.xml +++ /dev/null @@ -1,370 +0,0 @@ - - - -
- How To Use the POIFS APIs - - - -
- -
-

This document describes how to use the POIFS APIs to read, write, and modify files that employ a POIFS-compatible data structure to organize their content.

-
-
    -
  • 02.10.2002 - completely rewritten from original documents on sourceforge
  • -
-
-
-

This document is intended for Java developers who need to use the POIFS APIs to read, write, or modify files that employ a POIFS-compatible data structure to organize their content. It is not necessary for developers to understand the POIFS data structures, and an explanation of those data structures is beyond the scope of this document. It is expected that the members of the target audience will understand the rudiments of a hierarchical file system, and familiarity with the event pattern employed by Java APIs such as AWT would be helpful.

-
-
-

This document attempts to be consistent in its terminology, which is defined here:

- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -
TermDefinition
DirectoryA special file that may contain other directories and documents.
DirectoryEntryRepresentation of a directory within another directory.
DocumentA file containing data, such as word processing data or a spreadsheet workbook.
DocumentEntryRepresentation of a document within a directory.
EntryRepresentation of a file in a directory.
FileA named entity, managed and contained by the file system.
File SystemThe POIFS data structures, plus the contained directories and documents, which are maintained in a hierarchical directory structure.
Root DirectoryThe directory at the base of a file system. All file systems have a root directory. The POIFS APIs will not allow the root directory to be removed or renamed, but it can be accessed for the purpose of reading its contents or adding files (directories and documents) to it.
-
-
-
-

This section covers reading a file system. There are two ways to read a file system; these techniques are sketched out in the following table, and then explained in greater depth in the sections following the table.

- - - - - - - - - - - - - - - - -
TechniqueAdvantagesDisadvantages
Conventional Reading -
    -
  • Simpler API similar to reading a conventional file system.
  • -
  • Can read documents in any order.
  • -
-
-
    -
  • All files are resident in memory, whether your application needs them or not.
  • -
-
Event-Driven Reading -
    -
  • Reduced footprint -- only the documents you care about are processed.
  • -
  • Improved performance -- no time is wasted reading the documents you're not interested in.
  • -
-
-
    -
  • More complicated API.
  • -
  • Need to know in advance which documents you want to read.
  • -
  • No control over the order in which the documents are read.
  • -
  • No way to go back and get additional documents except to re-read the file system, which may not be possible, e.g., if the file system is being read from an input stream that lacks random access support.
  • -
-
-
-

In this technique for reading, the entire file system is loaded into memory, and the entire directory tree can be walked by an application, reading specific documents at the application's leisure.

-
-

Before an application can read a file from the file system, the file system needs to be loaded into memory. This is done by using the org.apache.poi.poifs.filesystem.POIFSFileSystem class. Once the file system has been loaded into memory, the application may need the root directory. The following code fragment will accomplish this preparation stage:

- -// need an open InputStream; for a file-based system, this would be appropriate: -// InputStream stream = new FileInputStream(fileName); -POIFSFileSystem fs; -try -{ - fs = new POIFSFileSystem(inputStream); -} -catch (IOException e) -{ - // an I/O error occurred, or the InputStream did not provide a compatible - // POIFS data structure -} -DirectoryEntry root = fs.getRoot(); -

Assuming no exception was thrown, the file system can then be read.

-

Note: loading the file system can take noticeable time, particularly for large file systems.

-
-
-

Once the file system has been loaded into memory and the root directory has been obtained, the root directory can be read. The following code fragment shows how to read the entries in an org.apache.poi.poifs.filesystem.DirectoryEntry instance:

- -// dir is an instance of DirectoryEntry ... -for (Iterator iter = dir.getEntries(); iter.hasNext(); ) -{ - Entry entry = (Entry)iter.next(); - System.out.println("found entry: " + entry.getName()); - if (entry instanceof DirectoryEntry) - { - // .. recurse into this directory - } - else if (entry instanceof DocumentEntry) - { - // entry is a document, which you can read - } - else - { - // currently, either an Entry is a DirectoryEntry or a DocumentEntry, - // but in the future, there may be other entry subinterfaces. The - // internal data structure certainly allows for a lot more entry types. - } -} -
-
-

There are a couple of ways to read a document, depending on whether the document resides in the root directory or in another directory. Either way, you will obtain an org.apache.poi.poifs.filesystem.DocumentInputStream instance.

-
-

The DocumentInputStream class is a simple implementation of InputStream that makes a few guarantees worth noting:

-
    -
  • available() always returns the number of bytes in the document from your current position in the document.
  • -
  • markSupported() returns true.
  • -
  • mark(int limit) ignores the limit parameter; basically the method marks the current position in the document.
  • -
  • reset() takes you back to the position when mark() was last called, or to the beginning of the document if mark() has not been called.
  • -
  • skip(long n) will take you to your current position + n (but not past the end of the document).
  • -
-

The behavior of available means you can read in a document in a single read call like this:

- -byte[] content = new byte[ stream.available() ]; -stream.read(content); -stream.close(); -

The combination of mark, reset, and skip provide the basic mechanisms needed for random access of the document contents.

-
-
-

If the document resides in the root directory, you can obtain a DocumentInputStream like this:

- -// load file system -try -{ - DocumentInputStream stream = filesystem.createDocumentInputStream(documentName); - // process data from stream -} -catch (IOException e) -{ - // no such document, or the Entry represented by documentName is not a - // DocumentEntry -} -
-
-

A more generic technique for reading a document is to obtain an org.apache.poi.poifs.filesystem.DirectoryEntry instance for the directory containing the desired document (recall that you can use getRoot() to obtain the root directory from its file system). From that DirectoryEntry, you can then obtain a DocumentInputStream like this:

- -DocumentEntry document = (DocumentEntry)directory.getEntry(documentName); -DocumentInputStream stream = new DocumentInputStream(document); - -
-
-
-
-

The event-driven API for reading documents is a little more complicated and requires that your application know, in advance, which files it wants to read. The benefit of using this API is that each document is in memory just long enough for your application to read it, and documents that you never read at all are not in memory at all. When you're finished reading the documents you wanted, the file system has no data structures associated with it at all and can be discarded.

-
-

The preparation phase involves creating an instance of org.apache.poi.poifs.eventfilesystem.POIFSReader and to then register one or more org.apache.poi.poifs.eventfilesystem.POIFSReaderListener instances with the POIFSReader.

- -POIFSReader reader = new POIFSReader(); -// register for everything -reader.registerListener(myOmnivorousListener); -// register for selective files -reader.registerListener(myPickyListener, "foo"); -reader.registerListener(myPickyListener, "bar"); -// register for selective files -reader.registerListener(myOtherPickyListener, new POIFSDocumentPath(), - "fubar"); -reader.registerListener(myOtherPickyListener, new POIFSDocumentPath( - new String[] { "usr", "bin" ), "fubar"); -
-
-

org.apache.poi.poifs.eventfilesystem.POIFSReaderListener is an interface used to register for documents. When a matching document is read by the org.apache.poi.poifs.eventfilesystem.POIFSReader, the POIFSReaderListener instance receives an org.apache.poi.poifs.eventfilesystem.POIFSReaderEvent instance, which contains an open DocumentInputStream and information about the document.

-

A POIFSReaderListener instance can register for individual documents, or it can register for all documents; once it has registered for all documents, subsequent (and previous!) registration requests for individual documents are ignored. There is no way to unregister a POIFSReaderListener.

-

Thus, it is possible to register a single POIFSReaderListener for multiple documents - one, some, or all documents. It is guaranteed that a single POIFSReaderListener will receive exactly one notification per registered document. There is no guarantee as to the order in which it will receive notification of its documents, as future implementations of POIFSReader are free to change the algorithm for walking the file system's directory structure.

-

It is also permitted to register more than one POIFSReaderListener for the same document. There is no guarantee of ordering for notification of POIFSReaderListener instances that have registered for the same document when POIFSReader processes that document.

-

It is guaranteed that all notifications occur in the same thread. A future enhancement may be made to provide multi-threaded notifications, but such an enhancement would very probably be made in a new reader class, a ThreadedPOIFSReader perhaps.

-

The following table describes the three ways to register a POIFSReaderListener for a document or set of documents:

- - - - - - - - - - - - - - - - - -
Method SignatureWhat it does
registerListener(POIFSReaderListener listener)registers listener for all documents.
registerListener(POIFSReaderListener listener, String name)registers listener for a document with the specified name in the root directory.
registerListener(POIFSReaderListener listener, POIFSDocumentPath path, String name)registers listener for a document with the specified name in the directory described by path
-
-
-

The org.apache.poi.poifs.filesystem.POIFSDocumentPath class is used to describe a directory in a POIFS file system. Since there are no reserved characters in the name of a file in a POIFS file system, a more traditional string-based solution for describing a directory, with special characters delimiting the components of the directory name, is not feasible. The constructors for the class are used as follows:

- - - - - - - - - - - - - - - - - - - - - - - - - -
Constructor exampleDirectory described
new POIFSDocumentPath()The root directory.
new POIFSDocumentPath(null)The root directory.
new POIFSDocumentPath(new String[ 0 ])The root directory.
new POIFSDocumentPath(new String[ ] { "foo", "bar"} )in Unix terminology, "/foo/bar".
new POIFSDocumentPath(new POIFSDocumentPath(new String[] { "foo" }), new String[ ] { "fu", "bar"} )in Unix terminology, "/foo/fu/bar".
-
-
-

Processing org.apache.poi.poifs.eventfilesystem.POIFSReaderEvent events is relatively easy. After all of the POIFSReaderListener instances have been registered with POIFSReader, the POIFSReader.read(InputStream stream) method is called.

-

Assuming that there are no problems with the data, as the POIFSReader processes the documents in the specified InputStream's data, it calls registered POIFSReaderListener instances' processPOIFSReaderEvent method with a POIFSReaderEvent instance.

-

The POIFSReaderEvent instance contains information to identify the document (a POIFSDocumentPath object to identify the directory that the document is in, and the document name), and an open DocumentInputStream instance from which to read the document.

-
-
-
-
-

Writing a file system is very much like reading a file system in that there are multiple ways to do so. You can load an existing file system into memory and modify it (removing files, renaming files) and/or add new files to it, and write it, or you can start with a new, empty file system:

- -POIFSFileSystem fs = new POIFSFileSystem(); -
-

There are two restrictions on the names of files in a file system that must be considered when creating files:

-
    -
  1. The name of the file must not exceed 31 characters. If it does, the POIFS API will silently truncate the name to fit.
  2. -
  3. The name of the file must be unique within its containing directory. This seems pretty obvious, but if it isn't spelled out, there'll be hell to pay, to be sure. Uniqueness, of course, is determined after the name has been truncated, if the original name was too long to begin with.
  4. -
-
-
-

A document can be created by acquiring a DirectoryEntry and calling one of the two createDocument methods:

- - - - - - - - - - - - - - - - -
Method SignatureAdvantagesDisadvantages
CreateDocument(String name, InputStream stream) -
    -
  • Simple API.
  • -
-
-
    -
  • Increased memory footprint (document is in memory until file system is written).
  • -
-
CreateDocument(String name, int size, POIFSWriterListener writer) -
    -
  • Decreased memory footprint (only very small documents are held in memory, and then only for a short time).
  • -
-
-
    -
  • More complex API.
  • -
  • Determining document size in advance may be difficult.
  • -
  • Lose control over when document is to be written.
  • -
-
-

Unlike reading, you don't have to choose between the in-memory and event-driven writing models; both can co-exist in the same file system.

-

Writing is initiated when the POIFSFileSystem instance's writeFilesystem() method is called with an OutputStream to write to.

-

The event-driven model is quite similar to the event-driven model for reading, in that the file system calls your org.apache.poi.poifs.filesystem.POIFSWriterListener when it's time to write your document, just as the POIFSReader calls your POIFSReaderListener when it's time to read your document. Internally, when writeFilesystem() is called, the final POIFS data structures are created and are written to the specified OutputStream. When the file system needs to write a document out that was created with the event-driven model, it calls the POIFSWriterListener back, calling its processPOIFSWriterEvent() method, passing an org.apache.poi.poifs.filesystem.POIFSWriterEvent instance. This object contains the POIFSDocumentPath and name of the document, its size, and an open org.apache.poi.poifs.filesystem.DocumentOutputStream to which to write. A DocumentOutputStream is a wrapper over the OutputStream that was provided to the POIFSFileSystem to write to, and has the responsibility of making sure that the document your application writes fits within the size you specified for it.

-
-
-

Creating a directory is similar to creating a document, except that there's only one way to do so:

- -DirectoryEntry createdDir = existingDir.createDirectory(name); -
-
-

As with reading documents, it is possible to create a new document or directory in the root directory by using convenience methods of POIFSFileSystem.

- - - - - - - - - - - - - - - - - -
DirectoryEntry Method SignaturePOIFSFileSystem Method Signature
createDocument(String name, InputStream stream)createDocument(InputStream stream, String name)
createDocument(String name, int size, POIFSWriterListener writer)createDocument(String name, int size, POIFSWriterListener writer)
createDirectory(String name)createDirectory(String name)
-
-
-
-

It is possible to modify an existing POIFS file system, whether it's one your application has loaded into memory, or one which you are creating on the fly.

-
-

Removing a document is simple: you get the Entry corresponding to the document and call its delete() method. This is a boolean method, but should always return true, indicating that the operation succeeded.

-
-
-

Removing a directory is also simple: you get the Entry corresponding to the directory and call its delete() method. This is a boolean method, but, unlike deleting a document, may not always return true, indicating that the operation succeeded. Here are the reasons why the operation may fail:

-
    -
  • The directory still has files in it (to check, call isEmpty() on its DirectoryEntry; is the return value false?)
  • -
  • The directory is the root directory. You cannot remove the root directory.
  • -
-
-
-

Regardless of whether the file is a directory or a document, it can be renamed, with one exception - the root directory has a special name that is expected by the components of a major software vendor's office suite, and the POIFS API will not let that name be changed. Renaming is done by acquiring the file's corresponding Entry instance and calling its renameTo method, passing in the new name.

-

Like delete, renameTo returns true if the operation succeeded, otherwise false. Reasons for failure include these:

-
    -
  • The new name is the same as another file in the same directory. And don't forget - if the new name is longer than 31 characters, it will be silently truncated. In its original length, the new name may have been unique, but truncated to 31 characters, it may not be unique any longer.
  • -
  • You tried to rename the root directory.
  • -
-
-
- -
diff --git a/src/documentation/xdocs/poifs/html/POIFSDesignDocument.html b/src/documentation/xdocs/poifs/html/POIFSDesignDocument.html deleted file mode 100755 index 0f18722fd6..0000000000 --- a/src/documentation/xdocs/poifs/html/POIFSDesignDocument.html +++ /dev/null @@ -1,1279 +0,0 @@ - - - POIFS Design Document - - - POIFS Design Document -

- This document describes the design of the POIFS system. It is - organized as follows: -

- -

-
    -
  1. - Scope -

    - This document is written as part of an iterative process. - As that process is not yet complete, neither is this - document. -

    -
  2. -
  3. - Assumptions -

    - The design of POIFS is not dependent on the code written - for the proof-of-concept prototype POIFS package. -

    -
  4. -
  5. - Design - Considerations -

    - As usual, the primary considerations in the design of the - POIFS assumption involve the classic space-time tradeoff. - In this case, the main consideration has to involve - minimizing the memory footprint of POIFS. POIFS may be - called upon to create relatively large documents, and in - web application server, it may be called upon to create - several documents simultaneously, and it will likely - co-exist with other Serializer systems, competing with - those other systems for space on the server. -

    -

    - We've addressed the risk of being too slow through a - proof-of-concept prototype. This prototype for POIFS - involved reading an existing file, decomposing it into its - constituent documents, composing a new POIFS from the - constituent documents, and writing the POIFS file back to - disk and verifying that the output file, while not - necessarily a byte-for-byte image of the input file, could - be read by the application that generated the input file. - This prototype proved to be quite fast, reading, - decomposing, and re-generating a large (300K) file in 2 to - 2.5 seconds. -

    -

    - While the POIFS format allows great flexibility in laying - out the documents and the other internal data structures, - the layout of the filesystem will be kept as simple as - possible. -

    -
  6. -
  7. - Design -

    - The design of the POIFS is broken down into two parts: - discussion of the classes and - interfaces, and discussion of how - these classes and interfaces will be used to convert an - appropriate Java InputStream (such as an XML stream) to a - POIFS output stream containing an HSSF document. -

    - Classes and Interfaces -

    - The classes and interfaces used in the POIFS are broken - down as follows: -

    - - - - - - - - - - - - - - - - - - - - - -
    PackageContents
    net.sourceforge.poi.poifs.storageBlock classes and interfaces
    net.sourceforge.poi.poifs.propertyProperty classes and interfaces
    net.sourceforge.poi.poifs.filesystemFilesystem classes and interfaces
    net.sourceforge.poi.utilUtility classes and interfaces
    -
      -
    1. - Block Classes and - Interfaces -

      - The block classes and interfaces are shown - in the following class diagram. -

      -

      - -

      - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -
      Class/InterfaceDescription
      BATBlockThe BATBlock class - represents a single big block - containing 128 BAT - entries.
      Its - _fields array is - used to read and write the BAT entries - into the _data - array.
      Its - createBATBlocks - method is used to create an array of - BATBlock instances from an array of - int BAT entries.
      Its - calculateStorageRequirements - method calculates the number of BAT - blocks necessary to hold the specified - number of BAT entries.
      BigBlockThe BigBlock class is an - abstract class representing the common - big block of 512 bytes. It implements - BlockWritable, - trivially delegating the - writeBlocks method - of BlockWritable to its own abstract - writeData - method.
      BlockWritableThe BlockWritable interface - defines a single method, - writeBlocks, that - is used to write an implementation's - block data to an - OutputStream.
      DocumentBlockThe DocumentBlock class is - used by a Document to holds - its raw data. It also retains the - number of bytes read, as this is used - by the Document class to determine the - total size of the data, and is also - used internally to determine whether - the block was filled by the - InputStream or - not.
      The - DocumentBlock - constructor is passed an - InputStream from which to - fill its _data - array.
      The size - method returns the number of bytes - read (_bytes_read - when the instance was - constructed.
      The - partiallyRead - method returns true if the - _data array was - not completely filled, which may be - interpreted by the Document as having - reached the end of file - point.
      Typical use of the - DocumentBlock class is like - this:
      while - (true)
      {
          DocumentBlock - block = new - DocumentBlock(stream);
          blocks.add(block);
          size - += - block.size();
          if - (block.partiallyRead())
          {
              break;
          }
      }
      HeaderBlockThe HeaderBlock class is - used to contain the data found in a - POIFS header.
      Its IntegerField - members are used to read and write the - appropriate entries into the - _data - array.
      Its - setBATBlocks, - setPropertyStart, - and setXBATStart - methods are used to set the - appropriate fields in the - _data - array.
      The - calculateXBATStorageRequirements - method is used to determine how many - XBAT blocks are necessary to - accommodate the specified number of - BAT blocks. -
      PropertyBlockThe PropertyBlock class is - used to contain Property - instances for the PropertyTable - class.
      It contains an array, - _properties of 4 - Property instances, which together - comprise the 512 bytes of a BigBlock.
      The - createPropertyBlockArray - method is used to convert a - List of Property - instances into an array of - PropertyBlock instances. The number of - Property instances is rounded up to a - multiple of 4 by creating empty - anonymous inner class extensions of - Property.
      -
    2. -
    3. - Property Classes - and Interfaces -

      - The property classes and interfaces are - shown in the following class diagram. -

      -

      - -

      - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -
      Class/InterfaceDescription
      DirectoryThe Directory interface is - implemented by the RootProperty - class. It is not strictly necessary - for the initial POIFS implementation, - but when the POIFS supports directory - elements, this interface will be - more widely implemented, and so is - included in the design at this point - to ease the eventual support of - directory elements.
      Its methods are - a getter/setter pair, - getChildren, - returning an Iterator of - Property - instances; and - addChild, which - will allow the caller to add another - Property instance to the Directory's - children.
      DocumentPropertyThe DocumentProperty class - is a trivial extension of Property and is - used by Document to keep - track of its associated entry in the - PropertyTable.
      Its - constructor takes a name and the - document size, on the assumption that - the Document will not create a - DocumentProperty until after it has - created the storage for the document - data and therefore knows how much data - there is.
      FileThe File interface - specifies the behavior of reading and - writing the next and previous child - fields of a Property.
      PropertyThe Property class is an - abstract class that defines the basic - data structure of an element of the Property - Table.
      Its ByteField, ShortField, and - IntegerField - members are used to read and write - data into the appropriate locations in - the _raw_data - array.
      The - _index member is - used to hold a Propery instance's - index in the List of - Property instances maintained by PropertyTable, - which is used to populate the child - property of parent Directory - properties and the next property and - previous property of sibling File - properties.
      The - _name, - _next_file, and - _previous_file - members are used to help fill the - appropriate fields of the _raw_data - array.
      Setters are provided for - some of the fields (name, property - type, node color, child property, - size, index, start block), as well as - a few getters (index, child - property).
      The - preWrite method is - abstract and is used by the owning - PropertyTable to iterate through its - Property instances and prepare each - for writing.
      The - shouldUseSmallBlocks - method returns true if the Property's - size is sufficiently small - how small - is none of the caller's business. -
      PropertyBlockSee the description in PropertyBlock.
      PropertyTableThe PropertyTable class - holds all of the DocumentProperty - instances and the RootProperty - instance for a Filesystem - instance.
      It maintains a - List of its Property - instances - (_properties), and - when prepared to write its data by a - call to preWrite, - it gets and holds an array of PropertyBlock - instances - (_blocks.
      It - also maintains its start block in its - _start_block - member.
      It has a method, - getRoot, to get - the RootProperty, returning it as an - implementation of Directory, and a - method to add a Property, - addProperty, and a - method to get its start block, - getStartBlock.
      RootPropertyThe RootProperty class acts - as the Directory for - all of the DocumentProperty - instance. As such, it is more of a - pure directory - entry than a proper root - entry in the Property - Table, but the initial POIFS - implementation does not warrant the - additional complexity of a full-blown - root entry, and so it is not modeled - in this design.
      It maintains a - List of its children, - _children, in - order to perform its - directory-oriented duties.
      -
    4. -
    5. - Filesystem - Classes and Interfaces -

      - The property classes and interfaces are - shown in the following class diagram. -

      -

      - -

      - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -
      Class/InterfaceDescription
      FilesystemThe Filesystem class is the - top-level class that manages the - creation of a POIFS document.
      It - maintains a PropertyTable - instance in its - _property_table - member, a HeaderBlock - instance in its - _header_block - member, and a List of its - Document - instances in its - _documents - member.
      It provides methods for a - client to create a document - (createDocument), - and a method to write the Filesystem - to an OutputStream - (writeFilesystem).
      BATBlockSee the description in BATBlock
      BATManagedThe BATManaged interface - defines common behavior for objects - whose location in the written file is - managed by the Block - Allocation Table.
      It defines - methods to get a count of the - implementation's BigBlock - instances - (countBlocks), and - to set an implementation's start block - (setStartBlock).
      BlockAllocationTableThe BlockAllocationTable is - an implementation of the POIFS Block - Allocation Table. It is only - created when the Filesystem is - about to be written to an - OutputStream.
      It - contains an IntList of block - numbers for all of the BATManaged - implementations owned by the - Filesystem, - _entries, which is - filled by calls to - allocateSpace.
      It - fills its array, - _blocks, of BATBlock - instances when its - createBATBlocks - method is called. This method has to - take into account its own storage - requirements, as well as those of the - XBAT blocks, and so calls - BATBlock.calculateStorageRequirements - and - HeaderBlock.calculateXBATStorageRequirements - repeatedly until the counts returned - by those methods stabilize.
      The - countBlocks method - returns the number of BATBlock - instances created by the preceding - call to createBlocks.
      BlockWritableSee the description in BlockWritable
      DocumentThe Document class is used - to contain a document, such as an HSSF - workbook.
      It has its own DocumentProperty - (_property) and - stores its data in a collection of DocumentBlock - instances - (_blocks).
      It - has a method, - getDocumentProperty, - to get its DocumentProperty.
      DocumentBlockSee the description in DocumentBlock
      DocumentPropertySee the description in DocumentProperty
      HeaderBlockSee the description in HeaderBlock
      PropertyTableSee the description in PropertyTable
      -
    6. -
    7. - Utility Classes - and Interfaces -

      - The utility classes and interfaces are - shown in the following class diagram. -

      -

      - -

      - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -
      Class/InterfaceDescription
      BitFieldThe BitField class is used - primarily by HSSF code to manage - bit-mapped fields of HSSF records. It - is not likely to be used in the POIFS - code itself and is only included here - for the sake of complete documentation - of the POI utility classes.
      ByteFieldThe ByteField class is an - implementation of FixedField for - the purpose of managing reading and - writing to a byte-wide field in an - array of bytes.
      FixedFieldThe FixedField interface - defines a set of methods for reading a - field from an array of - bytes or from an - InputStream, and for - writing a field to an array of - bytes. Implementations - typically require an offset in their - constructors that, for the purposes of - reading and writing to an array of - bytes, makes sure that - the correct bytes in the - array are read or written.
      HexDumpThe HexDump class is a - debugging class that can be used to - dump an array of bytes to - an OutputStream. The - static method dump - takes an array of bytes, - a long offset that is - used to label the output, an open - OutputStream, and an - int index that specifies - the starting index within the array of - bytes.
      The data is - displayed 16 bytes per line, with each - byte displayed in hexadecimal format - and again in printable form, if - possible (a byte is considered - printable if its value is in the range - of 32 ... 126).
      Here is an example - of a small array of bytes - with an offset of - 0x110:
      00000110 C8 00 00 00 FF 7F 90 01 00 00 00 00 00 00 05 01 ................
      00000120 41 00 72 00 69 00 61 00 6C 00                   A.r.i.a.l.
      IntegerFieldThe IntegerField class is - an implementation of FixedField for - the purpose of managing reading and - writing to an integer-wide field in an - array of bytes.
      IntListThe IntList class is a - work-around for functionality missing - in Java (see http://developer.java.sun.com/developer/bugParade/bugs/4487555.html - for details); it is a simple growable - array of ints that gets - around the requirement of wrapping and - unwrapping ints in - Integer instances in - order to use the - java.util.List - interface.
      IntList mimics - the functionality of the - java.util.List interface - as much as possible.
      LittleEndianThe LittleEndian class - provides a set of static methods for - reading and writing - shorts, - ints, longs, - and doubles in and out of - byte arrays, and out of - InputStreams, preserving - the Intel byte ordering and encoding - of these values.
      LittleEndianConstsThe LittleEndianConsts - interface defines the width of a - short, int, - long, and - double as stored by Intel - processors.
      LongFieldThe LongField class is an - implementation of FixedField for - the purpose of managing reading and - writing to a long-wide field in an - array of bytes.
      ShortFieldThe ShortField class is an - implementation of FixedField for - the purpose of managing reading and - writing to a short-wide field in an - array of bytes.
      ShortListThe ShortList class is a - work-around for functionality missing - in Java (see http://developer.java.sun.com/developer/bugParade/bugs/4487555.html - for details); it is a simple growable - array of shorts that gets - around the requirement of wrapping and - unwrapping shorts in - Short instances in order - to use the java.util.List - interface.
      ShortList mimics - the functionality of the - java.util.List interface - as much as possible.
      StringUtilThe StringUtil class - manages the processing of Unicode - strings.
      -
    8. -
    - Scenarios -

    - This section describes the scenarios of how the - POIFS classes and interfaces will be used to - convert an appropriate XML stream to a POIFS - output stream containing an HSSF document. -

    -

    - It is broken down as suggested by the following - scenario diagram: -

    -

    - -

    - - - - - - - - - - - - - - - - - -
    StepDescription
    1The Filesystem is - created by the client application.
    2The client - application tells the Filesystem to create a - document, providing an - InputStream and the name of the - document. This may be repeated several - times.
    3The client - application asks the Filesystem to write its - data to an OutputStream.
    -
      -
    1. -

      - Initialization -

      -

      - Initialization of the POIFS system is - shown in the following scenario diagram: -

      -

      - -

      - - - - - - - - - - - - - - - - - -
      StepDescription
      1The Filesystem - object, which is created for each - request to convert an appropriate XML - stream to a POIFS output stream - containing an HSSF document, creates - its PropertyTable.
      2The PropertyTable - creates its RootProperty - instance, making the RootProperty the - first Property - in its List of Property - instances.
      3The Filesystem - creates its HeaderBlock - instance. It should be noted that the - decision to create the HeaderBlock at - Filesystem initialization is - arbitrary; creation of the HeaderBlock - could easily and harmlessly be - postponed to the appropriate moment in - writing the - filesystem.
      -
    2. -
    3. -

      - Creating a - Document -

      -

      - Creating and adding a document to a POIFS - system is shown in the following scenario - diagram: -

      -

      - -

      - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -
      StepDescription
      1The Filesystem - instance creates a new Document - instance. It will store the newly - created Document in a - List of BATManaged - instances.
      2The Document reads - data from the provided - InputStream, storing the - data in DocumentBlock - instances. It keeps track of the byte - count as it reads the data.
      3The Document creates - a DocumentProperty - to keep track of its property - data. The byte count is stored in the - newly created DocumentProperty - instance.
      4The Filesystem - requests the newly created DocumentProperty - from the newly created Document - instance.
      5The Filesystem - sends the newly created DocumentProperty - to the Filesystem's PropertyTable - so that the PropertyTable can add the - DocumentProperty to its - List of Property - instances.
      6The Filesystem gets - the RootProperty - from its PropertyTable.
      7The Filesystem adds - the newly created DocumentProperty - to the RootProperty.
      -

      - Although typical deployment of the POIFS - system will only entail adding a single Document (the - workbook) to the Filesystem, there - is nothing in the design to prevent - multiple Documents from being added to the - Filesystem. This flexibility can be - employed to write summary information - document(s) in addition to the workbook. -

      -
    4. -
    5. -

      - Writing the - Filesystem -

      -

      - Writing the filesystem is shown in the - following scenario diagram: -

      -

      - -

      - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -
      StepDescription
      1The Filesystem adds - the PropertyTable - to its List of BATManaged - instances and calls the - PropertyTable's - preWrite - method. The action taken by the - PropertyTable is shown in the PropertyTable - preWrite scenario diagram.
      2The Filesystem - creates the BlockAllocationTable.
      3The Filesystem gets - the block count from the BATManaged - instance. These three steps are - repeated for each BATManaged - instance in the Filesystem's - List of BATManaged - instances (i.e., the Documents, in - order of their addition to the - Filesystem, followed by the PropertyTable).
      4The Filesystem - sends the block count to the BlockAllocationTable, - which adds the appropriate entries to - is IntList of - entries, returning the starting block - for the newly added entries.
      5The Filesystem - gives the start block number to the BATManaged - instance. If the BATManaged instance - is a Document, - it sets the start block field in its - DocumentProperty.
      6The Filesystem - tells the BlockAllocationTable - to create its BatBlocks.
      7The Filesystem - gives the BAT information to the HeaderBlock so - that it can set its BAT fields and, if - necessary, create XBAT blocks.
      8If the filesystem is - unusually large (over 7MB), the - HeaderBlock - will create XBAT blocks to contain the - BAT data that it cannot hold - directly. In this case, the Filesystem - tells the HeaderBlock where those - additional blocks will be stored.
      9The Filesystem - gives the PropertyTable - start block to the HeaderBlock.
      10The Filesystem - tells the BlockWritable - instance to write its blocks to the - provided - OutputStream.
      This - step is repeated for each - BlockWritable instance, in this - order:
      -
        -
      1. - The HeaderBlock. -
      2. -
      3. - Each Document, - in the order in which it was - added to the Filesystem. -
      4. -
      5. - The PropertyTable. -
      6. -
      7. - The BlockAllocationTable -
      8. -
      9. - The XBAT blocks created by the - HeaderBlock, - if any. -
      10. -
      -

      - PropertyTable - preWrite scenario diagram -

      -

      - -

      - - - - - - - - - - - - - - - - - - - - - - - - -
      StepDescription
      1The PropertyTable - calls setIndex for - each of its Property - instances, so that each Property now - knows its index within the - PropertyTable's List of - Property instances.
      2 The PropertyTable - requests the PropertyBlock - class to create an array of PropertyBlock - instances.
      3The PropertyBlock - calculates the number of empty Property - instances it needs to create and - creates them. The algorithm for the - number to create is:
      - block_count = (properties.size() - + 3) / 4;
      emptyPropertiesNeeded = - (block_count * 4) - - properties.size();
      4 The PropertyBlock - creates the required number of PropertyBlock - instances from the List - of Property - instances, including the newly created - empty Property - instances.
      5The PropertyTable - calls preWrite on - each of its Property - instances. For DocumentProperty - instances, this call is a no-op. For - the RootProperty, - the action taken is shown in the RootProperty - preWrite scenario diagram.
      -

      - RootProperty - preWrite scenario diagram -

      -

      - -

      - - - - - - - - - - - - - - - - - - -
      StepDescription
      1The RootProperty - sets its child property with the index - of the child Property that is - first in its List of - children.
      2The RootProperty - sets its child's next property field - with the index of the child's next - sibling in the RootProperty's - List of children. If the - child is the last in the - List, its next property - field is set to -1.These two steps are - repeated for each File in the RootProperty's - List of - children.
      3The RootProperty - sets its child's previous property - field with a value of - -1.
      -
    6. -
    -
  8. -
- - \ No newline at end of file diff --git a/src/documentation/xdocs/poifs/index.xml b/src/documentation/xdocs/poifs/index.xml deleted file mode 100644 index 9a244bdee2..0000000000 --- a/src/documentation/xdocs/poifs/index.xml +++ /dev/null @@ -1,40 +0,0 @@ - - - -
- PoiFS - Overview - - - - -
- -
-

POIFS is a pure Java implementation of the OLE 2 Compound - Document format.

-

By definition, all APIs developed by the POI project are - based somehow on the POIFS API.

-

A common confusion is on just what POIFS buys you or what OLE - 2 Compound Document format is exactly. POIFS does not buy you - DOC, or XLS, but is necessary to generate or read DOC or XLS - files. You see, all file formats based on the OLE 2 Compound - Document Format have a common structure. The OLE 2 Compound - Document Format is essentially a convoluted archive - format. Think of POIFS as a "zip" library. Once you can get - the data in a zip file you still need to interpret the - data. As a general rule, while all of our formats use - POIFS, most of them attempt to abstract you from it. There - are some circumstances where this is not possible, but as a - general rule this is true.

-

If you're an end user type just looking to generate XLS - files, then you'd be looking for HSSF not POIFS; however, if - you have legacy code that uses MFC property sets, POIFS is - for you! Regardless, you may or may not need to know how to - use POIFS but ultimately if you use technologies that come - from the POI project, you're using POIFS underneith. Perhaps - we should have a branding campaign "POIFS Inside!". ;-)

- -
- -
diff --git a/src/documentation/xdocs/poifs/usecases.xml b/src/documentation/xdocs/poifs/usecases.xml deleted file mode 100644 index 4d1290c323..0000000000 --- a/src/documentation/xdocs/poifs/usecases.xml +++ /dev/null @@ -1,690 +0,0 @@ - - - -
- POIFS Use Cases - - - -
- -
-
- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -
Primary Actor:POIFS client
Scope:POIFS
Level:Summary
Stakeholders and Interests: -
    -
  • POIFS client- wants to read content of file - system
  • -
  • POIFS - understands POIFS file system
  • -
-
Precondition:None
Minimal Guarantee:None
Main Success Guarantee: -
    -
  1. POIFS client requests POIFS to read a POIFS file - system, providing an InputStream - containing POIFS file system in question.
  2. -
  3. POIFS reads from the InputStream in - 512 byte blocks
  4. -
  5. POIFS verifies that the first block begins with - the well known signature - (0xE11AB1A1E011CFD0)
  6. -
  7. POIFS reads the Block Allocation Table from the - first block and, if necessary, from the XBAT - blocks.
  8. -
  9. POIFS obtains the start block of the Property - Table and reads the Property Table (use case 9, - read file)
  10. -
  11. POIFS reads the individual entries in the Property - Table
  12. -
  13. POIFS obtains the start block of the Small Block - Allocation Table and reads the Small Block - Allocation Table (use case 9, read file)
  14. -
  15. POIFS obtains the start block of the Small Block - store from the first entry in the Property Table - and reads the Small Block Array (use case 9, read - file)
  16. -
-
Extensions: -
    -
  • 2a. If the last block read is not a 512 byte - block, the InputStream is not that of - a POIFS file system, and POIFS throws an - appropriate exception.
  • -
  • 3a. If the signature is incorrect, the - InputStream is not that of a POIFS - file system, and POIFS throws an appropriate - exception.
  • -
-
-
-
- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -
Primary Actor:POIFS client
Scope:POIFS
Level:Summary
Stakeholders and Interests: -
    -
  • POIFS client- wants to write file system out.
  • -
  • POIFS - knows how to write file system out.
  • -
-
Precondition: -
    -
  • File system has been read (use case 1, read - existing file system) and subsequently modified - (use case 4, replace file in file system; use case - 5, delete file from file system; or use case 6, - write new file to file system; in any - combination)
  • -
-

or

-
    -
  • File system has been created (use case 3, create - new file system)
  • -
-
Minimal Guarantee:None
Main Success Guarantee: -
    -
  1. POIFS client provides an OutputStream - to write the file system to.
  2. -
  3. POIFS gets the sizes of the Property Table and - each file in the file system.
  4. -
  5. If any files in the file system requires storage - in a Small Block Array, POIFS creates a Small - Block Array of sufficient size to hold all of the - small files.
  6. -
  7. POIFS calculates the number of big blocks needed - to hold all of the large files, the Property - Table, and, if necessary, the Small Block Array - and the Small Block Allocation Table.
  8. -
  9. POIFS creates a set of big blocks sufficient to - store the Block Allocation Table
  10. -
  11. POIFS creates and writes the header block
  12. -
  13. POIFS writes out the XBAT blocks, if needed.
  14. -
  15. POIFS writes out the Small Block Array, if - needed
  16. -
  17. POIFS writes out the Small Block Allocation Table, - if needed
  18. -
  19. POIFS writes out the Property Table
  20. -
  21. POIFS writes out the large files, if needed
  22. -
  23. POIFS closes the OutputStream.
  24. -
-
Extensions: -
    -
  • 6a. Exceptions writing to the - OutputStream will be propagated back - to the POIFS client.
  • -
  • 7a. Exceptions writing to the - OutputStream will be propagated back - to the POIFS client.
  • -
  • 8a. Exceptions writing to the - OutputStream will be propagated back - to the POIFS client.
  • -
  • 9a. Exceptions writing to the - OutputStream will be propagated back - to the POIFS client.
  • -
  • 10a. Exceptions writing to the - OutputStream will be propagated back - to the POIFS client.
  • -
  • 11a. Exceptions writing to the - OutputStream will be propagated back - to the POIFS client.
  • -
  • 12a. Exceptions closing the - OutputStream will be propagated back - to the POIFS client.
  • -
-
-
-
- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -
Primary Actor:POIFS client
Scope:POIFS
Level:Summary
Stakeholders and Interests: -
    -
  • POIFS client- wants to create a new file - system
  • -
  • POIFS - knows how to create a new file system
  • -
-
Precondition:None
Minimal Guarantee:None
Main Success Guarantee: -
    -
  1. POIFS creates an empty Property Table.
  2. -
-
Extensions:None
-
-
- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -
Primary Actor:POIFS client
Scope:POIFS
Level:Summary
Stakeholders and Interests: -
    -
  • POIFS client- wants to replace an existing file in - the file system
  • -
  • POIFS - knows how to manage the file system
  • -
-
Precondition: -

Either

-
    -
  • The file system has been read (use case 1, read - existing file system) and a file has been - extracted from the file system (use case 7, read - existing file from file system)
  • -
-

or

-
    -
  • The file system has been created (use case 3, - create new file system) and a file has been - written to the file system (use case 6, write new - file to file system)
  • -
-
Minimal Guarantee:None
Main Success Guarantee: -
    -
  1. POIFS discards storage of the existing file.
  2. -
  3. POIFS updates the existing file's entry in the - Property Table
  4. -
  5. POIFS stores the new file's data
  6. -
-
Extensions: -
    -
  • 1a. POIFS throws an exception if the file does not - exist.
  • -
-
-
-
- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -
Primary Actor:POIFS client
Scope:POIFS
Level:Summary
Stakeholders and Interests: -
    -
  • POIFS client- wants to remove a file from a file - system
  • -
  • POIFS - knows how to manage the file system
  • -
-
Precondition: -

Either

-
    -
  • The file system has been read (use case 1, read - existing file system) and a file has been - extracted from the file system (use case 7, read - existing file from file system)
  • -
-

or

-
    -
  • The file system has been created (use case 3, - create new file system) and a file has been - written to the file system (use case 6, write new - file to file system)
  • -
-
Minimal Guarantee:None
Main Success Guarantee: -
    -
  1. POIFS discards the specified file's storage.
  2. -
  3. POIFS discards the file's Property Table - entry.
  4. -
-
Extensions: -
    -
  • 1a. POIFS throws an exception if the file does not - exist.
  • -
-
-
-
- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -
Primary Actor:POIFS client
Scope:POIFS
Level:Summary
Stakeholders and Interests: -
    -
  • POIFS client- wants to add a new file to the file - system
  • -
  • POIFS - knows how to manage the file system
  • -
-
Precondition:The specified file does not yet exist in the file - system
Minimal Guarantee:None
Main Success Guarantee: -
    -
  1. The POIFS client provides a file name
  2. -
  3. POIFS creates a new Property Table entry for the - new file
  4. -
  5. POIFS provides the POIFS client with an - OutputStream to write to.
  6. -
  7. The POIFS client writes data to the provided - OutputStream.
  8. -
  9. The POIFS client closes the provided - OutputStream
  10. -
  11. POIFS updates the Property Table entry with the - new file's size
  12. -
-
Extensions: -
    -
  • 1a. POIFS throws an exception if a file with the - specified name already exists in the file - system.
  • -
  • 1b. POIFS throws an exception if the file name is - too long. The limit on file name length is 31 - characters.
  • -
-
-
-
- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -
Primary Actor:POIFS client
Scope:POIFS
Level:Summary
Stakeholders and Interests: -
    -
  • POIFS client- wants to read a file from the file - system
  • -
  • POIFS - knows how to manage the file system
  • -
-
Precondition: -
    -
  • The file system has been read (use case 1, read - existing file system) or has been created and - written to (use case 3, create new file system; - use case 6, write new file to file system).
  • -
  • The specified file exists in the file system.
  • -
-
Minimal Guarantee:None
Main Success Guarantee: -
    -
  1. The POIFS client provides the name of a file to be - read
  2. -
  3. POIFS provides an InputStream to read - from.
  4. -
  5. The POIFS client reads from the - InputStream.
  6. -
  7. The POIFS client closes the - InputStream.
  8. -
-
Extensions:1a. POIFS throws an exception if no file with the - specified name exists.
-
-
- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -
Primary Actor:POIFS client
Scope:POIFS
Level:Summary
Stakeholders and Interests: -
    -
  • POIFS client- wants to know what files exist in - the file system
  • -
  • POIFS - knows how to manage the file system
  • -
-
Precondition:The file system has been read (use case 1, read - existing file system) or created (use case 3, create - new file system)
Minimal Guarantee:None
Main Success Guarantee: -
    -
  1. The POIFS client requests the file system - directory.
  2. -
  3. POIFS returns an Iterator. The - Iterator will not include the root - entry in the Property Table, and may be an - Iterator over an empty - Collection.
  4. -
-
Extensions:None
-
-
- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -
Primary Actor:POIFS
Scope:POIFS
Level:Summary
Stakeholders and Interests: -
    -
  • POIFS - POIFS needs to read a file, or something - resembling a file (i.e., the Property Table, the - Small Block Array, or the Small Block Allocation - Table)
  • -
-
Precondition:None
Minimal Guarantee:None
Main Success Guarantee: -
    -
  1. POIFS begins with a start block, a file size, and - a flag indicating whether to use the Big Block - Allocation Table or the Small Block Allocation - Table
  2. -
  3. POIFS returns an InputStream.
  4. -
  5. Reads from the InputStream are - performed by walking the specified Block - Allocation Table and reading the blocks - indicated.
  6. -
  7. POIFS closes the InputStream when - finished reading the file, or its client wants to - close the InputStream.
  8. -
-
Extensions:3a. An exception will be thrown if the specified Block - Allocation Table is corrupt, as evidenced by an index - pointing to a non-existent block, or by a chain - extending past the known size of the file.
-
-
- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -
Primary Actor:POIFS client
Scope:POIFS
Level:Summary
Stakeholders and Interests: -
    -
  • POIFS client- wants to rename an existing file in - the file system.
  • -
  • POIFS - knows how to manage the file system.
  • -
-
Precondition: -
    -
  • The file system is has been read (use case 1, read - existing file system) or has been created and - written to (use case 3, create new file system; - use case 6, write new file to file system.
  • -
  • The specified file exists in the file system.
  • -
  • The new name for the file does not duplicate - another file in the file system.
  • -
-
Minimal Guarantee:None
Main Success Guarantee: -
    -
  1. POIFS updates the Property Table entry for the - specified file with its new name.
  2. -
-
Extensions: -
    -
  • 1a. If the old file name is not in the file - system, POIFS throws an exception.
  • -
  • 1b. If the new file name already exists in the - file system, POIFS throws an exception.
  • -
  • 1c. If the new file name is too long (the limit is - 31 characters), POIFS throws an exception.
  • -
-
-
-
- -
diff --git a/src/documentation/xdocs/references/book.xml b/src/documentation/xdocs/references/book.xml deleted file mode 100644 index 35f8646486..0000000000 --- a/src/documentation/xdocs/references/book.xml +++ /dev/null @@ -1,17 +0,0 @@ - - - - - - - - - - - - - - diff --git a/src/documentation/xdocs/references/index.xml b/src/documentation/xdocs/references/index.xml deleted file mode 100644 index f4d6b194ae..0000000000 --- a/src/documentation/xdocs/references/index.xml +++ /dev/null @@ -1,37 +0,0 @@ - - - - -
- Live Sites using Poi - - - - - - - -
- - -
-

Currently we don't have any sites listed that use Poi, but we're sure they're out there. - Help us change this. If you've - written a site that utilises Poi let us know. -

- -
-
-

Publically available products/projects using POI include: -

-
    -
  • JTimeTracker
  • -
-
- -
diff --git a/src/documentation/xdocs/resolutions/book.xml b/src/documentation/xdocs/resolutions/book.xml deleted file mode 100644 index c3f3d1a6aa..0000000000 --- a/src/documentation/xdocs/resolutions/book.xml +++ /dev/null @@ -1,22 +0,0 @@ - - - - - - - - - - - - - - - - - - - - diff --git a/src/documentation/xdocs/resolutions/index.xml b/src/documentation/xdocs/resolutions/index.xml deleted file mode 100644 index 2e7cdba846..0000000000 --- a/src/documentation/xdocs/resolutions/index.xml +++ /dev/null @@ -1,37 +0,0 @@ - - - - -
- Resolutions - About this section - - - -
- - -
-

- Every project on Jakarta has resolutions that they vote on. - Decisions are made, etc. But what happens once those decisions - are made? They are archived in the mail list archive never to - be read again (once its not in the top 10 or so posts). So they - get discussed again and again. -

-

- Rather than have that big waste of time, we have this section to - record important POI decisions. Once a decision is passed it - need only be linked to this page (either by creating a page for - it or by simply linking it to the archive messages). Wherever - possible a brief about how many votes for and against an maybe - some background should be posted. -

-

- This section is intended mainly to reduce big waste of time - discussions from taking away from whats important...developing - POI! :-D -

-
- -
diff --git a/src/documentation/xdocs/resolutions/res001.xml b/src/documentation/xdocs/resolutions/res001.xml deleted file mode 100644 index d9a2570e62..0000000000 --- a/src/documentation/xdocs/resolutions/res001.xml +++ /dev/null @@ -1,91 +0,0 @@ - - - - -
- POI Resoluton - Resolution 001 - Minimal Coding Standards - - - -
- - -
-
-

- As the POI project has grown the "styles" used have become more - varied, some see this as a bad thing, but in reality it - can be a good thing. Each can learn from the different - styles by working with different code. That being said - there are some universal "good quality" guidelines that - must be adopted on a project of any proportions. -

-

- Marc Johnson Authored the following resolution: -

-

- On Tue, 2002-01-08 at 22:23, Marc Johnson wrote: - Standards are wonderful; everyone should have a set. - Here's what I propose for coding standards for POI WRT comments (should I - feel the need, I'll post more of these little gems): -

-
    -
  1. - All classes and interfaces MUST have, right at the beginning, the POI - License (see poi/doc/LICENSE). -
  2. -
  3. - All classes and interfaces MUST include class javadoc. Conventionally, - this goes after the package and imports, and before the start of the class - or interface. The class javadoc MUST have at least one @author tag -
  4. -
  5. - All methods that are accessible outside the class MUST have javadoc - comments. In other words, if it isn't private, it MUST have javadoc - comments. Simple getters can consist of a simple @return tag; simple setters - can consist of a simple @param tag. Anything else requires some verbiage - plus all the standard javadoc tags as appropriate. You MUST include @throws - or @exception for any non-runtime exceptions, and you SHOULD document any - runtime exceptions you expect to throw. @throws/@exception tags SHOULD - include an explanation of why that exception would be thrown. If your method - might return null, you MUST say so. An accompanying explanation of the - circumstances for doing so would be nice. -
  6. -
-
-
-
-

- As opposed to the formerly used POI License which was - based on the Apache Public License, now that POI is part of - Jakarta, use the APL 1.1 for the header. Currently, the - Apache Software Foundation requires us to use the full - long version. -

-
-
-

- Tip: No laughing or joking allowed in conversations regarding coding - standards. - Any mail on coding standards will be treated very seriously, - and sent here with a RTFM. -

-
-
-
-

- The motion was passed unanimously with no negative or - neutral votes. -

-
-
-

- Andy didn't feel like going through his mail and sucking - out the comments.. If there is anything you feel should - be added here do it yourself ;-). -

-
-
- -
diff --git a/src/documentation/xdocs/todo.xml b/src/documentation/xdocs/todo.xml deleted file mode 100644 index d403793f2b..0000000000 --- a/src/documentation/xdocs/todo.xml +++ /dev/null @@ -1,52 +0,0 @@ - - - - - - - - - - - - - - - - - - Finish HDF - - - Finish Charts - - - Finish Formulas. - - - - - - Expose functionality in low level records in higher level API - - - Implement more record types. - - - Add more dummy checks (for when API user's do things they - "can't" do). This will involve exploring the various - upper limits on the things Excel can handle. - - - Add support for embedded graphics and other objects. - - - Create new adapter object for handling MulBlank, MulRk, Rk - records. - - - Add a way to copy sheets. - - - - diff --git a/src/documentation/xdocs/trans/book.xml b/src/documentation/xdocs/trans/book.xml deleted file mode 100644 index 9eb97892b5..0000000000 --- a/src/documentation/xdocs/trans/book.xml +++ /dev/null @@ -1,20 +0,0 @@ - - - - - - - - - - - - - - - - - diff --git a/src/documentation/xdocs/trans/de/book.xml b/src/documentation/xdocs/trans/de/book.xml deleted file mode 100644 index 5810a96805..0000000000 --- a/src/documentation/xdocs/trans/de/book.xml +++ /dev/null @@ -1,70 +0,0 @@ - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - diff --git a/src/documentation/xdocs/trans/de/index.xml b/src/documentation/xdocs/trans/de/index.xml deleted file mode 100644 index 4bd2c7d917..0000000000 --- a/src/documentation/xdocs/trans/de/index.xml +++ /dev/null @@ -1,219 +0,0 @@ - - - - -
- Willkommen bei POI - - - - - - -
- - -
-
-

- Das POI-Übersetzungsprojekt hat begonnen. - Den Anfang machen spanisch, - japanisch - und deutsch. Andere Sprachen sind herzlich willkommen. - Machen Sie mit! -

-
-
-

- Die Wahl für das POI-Logo ist beendet. Danke für Ihre Stimmen. -

- - - -
-
-
-

- Das POI-Projekt besteht aus Java-APIs zum Erstellen und - Bearbeiten von Dateiformaten, die auf dem Microsoft-Dateiformat »OLE-2 - Compound Document« beruhen. Dateien in diesem Format sind unter - anderem die meisten Microsoft-Office-Dateien, wie zum - Beispiel Excel- und Word-Dateien. -

-

- Grundsätzlich versuchen wir, möglichst viel mit anderen Projekten - zusammenzuarbeiten, um die gewünschten Funktionalitäten zur Verfügung - zu stellen. Einige Beispiele: Für - Cocoon - werden bald Generatoren und Serializer zur Verfügung stehen. Wir - arbeiten mit - Open Office.org - zusammen, um das Excel-Dateiformat zu - dokumentieren. - Für Lucene - werden bald Filtermodule zur Verfügung stehen. Wir stellen anderen - Projekten Teile des POI-Projektes zur Verfügung, damit diese - die POI-Funktionalitäten nutzen können. -

-
-

- Wir werden diese Frage komponentenweise beantworten. POI besteht aus - einer Reihe von Komponenten, die jeweils unterschiedliche Probleme - angehen. Das Kürzel »POI« steht für das gesamte Projekt. -

-

- Mit POIFS können Sie Dateien oder Dokumente, die im - OLE 2 Compound Document Format geschrieben wurden, mit Java - einlesen. Solche Dateien werden üblicherweise mit der - MFC-Klassenbibliothek erzeugt. - Außerdem können sie POIFS nutzen, um Dateien im OLE 2 Compound - Document Format zu schreiben. Damit können sie zum Beispiel - den Datenaustausch mit der Windows-Plattform sicherstellen. - Wir können guten Gewissens behaupten, daß POIFS die - vollständigste Implementierung dieses Dateiformates ist. -

-

- Mit HSSF können sie Excel-Dateien in Java lesen und - schreiben. Sie können auch Excel-Tabellen lesen und - modifizieren. Allerdings ist die Schreibfunktionalität im Moment am - ausgereiftesten. -

-
- -
-

- POI bedeutet »Poor Obfuscation Implementation« (Schlechte, - verschleiernde Implementierung). - Warum geben wir unserem Projekt einen so abschätzigen Namen? - Nun, das Microsoft OLE 2 Compound Document Format ist einfach - schlecht durchdacht. Von seiner Grundidee her ist es ein - Dateiarchiv mit einer Struktur, die dem alten DOS-FAT-Dateisystem - ähnelt. Die Redmonder haben kein bereits vorhandenes Archivformat - wie tar, gzip, zip oder arc genutzt, sondern stattdessen ein - eigenes Archivformat erfunden, - das keinerlei Standardverschlüsselung oder -komprimierung - bietet, das schlecht erweiterbar ist, und das zur - Fragmentierung neigt. -

-

Poi ist außerdem eine Spezialität der hawaiianischen Küche. Sie wird - in Merriam Webster's - Dictionary beschrieben als: »Ein hawaiianisches Gericht aus - Taro-Wurzeln, die durch Stampfen, Kochen und Kneten zu einer Paste - geformt und oft noch ein wenig gegoren wird.« Dies ist witzigerweise - eine treffende Beschreibung des Dateiformats.

-

- POI ist also eine Abkürzung. Wenn Sie Abkürzungen nicht mögen, - dann denken sie einfach bei Poi an das hawaiianischen Gericht. - Je nachdem, ob Sie Abkürzungen mögen oder nicht, nutzen sie - einfach POI oder Poi, wenn sie dieses Projekt meinen. -

-
-
- - -
-
-

- POI besteht aus mehreren Komponenten, die jeweils unterschiedliche - Aufgaben angehen. Beispielsweise dient die Komponente HSSF dazu, - Excel-Dateien zu schreiben und zu lesen. Es folgt eine Liste aller - Komponenten des POI-Projektes mit einer sehr kurzen Zusammenfassung - ihres Zweckes. -

-
-
-

- POIFS ist der älteste und stabilste Teil des Projektes. POIFS - ist unsere Portierung des OLE 2 Compound Document Formats - in reinem Java. Es unterstützt Lesen und Schreiben. Alle - anderen Komponenten basieren auf POIFS. Mehr Informationen - gibt es auf der POIFS-Seite. -

-
-
-

- HSSF ist unsere Portierung des Microsoft Excel 97(-2002) - Dateiformats in reinem Java. Es unterstützt Lesen und - Schreiben. Mehr Informationen gibt es auf der - HSSF-Seite. -

-
-
-

- HDF ist unsere Portierung des Microsoft Word 97 Datei-Formats - in reinem Java. Es unterstützt Lesen und Schreiben. Mehr - Informationen gibt es auf der - HDF-Seite. - Diese Komponente ist noch nicht sehr weit fortgeschritten. Wir suchen - Entwickler, die mitmachen. -

-
-
-

- HPSF ist unsere Portierung des OLE 2 Property Formats. - Property Sets werden häufig genutzt, um Metadaten eines - Dokuments wie Titel, Autor und Datum zu speichern. Sie - können aber auch für applikationsspezifische Anwendungen - genutzt werden. Mehr Informationen gibt es auf der - HPSF-Seite. -

-
-
-
-

- Der HSSF-Serializer war Teil der Version 1.0 und der letzten Versionen - von Sourceforge. - Er wurde an das Cocoon-Projekt - übertragen und ist Bestandteil von Cocoon seit Version 2.0.2. -

-
-
-

- Sie möchten zu diesem Projekt beitragen? Hervorragend! - Wir brauchen immer begeisterte, fleißige und talentierte Leute, die - uns bei den verschiedenen Aufgaben des Projektes helfen. Nummer eins - sind Fehlerberichte und Vorschläge für neue Funktionen. Nummer zwei - ist die Dokumentation.

-

Egal, ob sie Kritik oder Vorschläge haben, - oder ob Sie Beiträge in Form von Code oder Dokumentation liefern - möchten, immer werden Sie bei uns ein offenes Ohr - finden. Und nicht zuletzt brauchen wir Java-Programmierer, die - sich durch die zahlreichen Ecken und Kanten der Microsoft-Dateiformate - hindurchwühlen und uns dabei helfen, diese Formate auf die - Java-Plattform zu portieren. -

-

- Wenn Sie motiviert sind und Zeit haben, tragen Sie sich - in unsere Mailing-Listen ein, und machen sie mit! Bei der Einarbeitung - helfen wir Ihnen gerne. -

-
- -
- - Copyright (c) @year@ The Apache Software Foundation All rights reserved. - $Revision$ $Date$ - -
-
- - diff --git a/src/documentation/xdocs/trans/es/3rdparty.xml b/src/documentation/xdocs/trans/es/3rdparty.xml deleted file mode 100644 index 785b8c2d5d..0000000000 --- a/src/documentation/xdocs/trans/es/3rdparty.xml +++ /dev/null @@ -1,68 +0,0 @@ - - - -
- Contribuciones de Terceras Partes - - - - -
- - - -
-

- Vea Cómo contribuir a Poi. -

- -
- -
-

- No es que estos tengan, necesariamente, suficiente calidad como para ser incluidos - en el núcleo de la distribución, pero han sido probados bajo - varios entornos clave , se proporcionan bajo la misma licencia que Poi, y se - incluyen en la distribución de POI bajo el directorio - contrib/. -

- -

- ¡Ninguno Todavía! - aunque pude esperarse que algunos de los enlaces - listados a continuación lleguen eventualmente a migrarse al nivel de "componentes contribuidos", - y posteriormente incluso a la mismísima distribución principal. -

-
- -
-

Entregas de modificaciones (Submissions of modifications) - a Poi que esperan revisión. Cualquiera puede realizar comentarios sobre ellas en la lista - de desarrollo - ¡se necesitan revisores del código! - Su utilización cae bajo su responsabilidad - aunque Poi no puede garantizarlo, - estos parches no han sido revisados, cuando menos aceptados. -

-
- -
-

Las otras extensiones listadas aquí tampoco están respaldadas - por el proyecto Poi - se proporcionan sólo por comodidad. Pueden funcionar o no, - pueden ser de código abierto o no, etc. -

- -

Para añadir un enlace a esta tabla, ver Cómo contribuir - a POI.

- - - - - - - - - - -
Nombre y EnlaceTipoDescripciónEstadoLicenciaContacto
- -
- -
diff --git a/src/documentation/xdocs/trans/es/book.xml b/src/documentation/xdocs/trans/es/book.xml deleted file mode 100644 index 602e75cb2f..0000000000 --- a/src/documentation/xdocs/trans/es/book.xml +++ /dev/null @@ -1,61 +0,0 @@ - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - diff --git a/src/documentation/xdocs/trans/es/casestudies.xml b/src/documentation/xdocs/trans/es/casestudies.xml deleted file mode 100644 index ee78c63980..0000000000 --- a/src/documentation/xdocs/trans/es/casestudies.xml +++ /dev/null @@ -1,67 +0,0 @@ - - - - -
- Jakarta POI - Case Studies - - - - - -
- - -
-

- A number of people are using POI for a variety of purposes. As with - any new API or technology, the first question people generally ask - is not "how can I" but rather "Who else is doing what I'm about to - do?" This is understandable with the abismal success rate in the - software business. These case statements are meant to help create - confidence and understanding. -

-
-
-

- We are actively seeking case studies for this page (after all it - just started). Andy Oliver (acoliver at apache dot org) has - agreed to have a few T-Shirts printed with the POI logo (once its - chosen) for the first - few best submissions. To submit a case study, either - - submit a patch for this page (preferred) or email it to the - mailing list - . -

-
-
-
-

- Edwards and Kelcey Technology (http://www.ekcorp.com/) developed a - Facility - Managament and Maintenance System for the Telecommunications industry - based - on Turbine and Velocity. Originally the invoicing was done with a simple - CVS - sheet which was then marked up by accounts and customized for each client. - As growth has been consistent with the application, the requirement for - invoices that need not be touched by hand increased. POI provided the - solution to this issue, integrating easily and transparently into the - system. POI HSSF was used to create the invoices directly from the server - in - Excel 97 format and now services over 150 unique invoices per month. -

-

- Cameron Riley (crileyNO@ SPAMekmail.com) -

-
-
- -
- - Copyright (c) @year@ The Apache Software Foundation All rights reserved. - $Revision$ $Date$ - -
-
diff --git a/src/documentation/xdocs/trans/es/changes.xml b/src/documentation/xdocs/trans/es/changes.xml deleted file mode 100644 index 924f74c95e..0000000000 --- a/src/documentation/xdocs/trans/es/changes.xml +++ /dev/null @@ -1,106 +0,0 @@ - - - - - - - - - - - - - - - Soporte para Formato de Datos Personalizado (Custom) - Soporte Unicode Mejorado para Ruso y Japonés - Soporte para Fórmulas Mejorado, incluyendo - sólo-lectura para sentencias tipo "optimizado si" (optimized if). - Soporte para la clonación de objetos - Arreglos en la cabecera/pie - Traducciones de la documentación al Español - Soporte para preservar macros VBA - - - Se quita la dependencia en tiempo de ejecución del registro - (logging) de "commons". - Soporte para fórmulas - - - Se quita la dependencia del registro de "commons". Ahora hay que definir la propiedad del sistema poi.loggin para permitir - que los registros (logs) vayan a la salida estándar. - Se arregla la gestión de la cadena SST para que las hojas de cálculo con texto rico (rich text) o texto - extendido se lean correctamente. - - - Nueva versión (build) del proyecto. - Nuevo sistema de documentación del proyecto basada en Cocoon. - Cambio de nombre del paquete - Varios errores (bugs) corregidos - Etapas preliminares del desarrollo de HFS (no esta listo para el desarrollo) - Soporte inicial de registros de bajo nivel para gráficas (no está completo) - - - Se crea un nuevo modelo de eventos - Se optimiza HSSF, incluyendo registros (records) para - valores, filas, etc. - predicción de tamaño, escritura basada en desplazamiento (en lugar de - multitud de copias de arrays) - un poco de re-factoring (¿re-factorización? mejor no) y corrección de errores. - - - Actualizaciones menores a la documentación. - - - Se añade la clase de ayuda DataFormat y se expone el formato set y get en - HSSFCellStyle - Correcciones a las apis de anchura de columna (en cuanto a las unidades) y - varios comentarios javadoc al respecto - Corrección para el registro de "Dimensions" (de nuevo)... (uno de estos días - escribiré una prueba unitaria (unit test) para esto ;-p). - Alguna optimización en la creación de páginas. - - - Mejoras no registradas - - - Se añaden MulBlank, Blank, ColInfo - Se añade facilidad log4j y se quitan las anotaciones (logs) del tipo sys.out - Se añade soporte para la adición de fuentes, estilos y el api de alto - nivel correspondiente para dar estilo a las celdas - Se añade soporte para cambiar el alto de una fila, el ancho de una celda, y - sus valores por defecto. - Correcciones para internacionalización (UTF-16 debería funcionar ahora - desde HSSFCell.setStringValue, etc cuando se define la codificación) - Soporte para la adición / eliminación y cambio de nombre de hojas. - - - Distribución de corrección de errores. - Lanzamos una excepción cuando leemos objetos de tipo RKRecord. - - - Registros de continuación ya funcionan (lectura/escritura) - Se añade soporte previo para fórmulas - Reorganización del API masiva, re-enpaquetado. - Se añade la clase BiffViewer para validar HSSF & POI y/o la - salida de HSSF. - Se mejora el soporte a la modificación del API. - - - Se añade una bandera de codificación para que las apis de alto - y bajo nivel utilicen utf-16 cuando sea necesario (HSSFCell.setEncoding()) - Se añade soporte de sólo lectura a registros de Etiqueta - (que son reinterpretados como LabelSST cuando se escriben) - Se rompe la implementación del registro de continuación (oops) - Se añade la clase BiffViewer - para validar HSSF & POI y/o la - salida de HSSF. - - - Soporte para lectura/escritura y modificación. - Soporte de sólo lectura para registros de tipo MulRK - (convertidos a Number cuando se escriben) - - - - diff --git a/src/documentation/xdocs/trans/es/faq.xml b/src/documentation/xdocs/trans/es/faq.xml deleted file mode 100644 index 8f61433d3d..0000000000 --- a/src/documentation/xdocs/trans/es/faq.xml +++ /dev/null @@ -1,208 +0,0 @@ - - - - - - - ¿Por qué la lectura de una hoja de cálculo simple lleva tanto tiempo? - - -

- Probablemente hayas habilitado el registro (logging). Dicho registro es - una herramienta útil para la búsqueda de errores (debug). Tenerlo habilitado - reducirá el rendimiento en un factor de al menos 100. El registro es útil para - comprender por qué POI no puede leer algún fichero o para el propio desarrollo - de POI. - Los errores importantes se lanzan como excepciones, lo cual significa que - probablemente no necesites el registro (log). -

-
-
- - - ¿Qué es el "eventmodel" (modelo de evento) de HSSF? - - -

El paquete "eventmodel" de HSSF es un nuevo API para la lectura más eficiente de ficheros - XML. Requiere mayor conocimiento por parte del usuario, pero reduce el consumo de memoria a - una décima parte. Está basado en el modelo de eventos AWT en combinación con SAX. Si necesita - acceso de sólo-lectura a un fichero XML determinado, esta es la mejor manera de hacerlo.

-
- -
- - - ¿Por qué no puedo leer el documento que creé utilizando Star Office 5.1? - - -

Star Office 5.1 escribe algunos registros utilizando el viejo estándar BIFF. - Esto provoca algunos problemas con POI que sólo soporta BIFF8.

-
-
- - - ¿Por qué recibo una excepción cada vez que intento leer mi hoja de cálculo? - - -

Es posible que su hoja de cálculo contenga alguna característica que no esté - soportada actualmente por HSSF. Por ejemplo - hojas de cálculo que contengan - celdas con formato RTF (rich text) no están soportadas actualmente.

-
-
- - - ¿Soporta HSSF hojas de cálculo protegidas? - - -

Al proteger una hoja de cálculo, ésta se cifra. No tocaremos el cifrado, porque no - tenemos el suficiente conocimiento legal y no estamos seguros de las implicaciones que - conllevaría el intentar implementar esto. Si desea intentarlo, es libre de hacerlo y - de añadirlo como un módulo enchufable (plugin). Sin embargo, no lo guardaremos aquí.

-
-
- - - ¿Cómo se sabe si un campo contiene una fecha con HSSF? - - -

Excel almacena las fechas como números. Así la única manera para determinar - si una celda está realmente almacenada como una fecha consiste en mirar su formato. - Hay un método de ayuda (helper) en HSSFDateUtil (desde la distribución 1.7.0-dev) - que lo comprueba. Gracias a Jason Hoffman por proporcionar la solución.

- - -case HSSFCell.CELL_TYPE_NUMERIC: - double d = cell.getNumericCellValue(); - // test if a date! - if (HSSFDateUtil.isCellDateFormatted(cell)) { - // format in form of M/D/YY - cal.setTime(HSSFDateUtil.getJavaDate(d)); - cellText = - (String.valueOf(cal.get(Calendar.YEAR))).substring(2); - cellText = cal.get(Calendar.MONTH)+1 + "/" + - cal.get(Calendar.DAY_OF_MONTH) + "/" + - cellText; - } - - -
-
- - - Estoy intentando ver un fichero XLS enviado como flujo (stream) desde un servlet y tengo - complicaciones. ¿Cuál es el problema? - - -

- El problema normalmente se manifiesta como un montón de caracteres basura - en la pantalla. El problema persiste incluso aunque hayas configurado el tipo mime - correcto. -

-

- La respuesta breve es: no dependas de IE para mostrar un fichero binario. - Escribe un documento adjunto como es debido si lo envías a través de un servlet. - Toda versión de IE tiene diferentes fallos (bugs) en este sentido. -

-

- El problema en la mayoría de las versiones de IE reside en que no utiliza el tipo mime - de la respuesta HTTP para determinar el tipo de fichero; en su lugar utiliza la extensión - del fichero en la petición. Así podría añadir un .xls a su cadena de petición. - Por ejemplo: http://yourserver.com/myServelet.xls?param1=xx. Esto se consigue - fácilmente a través del mapeo de URL en cualquier contenedor servlet. A veces una - petición como - http://yourserver.com/myServelet?param1=xx&dummy=file.xls - también funciona. -

-

- Para garantizar la correcta apertura del fichero en Excel desde IE, escribe - tu fichero a un fichero temporal bajo su raiz web desde tu servlet. Envía entonces - una respuesta http al navegador para que haga una redirección en el lado del cliente - a tu fichero temporal. (Si haces una redirección en el lado del servidor utilizando - RequestDispatcher, tendrás que añadir .xls a la petición como se ha mendionado más - arriba) -

-

- Date cuenta de que cuando pides un documento que se abre con un manejador externo, - IE a veces realiza dos peticiones al servidor web. Así que si tu proceso generador - es pesado, tiene sentido escribir a un fichero temporal, para que peticiones - múltiples utilicen el fichero estático. -

-

- Nada de esto pertenece a Excel. El mismo problema ocurre cuando intentas general - cualquier fichero binario dinámicamente a un cliente IE. Por ejemplo, si generas - ficheros pdf utilizando - FOP, - te encontrarás con los mismos problemas. -

- -
-
- - - Quiero dar formato a una celda (Data format of a cell) de una hoja excel como - ###,###,###.#### o ###,###,###.0000. ¿Es posible hacer esto con POI? - - -

- HSSF no soporta todavía formatos de datos personalizados, sin embargo, - debería ser una facilidad razonablemente sencilla de añadir y aceptaremos - gustosos contribuciones en este área. -

-

- Estos son los formatos incluidos que soporta: -

-

- http://jakarta.apache.org/poi/javadocs/javasrc/org/apache/poi/hssf/usermodel/HSSFDataFormat_java.html#HSSFDataFormat -

-
-
- - - ¿Cómo añado un borde alrededor de una celda unida (merged)? - - -

- Añade celdas vacías alrededor de donde las celdas hubieran estado normalmente y - configura los bordes individualmente para cada celda. - Probablemente mejoraremos HSSF en el futuro para facilitar este proceso. -

-
-
- - - Intenté escribir valores en celdas así como cambiar el nombre de la hoja Excel - en mi lengua nativa, pero no pude hacerlo. :( - - -

- Por defecto HSSF utiliza valores de celdas y nombres de hoja en unicode comprimido, - asi que para soportar la localización deberías utilizar Unicode. - Para hacerlo deberías configurarlo manualmente: -

- - - // - // para el nombre de la hoja - // - HSSFWorkbook wb = new HSSFWorkbook(); - HSSFSheet s = wb.createSheet(); - wb.setSheetName( 0, "SomeUnicodeName", HSSFWorkbook.ENCODING_UTF_16 ); - - - // - // para el valor de la celda - // - HSSFRow r = s.createRow( 0 ); - HSSFCell c = r.createCell( (short)0 ); - c.setCellType( HSSFCell.CELL_TYPE_STRING ); - c.setEncoding( HSSFCell.ENCODING_UTF_16 ); - c.setCellValue( "\u0422\u0435\u0441\u0442\u043E\u0432\u0430\u044F" ); - - -

- Asegúrate de que haces la llamada a setEncoding() antes de llamar a setCellValue(), - si no, lo que le pases no será interpretado correctamente. -

-
-
-
diff --git a/src/documentation/xdocs/trans/es/historyandfuture.xml b/src/documentation/xdocs/trans/es/historyandfuture.xml deleted file mode 100644 index 77e27d0cad..0000000000 --- a/src/documentation/xdocs/trans/es/historyandfuture.xml +++ /dev/null @@ -1,152 +0,0 @@ - - - - -
- Historia del Proyecto - - - - -
- - - - -
- -

- El proyecto POI se gestó tiempo atrás, cerca de abril de 2001, - cuando Andy Oliver obtuvo un contrato de corta duración para realizar - informes Excel basados en Java. Ya había realizado este proyecto unas - cuantas veces antes, y sabía exactamente dónde buscar las herramientas - que necesitaría. - Irónicamente, el API que solía utilizar se había disparado en precio - desde unos $300 ($US) hasta unos $10K ($US). Calculó que a dos personas - les llevaría unos seis meses el portar Excel así que le recomendó al - cliente que pagase los $10K. -

- -

- Cerca de junio de 2001, Andy empezó a pensar lo genial que sería - tener una herramienta Java de código abierto para hacer esto y, - mientras tuvo algo de tiempo libre, comenzó el proyecto y aprendió - cosas sobre el Formato de Documento Compuesto OLE2. Tras chocarse - con varios obstáculos insalvables, se dio cuenta de que necesitaría ayuda. - Publicó un mensaje en su Grupo de Usuarios Java local (JUG) y - preguntó si alguien estaba interesado. Tuvo mucha suerte y el - programador Java de mayor talento que había conocido nunca, - Marc Johnson, se unió al proyecto. A Marc le llevó unas pocas - iteraciones el obtener algo con lo que estaban contentos. -

- -

- Mientras Marc trabajaba en eso, Andy portó XLS a Java, basándose - en la biblioteca de Marc. Varios usuarios escribieron peticiones - para poder leer XLS (no sólo escribirlo como había sido planeado - originalmente) y un usuario tenía peticiones especiales para - un uso diferente de POIFS. Antes de que pasara mucho tiempo, - el alcance del proyecto se había triplicado. POI 1.0 se distribuyó - un mes más tarde de lo planeado, pero con muchas más características. - Marc escribió rápidamente el marco del serializador y el - Serializador HSSF en tiempo récord y Andy generó más documentación - y trabajó en hacer que la gente conociera este proyecto. -

- -

- Poco antes de la distribución, POI tuvo la fortuna de entrar - en contacto con Nicola -Ken- Barrozzi quien proporcionó ejemplos - para el Serializador HSSF y ayudó a descrubir sus desafortunados - fallos (que fueron arreglados de inmediato). Recientemente, Ken - portó la mayoría de la documentación del proyecto POI a XML - partiendo de los documentos HTML cutres que Andy había escrito - con Star Office. -

- -

- Más o menos al mismo tiempo de la primera distribución, Glen Stampoultzis - se unió al proyecto. A Glen le molestaba la actitud impertinente de Andy - en lo que añadir capacidades gráficas a HSSF se refería. Glen se molestó - tanto que decidió coger un martillo y hacerlo él mismo. Glen ya se ha - convertido en parte integral de la comunidad de desarrollo de POI; sus - contribuciones a HSSF ya han comenzado a producir olas. -

- -

- En algún momento decidimos finalmente remitir el proyecto a - El Proyecto Cocoon - de Apache, sólo para descubrir que el proyecto había - crecido encajando perfectamente con Cocoon hacía tiempo. - Lo que es más, Andy comenzó a ojear otros proyectos a los que - le gustaría que se añadiera la funcionalidad de POI. Así que - se decidió donar los Serializadores y Generadores a Cocoon, otros - componentes de integración con POI a otros proyectos, y los APIs - de POI pasarían a formar parte de Jakarta. Fue un camino con - baches, ¡pero parece que todo salió bien puesto que ahora estás - leyendo esto! -

- -
- -
-

- Primero abordaremos esto desde el punto de vista del proyecto: - Bueno, les hicimos la oferta a Microsoft y Actuate (de coña - ... en su mayor parte) de que dejaríamos el proyecto y nos - retiraríamos si simplemente nos firmaban a cada uno un cheque - con muchos ceros. Todavía estoy esperando una llamada o correo - electrónico, así que de momento asumo que no nos van a pagar - para quitarnos de en medio. -

-

- Después, tenemos algo de trabajo que hacer aquí en Jakarta - para terminar de integrar POI en la comunidad. Lo que es más, - todavía estamos realizando la transición del Serializador a - Cocoon. -

-

- HSSF, durante el ciclo 2.0, sufrirá varias optimizaciones. - También añadiremos nuevas características como una implementación - completa de Fórmulas y formatos de texto personalizados. Esperamos - ser capaces de generar ficheros más pequeños añadiendo soporte de - escritura para registros RK, MulRK y MulBlank. A día de hoy, la - lectura en HSSF no es muy eficiente. Esto se debe sobre todo a que - para escribir o modificar, uno necesita ser capaz de actualizar - punteros del flujo de subida (upstream pointers) a datos del flujo - de bajada. Para hacer esto hay que tener todo lo que haya en - medio en memoria. En vez de eso, un Generador permitiría que se - procesaran eventos SAX. (Esto se basará en las estructuras de - bajo nivel). Una de las mejores cosas sobre esto es que así no sólo - tendremos una manera más eficiente de leer el fichero, también - tendremos una magnífica forma de utilizar hojas de cálculo como - fuentes de datos XML. -

-

- El Serializador HSSF, se separará más aún en un marco genérico - para la creación de serializadores para otras plataformas y - en la implementación específica del serializador HSSF. (Esto ya - es cierto en gran medida). También añadiremos soporte para - características ya soportadas por HSSF (estilos, fuentes, formatos - de texto). Esperamos añadir soporte para fórmulas durante este ciclo. -

-

- Estamos empezando a expandir nuestro alcance de nuevo. Si pudimos - hacer todo esto para ficheros XLS, ¿qué hay de ficheros Doc o PPT? - Pensamos que nuestro siguiente componente (HDF - Formato de - Documento Horrible) debería seguir el mismo patrón. Esperamos - que se nos una sangre nueva al equipo y que nos permita abordar - esto con mayor celeridad (en parte porque POIFS ya está terminado). - ¡Pero a lo mejor lo que más necesitamos es a ti! -

-
- - -
- - Copyright (c) @year@ The Apache Software Foundation All rights reserved. - $Revision$ $Date$ - -
- - -
diff --git a/src/documentation/xdocs/trans/es/hssf/alternatives.xml b/src/documentation/xdocs/trans/es/hssf/alternatives.xml deleted file mode 100644 index 6087c928bd..0000000000 --- a/src/documentation/xdocs/trans/es/hssf/alternatives.xml +++ /dev/null @@ -1,102 +0,0 @@ - - - - -
- HSSF - Programas Alternativos a HSSF - - - -
- - -
-

- Puede ser que no es buen idea hablar de nuestra competidores pero nosotros creemos que la compentencia es bueno y que tenemos disponible hoy en día la mejor solución para escribir y leer archivos de Excel -

- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -
ProductoURLDescripción
Formula One - www.tidestone.com - - Una alternativa a este proyecto es comprar a $10,000 (dolares EEUU) el "Formula 1 library" y aceptar su API immadura y sus limitaciones. -
Visual Basic - www.microsoft.com - - Deja de usar XML y escriba su programa usando Visual Basic en una computadora de Microsoft Windows. O trata de crear una solución usando la versión beta e indocumentada del nuevo formato de Microsoft llamado 'XML for Office'. -
JExcelhttp://stareyes.homeip.net:8888Frequentemente no está disponible. Se conoce poco acerca de - las habilidades de este programa. -
JWorkbookhttp://www.object-refinery.com/jworkbook/index.html - Este esfuerzo puede trabajar con Gnumeric y Excel, pero la parte de Excel lo hace usando POI de todas formas. -
xlReaderhttp://www.sourceforge.net/projects/xlrdFunciona bastante bien leyendo Excel. -
Excel ODBC Driverhttp://www.nwlink.com/~leewal/content/exceljavasample.htmODBC ofrece una forma bastante rara para trabajar con Excel. -
ExtenXLShttp://www.extentech.com/products/ExtenXLS/docs/intro3.jspUna biblioteca commercial para leer, modificar, y escribir hojas de balance de Excel. No es barato pero cuesta menos que Formula 1. No tenemos idea de la calidad de este software. -
J-Integra Java-Excel Bridgehttp://www.intrinsyc.com/products/bridging/jintegra.aspUsa DCOM a una instancia de Excel en una computadora de Windows. -
Perl & C-Hay varias bibliotecas en los idiomas de Perl y C. Pero no son consistentes o compatibles. -
VistaJDBChttp://www.vistaportal.com/products/vistajdbc.htmEl driver de VistaJDBC trabaja con hojas de balance de StarOffice y Excel y proveen acceso al data sin tener que programar usando SQL estandar. VistaJDBC tambien permite escoger cellulas no solo por columna y fila per tambien cellulas individuales específicas, juegos de celulas, etc. -
Coldtags Excel Tag Libraryhttp://www.servletsuite.com/servlets/exceltag.htm - Esta biblioteca ayuda a crear archivos en el formato de CSV, en la cual se permiten numeros y textos. Puede crear achivos de CSV sin esta ayuda, pero la bibioteca da estructura y leabilidad a su codigo y lo puede extender si necesita manejar casos mas complejos. Cuando un browser encuentra uno de estos páginas de JSP, abre una instancia de Excel automaticamente. No hay formatting, hojas de balance distintas, etc. O sea que, estrictamente, no es un competidor pero en algunos casos es todo lo que se necesita. -
-
- -
diff --git a/src/documentation/xdocs/trans/es/hssf/book.xml b/src/documentation/xdocs/trans/es/hssf/book.xml deleted file mode 100644 index 7595547ec4..0000000000 --- a/src/documentation/xdocs/trans/es/hssf/book.xml +++ /dev/null @@ -1,27 +0,0 @@ - - - - - - - - - - - - - - - - - - - - - - - - - diff --git a/src/documentation/xdocs/trans/es/hssf/diagram1.xml b/src/documentation/xdocs/trans/es/hssf/diagram1.xml deleted file mode 100644 index 4fc9edb751..0000000000 --- a/src/documentation/xdocs/trans/es/hssf/diagram1.xml +++ /dev/null @@ -1,19 +0,0 @@ - - - - -
- HSSF - Vista General - - - - -
- - -
- -
- -
diff --git a/src/documentation/xdocs/trans/es/hssf/diagrams.xml b/src/documentation/xdocs/trans/es/hssf/diagrams.xml deleted file mode 100644 index c488d0e000..0000000000 --- a/src/documentation/xdocs/trans/es/hssf/diagrams.xml +++ /dev/null @@ -1,32 +0,0 @@ - - - - -
- HSSF - Vista General - - - - -
- - -
-

- Se espera usar esta sección para diagramas (UML/etc) que ayudan a - explicar HSSF. -

-
    -
  • - Diagrama de Clases del UserModel - - por Matthew Young (myoung at westernasset dot com) -
  • -
-

- Tiene más? Agrega un nuevo "bug" al archivo de bugs con la - palabra [DOCUMENTATION] antes de la descripción y un link al file en algun webserver. Si no tiene su propio webserver, mandaselo por email a (acoliver at apache dot org) siempre y cuando el tamaño del archivo es menor a 5 Mb. Diagramas deben estar en algun formato compatible, por lo menos, con Linux y Windows. Se prefieren diagramas que se pueden cambiar o modificar pero entendemos que hay pocos programas para crear UML que están a buen precio. Y no, no TIENEN que ser UML... solo tienen que ser utiles. -

-
- -
diff --git a/src/documentation/xdocs/trans/es/hssf/formula.xml b/src/documentation/xdocs/trans/es/hssf/formula.xml deleted file mode 100644 index d3003d8b94..0000000000 --- a/src/documentation/xdocs/trans/es/hssf/formula.xml +++ /dev/null @@ -1,90 +0,0 @@ - - - - -
- Formula Support - - - -
- -
-

- This document describes the current state of formula support in POI. - The information in this document applies to the 2.0-dev version of POI (i.e. CVS HEAD). - Since this area is a work in progress, this document will be updated with new features as and - when they are added. -

- -
-
-

- In org.apache.poi.hssf.usermodel.HSSFCell - setCellFormula("formulaString") is used to add a formula to sheet and - getCellFormula() is used to retrieve the string representation of a formula. -

-

- We aim to support the complete excel grammer for formulas. Thus, the string that you pass in - to the setCellFormula call should be what you expect to type into excel. Also, note - that you should NOT add a "=" to the front of the string. -

-
-
-
    -
  • Cell References
  • -
  • String, integer and floating point literals
  • -
  • Area references
  • -
  • Relative or absolute references
  • -
  • Arithmetic Operators
  • -
  • Sheet Functions
  • -
-
-
-
    -
  • - The formula parser now has the ability to parse formulas containing strings. However - formulas that return a string value are not yet supported. -
  • -
  • Formula tokens in Excel are stored in one of three possible classes : - Reference, Value and Array. Based on the location of a token, its class can change - in complicated and undocumented ways. While we have support for most cases, we - are not sure if we have covered all bases (since there is no documentation for this area.) - We would therefore like you to report any - occurence of #VALUE! in a cell upon opening a POI generated workbook in excel. (Check that - typing the formula into Excel directly gives a valid result.) -
  • - -
-
-
-
    -
  • Array formulas
  • -
  • Formulas with logical operations (IF)
  • -
  • Sheet References in formulas
  • -
  • Everything else :)
  • -
-
- -
-

- Formulas in Excel are stored as sequences of tokens in Reverse Polish Notation order. The - open office XLS spec is the best - documentation you will find for the format. -

- -

- The tokens used by excel are modelled as individual *Ptg classes in the - org.apache.poi.hssf.record.formula package. -

-

- The task of parsing a formula string into an array of RPN ordered tokens is done by the - org.apache.poi.hssf.record.formula.FormulaParser class. This class implements a hand - written recursive descent parser. -

-

Check out the javadocs for details. -

-
- - -
\ No newline at end of file diff --git a/src/documentation/xdocs/trans/es/hssf/hacking-hssf.xml b/src/documentation/xdocs/trans/es/hssf/hacking-hssf.xml deleted file mode 100644 index 1cdf65af04..0000000000 --- a/src/documentation/xdocs/trans/es/hssf/hacking-hssf.xml +++ /dev/null @@ -1,72 +0,0 @@ - - - - -
- Hacking HSSF - - - - -
- -
-

- You might find the - 'Excel 97 Developer's Kit' (out of print, Microsoft Press, no - restrictive covenants, available on Amazon.com) helpful for - understanding the file format. -

-

- Also useful is the open office XLS spec. We - are collaborating with the maintainer of the spec so if you think you can add something to their - document just send through your changes. -

-
-
-
    -
  1. - Look at OpenOffice.org or Gnumeric sources if its implemented there. -
  2. -
  3. - Use org.apache.poi.hssf.dev.BiffViewer to view the structure of the - file. Experiment by adding one criteria entry at a time. See what it - does to the structure, infer behavior and structure from it. Using the - unix diff command (or get cygwin from www.cygwin.com for windows) you - can figure out a lot very quickly. Unimplemented records show up as - 'UNKNOWN' and prints a hex dump. -
  4. -
-
-
-

- Low level records can be time consuming to created. We created a record - generator to help generate some of the simpler tasks. -

-

- We use XML - descriptors to generate the Java code (which sure beats the heck out of - the PERL scripts originally used ;-) for low level records. The - generator is kinda alpha-ish right now and could use some enhancement, - so you may find that to be about 1/2 of the work. Notice this is in - org.apache.poi.hssf.record.definitions. -

-
-
-

One thing to note: If you are making a large code contribution we need to ensure - any participants in this process have never - signed a "Non Disclosure Agreement" with Microsoft, and have not - received any information covered by such an agreement. If they have - they'll not be able to participate in the POI project. For large contributions we - may ask you to sign an agreement.

-
-
-

Check our todo list or simply look for missing functionality. Start small - and work your way up.

-
-
-

Make sure you read the contributing section - as it contains more generation information about contributing to Poi in general.

-
- -
\ No newline at end of file diff --git a/src/documentation/xdocs/trans/es/hssf/how-to.xml b/src/documentation/xdocs/trans/es/hssf/how-to.xml deleted file mode 100644 index 39aa03ca22..0000000000 --- a/src/documentation/xdocs/trans/es/hssf/how-to.xml +++ /dev/null @@ -1,498 +0,0 @@ - - - - -
- The New Halloween Document - - - - - -
- -
- -
-

This release of the how-to outlines functionality for the CVS HEAD. - Those looking for information on previous releases should - look in the documentation distributed with that release.

-

- This release allows numeric and string cell values to be written to - or read from an XLS file as well as reading and writing dates. Also - in this release is row and column sizing, cell styling (bold, - italics, borders,etc), and support for built-in data formats. New - to this release is an event-based API for reading XLS files. - It differs greatly from the read/write API - and is intended for intermediate developers who need a smaller - memory footprint. It will also serve as the basis for the HSSF - Generator.

-
-
-
-
- -

The high level API (package: org.apache.poi.hssf.usermodel) - is what most people should use. Usage is very simple. -

-

Workbooks are created by creating an instance of - org.apache.poi.hssf.usermodel.HSSFWorkbook. -

-

Sheets are created by calling createSheet() from an existing - instance of HSSFWorkbook, the created sheet is automatically added in - sequence to the workbook. Sheets do not in themselves have a sheet - name (the tab at the bottom); you set - the name associated with a sheet by calling - HSSFWorkbook.setSheetName(sheetindex,"SheetName",encoding). - The name may be in 8bit format (HSSFWorkbook.ENCODING_COMPRESSED_UNICODE) - or Unicode (HSSFWorkbook.ENCODING_UTF_16). Default encoding is 8bit per char. -

-

Rows are created by calling createRow(rowNumber) from an existing - instance of HSSFSheet. Only rows that have cell values should be - added to the sheet. To set the row's height, you just call - setRowHeight(height) on the row object. The height must be given in - twips, or 1/20th of a point. If you prefer, there is also a - setRowHeightInPoints method. -

-

Cells are created by calling createCell(column, type) from an - existing HSSFRow. Only cells that have values should be added to the - row. Cells should have their cell type set to either - HSSFCell.CELL_TYPE_NUMERIC or HSSFCell.CELL_TYPE_STRING depending on - whether they contain a numeric or textual value. Cells must also have - a value set. Set the value by calling setCellValue with either a - String or double as a parameter. Individual cells do not have a - width; you must call setColumnWidth(colindex, width) (use units of - 1/256th of a character) on the HSSFSheet object. (You can't do it on - an individual basis in the GUI either).

-

Cells are styled with HSSFCellStyle objects which in turn contain - a reference to an HSSFFont object. These are created via the - HSSFWorkbook object by calling createCellStyle() and createFont(). - Once you create the object you must set its parameters (colors, - borders, etc). To set a font for an HSSFCellStyle call - setFont(fontobj). -

-

Once you have generated your workbook, you can write it out by - calling write(outputStream) from your instance of Workbook, passing - it an OutputStream (for instance, a FileOutputStream or - ServletOutputStream). You must close the OutputStream yourself. HSSF - does not close it for you. -

-

Here is some example code (excerpted and adapted from - org.apache.poi.hssf.dev.HSSF test class):

- -
-
- -

Reading in a file is equally simple. To read in a file, create a -new instance of org.apache.poi.poifs.Filesystem, passing in an open InputStream, such as a FileInputStream -for your XLS, to the constructor. Construct a new instance of -org.apache.poi.hssf.usermodel.HSSFWorkbook passing the -Filesystem instance to the constructor. From there you have access to -all of the high level model objects through their assessor methods -(workbook.getSheet(sheetNum), sheet.getRow(rownum), etc). -

-

Modifying the file you have read in is simple. You retrieve the -object via an assessor method, remove it via a parent object's remove -method (sheet.removeRow(hssfrow)) and create objects just as you -would if creating a new xls. When you are done modifying cells just -call workbook.write(outputstream) just as you did above.

-

An example of this can be seen in -org.apache.poi.hssf.dev.HSSF.

-
-
-
- -

The event API is brand new. It is intended for intermediate - developers who are willing to learn a little bit of the low level API - structures. Its relatively simple to use, but requires a basic - understanding of the parts of an Excel file (or willingness to - learn). The advantage provided is that you can read an XLS with a - relatively small memory footprint. -

-

To use this API you construct an instance of - org.apache.poi.hssf.eventmodel.HSSFRequest. Register a class you - create that supports the - org.apache.poi.hssf.eventmodel.HSSFListener interface using the - HSSFRequest.addListener(yourlistener, recordsid). The recordsid - should be a static reference number (such as BOFRecord.sid) contained - in the classes in org.apache.poi.hssf.record. The trick is you - have to know what these records are. Alternatively you can call - HSSFRequest.addListenerForAllRecords(mylistener). In order to learn - about these records you can either read all of the javadoc in the - org.apache.poi.hssf.record package or you can just hack up a - copy of org.apache.poi.hssf.dev.EFHSSF and adapt it to your - needs. TODO: better documentation on records.

-

Once you've registered your listeners in the HSSFRequest object - you can construct an instance of - org.apache.poi.poifs.filesystem.FileSystem (see POIFS howto) and - pass it your XLS file inputstream. You can either pass this, along - with the request you constructed, to an instance of HSSFEventFactory - via the HSSFEventFactory.processWorkbookEvents(request, Filesystem) - method, or you can get an instance of DocumentInputStream from - Filesystem.createDocumentInputStream("Workbook") and pass - it to HSSFEventFactory.processEvents(request, inputStream). Once you - make this call, the listeners that you constructed receive calls to - their processRecord(Record) methods with each Record they are - registered to listen for until the file has been completely read. -

-

A code excerpt from org.apache.poi.hssf.dev.EFHSSF (which is - in CVS or the source distribution) is reprinted below with excessive - comments:

- -
-
- -

The low level API is not much to look at. It consists of lots of -"Records" in the org.apache.poi.hssf.record.* package, -and set of helper classes in org.apache.poi.hssf.model.*. The -record classes are consistent with the low level binary structures -inside a BIFF8 file (which is embedded in a POIFS file system). You -probably need the book: "Microsoft Excel 97 Developer's Kit" -from Microsoft Press in order to understand how these fit together -(out of print but easily obtainable from Amazon's used books). In -order to gain a good understanding of how to use the low level APIs -should view the source in org.apache.poi.hssf.usermodel.* and -the classes in org.apache.poi.hssf.model.*. You should read the -documentation for the POIFS libraries as well.

-
-
- -

The HSSF application is nothing more than a test for the high -level API (and indirectly the low level support). The main body of -its code is repeated above. To run it: -

-
    -
  • download the poi-alpha build and untar it (tar xvzf - tarball.tar.gz) -
  • -
  • set up your classpath as follows: - export HSSFDIR={wherever you put HSSF's jar files} -export LOG4JDIR={wherever you put LOG4J's jar files} -export CLASSPATH=$CLASSPATH:$HSSFDIR/hssf.jar:$HSSFDIR/poi-poifs.jar:$HSSFDIR/poi-util.jar:$LOG4JDIR/jog4j.jar -
  • type: - java org.apache.poi.hssf.dev.HSSF ~/myxls.xls write
  • -
-

-

This should generate a test sheet in your home directory called "myxls.xls".

-
    -
  • Type: - java org.apache.poi.hssf.dev.HSSF ~/input.xls output.xls -
    -
    -This is the read/write/modify test. It reads in the spreadsheet, modifies a cell, and writes it back out. -Failing this test is not necessarily a bad thing. If HSSF tries to modify a non-existant sheet then this will -most likely fail. No big deal.
  • -
-
-
-

Poi can dynamically select it's logging implementation. Poi trys to - create a logger using the System property named "org.apache.poi.util.POILogger". - Out of the box this can be set to one of three values: -

-
    -
  • org.apache.poi.util.CommonsLogger
  • -
  • org.apache.poi.util.NullLogger
  • -
  • org.apache.poi.util.SystemOutLogger
  • -
-

- If the property is not defined or points to an invalid classthen the NullLogger is used. -

-

- Refer to the commons logging package level javadoc for more information concerning how to - configure commons logging. -

-
-
- -

HSSF has a number of tools useful for developers to debug/develop -stuff using HSSF (and more generally XLS files). We've already -discussed the app for testing HSSF read/write/modify capabilities; -now we'll talk a bit about BiffViewer. Early on in the development of -HSSF, it was decided that knowing what was in a record, what was -wrong with it, etc. was virtually impossible with the available -tools. So we developed BiffViewer. You can find it at -org.apache.poi.hssf.dev.BiffViewer. It performs two basic -functions and a derivative. -

-

The first is "biffview". To do this you run it (assumes -you have everything setup in your classpath and that you know what -you're doing enough to be thinking about this) with an xls file as a -parameter. It will give you a listing of all understood records with -their data and a list of not-yet-understood records with no data -(because it doesn't know how to interpret them). This listing is -useful for several things. First, you can look at the values and SEE -what is wrong in quasi-English. Second, you can send the output to a -file and compare it. -

-

The second function is "big freakin dump", just pass a -file and a second argument matching "bfd" exactly. This -will just make a big hexdump of the file. -

-

Lastly, there is "mixed" mode which does the same as -regular biffview, only it includes hex dumps of certain records -intertwined. To use that just pass a file with a second argument -matching "on" exactly.

-

In the next release cycle we'll also have something called a -FormulaViewer. The class is already there, but its not very useful -yet. When it does something, we'll document it.

- -
-
- -

This release contains code that supports "internationalization" -or more accurately non-US/UK languages; however, it has not been -tested with the new API changes (please help us with this). We've -shifted focus a bit for this release in recognition of the -international support we've gotten. We're going to focus on western -European languages for our first beta. We're more than happy to -accept help in supporting non-Western European languages if someone -who knows what they're doing in this area is willing to pitch in! -(There is next to no documentation on what is necessary to support -such a move and its really hard to support a language when you don't even -know the alphabet).

-

This release of HSSF does not yet support Formulas. I've been -focusing on the requests I've gotten in. That being said, if we get -more user feedback on what is most useful first we'll aim for that. -As a general principal, HSSF's goal is to support HSSF-Serializer -(meaning an emphasis on write). We would like to hear from you! How -are you using HSSF/POIFS? How would you like to use it? What features -are most important first? -

-
- -
- -
- -
diff --git a/src/documentation/xdocs/trans/es/hssf/index.xml b/src/documentation/xdocs/trans/es/hssf/index.xml deleted file mode 100644 index 9b96f3c390..0000000000 --- a/src/documentation/xdocs/trans/es/hssf/index.xml +++ /dev/null @@ -1,47 +0,0 @@ - - - - -
- HSSF - Overview - - - - -
- - -
- -

HSSF is the POI Project's pure Java implementation of the Excel '97(-2002) file format.

-

HSSF provides a way to read spreadsheets create, modify, read and write XLS spreadsheets - It provides: -

-
    -
  • low level structures for those with special needs
  • -
  • an eventmodel api for efficient read-only access
  • -
  • a full usermodel api for creating, reading and modifying XLS files
  • -
-

- Truth be told there is probably a better way to generate your spreadsheet - generation (yet you'll still be using HSSF indirectly). At the time of - this writing we're in the process of moving the HSSF Serializer over to - the Apache Cocoon - Project. With Cocoon you can serialize any XML datasource (of - which might be a ESQL page outputting in SQL for instance) by simply - applying the stylesheet and designating the serializer. -

-

- If you're merely reading spreadsheet data, then use the eventmodel api - in the org.apache.poi.hssf.eventmodel package. -

-

- If you're modifying spreadsheet data then use the usermodel api. You - can also generate spreadsheets this way, but using Cocoon (which will do - it this way indirectly) is the best way...we promise. -

- -
- -
diff --git a/src/documentation/xdocs/trans/es/hssf/limitations.xml b/src/documentation/xdocs/trans/es/hssf/limitations.xml deleted file mode 100644 index 18b379e1a9..0000000000 --- a/src/documentation/xdocs/trans/es/hssf/limitations.xml +++ /dev/null @@ -1,52 +0,0 @@ - -
- Limitations - - - -
- -
-

- The intent of this document is to outline some of the known limitations of the - POI HSSF API's. It is not intended to be complete list of every bug or missing - feature of HSSF, rather it's purpose is to provide a broad feel for some of the - functionality that is missing or broken. -

-
    -
  • - Charts

    - You can not currently create charts. This is planned for the 2.0 release. You can - however create a chart in Excel, modify the chart data values using HSSF and write - a new spreadsheet out. This is possible because POI attempts to keep existing records - intact as far as possible.

    -
  • -
  • - Rich Text

    - HSSF does not support rich text cells. Rich text cells are - cells that have multiple fonts and styles in the once cell. Any attempt to read - a spreadsheet that has rich text cells will throw an exception. This feature may - be supported in the future but it is not currently planned. Patches are welcome.

    -
  • -
  • - Outlines

    - It is not yet possible to create outlines. Reading a spreadsheet with outlines - may work correctly but has not been tested. Write support for outlines may - be added in the future but it is not currently planned. Patches are welcome.

    -
  • -
  • - Macros

    - Macros can not be created. The are currently no plans to support macros. Reading - workbooks containing macros is supported but attempting to write those workbooks - will fail. This is because macros are stored as extra file sytems within the - compound document, and these are not currently kept when the file is rewritten.

    -
  • -
  • - Pivot Tables

    - Generating pivot tables is not supported. Reading spreadsheets containing pivot tables - has not been tested. -
  • -
-
- -
diff --git a/src/documentation/xdocs/trans/es/hssf/quick-guide.xml b/src/documentation/xdocs/trans/es/hssf/quick-guide.xml deleted file mode 100644 index 0d1bca1821..0000000000 --- a/src/documentation/xdocs/trans/es/hssf/quick-guide.xml +++ /dev/null @@ -1,404 +0,0 @@ - - - - -
- Busy Developers' Guide to HSSF Features - - - -
- -
-

- Want to use HSSF read and write spreadsheets in a hurry? This guide is for you. If you're after - more in-depth coverage of the HSSF user-API please consult the HOWTO - guide as it contains actual descriptions of how to use this stuff. -

-
-
    -
  • How to create a new workbook
  • -
  • How to create a sheet
  • -
  • How to create cells
  • -
  • How to create date cells
  • -
  • Working with different types of cells
  • -
  • Aligning cells
  • -
  • Working with borders
  • -
  • Fills and color
  • -
  • Merging cells
  • -
  • Working with fonts
  • -
  • Reading and writing
  • -
  • Use newlines in cells.
  • -
  • Create user defined data formats.
  • -
  • Set print area for a sheet.
  • -
  • Set page numbers on the footer of a sheet.
  • -
-
-
- -
- - HSSFWorkbook wb = new HSSFWorkbook(); - FileOutputStream fileOut = new FileOutputStream("workbook.xls"); - wb.write(fileOut); - fileOut.close(); - -
- -
- - HSSFWorkbook wb = new HSSFWorkbook(); - HSSFSheet sheet1 = wb.createSheet("new sheet"); - HSSFSheet sheet2 = wb.createSheet("second sheet"); - FileOutputStream fileOut = new FileOutputStream("workbook.xls"); - wb.write(fileOut); - fileOut.close(); - -
- -
- - HSSFWorkbook wb = new HSSFWorkbook(); - HSSFSheet sheet = wb.createSheet("new sheet"); - - // Create a row and put some cells in it. Rows are 0 based. - HSSFRow row = sheet.createRow((short)0); - // Create a cell and put a value in it. - HSSFCell cell = row.createCell((short)0); - cell.setCellValue(1); - - // Or do it on one line. - row.createCell((short)1).setCellValue(1.2); - row.createCell((short)2).setCellValue("This is a string"); - row.createCell((short)3).setCellValue(true); - - // Write the output to a file - FileOutputStream fileOut = new FileOutputStream("workbook.xls"); - wb.write(fileOut); - fileOut.close(); - -
- -
- - HSSFWorkbook wb = new HSSFWorkbook(); - HSSFSheet sheet = wb.createSheet("new sheet"); - - // Create a row and put some cells in it. Rows are 0 based. - HSSFRow row = sheet.createRow((short)0); - - // Create a cell and put a date value in it. The first cell is not styled - // as a date. - HSSFCell cell = row.createCell((short)0); - cell.setCellValue(new Date()); - - // we style the second cell as a date (and time). It is important to - // create a new cell style from the workbook otherwise you can end up - // modifying the built in style and effecting not only this cell but other cells. - HSSFCellStyle cellStyle = wb.createCellStyle(); - cellStyle.setDataFormat(HSSFDataFormat.getFormat("m/d/yy h:mm")); - cell = row.createCell((short)1); - cell.setCellValue(new Date()); - cell.setCellStyle(cellStyle); - - // Write the output to a file - FileOutputStream fileOut = new FileOutputStream("workbook.xls"); - wb.write(fileOut); - fileOut.close(); - -
- -
- - HSSFWorkbook wb = new HSSFWorkbook(); - HSSFSheet sheet = wb.createSheet("new sheet"); - HSSFRow row = sheet.createRow((short)2); - row.createCell((short) 0).setCellValue(1.1); - row.createCell((short) 1).setCellValue(new Date()); - row.createCell((short) 2).setCellValue("a string"); - row.createCell((short) 3).setCellValue(true); - row.createCell((short) 4).setCellType(HSSFCell.CELL_TYPE_ERROR); - - // Write the output to a file - FileOutputStream fileOut = new FileOutputStream("workbook.xls"); - wb.write(fileOut); - fileOut.close(); - -
- -
- - public static void main(String[] args) - throws IOException - { - HSSFWorkbook wb = new HSSFWorkbook(); - HSSFSheet sheet = wb.createSheet("new sheet"); - HSSFRow row = sheet.createRow((short) 2); - createCell(wb, row, (short) 0, HSSFCellStyle.ALIGN_CENTER); - createCell(wb, row, (short) 1, HSSFCellStyle.ALIGN_CENTER_SELECTION); - createCell(wb, row, (short) 2, HSSFCellStyle.ALIGN_FILL); - createCell(wb, row, (short) 3, HSSFCellStyle.ALIGN_GENERAL); - createCell(wb, row, (short) 4, HSSFCellStyle.ALIGN_JUSTIFY); - createCell(wb, row, (short) 5, HSSFCellStyle.ALIGN_LEFT); - createCell(wb, row, (short) 6, HSSFCellStyle.ALIGN_RIGHT); - - // Write the output to a file - FileOutputStream fileOut = new FileOutputStream("workbook.xls"); - wb.write(fileOut); - fileOut.close(); - - } - - /** - * Creates a cell and aligns it a certain way. - * - * @param wb the workbook - * @param row the row to create the cell in - * @param column the column number to create the cell in - * @param align the alignment for the cell. - */ - private static void createCell(HSSFWorkbook wb, HSSFRow row, short column, short align) - { - HSSFCell cell = row.createCell(column); - cell.setCellValue("Align It"); - HSSFCellStyle cellStyle = wb.createCellStyle(); - cellStyle.setAlignment(align); - cell.setCellStyle(cellStyle); - } - -
- -
- - HSSFWorkbook wb = new HSSFWorkbook(); - HSSFSheet sheet = wb.createSheet("new sheet"); - - // Create a row and put some cells in it. Rows are 0 based. - HSSFRow row = sheet.createRow((short) 1); - - // Create a cell and put a value in it. - HSSFCell cell = row.createCell((short) 1); - cell.setCellValue(4); - - // Style the cell with borders all around. - HSSFCellStyle style = wb.createCellStyle(); - style.setBorderBottom(HSSFCellStyle.BORDER_THIN); - style.setBottomBorderColor(HSSFColor.BLACK.index); - style.setBorderLeft(HSSFCellStyle.BORDER_THIN); - style.setLeftBorderColor(HSSFColor.GREEN.index); - style.setBorderRight(HSSFCellStyle.BORDER_THIN); - style.setRightBorderColor(HSSFColor.BLUE.index); - style.setBorderTop(HSSFCellStyle.BORDER_MEDIUM_DASHED); - style.setTopBorderColor(HSSFColor.BLACK.index); - cell.setCellStyle(style); - - // Write the output to a file - FileOutputStream fileOut = new FileOutputStream("workbook.xls"); - wb.write(fileOut); - fileOut.close(); - -
- -
- - HSSFWorkbook wb = new HSSFWorkbook(); - HSSFSheet sheet = wb.createSheet("new sheet"); - - // Create a row and put some cells in it. Rows are 0 based. - HSSFRow row = sheet.createRow((short) 1); - - // Aqua background - HSSFCellStyle style = wb.createCellStyle(); - style.setFillBackgroundColor(HSSFColor.AQUA.index); - style.setFillPattern(HSSFCellStyle.BIG_SPOTS); - HSSFCell cell = row.createCell((short) 1); - cell.setCellValue("X"); - cell.setCellStyle(style); - - // Orange "foreground", foreground being the fill foreground not the font color. - style = wb.createCellStyle(); - style.setFillForegroundColor(HSSFColor.ORANGE.index); - style.setFillPattern(HSSFCellStyle.SOLID_FOREGROUND); - cell = row.createCell((short) 2); - cell.setCellValue("X"); - cell.setCellStyle(style); - - // Write the output to a file - FileOutputStream fileOut = new FileOutputStream("workbook.xls"); - wb.write(fileOut); - fileOut.close(); - -
- -
- - HSSFWorkbook wb = new HSSFWorkbook(); - HSSFSheet sheet = wb.createSheet("new sheet"); - - HSSFRow row = sheet.createRow((short) 1); - HSSFCell cell = row.createCell((short) 1); - cell.setCellValue("This is a test of merging"); - - sheet.addMergedRegion(new Region(1,(short)1,1,(short)2)); - - // Write the output to a file - FileOutputStream fileOut = new FileOutputStream("workbook.xls"); - wb.write(fileOut); - fileOut.close(); - -
- -
- - HSSFWorkbook wb = new HSSFWorkbook(); - HSSFSheet sheet = wb.createSheet("new sheet"); - - // Create a row and put some cells in it. Rows are 0 based. - HSSFRow row = sheet.createRow((short) 1); - - // Create a new font and alter it. - HSSFFont font = wb.createFont(); - font.setFontHeightInPoints((short)24); - font.setFontName("Courier New"); - font.setItalic(true); - font.setStrikeout(true); - - // Fonts are set into a style so create a new one to use. - HSSFCellStyle style = wb.createCellStyle(); - style.setFont(font); - - // Create a cell and put a value in it. - HSSFCell cell = row.createCell((short) 1); - cell.setCellValue("This is a test of fonts"); - cell.setCellStyle(style); - - // Write the output to a file - FileOutputStream fileOut = new FileOutputStream("workbook.xls"); - wb.write(fileOut); - fileOut.close(); - -
- -
- - POIFSFileSystem fs = - new POIFSFileSystem(new FileInputStream("workbook.xls")); - HSSFWorkbook wb = new HSSFWorkbook(fs); - HSSFSheet sheet = wb.getSheetAt(0); - HSSFRow row = sheet.getRow(2); - HSSFCell cell = row.getCell((short)3); - if (cell == null) - cell = row.createCell((short)3); - cell.setCellType(HSSFCell.CELL_TYPE_STRING); - cell.setCellValue("a test"); - - // Write the output to a file - FileOutputStream fileOut = new FileOutputStream("workbook.xls"); - wb.write(fileOut); - fileOut.close(); - -
- -
- - HSSFWorkbook wb = new HSSFWorkbook(); - HSSFSheet s = wb.createSheet(); - HSSFRow r = null; - HSSFCell c = null; - HSSFCellStyle cs = wb.createCellStyle(); - HSSFFont f = wb.createFont(); - HSSFFont f2 = wb.createFont(); - - cs = wb.createCellStyle(); - - cs.setFont( f2 ); - //Word Wrap MUST be turned on - cs.setWrapText( true ); - - r = s.createRow( (short) 2 ); - r.setHeight( (short) 0x349 ); - c = r.createCell( (short) 2 ); - c.setCellType( HSSFCell.CELL_TYPE_STRING ); - c.setCellValue( "Use \n with word wrap on to create a new line" ); - c.setCellStyle( cs ); - s.setColumnWidth( (short) 2, (short) ( ( 50 * 8 ) / ( (double) 1 / 20 ) ) ); - - FileOutputStream fileOut = new FileOutputStream( "workbook.xls" ); - wb.write( fileOut ); - fileOut.close(); -
- -
- - HSSFWorkbook wb = new HSSFWorkbook(); - HSSFSheet sheet = wb.createSheet("format sheet"); - HSSFCellStyle style; - HSSFDataFormat format = wb.createDataFormat(); - HSSFRow row; - HSSFCell cell; - short rowNum = 0; - short colNum = 0; - - row = sheet.createRow(rowNum++); - cell = row.createCell(colNum); - cell.setCellValue(11111.25); - style = wb.createCellStyle(); - style.setDataFormat(format.getFormat("0.0")); - cell.setCellStyle(style); - - row = sheet.createRow(rowNum++); - cell = row.createCell(colNum); - cell.setCellValue(11111.25); - style = wb.createCellStyle(); - style.setDataFormat(format.getFormat("#,##0.0000")); - cell.setCellStyle(style); - - FileOutputStream fileOut = new FileOutputStream("workbook.xls"); - wb.write(fileOut); - fileOut.close(); - -
- -
- - HSSFWorkbook wb = new HSSFWorkbook(); - HSSFSheet sheet = wb.createSheet("format sheet"); - HSSFPrintSetup ps = sheet.getPrintSetup() - - sheet.setAutobreaks(true) - - ps.setFitHeight((short)1); - ps.setFitWidth((short)1); - - - // Create various cells and rows for spreadsheet. - - FileOutputStream fileOut = new FileOutputStream("workbook.xls"); - wb.write(fileOut); - fileOut.close(); - -
- -
- - HSSFWorkbook wb = new HSSFWorkbook(); - HSSFSheet sheet = wb.createSheet("format sheet"); - HSSFFooter footer = sheet.getFooter() - - footer.setRight( "Page " + HSSFFooter.page() + " of " + HSSFFooter.numPages() ); - - - - // Create various cells and rows for spreadsheet. - - FileOutputStream fileOut = new FileOutputStream("workbook.xls"); - wb.write(fileOut); - fileOut.close(); - -
- - -
-
- -
diff --git a/src/documentation/xdocs/trans/es/hssf/record-generator.xml b/src/documentation/xdocs/trans/es/hssf/record-generator.xml deleted file mode 100644 index 503a1599c1..0000000000 --- a/src/documentation/xdocs/trans/es/hssf/record-generator.xml +++ /dev/null @@ -1,114 +0,0 @@ - - - - -
- Record Generator HOWTO - - - - -
- -
- -
-

- The record generator was born from frustration with translating - the Excel records to Java classes. Doing this manually is a time - consuming process. It's also very easy to make mistakes. -

-

- A utility was needed to take the defintition of what a - record looked like and do all the boring stuff. Thus the - record generator was born. -

-
- -
-

- The record generator takes XML as input and produced the following - output: -

    -
  • A Java file capabile of decoding and encoding the record.
  • -
  • A test class with provides a fill-in-the-blanks implementation of a test case - for ensuring the record operates as designed.
  • -
-

-
-
-

- The record generator is invoked as an Ant target (generate-records). It goes - through looking for all files in src/records/defintitions ending with _record.xml. - It then creates two files; the Java record definition and the Java test case template. -

-

- The records themselves have the following general layout: -

- - The frame record indicates whether there is a border - around the displayed text of a chart. - Glen Stampoultzis (glens at apache.org) - - - - - - - - - - - - ]]> -

- Currently the type can be of type int, float or string. The 'int' - type covers bytes, shorts and integers which is selected using a - size of 1, 2 or 4. An additional type called varword is used to - represent a array of word values where the first short is the length - of the array. The string type generation is only partially - implemented. If choosing string you must select a size of 'var'. -

-

- The Java records are regenerated each time the record generator is - run, however the test stubs are only created if the test stub does - not already exist. What this means is that you may change test - stubs but not the generated records. -

-
-
-

- The record generation works by taking an XML file and styling it - using XLST. Given that XSLT is a little limited in some ways it was - necessary to add a little Java code to the mix. -

-

- See record.xsl, record_test.xsl, FieldIterator.java, - RecordUtil.java, RecordGenerator.java -

-
-
-

- The record generator does not handle all possible record types and - is not ment to. Sometimes it's going to make more sense to generate - the records manually. The main point of this thing is to make the - easy stuff simple. -

-

- Currently the record generator is optimized to create Excel records. - It could be adapted to create Word records with a little poking - around. -

-

- Currently the the XSL file that generates the record calls out to - Java objects. This would have been better done as Javascript inside - the XSL file itself. The Java code for the record generation is - currently quite messy with minimal comments. -

-
-
- -
diff --git a/src/documentation/xdocs/trans/es/hssf/use-case.xml b/src/documentation/xdocs/trans/es/hssf/use-case.xml deleted file mode 100644 index ee37cf59d8..0000000000 --- a/src/documentation/xdocs/trans/es/hssf/use-case.xml +++ /dev/null @@ -1,182 +0,0 @@ - - - - -
- HSSF Use Cases - - - -
- -
-
- -

Primary Actor: HSSF client

-

Scope: HSSF

-

Level: Summary

-

Stakeholders and Interests:

-
    -
  • HSSF client- wants to read content - of HSSF file
  • -
  • HSSF - understands HSSF file
  • -
  • POIFS - understands underlying POI - file system
  • -
-

Precondition: None

-

Minimal Guarantee: None

-

Main Success Guarantee:

-
    -
  1. HSSF client requests HSSF to read - a HSSF file, providing an InputStream - containing HSSF file in question.
  2. -
  3. HSSF requests POIFS to read the HSSF - file, passing the InputStream - object to POIFS (POIFS use case 1, read existing file system)
  4. -
  5. HSSF reads the "Workbook" - file (use case 4, read workbook entry)
  6. -
-

Extensions:

-

2a. Exceptions -thrown by POIFS will be passed on to the HSSF client.

-
-
- -

Primary Actor: HSSF client

-

Scope: HSSF

-

Level: Summary

-

Stakeholders and Interests:

-
    -
  • HSSF client- wants to write file - out.
  • -
  • HSSF - knows how to write file - out.
  • -
  • POIFS - knows how to write file - system out.
  • -
-

Precondition:

-
    -
  • File has been - read (use case 1, read existing HSSF file) and subsequently modified - or file has been created (use case 3, create HSSF file)
  • -
-

Minimal Guarantee: None

-

Main Success Guarantee:

-
    -
  1. HSSF client - provides an OutputStream to - write the file to.
  2. -
  3. HSSF writes - the "Workbook" to its associated POIFS file system (use case - 5, write workbook entry)
  4. -
  5. HSSF - requests POIFS to write its file system out, using the OutputStream - obtained from the HSSF client (POIFS use case 2, write file system).
  6. -
-

Extensions:

-

3a. Exceptions -from POIFS are passed to the HSSF client.

- -
-
- -

Primary Actor: HSSF client

-

Scope: HSSF

-

-Level: Summary

-

Stakeholders and Interests:

-
    -
  • HSSF client- wants to create a new - file.
  • -
  • HSSF - knows how to create a new - file.
  • -
  • POIFS - knows how to creat a new - file system.
  • -
-

Precondition:

-

Minimal Guarantee: None

-

Main Success Guarantee:

-
    -
  1. HSSF requests - POIFS to create a new file system (POIFS use case 3, create new file - system)
  2. -
-

Extensions: -None

- -
-
-

Primary Actor: HSSF

-

Scope: HSSF

-

-Level: Summary

-

Stakeholders and Interests:

-
    -
  • HSSF - knows how to read the - workbook entry
  • -
  • POIFS - knows how to manage the file - system.
  • -
-

Precondition:

-
    -
  • The file - system has been read (use case 1, read existing HSSF file) or has - been created and written to (use case 3, create HSSF file system; - use case 5, write workbook entry).
  • -
-

Minimal -Guarantee: None

-

Main Success Guarantee:

-
    -
  1. - HSSF requests POIFS for the "Workbook" file
  2. -
  3. POIFS returns - an InputStream for the file.
  4. -
  5. HSSF reads - from the InputStream provided by POIFS
  6. -
  7. HSSF closes - the InputStream provided by POIFS
  8. -
-

Extensions:

-

3a. Exceptions -thrown by POIFS will be passed on

-
-
- - -

Primary Actor: HSSF

-

Scope: HSSF

-

-Level: Summary

-

Stakeholders and Interests:

-
    -
  • HSSF - knows how to manage the - write the workbook entry.
  • -
  • POIFS - knows how to manage the file - system.
  • -
-

Precondition: -

-
    -
  • Either an existing HSSF file has - been read (use case 1, read existing HSSF file) or an HSSF file has - been created (use case 3, create HSSF file).
  • -
-

Minimal Guarantee: None

-

Main Success Guarantee:

-
    -
  1. HSSF - checks the POIFS file system directory for the "Workbook" - file (POIFS use case 8, read file system directory)
  2. -
  3. If "Workbook" is in the directory, HSSF requests POIFS to - replace it with the new workbook entry (POIFS use case 4, replace file - in file system). Otherwise, HSSF requests POIFS to write the new - workbook file, with the name "Workbook" (POIFS use case 6, - write new file to file system)
  4. -
-

Extensions:None

-
- -
- -
diff --git a/src/documentation/xdocs/trans/es/index.xml b/src/documentation/xdocs/trans/es/index.xml deleted file mode 100644 index 083e0eac08..0000000000 --- a/src/documentation/xdocs/trans/es/index.xml +++ /dev/null @@ -1,160 +0,0 @@ - - - - -
- Bienvenido a POI - - - - - -
- - -
-
-

- El proyecto de traducción de la documentación de POI ha comenzado. - Los primeros en empezar han sido las traducciones al Español - , al Japonés y al - Alemán. - Otras serán bienvenidas. ¡Podéis participar si queréis! -

-
-
-

- Se ha terminado el plazo de votaciones para el concurso de logotipo de POI. Gracias por vuestros votos. -

- - - -
-
-
-

- El proyecto POI consiste en APIs para manipular varios formatos de ficheros, - basados en el formato de Documento Compuesto OLE 2 de Microsoft, utilizando Java puro. -

-

- Entre los ficheros basados en el formato de Documento Compuesto OLE 2 de Microsoft se - incluyen la mayor parte de los ficheros de Microsoft Office tales como XLS y DOC. -

-

- Como regla general intentamos colaborar lo más posible con otros proyectos para proporcionar esta - funcionalidad. Algunos ejemplos: Cocoon para - el que pronto encontraréis generadores y serializadores para nuestros proyectos; - Open Office.org con quienes colaboramos en la - documentación del formato XLS; y Lucene para - el que pronto tendremos intérpretes del formato de fichero. Cuando sea práctico, donaremos componentes - directamente a aquellos proyectos para dotarles de capacidad-POI. -

-
-

- Abordaremos esto a nivel de componente. POI se refiere al proyecto completo. -

-

- Así que, ¿por qué debería utilizar POIFS o HSSF? -

-

- Utilizarías POIFS si tuvieras un documento escrito en el Formato de Documento Compuesto OLE 2, probablemente - escrito utilizando MFC, que necesitaras leer en Java. Alternativamente, utilizarías POI para escribir - en el Formato de Documento Compuesto OLE 2 si necesitaras inter-operar con programas ejecutándose en la - plataforma Windows. No nos estamos jactando cuando decimos que POIFS es la adaptación más completa y correcta - de este formato de fichero hasta la fecha. -

-

- Utilizaríás HSSF si necesitaras leer o escribir un fichero XLS (Excel) utilizando Java. También se pueden - leer y modificar hojas de cálculo utilizando este API, aunque ahora mismo la escritura está más madura. -

-
- -
-

- POI significa Implementación Pobre de Ofuscación (Poor Obfuscation Implementation). ¿Por qué daríamos a - nuestro proyecto un nombre tan derogatorio? Bien, el Formato de Documento Compuesto OLE 2 de Microsoft es - algo bastante mal concebido. Esencialmente es un fichero estructurado muy a la manera del viejo sistema FAT - del DOS. Redmon eligió, en vez de utilizar tar, gzip, o arc, inventar su propio formato de fichero que no - proporciona ningún estándar de cifrado o compresión, no es fácil de unir con otros ficheros del mismo tipo, y - es dado a sufrir problemas de fragmentación. -

-

- Poi también es una delicatessen Hawaiiana que el diccionario Merriam Webster's - define como "Comida Hawaiiana de raiz de taro cocinada, machacada y amasada en una pasta que a menudo se deja fermentar." - Esto extrañamente parecía una descripción del formato del fichero. -

-

- Así que si te gustan los acrónimos, entonces POI es un acrónimo. Si los odias, entonces sólo usamos el nombre de la - comida para nuestro proyecto. Si deseas expresar amor u odio por los acrónimos, utiliza POI o Poi respectivamente - para referirte al proyecto. -

-
- -
- - -
-
-

Un concepto erróneo es que POI escribe ficheros Excel. POI es el nombre del proyecto. POI contiene varios - componentes, uno de los cuales, HSSF, escribe ficheros Excel. Siguen a continuación los componentes del - proyecto POI completo y un pequeño sumario de su propósito.

-
-
-

POIFS es la parte más vieja y más estable del proyecto. Es nuestra adaptación del Formato de Documento Compuesto - OLE 2 a Java puro. Soporta funcionalidad de lectura y escritura. Todos nuestros componentes se sirven de él por - definición. Por favor, vea la página del proyecto POIFS para más información.

-
-
-

HSSF es nuestra adaptación del formato de fichero de Microsoft Excel 97(-2002) a Java puro. Soporta lectura y - escritura. Por favor, vea la página del proyecto HSSF para más información.

-
-
-

HDF es nuestra adaptación del formato de fichero de Microsoft Word 97 a Java puro. Soporta lectura y escritura. - Por favor, vea la página del proyecto HDF para más información. Este - componente está en la fase inicial de diseño. ¡Salta dentro!

-
-
-

HPSF es nuestra adaptación del formato de conjunto de propiedades OLE 2 a java puro. - Los conjuntos de propiedades se utilizan mayoritariamente para almacenar las propiedades de un documento - (título, autor, fecha de la última modificació, etc.), pero también pueden ser utilizados para propósitos - específicos de una aplicación. Actualmente HPSF soporta sólo funcionalidad de lectura. Por favor, vea - la página del proyecto HPSF para más información.

-
- -
- -
-

El Serializador HSSF, que era parte de nuestra release 1.0 y de las últimas compilaciones en - Sourceforge, ha sido donado al proyecto - Cocoon, y está disponible a partir de la versión - 2.0.2.

-
- -
-

- ¿Así que te gustaría contribuir al proyecto? ¡Genial! Necesitamos gente entusiasta, que trabaje duro, - que tenga talento para ayudarnos con el proyecto en varias áreas. ¡La primera es petición de nuevas - funciones y aviso de errores! La segunda es documentación - estaremos a tu entera disposición si - tienes alguna crítica o te gustaría contribuir o mejorar de alguna forma la documentación. ¡Especialmente - no nos vendría mal algo de ayuda en documentar el formato de fichero HSSF! ¡Por último, aunque no por ello - menos importante, nos vendría bien algunos programadores Java que mastiquen binario, para que le echen el diente - a la convolución que caracteriza los formatos de fichero de Microsoft y para que nos ayude a adaptar nuevos - formatos a una plataforma Java superior! -

-

¡Así que si estás motivado, listo, y tienes tiempo, únete a las listas de correo y estaremos encantados de ayudarte a - empezar en el proyecto! -

- - -
- -
- - Copyright (c) @year@ The Apache Software Foundation All rights reserved. - $Revision$ $Date$ - -
-
- - - diff --git a/src/documentation/xdocs/trans/es/news.xml b/src/documentation/xdocs/trans/es/news.xml deleted file mode 100644 index 74fc28cdb9..0000000000 --- a/src/documentation/xdocs/trans/es/news.xml +++ /dev/null @@ -1,209 +0,0 @@ - - - - -
- - - - - -
- - -
-

- Estos son artículos/etc. sobre POI publicados en la red. Si ves - alguna noticia sobre POI o algún sitio en el que se le menciona - de manera notable (no en tu propia página en la que pones la palabra - POI para que enlacemos contigo y en la que casualmente hay una foto - de tu mujer e hijos) entonces envía un parche a la lista. En general - dedicaremos el mismo tiempo así que no tengáis reparos en enviar - difamaciones coléricas así como comentarios favorables, técnicos y - objetivos. No mencionaremos mensajes realmente estúpidos (lo sentimos). -

-
-
-
    -
  • - Discusión sobre el uso de POI en AS/400s -
  • -
  • - Discusión de cuando casi tuvimos POI como el filtro para KOffice si asuntos de política y licencias no lo hubieran condenado al fracaso -
  • -
  • - Discusión Java en O'Reilly Network incluyendo discusión sobre POI - O'Reilly.net -
  • -
  • - Poor Obfuscation Implementation (Implementación Pobre de Ofuscación). - Blog de David M. Johnson -
  • -
  • - - Descarga POI 1.5-dev-rc2 - JSurfer -
  • - -
  • - Google dice que somos los más importantes en nuestra categoría -
  • -
  • - It's POI-fect - Tony Sintes, Javaworld -
  • -
  • - - Nicola anuncia código de serialización POI - - Matthew Langham's Radio Weblog -
  • -
  • - - Descarga Jakarta POI 1.4583 - JavaLobby -
  • -
  • - - El proyecto POI se mueve a Jakarta (OLE 2 CDF/Excel/Word en - java puro) - JavaLobby -
  • -
  • - - Listado de bibliotecas Java para leer y escribir ficheros de imágenes y documentos - - Página de Marco Schmidt (normalmente no anunciaríamos - la página personal de nadie, pero es un listado extensivo de información - que incluye "alternativas a POI" (para aquellos que son muy ricos). ¡Pero, qué - l3ch3s, creo que voy a marcar esta página para mí puesto que tiene enlaces a - direcciones que en conjunto contienen o enlazan con todo el conocimiento de la humanidad!) -
  • -
  • - - Experiencias de un Operador (Måns af Klercker) - - radio.weblogs.com -
  • -
  • - - DATACONV - Herramientas de Conversión de Datos: Office - DATACONV -
  • -
  • - - Página del Desarrollador de Chicago - -
  • -
  • - - POI/Proyecto de Serialización de POI - - Tío, sabes que estás en lo alto cuando le gustas a - O'Reilly. ;-) -
  • -
  • - - Noticias en la Red - - Java World -
  • - -
-
-
-
    -
  • - - Een Excel-werkboek maken vanuit Java - Lieven Smits - -
  • -
-
-
-
    -
  • Apache POI verffentlicht - entwicker.com -
  • -
  • - - Apache Jakarta-Projekt bringt Word und Excel in die Java-Welt - jsp-develop.de (for the misguided who use JSP ;-) ) -
  • -
  • - - Neues Apache-Projekt bringt Word- und Excel nach Java - - entwickler.com -
  • -
-
-
-
    -
  • - - OLE2 desde Java nativo - - javaHispano -
  • - -
-
-
-
    -
  • - - Excel/OLE accessibles - - Da Linux French Page -
  • -
  • - Discusión sobre POI en francés -
  • -
-
-
-
    -
  • - 100% PureJava... - Dr. Panda Portal -
  • -
  • - - What's new with java? - - gimlay.org -
  • -
  • Java?Excel????? - appears to show how to use Japanese with POI
  • -
  • Various discussion in Japanese including on POI
  • -
  • Japanese resources on Jakarta projects including POI
  • -
  • Kishida's site various information, includes a snip about POI and Japanese.
  • -
-
-
-
    -
  • - - Probably a translation of the Javalobby announcement of 1.5-final - Java-??????? -
  • -
-
-
-
    -
  • - Various discussion in Korean about Excel output/APIs including POI -
  • -
-
-
-

- ¡Si entiendes alguno de estos idiomas, envía un correo a la lista - diciéndonos en qué idioma está escrito y lo pondremos donde corresponda! -

-
    -
  • - - Si tuviera que adivinar, diría que es Tailandés, pero a lo mejor - alguno de vosotros lo sabe con seguridad. - javacentrix.com -
  • -
-
- -
- - Copyright (c) @year@ The Apache Software Foundation All rights reserved. - $Revision$ $Date$ - -
-
diff --git a/src/documentation/xdocs/trans/es/overview.xml b/src/documentation/xdocs/trans/es/overview.xml deleted file mode 100644 index cb23c66100..0000000000 --- a/src/documentation/xdocs/trans/es/overview.xml +++ /dev/null @@ -1,72 +0,0 @@ - - - - -
- Descripción General - - - - -
- - -
-

El proyecto POI es el proyecto principal para el desarrollo de adaptaciones (ports) - en Java puro de los formatos de fichero basados en el Formato de Documento Compuesto OLE 2 - de Microsoft. El Formato de Documento Compuesto OLE 2 lo utilizan los Documentos Office de - Microsoft, así como por los programas que utilizan conjuntos de propiedades MFC para serializar - sus objetos de tipo documento. -

-
-
-

- Los siguientes son adaptaciones, paquetes o componentes contenidos en el proyecto POI. -

-
-

- POIFS es el conjunto de APIs - (Interfaces de Aplicación) para la lectura y escritura del Formato de Documento Compuesto OLE 2 - utilizando (únicamente) Java. -

-
- -
-

- HSSF es el conjunto de APIs para la lectura y - escritura de hojas de cálculo de Microsoft Excel 97(-XP) utilizando (únicamente) Java. -

-
- -
-

- HDF es el conjunto de APIs para la lectura y - escritura de documentos Word 97(-XP) de Microsoft utilizando (únicamente) Java. -

-
- -
-

- HPSF es el conjunto de APIs para la lectura - de conjuntos de propiedades utilizando (únicamente) Java. -

-
- -
-

- POI-Utils son artefactos de propósito - general surgidos en el desarrollo de POI que no han sido implementados en ningún otro sitio. - Siempre buscamos donarlos y mantenerlos como parte de una biblioteca general utilizada en - algún otro proyecto. Estas son cosas que necesitamos para completar nuestra misión pero que - generalmente estan fuera de ella. -

-
-
- -
- - Copyright (c) @year@ The Poi Project All rights reserved. - $Revision$ $Date$ - -
-
diff --git a/src/documentation/xdocs/trans/es/patches.xml b/src/documentation/xdocs/trans/es/patches.xml deleted file mode 100644 index 92cea31a93..0000000000 --- a/src/documentation/xdocs/trans/es/patches.xml +++ /dev/null @@ -1,23 +0,0 @@ - - - -
Patch Queue

- This is an informal list - in chronological order - - of some of the noteworthy patches that have been posted - to the developers mailing list. - These patches are not (yet) part of the Poi project, but need reviewing for possible - inclusion. This system was instituted because, due to the large volume of mail and - the lack of time of the committers, some patches tended to get forgotten about. This - queue does not guarantee that any patch will be reviewed within a reasonable time frame, - but it does at least make them easier to find! -

Reviewers wanted! - If you have time to review and/or test these patches, - we would be grateful for your time. Please post comments to the dev mailing lists. -

- Before submitting a patch, please read the page on Third-Party - Contributions. The preferred submission method for patches is: -

  • Post to Poi developers list
  • Describe the patch, the reason for it and (if necessary) why this is important.
  • Generate the patch in diff -u format from CVS
  • Also generate a documentation patch or new file, if this is something that should be documented. -
  • Post as an attachment rather than inline (unless it is trivially small).

Following the above guidelines will facilitate your patch being reviewed - and applied efficiently.

[Under Construction] Archive links will be added later. - Please do not bother the patch submitters/authors without first reading the - relevant post(s) in the mailing list archives.

Vapourware will not be listed.

idSummaryReviewerResolutionStatus

See also additional list of patches to be added in To Do. -

diff --git a/src/documentation/xdocs/trans/es/todo.xml b/src/documentation/xdocs/trans/es/todo.xml deleted file mode 100644 index e1052ac763..0000000000 --- a/src/documentation/xdocs/trans/es/todo.xml +++ /dev/null @@ -1,60 +0,0 @@ - - - - - - - - - - - - - - - - - - Terminar HDF - - - Terminar Gráficas (Charts) - - - Terminar Fórmulas. - - - - - - Exponer funcionalidad de registros a bajo nivel en API de más alto nivel - - - Implementar más tipos de registros (para otras cosas ... no estoy - seguro de lo que significará esto). - - - Implementar más tipos de registros (para otras cosas ... no estoy - seguro de lo que significará esto). - - - Añadir más comprobaciones evidentes (para cuando los usuarios del API - hagan cosas que "no pueden" hacer) - - - Añadir soporte para gráficos embebidos y cosas similares. - - - Crear un nuevo objeto adaptador para manejar registros MulBlank, MulRk, Rk. - - - Añadir alguna manera de copiar hojas. - - - - - - diff --git a/src/documentation/xdocs/trans/es/who.xml b/src/documentation/xdocs/trans/es/who.xml deleted file mode 100644 index 71ca8eef1f..0000000000 --- a/src/documentation/xdocs/trans/es/who.xml +++ /dev/null @@ -1,58 +0,0 @@ - - - - -
- Who we are - - - - -
- - - -
-

- The Poi Project operates on a meritocracy: the more you do, the more - responsibility you will obtain. This page lists all of the people who have - gone the extra mile and are Committers. If you would like to get involved, - the first step is to join the mailing lists. -

- -

- We ask that you please do not send us emails privately asking for support. - We are non-paid volunteers who help out with the project and we do not - necessarily have the time or energy to help people on an individual basis. - Instead, we have set up mailing lists which often contain hundreds of - individuals who will help answer detailed requests for help. The benefit of - using mailing lists over private communication is that it is a shared - resource where others can also learn from common mistakes and as a - community we all grow together. -

- -
-
    -
  • Stefano Mazzocchi (stefano at apache dot org) -
  • -
-
- -
-
    -
  • Andrew C. Oliver (acoliver at apache dot org)
  • -
  • Marc Johnson (mjohnson at apache dot org)
  • -
  • Glen Stampoultzis (glens at apache.org)
  • -
  • Rainer Klute (klute at apache dot org)
  • -
  • Nicola Ken Barozzi (barozzi at nicolaken dot com)
  • -
  • Ryan Ackley (sackley at apache dot org)
  • -
  • Avik Sengupta (avik at apache dot org)
  • -
-
-
- -
-
- - -
diff --git a/src/documentation/xdocs/trans/guidelines.xml b/src/documentation/xdocs/trans/guidelines.xml deleted file mode 100644 index 61ab1cf1fd..0000000000 --- a/src/documentation/xdocs/trans/guidelines.xml +++ /dev/null @@ -1,127 +0,0 @@ - - - - -
- Welcome to POI - - - -
- - -
-

This document hopes to serve as a general introduction and helpful set of - guidelines for translating POI documentation into other languages. We hope - to capture both general information here (such as "how do I test my changes") - as well as language specific guidelines and translation conventions.

-
-
-

- POI's XML based documentation is built along side the sources. To build poi's documentation - you run "./build.sh docs" (UNIX/cygwin) or "build docs" (Windows) from the jakarta-poi - directory. This will put the documentation under the build/docs directory, you can navigate - there using your browser generally by typing in the path name or File -> Open new web location - (or some similar wording) - and browsing to the "index.html" file. You may also want to run "./build.sh clean docs" or - "build clean docs" so that all documentation previously built is erased before running the build. - The words "clean" and "docs" are called "targets", from here on out we will refer to them as - "targets" in which case you may assume you type "./build.sh" or "build" before them in order to - execute them. -

-

- To generate all of teh documentation such as it would appear on the - POI Website you can execute the "site" target (optionally - preceeded by the "clean" target. -

-

- The source for POI's XML documentation is in src/documentation/xdocs. To edit one of these files you can use - a standard text editor. Translated documentation is under src/documentation/xdocs/trans/xx, where xx is a - two to three letter country code, in general this should match the internet domain suffix of the country where - that language generally evolved or just be generally recognizable and unique. The directory structure under - src/documentation/trans/xx should match the structure of src/documentation (the English edition) minus the - trans directory. -

-

- The translated documentation should match the content and meaning of the "master" or English documentation. - All documentation should originate in English (this is for simplicity). While documentation written in other - languages is certainly welcome, it must first be translated (perhaps by posting it to the mail list and - requesting it be translated) into English and applied to the master before being applied to a translation. -

-

- We prefer you donate translations directly to the Jakarta POI - project rather than hosting them offsite. We will make every effort to accomidate you as we greatly appreciate your - efforts. However, we understand that sites located within a country are the fastest and most searchable. Therefore, - we recommend and welcome folks mirroring the POI site and making the translated page the home page. You can do this - either via an HTML copy with some appropriate software - or the preferred method of executing the POI build directly. You can contact us via the mail list for both push and - pull options. The same scripts which regenerate the POI website every 2 hours, should work for others. These are not - yet in CVS as they are nasty dirty shell scripts ;-). If you mirror us, tell us so we can link you. (This will help google - associate you strongly with the project) -

-

- Submitting translations is simple, you follow the same - instructions as you would for submitting a code patch. - Remeber to always generate patchs in diff -u format preserving the context relative to the jakarta-poi directory. Also remember - to submit any new files in a directory preserving archive format. Never post these to the list, always use - Bugzilla - and create attachments per the above linked instructions. -

-
-
-

- Some people feel uncomfortable putting themselves in the <authors> tags at the top of the documentation as they feel that - translation does not give them the right to claim authorship. Please don't feel this way, please add yourself to the authors - tags. It can be assumed that authors on the master documentation are all content creators and any additional authors listed - on the translation that are not on the master document are translators of the documentation. You authored the xx language - version of the document and should freely add yourself there. Additionally, please supply a patch to the - Who We Are page noting you as a developer once you've submitted a few translation patches. You deserve - credit and it helps the project to give you credit. Remember documentation is on par with code contribution. -

-
-
-

- To start a translation for a language not already in existance you must create a directory under src/documentation/xdocs/trans with a - two or three letter designation of the country where the language originated. (For example es = Spanish, de = German) - Copy the book.xml and index.xml file from src/documentation/xdocs directory into the src/documentation/xdocs/trans/xx directory. - Change all paths in the book.xml and index.xml to match the relative location of the English version. For example if there is a - link in index.html that references ./poifs/index.html, you'd change that to ../../poifs/index.html (up 2 directories from trans/xx). - Create a link from the book.xml file in the src/documentation/trans directory (this is necessary or the build will ignore your - documentation) similar to the other languages. - Run the clean target followed by the docs target. If the build is successful, congradulations! If it fails, you probably got one of - the relative paths incorrect! Go fix it (the first error message generally contains the most useful information). If you need help - post to the poi-dev list and ask for it (send the output from the build). -

-

- So now you have a directory with a copy of the index from the master documentation...so what? Well now translate book.xml and index.xml. - Try to build again. It probably won't work. Why? The encoding. At the top of every file there is an encoding="UTF-8" (in general). - This encoding will work for many Western European languages, but not for others, or will require some nasty escape sequencing. This is - where trial and error + guess work come in. This Table of encodings may help. There is a - catch. Your encoding should work on a Linux system under Java 1.3.1 and of course with the build in general. If in doubt, ask. - (This is a practical consideration as thats the setup of the machine currently running the nightly/site builds.) -

-
-
-

- Andy Oliver is the cofounder of the POI project and one of its most active documentation contributers. Well, Andy used to think he - spoke very clearly until he traveled abroad and discovered his speech was composed almost entirely of coloquialisms. This can make some - of the POI documentation difficult to translate, if in doubt...ask. Its also appropriate to eliminate these from the master documentation - where it makes it clearer. -

-
-
-

- In addition to the above practical guidelines we hope to come up with a set of translation guidelines here (or linked from here) for - general use as well as language specific translation guidelines and conventions. We assume that the POI translators will document - them here as they develop. -

-
- -
- - Copyright (c) @year@ The Apache Software Foundation All rights reserved. - $Revision$ $Date$ - -
-
diff --git a/src/documentation/xdocs/trans/index.xml b/src/documentation/xdocs/trans/index.xml deleted file mode 100644 index 200067493b..0000000000 --- a/src/documentation/xdocs/trans/index.xml +++ /dev/null @@ -1,46 +0,0 @@ - - - - -
- POI Documentation Translations - - - -
- - -
-

- The POI project has always had a very international following. In fact even today POI is - more popular in Europe than in the US where the original founders reside. Today POI is - developed by people from all over the world including India, Austrailia, Japan, and Russia. - We recognize and welcome our geographically and culturally diverse users and contributors and - wish to accomidate you as best as we can. -

-

- Documentation has always been a cornerstone to POI's success. No matter how fluent one is in - a second language, it is always easier to comprehend concepts explained in one's native language, - therefore, we encourage volunteers to come and help us translate the documentation into other - languages. Below is a list of the translations that are currently in progress and their status. -

-
-
- - - - - - - - -
xxLanguageStatusOn/OffsiteLink
deGermanJust beginning (help!)Onlink
esSpanishIndex page, some auxiliary pages translatedOnlink
jpJapaneseComplete XML translation, a few releases oldOfflink
-
- -
- - Copyright (c) @year@ The Apache Software Foundation All rights reserved. - $Revision$ $Date$ - -
-
diff --git a/src/documentation/xdocs/utils/book.xml b/src/documentation/xdocs/utils/book.xml deleted file mode 100644 index d335bf38b4..0000000000 --- a/src/documentation/xdocs/utils/book.xml +++ /dev/null @@ -1,19 +0,0 @@ - - - - - - - - - - - - - - - - - diff --git a/src/documentation/xdocs/utils/index.xml b/src/documentation/xdocs/utils/index.xml deleted file mode 100644 index f3a554ea4b..0000000000 --- a/src/documentation/xdocs/utils/index.xml +++ /dev/null @@ -1,34 +0,0 @@ - - - - -
- Poi Utils - Overview - - - -
- - -
- -

The POI Utils are classes we're looking to donate elsewhere and include. - These are usually classes that while are required for our mission, - are somehow outside of it. General utilities that could be used in - any project are what we would normally put here. If you see one, and you - think "gee that would be great as part of X project" then let us know. - While we wish to put these in their rightful place, we also don't want to - include a 40mb jar file just to process text strings, so that will - be a consideraton. -

-

- Currently, we're looking into which subprojects in the Jakarta Commons project - to donate these too. The "Poi Utils" package won't go away, as there may - be later classes. The idea is that "go ahead and add it, we'll merge it or - find an alternative later, just keep pounding out that poi!" -

- -
- -
diff --git a/src/documentation/xdocs/utils/logging.xml b/src/documentation/xdocs/utils/logging.xml deleted file mode 100644 index c687592e12..0000000000 --- a/src/documentation/xdocs/utils/logging.xml +++ /dev/null @@ -1,84 +0,0 @@ - - - - -
- Poi Utils - Overview - - - - -
- - -
- -

- Logging in POI is used only as a debugging mechanism, not a normal runtime - logging system. Logging is ONLY for autopsie type debugging, and should - NEVER be enabled on a production system. Enabling logging will reduce - performance by at least a factor of 100. If you are not developing - POI or trying to debug why POI isn't reading a file correctly, then DO - NOT enable logging. You've been warned. -

- -

- Hence, we need to be able to easily disable it entirely and make POI not dependent - on any logging package. -

- - - POI is not dependent on commons-logging for running, but not for compiling. - - -
-

- Every class uses a POILogger to log, and gets it using a static method - of the POILogFactory . -

-

- The POILogFactory uses the NullLogger by default; - it can be instructed to use any other POILogger implementation - by setting the system property org.apache.poi.util.POILogger. -

- java -Dorg.apache.poi.util.POILogger=the.package.of.MyPoiLoggerImpl ProgramThatUsesPoi - - Still needs testing. - -
- -
-

- Each class in POI can get its POILogger by calling a static method - of the POILogFactory . -

-
- -
-

- Each class in POI can log using a POILogger, which is an abstract class. - We decided to make our own logging facade because:

-
    -
  1. we need to log many values and we put many methods in this class to facilitate the - programmer, without having him write string concatenations;
  2. -
  3. we need to be able to use POI without any logger package present.
  4. -
-

There are three implementations available, and you can roll out your own, just - extend org.apache.poi.util.POILogger. -

-
-

Discards every logging request.

-
-
-

Sends every logging request to System.out.

-
-
-

Sends every logging request to the Commons Logging package. This can use JDK1.4 logging, - log4j, logkit, and is an actively maintained Jakarta Project.

-
- -
-
- -
diff --git a/src/documentation/xdocs/who.xml b/src/documentation/xdocs/who.xml deleted file mode 100644 index 276dc21ca9..0000000000 --- a/src/documentation/xdocs/who.xml +++ /dev/null @@ -1,62 +0,0 @@ - - - - -
- Who we are - - - - -
- - - -
-

- The Poi Project operates on a meritocracy: the more you do, the more - responsibility you will obtain. This page lists all of the people who have - gone the extra mile and are Committers. If you would like to get involved, - the first step is to join the mailing lists. -

- -

- We ask that you please do not send us emails privately asking for support. - We are non-paid volunteers who help out with the project and we do not - necessarily have the time or energy to help people on an individual basis. - Instead, we have set up mailing lists which often contain hundreds of - individuals who will help answer detailed requests for help. The benefit of - using mailing lists over private communication is that it is a shared - resource where others can also learn from common mistakes and as a - community we all grow together. -

- -
-
    -
  • Stefano Mazzocchi (stefano at apache dot org) -
  • -
-
- -
-
    -
  • Andrew C. Oliver (acoliver at apache dot org)
  • -
  • Marc Johnson (mjohnson at apache dot org)
  • -
  • Glen Stampoultzis (glens at apache.org)
  • -
  • Rainer Klute (klute at apache dot org)
  • -
  • Nicola Ken Barozzi (barozzi at nicolaken dot com)
  • -
  • Ryan Ackley (sackley at apache dot org)
  • -
  • Avik Sengupta (avik at apache dot org)
  • -
  • Shawn Laubach (slaubach at apache dot org)
  • -
-
-
-
    -
  • Jason Height (jheight at chariot dot net dot au)
  • -
  • Agustin Martin (agusmba at terra dot es)
  • -
-
-
- - -