diff options
-rw-r--r-- | docs/design/architecture.xml | 277 | ||||
-rw-r--r-- | docs/design/fotree.xml | 62 | ||||
-rw-r--r-- | docs/design/intro.xml | 26 |
3 files changed, 101 insertions, 264 deletions
diff --git a/docs/design/architecture.xml b/docs/design/architecture.xml index f76374ae7..a9a19e318 100644 --- a/docs/design/architecture.xml +++ b/docs/design/architecture.xml @@ -1,11 +1,11 @@ <?xml version="1.0" standalone="no"?> -<!-- by Arved Sandstrom --> <document> <header> <title>Architecture</title> <subtitle>Architecture information for FOP</subtitle> <authors> + <person name="Arved Sandstrom"/> </authors> </header> @@ -14,20 +14,23 @@ <s1 title="FOP Mechanics"> <s2 title="Introduction"> - <p> -The overall process is controlled by <em>org.apache.fop.apps.Driver</em>. In -this class, a typical sequence is:</p> - -<source>Driver driver = new Driver();<br/> -driver.setRenderer("org.apache.fop.render.pdf.PDFRenderer", version);<br/> -driver.setOutputStream(new FileOutputStream(args[1]));<br/> -driver.render(parser, inputHandler.getInputSource());</source> +The overall process is controlled by <em>org.apache.fop.apps.Driver</em>. +This class handles the FO Tree building, renderers, output and logging. +</p> +<p> +The process in general is that the FO document is sent to the tree +builder via SAX events. This creates an FO Tree. The FO Tree is then +handled by the layout processor which converts the FO Tree into an area +tree. This area tree is then given to the renderer and the renderer converts +the area tree into a stream of data containing the output document. +</p> </s2> <s2 title="Formatting Object Tree"> -<p>The class <em>org.apache.fop.fo.FOTreeBuilder</em> is responsible for actually -constructing the FO tree. The key SAX events used are </p> +<p> +The class <em>org.apache.fop.fo.FOTreeBuilder</em> is responsible for +actually constructing the FO tree. The key SAX events used are </p> <p><code>startElement()</code>,</p> <p><code>endElement()</code> and <code>characters()</code>.</p> @@ -35,160 +38,9 @@ constructing the FO tree. The key SAX events used are </p> <em>org.apache.fop.fo.FONode</em>. The other FO classes inherit from <em>FONode</em> as follows:</p> -<p><code>            FONode</code></p> -<p><code>               |</code></p> -<p><code>     __________|________</code></p> -<p><code>    |                   |</code></p> -<p><code>   FObj               FOText</code></p> -<p><code>    |</code></p> -<p><code>    |___________________</code></p> -<p><code>    |                   |</code></p> -<p><code>  FObjMixed      SequenceSpecifier -</code></p> - -<p><code>Block</code></p> -<p><code>Inline</code></p> -<p><code>BasicLink</code></p> - -</s2> - -<s2 title="FONode"> -<p> -The class inheritance described above only describes the nature of the -content. Every FO in FOP also has a parent, and a Vector of children. The -parent attribute (in the Java sense), in particular, is used to enforce -constraints required by the FO hierarchy. -</p> - -<p> -FONode, among other things, ensures that FO's have a parent, that they -have children, that they maintain a marker of where the layout was up to -(for FObj's it is the child number, and for FOText's it is the character -number), and that they have a <code>layout()</code> method. -</p> -</s2> - -<s2 title="Making FO's"> - -<p> -Every FO class has code that looks something like this: -</p> - -<p><code>public static class Maker extends FObj.Maker {</code></p> -<p><code>   public FObj make(FObj parent, PropertyList propertyList)</code></p> -<p><code>     throws FOPException</code></p> -<p><code>   {</code></p> -<p><code>     return new SimplePageMaster(parent, propertyList);</code></p> -<p><code>   }</code></p> -<p><code>}</code></p> - - -<p> -The class also has a static method that resembles -</p> - -<p><code>public static FObj.Maker maker()</code></p> -<p><code>   {</code></p> -<p><code>     return new PageSequence.Maker();</code></p> -<p><code>   }</code></p> - -<p> -A hash 'fobjTable' exists in <em>FOTreeBuilder</em>, and maps the FO names (such as -'fo:table') to object references to the appropriate factories -(such as <em>Table.Maker</em>). -</p> - -<p> -Properties (recall that FO's have properties, areas have traits, and XML -nodes have attributes) are also a concern of <em>FOTreeBuilder</em>. It -accomplishes this by using a <em>PropertyListBuilder</em>. There is a -separate <em>PropertyListBuilder</em> for each namespace encountered -while building the FO tree. Each Builder object contains a hash of -property names and <ref>their</ref> respective makers. It may also -contain element-specific property maker hashes; these are based on the -<em>local name</em> of the flow object, ie. <em>table-row</em>, not -<em>fo:table-row</em>. If an element-specific property mapping exists, -it is preferred to the generic mapping.</p> -<p>The base class for all -properties is <em>Property</em>, and all the property makers extend -<em>Property.Maker</em>. A more complete discussion of the property -architecture may be found in <jump href="properties.html">Properties</jump>. -</p> -</s2> -<s2 title="FO Formatting"> - -<p> -<em>FOTreeBuilder</em> calls <code>format()</code> on the root FO, passing -it the <em>AreaTree</em> -reference. In turn, <em>Root</em> calls <code>format()</code> on each -<em>PageSequence</em>, passing <ref>it</ref> -the <em>AreaTree</em> reference. -</p> - -<p> -The <em>PageSequence</em> <code>format()</code> method does the following things: -</p> - -<ol> -<li>Makes a <em>Page</em>, using <em>PageMasterFactory</em> to produce a -<em>PageMaster</em>, and -using <code>makePage()</code> in the latter class. In the simplest picture, -a <em>Page</em> has -5 areas represented by <em>AreaContainers</em>;</li> - -<li>Handles layout for <em>StaticContent</em> objects in the 'before' and 'after' -regions, if set. This uses the <code>layout()</code> method in -<em>StaticContent</em>;</li> - -<li>If a page break is not forced, it will continue to layout the flow into -the body area (<em>AreaContainer</em>) of the current page;</li> - -<li>It continues with (1) when layout into the current page is done, but -the flow is not empty.</li> -</ol> -</s2> - -<s2 title="Area Layout"> - -<p> -FO's that represent actual areas, starting with <em>Flow</em> and -<em>StaticContent</em>, have -a <code>layout()</code> method, with the following signature: -</p> - -<p> -<code> - public Status layout(Area area) -</code> -</p> - -<p> -The fundamental role of the <code>layout()</code> method is to manage the layout of -children and/or to generate new areas. -</p> - -<p> -<em>Example</em>: the <code>layout()</code> method for <em>Flow</em> generates no new areas - it manages the -layout of the flow children. -</p> - -<p> -<em>Example</em>: the <code>layout()</code> method for <em>Block</em> -generates a new <em>BlockArea</em> in and of -itself, and also manages the layout of the block children, which are added -to the <em>BlockArea</em> before that is itself added to its parent -<em>Area</em>. -</p> - -<p> -<code>Layout()</code> methods are subject to the general constraint that possibly not -all of their children can be accommodated, and they report back accordingly -with an appropriate <em>Status</em>. -</p> </s2> <s2 title="Rendering"> - <p> This is a separate process. The <code>render()</code> method in <em>Driver</em> is invoked (say, @@ -198,109 +50,8 @@ This actually calls the <code>render()</code> method in a specific implementatio the <em>Renderer</em> interface, typically <em>PDFRenderer</em> or <em>AWTRenderer</em>. </p> - -<p> -At the highest level <em>PDFRenderer</em>, for example, begins by rendering each -<em>Page</em>. The <code>render()</code> method in <em>Page</em> (as is the case for other areas), -invokes a particular method in the renderer of choice, e.g. -<code>renderPage()</code>. -<em>NOTE</em>: this system is bypassed for <em>Page</em>, incidentally. -</p> - -</s2> - - -<!-- by Art Welch --> -<s2 title="Renderers"> - <s3 title="PrintRenderer"> - <p>The PrintRenderer is an abstract base class for print type renderers. Currently the PCL, PDF, and TXT renderers extend from this. This allows as much common functionality to be contained in one place as possible (at least as much as I could consolidate fairly quickly). Unfortunately I have not yet been able to make the renderPage and renderWordArea methods common. This is unfortunate because these methods seem to experience the most activity. Maybe soneone else will have a clever solution to this (without breaking them into a bunch of little bits).</p> - <p>It is my hope that this base class will be useful for other renderers as well.</p> - </s3> - - <s3 title="PCLRenderer"> - <p>The PCLRenderer is a FOP renderer that should produce output as close to identical as possible to the printed output of the PDFRenderer within the limitations of the renderer, and output device.</p> - - <p>The output created by the PCLRenderer is generic PCL 5 as documented in the "HP PCL 5 Printer Language Technical Reference Manual" (copyright 1990). This should allow any device fully supporting PCL 5 to be able to print the output generated by the PCLRenderer.</p> - - <s4 title="Limitations"> - <ul> - <li>Text or graphics outside the left or top of the printable area are not rendered properly. In general things that should print to the left of the printable area are shifted to the right so that they start at the left edge of the printable area and an error message is generated.</li> - <li>The Helvetica and Times fonts are not well supported among PCL printers so Helvetica is mapped to Arial and Times is mapped to Times New. This is done in the PCLRenderer, no changes are required in the FO's. The metrics and appearance for Helvetica/Arial and Times/Times New are nearly identical, so this has not been a problem so far.</li> - <li>Only the original fonts built into FOP are supported.</li> - <li>For the non-symbol fonts, the ISO 8859/1 symbol set is used (PCL set "0N").</li> - <li>Multibyte characters are not supported.</li> - <li>SVG support is limited. Currently only lines, rectangles (may be rounded), circles, ellipses, text, simple paths, and images are supported. Colors are supported (dithered black and white) but not gradients.</li> - <li>Images print black and white only (not dithered). When the renderer prints a color image it uses a threshold value, colors above the threshold are printed as white and below are black. If you need to print a non-monochrome image you should dither it first.</li> - <li>Image scaling is accomplished by modifying the effective resolution of the image data. The available resolutions are 75, 100, 150, 300, and 600 DPI.</li> - <li>Color printing is not supported. Colors are rendered by mapping the color intensity to one of the PCL fill shades (from white to black in 9 steps).</li> - <li>SVG clipping is not supported.</li> - </ul> - </s4> - - <s4 title="Additional Features"> - <p>There are some special features that are controlled by some public variables on the PCLRenderer class.</p> - - <dl> - <dt>orientation</dt> - <dd><p>The logical page orientation is controlled by the public orientation variable. Legal values are:</p> - <ul> - <li>0 Portrait</li> - <li>1 Landscape</li> - <li>2 Reverse Portrait</li> - <li>3 Reverse Landscape</li> - </ul> - </dd> - <dt>curdiv, paperheight</dt> - <dd>The curdiv and paperheight variables allow multiple virtual pages to be printed on a piece of paper. This allows a standard laser printer to use perforated paper where every perforation will represent an individual page. The paperheight sets the height of a piece of paper in decipoints. This will be divided by the page.getHeight() to determine the number of equal sized divisions (pages) that will fit on the paper. The curdiv variable may be read/written to get/set the current division on the page (to set the starting division and read the ending division for multiple invocations).</dd> - <dt>topmargin, leftmargin</dt> - <dd>The topmargin and leftmargin may be used to increase the top and left margins for printing.</dd> - </dl> - </s4> - </s3> - - <s3 title="TXTRenderer"> - <p>The TXTRenderer is a FOP renderer that produces plain ASCII text output that attempts to match the output of the PDFRenderer as closely as possible. This was originally developed to accommodate an archive system that could only accept plain text files. Of course when limited to plain fixed pitch text the output does not always look very good.</p> - <p>The TXTRenderer works with a fixed size page buffer. The size of this buffer is controlled with the textCPI and textLPI public variables. The textCPI is the effective horizontal characters per inch to use. The textLPI is the vertical lines per inch to use. From these values and the page width and height the size of the buffer is calculated. The formatting objects to be rendered are then mapped to this grid. Graphic elements (lines, borders, etc) are assigned a lower priority than text, so text will overwrite any graphic element representations.</p> - </s3> -</s2> - - -<s2 title="UML Diagrams"> -<p> -You can find UML diagramms for all Fop packages (latest release version) -<jump href="http://xml.apache.org/dist/fop/fop-uml.zip">here</jump>.</p> </s2> -<s2 title="SVG"> - -<p> -FOP supports svg rendering. SVG is supported as an instream-foreign-object -embedded in an FO document or as an external SVG image. -</p> - -<p> -If the svg is embedded in an instream-foreign-object then all the elements and -attributes are read directly and converted into an SVG DOM representation -using the Batik library. This is then stored as a DOM until required for rendering. -The rendering process depends on the what type of renderer is being used. -</p> - -<p> -The SVG DOM is rendered in the PDF renderer by using the abitlity of Batik to render -DOM to a Graphics2D. First the DOM is converted into an intermediate representation -then this is rendered to a PDFGraphics2D graphic object which writes the drawing -instructions directly as PDF markup. -</p> - -<p> -The AWTRenderer and the PrintRenderer use Batik directly to draw the SVG image -into the current java Graphics2D context. -</p> - -<p> -For more information see the SVG documentation. -</p> -</s2> </s1> </body> </document> diff --git a/docs/design/fotree.xml b/docs/design/fotree.xml index 9558f2fc2..1f8ed6a3b 100644 --- a/docs/design/fotree.xml +++ b/docs/design/fotree.xml @@ -23,6 +23,68 @@ The FO Tree is used as an intermediatory structure which is converted into the area tree. The complete FO tree should not be held in memory since FOP should be able to handle FO documents of any size. </p> + +<s2 title="FONode"> +<p> +The class inheritance described above only describes the nature of the +content. Every FO in FOP also has a parent, and a Vector of children. The +parent attribute (in the Java sense), in particular, is used to enforce +constraints required by the FO hierarchy. +</p> + +<p> +FONode, among other things, ensures that FO's have a parent, that they +have children, that they maintain a marker of where the layout was up to +(for FObj's it is the child number, and for FOText's it is the character +number), and that they have a <code>layout()</code> method. +</p> +</s2> + +<s2 title="Making FO's"> +<p> +An FO maker is read from a hashmap lookup using the namespace and +element name. This maker is then used to create a new class that +represents an FO element. This is then added to the FO tree as a child +of the current parent. +</p> + +<p> +Properties (recall that FO's have properties, areas have traits, and XML +nodes have attributes) are also a concern of <em>FOTreeBuilder</em>. It +accomplishes this by using a <em>PropertyListBuilder</em>. There is a +separate <em>PropertyListBuilder</em> for each namespace encountered +while building the FO tree. Each Builder object contains a hash of +property names and <ref>their</ref> respective makers. It may also +contain element-specific property maker hashes; these are based on the +<em>local name</em> of the flow object, ie. <em>table-row</em>, not +<em>fo:table-row</em>. If an element-specific property mapping exists, +it is preferred to the generic mapping.</p> +<p>The base class for all +properties is <em>Property</em>, and all the property makers extend +<em>Property.Maker</em>. A more complete discussion of the property +architecture may be found in <jump href="properties.html">Properties</jump>. +</p> +</s2> + +<s2 title="Foreign XML"> +<p> +FOP supports the handlingof foreign XML. +The XML is converted internally into a DOM, this is then available to +the FO tree to convert the DOM into another format which can be rendered. +In the case of SVG the DOM needs to be created with Batik, so an element +mapping is used to read all elements in the SVG namespace and pass them +into the Batik DOM. +</p> +</s2> + +<s2 title="Extensions"> +<p> +It is possible to add extensions to FOP so that you can extend the ability of +FOP with respect to render output, document specific information or extended +layout functionality. +</p> +</s2> + </s1> </body> diff --git a/docs/design/intro.xml b/docs/design/intro.xml index b1aa5db67..f05c42f13 100644 --- a/docs/design/intro.xml +++ b/docs/design/intro.xml @@ -14,6 +14,11 @@ <body> <s1 title="Introduction"> <p> +<note><p> +The information here is still in the early stage, some information +may be wrong or misleading. In general there is not a broad enough +input and it is lacking in depth of information. +</p></note> The information here describes the design and architecture details for FOP. Currently this is part of a redesign process for some of the core parts of FOP. @@ -22,9 +27,28 @@ FOP. The redesign is mainly focusing on some particular process involved with the layout process when converting the FO tree into the Area Tree. </p> +<s2 title="Aims"> <p> - +The main aim for FOP is to comply with the spec and to be able to +process files of any size. + </p> + <p> +In achieving this aim we need to understand the problem and break it +into small problems that can be solved. + </p> + <p> +<ul> +<li>use SAX as input</li> +<li>process FO elements ASAP</li> +<li>dispose of unused memory, keep memory minimal</li> +<li>layout handles floats, footnotes and keeps in a simple straight forward way</li> +<li>id references are kept simple</li> +<li>pages are rendered ASAP, can be cached until resolved</li> +<li>renderers are totally responsible for their output format</li> +<li>output is sent to a stream</li> +</ul> </p> +</s2> </s1> </body> |