You can not select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.

complexscripts.xml 32KB

123456789101112131415161718192021222324252627282930313233343536373839404142434445464748495051525354555657585960616263646566676869707172737475767778798081828384858687888990919293949596979899100101102103104105106107108109110111112113114115116117118119120121122123124125126127128129130131132133134135136137138139140141142143144145146147148149150151152153154155156157158159160161162163164165166167168169170171172173174175176177178179180181182183184185186187188189190191192193194195196197198199200201202203204205206207208209210211212213214215216217218219220221222223224225226227228229230231232233234235236237238239240241242243244245246247248249250251252253254255256257258259260261262263264265266267268269270271272273274275276277278279280281282283284285286287288289290291292293294295296297298299300301302303304305306307308309310311312313314315316317318319320321322323324325326327328329330331332333334335336337338339340341342343344345346347348349350351352353354355356357358359360361362363364365366367368369370371372373374375376377378379380381382383384385386387388389390391392393394395396397398399400401402403404405406407408409410411412413414415416417418419420421422423424425426427428429430431432433434435436437438439440441442443444445446447448449450451452453454455456457458459460461462463464465466467468469470471472473474475476477478479480481482483484485486487488489490491492493494495496497498499500501502503504505506507508509510511512513514515516517518519520521522523524525526527528529530531532533534535536537538539540541542543544545546547548549550551552553554555556557558559560561562563564565566567568569570571572573574575576577578579580581582583584585586587588589590591592593594595596597598599600601602603604605606607608609610611612613614615616617618619620621622623624625626627628629630631632633634635636637638639640641642643644645646647648649650651652653654655656657658659660661662663664665666667668669670671672673674675676677678679680681682683684685686687688689690691692693694
  1. <?xml version="1.0" encoding="UTF-8" standalone="no"?>
  2. <!--
  3. Licensed to the Apache Software Foundation (ASF) under one or more
  4. contributor license agreements. See the NOTICE file distributed with
  5. this work for additional information regarding copyright ownership.
  6. The ASF licenses this file to You under the Apache License, Version 2.0
  7. (the "License"); you may not use this file except in compliance with
  8. the License. You may obtain a copy of the License at
  9. http://www.apache.org/licenses/LICENSE-2.0
  10. Unless required by applicable law or agreed to in writing, software
  11. distributed under the License is distributed on an "AS IS" BASIS,
  12. WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
  13. See the License for the specific language governing permissions and
  14. limitations under the License.
  15. -->
  16. <!-- $Id$ -->
  17. <!DOCTYPE document PUBLIC "-//APACHE//DTD Documentation V2.0//EN" "document-v20.dtd">
  18. <document>
  19. <header>
  20. <title>Apache™ FOP: Complex Scripts</title>
  21. </header>
  22. <body>
  23. <section id="overview">
  24. <title>Overview</title>
  25. <p>
  26. This page describes the
  27. <link href="http://en.wikipedia.org/wiki/Complex_scripts">complex scripts</link>
  28. features of Apache™ FOP, which include:
  29. </p>
  30. <ul>
  31. <li>Support for languages written with right-to-left scripts, such as Arabic and Hebrew scripts.</li>
  32. <li>Support for languages written with South Asian and Southeast Asian scripts, such as Devanagari,
  33. Khmer, Tamil, Thai, and others.</li>
  34. <li>Support for advanced substitution, reordering, and positioning of glyphs according to language
  35. and script sensitive rules.</li>
  36. <li>Support for advanced number to string formatting.</li>
  37. </ul>
  38. </section>
  39. <section>
  40. <title>Disabling complex scripts</title>
  41. <p>Complex script features are enabled by default. If some application of FOP does not
  42. require this support, then it can be disabled in three ways:</p>
  43. <ol>
  44. <li>
  45. <strong>Command line:</strong> The command line option <code>-nocs</code> turns off complex script
  46. features: <code>fop -nocs -fo mydocument.fo -pdf mydocument.pdf</code>
  47. </li>
  48. <li>
  49. <strong>Embedding:</strong> <code>userAgent.setComplexScriptFeaturesEnabled(false);</code>
  50. </li>
  51. <li>
  52. <strong>Optional setting in fop.xconf file:</strong>
  53. <pre>
  54. &lt;fop version="1.0"&gt;
  55. &lt;complex-scripts disabled="true"/&gt;
  56. ...
  57. &lt;/fop&gt;
  58. </pre>
  59. </li>
  60. </ol>
  61. <p>
  62. When complex scripts features are enabled, additional information related to bidirectional
  63. level resolution, the association between characters and glyphs, and glyph position adjustments
  64. are added to the internal, parsed representation of the XSL-FO tree and its corresponding
  65. formatted area tree. This additional information will somewhat increase the memory requirements for
  66. processing documents that use these features.
  67. </p>
  68. <note>A document author need not make explicit use of any complex scripts feature in order
  69. for this additional information to be created. For example, if the author makes use of a font
  70. that contains OpenType GSUB and/or GPOS tables, then those tables will be automatically used
  71. unless complex scripts features are disabled.</note>
  72. </section>
  73. <section id="source">
  74. <title>Changes to your XSL-FO input files</title>
  75. <p>
  76. In most circumstances, XSL-FO content does not need to change in order to make use of
  77. complex scripts features; however, in certain contexts, fully automatic processing is not
  78. sufficient. In these cases, an author may make use of the following XSL-FO constructs:
  79. </p>
  80. <ul>
  81. <li>The <link href="http://www.w3.org/TR/2006/REC-xsl11-20061205/#script"><code>script</code></link> property.</li>
  82. <li>The <link href="http://www.w3.org/TR/2006/REC-xsl11-20061205/#language"><code>language</code></link> property.</li>
  83. <li>The <link href="http://www.w3.org/TR/2006/REC-xsl11-20061205/#writing-mode"><code>writing-mode</code></link> property.</li>
  84. <li>The number to string conversion properties:
  85. <link href="http://www.w3.org/TR/2006/REC-xsl11-20061205/#format"><code>format</code></link>,
  86. <link href="http://www.w3.org/TR/2006/REC-xsl11-20061205/#grouping-separator"><code>grouping-separator</code></link>,
  87. <link href="http://www.w3.org/TR/2006/REC-xsl11-20061205/#grouping-size"><code>grouping-size</code></link>,
  88. <link href="http://www.w3.org/TR/2006/REC-xsl11-20061205/#letter-value"><code>letter-value</code></link>,
  89. and <code>fox:number-conversion-features</code>.</li>
  90. <li>The <link href="http://www.w3.org/TR/2006/REC-xsl11-20061205/#fo_bidi-override"><code>fo:bidi-override</code></link> element.</li>
  91. <li>Explicit bidirectional control characters: U+200E LRM, U+200F RLM, U+202A LRE,
  92. U+202B RLE, U+202C PDF, U+202D LRO, U+202E RLO.</li>
  93. <li>Explicit join control characters: U+200C ZWNJ and U+200D ZWJ.</li>
  94. </ul>
  95. </section>
  96. <section id="details">
  97. <title>Authoring Details</title>
  98. <p>The complex scripts related effects of the above enumerated XSL-FO constructs are more
  99. fully described in the following sub-sections.</p>
  100. <section id="script_property">
  101. <title>Script Property</title>
  102. <p>In order to apply font specific complex script features, it is necessary to know
  103. the script that applies to the text undergoing layout processing. This script is determined
  104. using the following algorithm:
  105. </p>
  106. <ol>
  107. <li>If the FO element that governs the text specifies a
  108. <link href="http://www.w3.org/TR/2006/REC-xsl11-20061205/#script"><code>script</code></link>
  109. property and its value is not the empty string or <code>"auto"</code>, then that script is used.</li>
  110. <li>Otherwise, the dominant script of the text is determined automatically by finding the
  111. script whose constituent characters appear most frequently in the text.</li>
  112. </ol>
  113. <p>In case the automatic algorithm does not produce the desired results, an author may
  114. explicitly specify a <code>script</code> property with the desired script. If specified,
  115. it must be one of the four-letter script code specified in
  116. <link href="http://unicode.org/iso15924/iso15924-codes.html">ISO 15924 Code List</link> or
  117. in the <link href="#extended_script_codes">Extended Script Codes</link> table. Comparison
  118. of script codes is performed in a case-insensitive manner, so it does not matter what case
  119. is used when specifying these codes in an XSL-FO document.</p>
  120. <section id="standard_script_codes">
  121. <title>Standard Script Codes</title>
  122. <p>The following table enumerates the standard ISO 15924 4-letter codes recognized by FOP.</p>
  123. <table class="ForrestTable tw30">
  124. <tr>
  125. <th style="text-align:center">Code</th>
  126. <th style="text-align:left">Script</th>
  127. </tr>
  128. <tr>
  129. <td style="text-align:center"><code>arab</code></td>
  130. <td>Arabic</td>
  131. </tr>
  132. <tr>
  133. <td style="text-align:center"><code>beng</code></td>
  134. <td>Bengali</td>
  135. </tr>
  136. <tr>
  137. <td style="text-align:center"><code>bopo</code></td>
  138. <td>Bopomofo</td>
  139. </tr>
  140. <tr>
  141. <td style="text-align:center"><code>cyrl</code></td>
  142. <td>Cyrillic</td>
  143. </tr>
  144. <tr>
  145. <td style="text-align:center"><code>deva</code></td>
  146. <td>Devanagari</td>
  147. </tr>
  148. <tr>
  149. <td style="text-align:center"><code>ethi</code></td>
  150. <td>Ethiopic</td>
  151. </tr>
  152. <tr>
  153. <td style="text-align:center"><code>geor</code></td>
  154. <td>Georgian</td>
  155. </tr>
  156. <tr>
  157. <td style="text-align:center"><code>grek</code></td>
  158. <td>Greek</td>
  159. </tr>
  160. <tr>
  161. <td style="text-align:center"><code>gujr</code></td>
  162. <td>Gujarati</td>
  163. </tr>
  164. <tr>
  165. <td style="text-align:center"><code>guru</code></td>
  166. <td>Gurmukhi</td>
  167. </tr>
  168. <tr>
  169. <td style="text-align:center"><code>hang</code></td>
  170. <td>Hangul</td>
  171. </tr>
  172. <tr>
  173. <td style="text-align:center"><code>hani</code></td>
  174. <td>Han</td>
  175. </tr>
  176. <tr>
  177. <td style="text-align:center"><code>hebr</code></td>
  178. <td>Hebrew</td>
  179. </tr>
  180. <tr>
  181. <td style="text-align:center"><code>hira</code></td>
  182. <td>Hiragana</td>
  183. </tr>
  184. <tr>
  185. <td style="text-align:center"><code>kana</code></td>
  186. <td>Katakana</td>
  187. </tr>
  188. <tr>
  189. <td style="text-align:center"><code>knda</code></td>
  190. <td>Kannada</td>
  191. </tr>
  192. <tr>
  193. <td style="text-align:center"><code>khmr</code></td>
  194. <td>Khmer</td>
  195. </tr>
  196. <tr>
  197. <td style="text-align:center"><code>laoo</code></td>
  198. <td>Lao</td>
  199. </tr>
  200. <tr>
  201. <td style="text-align:center"><code>latn</code></td>
  202. <td>Latin</td>
  203. </tr>
  204. <tr>
  205. <td style="text-align:center"><code>mlym</code></td>
  206. <td>Malayalam</td>
  207. </tr>
  208. <tr>
  209. <td style="text-align:center"><code>mymr</code></td>
  210. <td>Burmese</td>
  211. </tr>
  212. <tr>
  213. <td style="text-align:center"><code>mong</code></td>
  214. <td>Mongolian</td>
  215. </tr>
  216. <tr>
  217. <td style="text-align:center"><code>orya</code></td>
  218. <td>Oriya</td>
  219. </tr>
  220. <tr>
  221. <td style="text-align:center"><code>sinh</code></td>
  222. <td>Sinhalese</td>
  223. </tr>
  224. <tr>
  225. <td style="text-align:center"><code>taml</code></td>
  226. <td>Tamil</td>
  227. </tr>
  228. <tr>
  229. <td style="text-align:center"><code>telu</code></td>
  230. <td>Telugu</td>
  231. </tr>
  232. <tr>
  233. <td style="text-align:center"><code>thai</code></td>
  234. <td>Thai</td>
  235. </tr>
  236. <tr>
  237. <td style="text-align:center"><code>tibt</code></td>
  238. <td>Tibetan</td>
  239. </tr>
  240. <tr>
  241. <td style="text-align:center"><code>zmth</code></td>
  242. <td>Math</td>
  243. </tr>
  244. <tr>
  245. <td style="text-align:center"><code>zsym</code></td>
  246. <td>Symbol</td>
  247. </tr>
  248. <tr>
  249. <td style="text-align:center"><code>zyyy</code></td>
  250. <td>Undetermined</td>
  251. </tr>
  252. <tr>
  253. <td style="text-align:center"><code>zzzz</code></td>
  254. <td>Uncoded</td>
  255. </tr>
  256. </table>
  257. </section>
  258. <section id="extended_script_codes">
  259. <title>Extended Script Codes</title>
  260. <p>The following table enumerates a number of non-standard extended script codes recognized by FOP.</p>
  261. <table class="ForrestTable tw70">
  262. <colgrp>
  263. <col style="width:10%"/>
  264. <col style="width:20%"/>
  265. <col style="width:80%"/>
  266. </colgrp>
  267. <tr>
  268. <th>Code</th>
  269. <th>Script</th>
  270. <th>Comments</th>
  271. </tr>
  272. <tr>
  273. <td style="text-align:center"><code>bng2</code></td>
  274. <td>Bengali</td>
  275. <td>OpenType Indic Version 2 (May 2008 and following) behavior.</td>
  276. </tr>
  277. <tr>
  278. <td style="text-align:center"><code>dev2</code></td>
  279. <td>Devanagari</td>
  280. <td>OpenType Indic Version 2 (May 2008 and following) behavior.</td>
  281. </tr>
  282. <tr>
  283. <td style="text-align:center"><code>gur2</code></td>
  284. <td>Gurmukhi</td>
  285. <td>OpenType Indic Version 2 (May 2008 and following) behavior.</td>
  286. </tr>
  287. <tr>
  288. <td style="text-align:center"><code>gjr2</code></td>
  289. <td>Gujarati</td>
  290. <td>OpenType Indic Version 2 (May 2008 and following) behavior.</td>
  291. </tr>
  292. <tr>
  293. <td style="text-align:center"><code>knd2</code></td>
  294. <td>Kannada</td>
  295. <td>OpenType Indic Version 2 (May 2008 and following) behavior.</td>
  296. </tr>
  297. <tr>
  298. <td style="text-align:center"><code>mlm2</code></td>
  299. <td>Malayalam</td>
  300. <td>OpenType Indic Version 2 (May 2008 and following) behavior.</td>
  301. </tr>
  302. <tr>
  303. <td style="text-align:center"><code>ory2</code></td>
  304. <td>Oriya</td>
  305. <td>OpenType Indic Version 2 (May 2008 and following) behavior.</td>
  306. </tr>
  307. <tr>
  308. <td style="text-align:center"><code>tml2</code></td>
  309. <td>Tamil</td>
  310. <td>OpenType Indic Version 2 (May 2008 and following) behavior.</td>
  311. </tr>
  312. <tr>
  313. <td style="text-align:center"><code>tel2</code></td>
  314. <td>Telugu</td>
  315. <td>OpenType Indic Version 2 (May 2008 and following) behavior.</td>
  316. </tr>
  317. </table>
  318. <warning>
  319. Explicit use of one of the above extended script codes is not portable,
  320. and should be limited to use with FOP only.
  321. </warning>
  322. <note>
  323. When performing automatic script determination, FOP selects the OpenType Indic
  324. Version 2 script codes by default. If the author requires Version 1 behavior, then
  325. an explicit, non-extension script code should be specified in a governing <code>script</code>
  326. property.
  327. </note>
  328. </section>
  329. </section>
  330. <section id="language_property">
  331. <title>Language Property</title>
  332. <p>Certain fonts that support complex script features can make use of language information in order for
  333. language specific processing rules to be applied. For example, a font designed for the Arabic script may support
  334. typographic variations according to whether the written language is Arabic, Farsi (Persian), Sindhi, Urdu, or
  335. another language written with the Arabic script. In order to apply these language specific features, the author
  336. may explicitly mark the text with a <link href="http://www.w3.org/TR/2006/REC-xsl11-20061205/#language"><code>language</code></link>
  337. property.</p>
  338. <p>When specifying the <code>language</code> property, the value of the property must be either an
  339. <link href="http://en.wikipedia.org/wiki/List_of_ISO_639-2_codes">ISO639-2 3-letter code</link> or an
  340. <link href="http://en.wikipedia.org/wiki/List_of_ISO_639-1_codes">ISO639-1 2-letter code</link>. Comparison of language
  341. codes is performed in a case-insensitive manner, so it does not matter what case is used when specifying these
  342. codes in an XSL-FO document.</p>
  343. </section>
  344. <section id="writing_mode_property">
  345. <title>Writing Mode Property</title>
  346. <p>The <code>writing-mode</code> property is used to determine the axes and direction of the inline
  347. progression direction, the block progression direction, the column progression direction (in tables
  348. and flows), the shift direction, region placement, the resolution of writing-mode relative property
  349. values (such as start, end, before, after), and the default block (paragraph) bidirectionality
  350. level.
  351. </p>
  352. <p>The <code>writing-mode</code> property is inherited, so it can appear on any XSL-FO element type; however,
  353. it applies (semantically) only to the following element types:
  354. </p>
  355. <ul>
  356. <li><code>fo:page-sequence</code></li>
  357. <li><code>fo:simple-page-master</code></li>
  358. <li><code>fo:region-*</code></li>
  359. <li><code>fo:block-container</code></li>
  360. <li><code>fo:inline-container</code></li>
  361. <li><code>fo:table</code></li>
  362. </ul>
  363. <p>If it is not specified on one of these element types, but is specified on an ancestor element, then the value specified on that
  364. ancestor element (the inherited value) is used; otherwise, the initial value <code>lr-tb</code> is used.</p>
  365. <p>At present, only the following values of the <code>writing-mode</code> property are supported:</p>
  366. <ul>
  367. <li><code>lr-tb</code></li>
  368. <li><code>rl-tb</code></li>
  369. <li><code>lr</code></li>
  370. <li><code>rl</code></li>
  371. </ul>
  372. <p>Writing modes that employ a vertical inline progression direction are not yet supported.</p>
  373. </section>
  374. <!-- [TBD] - NEED TO DOCUMENT
  375. <section id="number_conversion_properties">
  376. <title>Number Conversion Properties</title>
  377. </section>
  378. -->
  379. <section id="bidi_override_element">
  380. <title>Bidi Override Element</title>
  381. <p>The <link href="http://www.w3.org/TR/2006/REC-xsl11-20061205/#fo_bidi-override"><code>fo:bidi-override</code></link>
  382. element may be used to override default bidirectional processing behavior,
  383. including default embedding levels and default character directionality. In the absence of either this element
  384. or use of explicit <link href="#bidi_controls">Bidi Control Characters</link>, the default behavior prescribed
  385. by the <link href="http://www.w3.org/TR/2006/REC-xsl11-20061205/#fo_bidi-override">Unicode Bidirectional Algorithm</link>
  386. applies.
  387. </p>
  388. </section>
  389. <section id="bidi_controls">
  390. <title>Bidi Control Characters</title>
  391. <p>In addition to the use of the <link href="#bidi_override_element">Bidi Override Element</link>, an author
  392. may make use of the following explicit Unicode Bidi Control Characters:</p>
  393. <ul>
  394. <li>U+200E - LEFT-TO-RIGHT MARK (LRM)</li>
  395. <li>U+200F - RIGHT-TO-LEFT MARK (RLM)</li>
  396. <li>U+202A - LEFT-TO-RIGHT EMBEDDING (LRE)</li>
  397. <li>U+202B - RIGHT-TO-LEFT EMBEDDING (RLE)</li>
  398. <li>U+202C - POP DIRECTIONAL FORMATTING (PDF)</li>
  399. <li>U+202D - LEFT-TO-RIGHT OVERRIDE (LRO)</li>
  400. <li>U+202E - RIGHT-TO-LEFT OVERRIDE (RLO)</li>
  401. </ul>
  402. <p>If an embedding or override is not terminated (using U+202C PDF) prior to the end of a
  403. <link href="http://www.w3.org/TR/2006/REC-xsl11-20061205/#d0e4879">delimited text range</link>,
  404. then it is automatically terminated by FOP.
  405. </p>
  406. </section>
  407. <section id="join_controls">
  408. <title>Join Control Characters</title>
  409. <p>In order to prevent joining behavior in contexts where joining occurs by default,
  410. for example, between U+0628 ARABIC LETTER BEH and U+0646 ARABIC LETTER NOON,
  411. an author may used a U+200C ZERO WIDTH NON-JOINER (ZWNJ).
  412. </p>
  413. <p>Conversely, in order to force joining behavior in contexts where joining does not occur by default,
  414. for example, between U+0628 ARABIC LETTER BEH and U+0020 SPACE, an author may used a U+200D ZERO WIDTH JOINER (ZWJ).
  415. </p>
  416. <p>The behavior of ZWNJ and ZWJ is script specific. See
  417. <link href="http://www.unicode.org/versions/Unicode6.1.0/ch08.pdf">The Unicode Standard, Chapter 8,
  418. Middle Eastern Scripts</link> for information on the use of these control characters with the Arabic
  419. script. See
  420. <link href="http://www.unicode.org/versions/Unicode6.1.0/ch09.pdf">The Unicode Standard, Chapter 9,
  421. South Asian Scripts - I</link> for information on the use of these control characters with common
  422. Indic scripts.
  423. </p>
  424. </section>
  425. </section>
  426. <section id="supported_scripts">
  427. <title>Supported Scripts</title>
  428. <p>Support for specific complex scripts is enumerated in the following table. Support
  429. for those marked as not being supported is expected to be added in future revisions.</p>
  430. <table>
  431. <colgrp>
  432. <col style="width: 15%"/>
  433. <col style="width: 10%"/>
  434. <col style="width: 10%"/>
  435. <col style="width: 65%"/>
  436. </colgrp>
  437. <tr>
  438. <th style="text-align:left">Script</th>
  439. <th>Support</th>
  440. <th>Tested</th>
  441. <th style="text-align:left">Comments</th>
  442. </tr>
  443. <tr>
  444. <td><link href="http://en.wikipedia.org/wiki/Arabic_alphabet">Arabic</link></td>
  445. <td class="yes">full</td>
  446. <td class="yes">full</td>
  447. <td></td>
  448. </tr>
  449. <tr>
  450. <td><link href="http://en.wikipedia.org/wiki/Bengali_alphabet">Bengali</link></td>
  451. <td class="no">none</td>
  452. <td class="no">none</td>
  453. <td></td>
  454. </tr>
  455. <tr>
  456. <td><link href="http://en.wikipedia.org/wiki/Burmese_alphabet">Burmese</link></td>
  457. <td class="no">none</td>
  458. <td class="no">none</td>
  459. <td></td>
  460. </tr>
  461. <tr>
  462. <td><link href="http://en.wikipedia.org/wiki/Devanagari_alphabet">Devanagari</link></td>
  463. <td class="partial">partial</td>
  464. <td class="partial">partial</td>
  465. <td style="text-align:left">join controls (ZWJ, ZWNJ) not yet supported</td>
  466. </tr>
  467. <tr>
  468. <td><link href="http://en.wikipedia.org/wiki/Khmer_alphabet">Khmer</link></td>
  469. <td class="no">none</td>
  470. <td class="no">none</td>
  471. <td></td>
  472. </tr>
  473. <tr>
  474. <td><link href="http://en.wikipedia.org/wiki/Gujarati_alphabet">Gujarati</link></td>
  475. <td class="partial">partial</td>
  476. <td class="no">none</td>
  477. <td style="text-align:left">pre-alpha</td>
  478. </tr>
  479. <tr>
  480. <td><link href="http://en.wikipedia.org/wiki/Gurmukhī_alphabet">Gurmukhi</link></td>
  481. <td class="partial">partial</td>
  482. <td class="no">none</td>
  483. <td style="text-align:left">pre-alpha</td>
  484. </tr>
  485. <tr>
  486. <td><link href="http://en.wikipedia.org/wiki/Hebrew_alphabet">Hebrew</link></td>
  487. <td class="yes">full</td>
  488. <td class="partial">partial</td>
  489. <td></td>
  490. </tr>
  491. <tr>
  492. <td><link href="http://en.wikipedia.org/wiki/Kannada_alphabet">Kannada</link></td>
  493. <td class="no">none</td>
  494. <td class="no">none</td>
  495. <td></td>
  496. </tr>
  497. <tr>
  498. <td><link href="http://en.wikipedia.org/wiki/Lao_alphabet">Lao</link></td>
  499. <td class="no">none</td>
  500. <td class="no">none</td>
  501. <td></td>
  502. </tr>
  503. <tr>
  504. <td><link href="http://en.wikipedia.org/wiki/Malayalam_alphabet">Malayalam</link></td>
  505. <td class="no">none</td>
  506. <td class="no">none</td>
  507. <td></td>
  508. </tr>
  509. <tr>
  510. <td><link href="http://en.wikipedia.org/wiki/Mongolian_script">Mongolian</link></td>
  511. <td class="no">none</td>
  512. <td class="no">none</td>
  513. <td></td>
  514. </tr>
  515. <tr>
  516. <td><link href="http://en.wikipedia.org/wiki/Oriya_script">Oriya</link></td>
  517. <td class="no">none</td>
  518. <td class="no">none</td>
  519. <td></td>
  520. </tr>
  521. <tr>
  522. <td><link href="http://en.wikipedia.org/wiki/Tamil_alphabet">Tamil</link></td>
  523. <td class="no">none</td>
  524. <td class="no">none</td>
  525. <td></td>
  526. </tr>
  527. <tr>
  528. <td><link href="http://en.wikipedia.org/wiki/Telugu_alphabet">Telugu</link></td>
  529. <td class="no">none</td>
  530. <td class="no">none</td>
  531. <td></td>
  532. </tr>
  533. <tr>
  534. <td><link href="http://en.wikipedia.org/wiki/Tibetan_alphabet">Tibetan</link></td>
  535. <td class="no">none</td>
  536. <td class="no">none</td>
  537. <td></td>
  538. </tr>
  539. <tr>
  540. <td><link href="http://en.wikipedia.org/wiki/Thai_alphabet">Thai</link></td>
  541. <td class="no">none</td>
  542. <td class="no">none</td>
  543. <td></td>
  544. </tr>
  545. </table>
  546. </section>
  547. <section id="supported_fonts">
  548. <title>Supported Fonts</title>
  549. <p>Support for specific fonts is enumerated in the following sub-sections. If a given
  550. font is not listed, then it has not been tested with these complex scripts features.</p>
  551. <section id="fonts_arabic">
  552. <title>Arabic Fonts</title>
  553. <table>
  554. <colgrp>
  555. <col style="width: 15%"/>
  556. <col style="width: 10%"/>
  557. <col style="width: 10%"/>
  558. <col style="width: 65%"/>
  559. </colgrp>
  560. <tr>
  561. <th style="text-align:left">Font</th>
  562. <th style="text-align:left">Version</th>
  563. <th style="text-align:center">Glyphs</th>
  564. <th style="text-align:left">Comments</th>
  565. </tr>
  566. <tr>
  567. <td><link href="http://www.microsoft.com/typography/fonts/family.aspx?FID=24">Arial Unicode MS</link></td>
  568. <td class="yes" style="text-align:left"><link href="http://www.microsoft.com/typography/fonts/font.aspx?FMID=1081">1.01</link></td>
  569. <td style="text-align:center">50377</td>
  570. <td>limited GPOS support</td>
  571. </tr>
  572. <tr>
  573. <td><link href="http://scripts.sil.org/cms/scripts/page.php?site_id=nrsi&amp;id=ArabicFonts">Lateef</link></td>
  574. <td class="yes" style="text-align:left">1.0</td>
  575. <td style="text-align:center">1147</td>
  576. <td>language features for Kurdish (KUR), Sindhi (SND), Urdu (URD)</td>
  577. </tr>
  578. <tr>
  579. <td><link href="http://scripts.sil.org/cms/scripts/page.php?site_id=nrsi&amp;id=ArabicFonts">Scheherazade</link></td>
  580. <td class="yes" style="text-align:left">1.0</td>
  581. <td style="text-align:center">1197</td>
  582. <td>language features for Kurdish (KUR), Sindhi (SND), Urdu (URD)</td>
  583. </tr>
  584. <tr>
  585. <td><link href="http://www.microsoft.com/typography/fonts/family.aspx?FID=261">Simplified Arabic</link></td>
  586. <td class="no" style="text-align:left"><link href="http://www.microsoft.com/typography/fonts/font.aspx?FMID=867">1.01</link></td>
  587. <td/>
  588. <td>contains invalid, out of order coverage table entries</td>
  589. </tr>
  590. <tr>
  591. <td><link href="http://www.microsoft.com/typography/fonts/font.aspx?FID=261">Simplified Arabic</link></td>
  592. <td class="yes" style="text-align:left"><link href="http://www.microsoft.com/typography/fonts/font.aspx?FMID=1645">5.00</link></td>
  593. <td style="text-align:center">414</td>
  594. <td>lacks GPOS support</td>
  595. </tr>
  596. <tr>
  597. <td><link href="http://www.microsoft.com/typography/fonts/font.aspx?FID=261">Simplified Arabic</link></td>
  598. <td class="yes" style="text-align:left">5.92</td>
  599. <td style="text-align:center">473</td>
  600. <td>includes GPOS for advanced position adjustment</td>
  601. </tr>
  602. <tr>
  603. <td><link href="http://www.microsoft.com/typography/fonts/family.aspx?FID=264">Traditional Arabic</link></td>
  604. <td class="yes" style="text-align:left"><link href="http://www.microsoft.com/typography/fonts/family.aspx?FMID=877">1.01</link></td>
  605. <td style="text-align:center">530</td>
  606. <td>lacks GPOS support</td>
  607. </tr>
  608. <tr>
  609. <td><link href="http://www.microsoft.com/typography/fonts/font.aspx?FID=264">Traditional Arabic</link></td>
  610. <td class="yes" style="text-align:left"><link href="http://www.microsoft.com/typography/fonts/font.aspx?FMID=1658">5.00</link></td>
  611. <td style="text-align:center">530</td>
  612. <td>lacks GPOS support</td>
  613. </tr>
  614. <tr>
  615. <td><link href="http://www.microsoft.com/typography/fonts/font.aspx?FID=264">Traditional Arabic</link></td>
  616. <td class="yes" style="text-align:left">5.92</td>
  617. <td style="text-align:center">589</td>
  618. <td>includes GPOS for advanced position adjustment</td>
  619. </tr>
  620. </table>
  621. </section>
  622. <section id="devanagari_fonts">
  623. <title>Devanagari Fonts</title>
  624. <table>
  625. <colgrp>
  626. <col style="width: 15%"/>
  627. <col style="width: 10%"/>
  628. <col style="width: 10%"/>
  629. <col style="width: 65%"/>
  630. </colgrp>
  631. <tr>
  632. <th style="text-align:left">Font</th>
  633. <th style="text-align:left">Version</th>
  634. <th style="text-align:center">Glyphs</th>
  635. <th style="text-align:left">Comments</th>
  636. </tr>
  637. <tr>
  638. <td><link href="http://www.microsoft.com/typography/fonts/family.aspx?FID=370">Aparajita</link></td>
  639. <td class="yes" style="text-align:left"><link href="http://www.microsoft.com/typography/fonts/font.aspx?FMID=1700">1.00</link></td>
  640. <td style="text-align:center">706</td>
  641. <td/>
  642. </tr>
  643. <tr>
  644. <td><link href="http://www.microsoft.com/typography/fonts/family.aspx?FID=374">Kokila</link></td>
  645. <td class="yes" style="text-align:left"><link href="http://www.microsoft.com/typography/fonts/font.aspx?FMID=1749">1.00</link></td>
  646. <td style="text-align:center">706</td>
  647. <td/>
  648. </tr>
  649. <tr>
  650. <td><link href="http://www.microsoft.com/typography/fonts/family.aspx?FID=243">Mangal</link></td>
  651. <td class="yes" style="text-align:left"><link href="http://www.microsoft.com/typography/fonts/font.aspx?FMID=1759">5.01</link></td>
  652. <td style="text-align:center">885</td>
  653. <td>designed for use in user interfaces</td>
  654. </tr>
  655. <tr>
  656. <td><link href="http://www.microsoft.com/typography/fonts/family.aspx?FID=384">Utsaah</link></td>
  657. <td class="yes" style="text-align:left"><link href="http://www.microsoft.com/typography/fonts/font.aspx?FMID=1811">1.00</link></td>
  658. <td style="text-align:center">706</td>
  659. <td/>
  660. </tr>
  661. </table>
  662. </section>
  663. </section>
  664. <section id="limitations">
  665. <title>Other Limitations</title>
  666. <p>
  667. Complex scripts support in Apache FOP is relatively new, so there are certain
  668. limitations. Please help us identify and close any gaps.
  669. </p>
  670. <ul>
  671. <li>Only the PDF output format fully supports complex scripts features at the present time.</li>
  672. <li>Shaping context does not extend across an element boundary. This limitation prevents the use of
  673. <code>fo:character</code>, <code>fo:inline</code> or <code>fo:wrapper</code> in order to colorize
  674. individual Arabic letters without affecting shaping behavior across the element boundary.</li>
  675. </ul>
  676. </section>
  677. <section id="links">
  678. <title>Related Links</title>
  679. <p>
  680. In addition to the XSL-FO specification, a number of external resources provide
  681. guidance about authoring documents that employ complex scripts and the features
  682. described above:
  683. </p>
  684. <ul>
  685. <li><link href="http://www.unicode.org/">The Unicode Standard</link></li>
  686. <li><link href="http://www.w3.org/TR/2006/REC-xsl11-20061205/#fo_bidi-override">Unicode Bidirectional Algorithm</link></li>
  687. <li><link href="http://www.microsoft.com/typography/otspec/ttochap1.htm">OpenType Advanced Typographic Extensions</link></li>
  688. <li><link href="http://scripts.sil.org/cms/scripts/page.php?site_id=nrsi&amp;item_id=CmplxRndExamples">Examples of Complex Rendering</link></li>
  689. </ul>
  690. </section>
  691. </body>
  692. </document>