<?xml version="1.0"?>
<feed xmlns="http://www.w3.org/2005/Atom" xml:lang="en">
	<id>http://okapiframework.org/wiki/api.php?action=feedcontributions&amp;feedformat=atom&amp;user=Jhargraveiii</id>
	<title>Okapi Framework - User contributions [en]</title>
	<link rel="self" type="application/atom+xml" href="http://okapiframework.org/wiki/api.php?action=feedcontributions&amp;feedformat=atom&amp;user=Jhargraveiii"/>
	<link rel="alternate" type="text/html" href="http://okapiframework.org/wiki/index.php/Special:Contributions/Jhargraveiii"/>
	<updated>2026-04-18T22:45:02Z</updated>
	<subtitle>User contributions</subtitle>
	<generator>MediaWiki 1.38.2</generator>
	<entry>
		<id>http://okapiframework.org/wiki/index.php?title=Distributions&amp;diff=1056</id>
		<title>Distributions</title>
		<link rel="alternate" type="text/html" href="http://okapiframework.org/wiki/index.php?title=Distributions&amp;diff=1056"/>
		<updated>2026-01-16T20:43:15Z</updated>

		<summary type="html">&lt;p&gt;Jhargraveiii: /* Main Project */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;&lt;br /&gt;
== Main Project ==&lt;br /&gt;
&lt;br /&gt;
This includes platform-specific distributions for the applications (Rainbow, Tikal, Ratel, etc.) as well as the libraries-only package (all platforms).&lt;br /&gt;
&lt;br /&gt;
* Latest release ( &amp;lt;b&amp;gt;1.48.0 - Jan 16, 2026&amp;lt;/b&amp;gt; ) : [https://okapiframework.org/binaries/main/1.47.0 https://okapiframework.org/binaries/main/1.47.0]&lt;br /&gt;
* Recent releases: [https://okapiframework.org/binaries/main https://okapiframework.org/binaries/main]&lt;br /&gt;
* Release artifacts on Maven Central: https://search.maven.org/search?q=net.sf.okapi&lt;br /&gt;
* Changes log: [https://okapiframework.org/binaries/main/changes/files/1.47.0.html https://okapiframework.org/binaries/main/changes/files/1.47.0.html]&lt;br /&gt;
&lt;br /&gt;
Snapshots:&lt;br /&gt;
&lt;br /&gt;
* Latest Development Snapshots (nightly builds): [https://gitlab.com/okapiframework/Okapi/-/pipelines https://gitlab.com/okapiframework/Okapi/-/pipelines]&lt;br /&gt;
&lt;br /&gt;
== Longhorn ==&lt;br /&gt;
&lt;br /&gt;
This includes the Longhorn distributions (all platforms).&lt;br /&gt;
&lt;br /&gt;
* Latest release ( &amp;lt;b&amp;gt;1.44.0 - Jan 16, 2023&amp;lt;/b&amp;gt; ) : [https://okapiframework.org/binaries/longhorn/okapi-longhorn_all-platforms_1.44.0.zip okapi-longhorn_all-platforms_1.44.0.zip]&lt;br /&gt;
* Recent releases: [https://okapiframework.org/binaries/longhorn https://okapiframework.org/binaries/longhorn]&lt;br /&gt;
&lt;br /&gt;
== OmegaT Filter Plugin ==&lt;br /&gt;
&lt;br /&gt;
This includes the Filters Plugin for OmegaT (all platforms).&lt;br /&gt;
&lt;br /&gt;
* Latest release ( &amp;lt;b&amp;gt;1.13-1.45.0 - Feb 26, 2023&amp;lt;/b&amp;gt; ) : [https://okapiframework.org/binaries/omegat-plugin/okapiFiltersForOmegaT-1.13-1.45.0-dist.zip okapiFiltersForOmegaT-1.13-1.45.0-dist.zip]&lt;br /&gt;
* All releases: [https://okapiframework.org/binaries/omegat-plugin https://okapiframework.org/binaries/omegat-plugin]&lt;br /&gt;
&lt;br /&gt;
== Ocelot ==&lt;br /&gt;
&lt;br /&gt;
This includes the  Review Workbench application Ocelot (all platforms).&lt;br /&gt;
&lt;br /&gt;
* Latest release ( &amp;lt;b&amp;gt;3.0 - Oct 17, 2017&amp;lt;/b&amp;gt; ) : [https://okapiframework.org/binaries/ocelot/Ocelot-3.0.jar Ocelot-3.0.jar]&lt;br /&gt;
* Recent releases: [https://okapiframework.org/binaries/ocelot https://okapiframework.org/binaries/ocelot]&lt;br /&gt;
&lt;br /&gt;
== Archives ==&lt;br /&gt;
&lt;br /&gt;
Older distributions that are not included above.&lt;br /&gt;
&lt;br /&gt;
* [https://okapiframework.org/binaries/archives Archives (https://okapiframework.org/binaries/archives)]&lt;/div&gt;</summary>
		<author><name>Jhargraveiii</name></author>
	</entry>
	<entry>
		<id>http://okapiframework.org/wiki/index.php?title=Main_Page&amp;diff=1055</id>
		<title>Main Page</title>
		<link rel="alternate" type="text/html" href="http://okapiframework.org/wiki/index.php?title=Main_Page&amp;diff=1055"/>
		<updated>2026-01-16T20:42:20Z</updated>

		<summary type="html">&lt;p&gt;Jhargraveiii: /* Downloads &amp;amp;mdash; Stable Release */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;__NOTOC__&lt;br /&gt;
{| border=&amp;quot;0&amp;quot; cellspacing=&amp;quot;0&amp;quot; cellpadding=&amp;quot;0&amp;quot; width=&amp;quot;100%&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
|colspan=&amp;quot;2&amp;quot;|&lt;br /&gt;
&amp;lt;div style=&amp;quot;margin-bottom: 10px; padding:5px; padding-left:10px; background:#ffffff; border:1px solid #ccc;&amp;quot;&amp;gt;&lt;br /&gt;
The Okapi Framework is a '''free''', '''open-source''' and '''cross-platform''' set of components and applications designed to help your '''localization''' and '''translation''' processes.&amp;lt;br&amp;gt;&lt;br /&gt;
The project uses and promotes '''[[Open Standards|open standards]]''' such as [[XLIFF]], [[TMX]], [[SRX]], [[ITS]] and other formats. It is an [[Open_Standards#OAXAL_1.0_Conformance_Statement|OAXAL 1.0 implementation]].&lt;br /&gt;
&amp;lt;/div&amp;gt;&lt;br /&gt;
&lt;br /&gt;
__NOTOC__&lt;br /&gt;
{| border=&amp;quot;0&amp;quot; cellspacing=&amp;quot;0&amp;quot; cellpadding=&amp;quot;0&amp;quot; width=&amp;quot;100%&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
|colspan=&amp;quot;2&amp;quot;|&lt;br /&gt;
&amp;lt;div style=&amp;quot;margin-bottom: 10px; margin-top: 0px; padding:5px; padding-left:10px; background-color: yellow; color: red !important; font-weight: bold; border:1px solid #ccc;&amp;quot;&amp;gt;&lt;br /&gt;
[[About the Log4j2 vulnerabilities (2021)]]&lt;br /&gt;
&amp;lt;/div&amp;gt;&lt;br /&gt;
&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
|&lt;br /&gt;
&amp;lt;div style=&amp;quot;margin-bottom:10px; margin-right:5px; padding:10px; padding-top:0; background:#ffffdf; border:1px solid #ccc;&amp;quot;&amp;gt;&lt;br /&gt;
==Users Help and Documentation==&lt;br /&gt;
* '''[[Getting Started]]''' &amp;amp;mdash; all you need to know to get started using the Okapi tools&lt;br /&gt;
* '''[[FAQ]]''' &amp;amp;mdash; the answers to some of the most asked questions&lt;br /&gt;
* '''[[Online Help]]''' &amp;amp;mdash; the applications online help pages&lt;br /&gt;
* '''[[Knowledge Base]]''' &amp;amp;mdash; various articles on how to work with the Okapi tools&lt;br /&gt;
* '''[[Tutorials]]''' &amp;amp;mdash; step-by-step tutorials on using the Okapi tools&lt;br /&gt;
* '''[[Glossary]]''' &amp;amp;mdash; list of the main terms used in the help and documentation&lt;br /&gt;
* '''[[Screenshots]]''' &amp;amp;mdash; of the applications on different platforms3&lt;br /&gt;
&lt;br /&gt;
Additional support:&lt;br /&gt;
* '''[https://groups.google.com/forum/#!forum/okapi-users Users Group]''' &amp;amp;mdash; users group and mailing list (or '''[http://twitter.com/OkapiFramework follow us on Twitter]''')&lt;br /&gt;
* '''[https://gitlab.com/okapiframework/Okapi/-/issues Bug Report &amp;amp; Feature Request]''' &amp;amp;mdash; list of bug reports and feature requests(Login required to post one)&lt;br /&gt;
* '''[mailto:okapitools@opentag.com&amp;amp;subject=Feedback Send Feedback]''' &amp;amp;mdash; email some feedback (no login required)&lt;br /&gt;
&amp;lt;/div&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;div style=&amp;quot;padding:10px; margin-right:5px; padding-top:0; background:#f9f9ff; border:1px solid #ccc;&amp;quot;&amp;gt;&lt;br /&gt;
&lt;br /&gt;
==Applications==&lt;br /&gt;
* '''[[Rainbow]]''' &amp;amp;mdash; a toolbox to launch a large variety of localization tasks&lt;br /&gt;
* '''[[CheckMate]]''' &amp;amp;mdash; an application to perform quality checks on bilingual files&lt;br /&gt;
* '''[[Tikal]]''' &amp;amp;mdash; a command-line tool for basic localization tasks&lt;br /&gt;
* '''[[Ratel]]''' &amp;amp;mdash; a WYSIWYG editor to create, test and maintain SRX segmentation rules&lt;br /&gt;
* '''[[Longhorn]]''' &amp;amp;mdash; a batch processing server&lt;br /&gt;
* '''[http://github.com/vistatec/ocelot/blob/master/README.md Ocelot]''' &amp;amp;mdash; a XLIFF translator/reviewer workbench&lt;br /&gt;
&lt;br /&gt;
==Components==&lt;br /&gt;
* '''[[Filters]]''' &amp;amp;mdash; extract translatable text from various file formats&lt;br /&gt;
* '''[[Steps|Pipelines and Steps]]''' &amp;amp;mdash; perform a chain of user-defined tasks on input documents&lt;br /&gt;
* '''[[Connectors|Translation Resource Connectors]]''' &amp;amp;mdash; provide access to MT, TM, term-bases, etc.&lt;br /&gt;
&amp;lt;/div&amp;gt;&lt;br /&gt;
&lt;br /&gt;
|&lt;br /&gt;
&amp;lt;div style=&amp;quot;margin-bottom:10px; margin-left:5px; padding:10px; padding-top:0; background:#dfffe8; border:1px solid #ccc;&amp;quot;&amp;gt;&lt;br /&gt;
==Downloads &amp;amp;mdash; Stable Release==&lt;br /&gt;
The latest stable release of Rainbow, CheckMate, Tikal, Ratel, etc. ('''1.48.0 - Jan 16, 2026'''):&lt;br /&gt;
* '''[https://okapiframework.org/binaries/main/1.47.0/okapi-apps_win32-x86_64_1.47.0.zip Windows 64-bit]''' &amp;amp;mdash; Windows platform&lt;br /&gt;
* '''[https://okapiframework.org/binaries/main/1.47.0/okapi-apps_gtk2-linux-x86_64_1.47.0.zip Linux 64-bit]''' &amp;amp;mdash; Linux platform (gtk2, Intel)&lt;br /&gt;
* '''[https://okapiframework.org/binaries/main/1.47.0/okapi-apps_cocoa-macosx-x86_64_1.47.0.dmg Macintosh Intel 64-bit]''' &amp;amp;mdash; Macintosh platform (Cocoa, Intel)&lt;br /&gt;
* '''[https://okapiframework.org/binaries/main/1.47.0/okapi-apps_cocoa-macosx-aarch64_1.47.0.dmg Macintosh Arm 64-bit]''' &amp;amp;mdash; Macintosh platform (Cocoa, Apple M1 chip)&lt;br /&gt;
* '''[https://okapiframework.org/binaries/longhorn/okapi-longhorn_all-platforms_1.43.0.zip Longhorn]''' &amp;amp;mdash; batch processing server ([[Longhorn|More information...)]]&lt;br /&gt;
* '''[https://okapiframework.org/binaries/omegat-plugin/okapiFiltersForOmegaT-1.15-1.47.0-dist.zip Plugin for OmegaT]''' &amp;amp;mdash; filters plugin for OmegaT ([[Okapi Filters Plugin for OmegaT|More information...)]]&lt;br /&gt;
* '''[https://okapiframework.org/binaries/ocelot Ocelot (v3.0)]''' &amp;amp;mdash; XLIFF reviewer workbench ([http://github.com/vistatec/ocelot/blob/master/README.md More information...)]&lt;br /&gt;
* '''[https://okapiframework.org/binaries/main/changes Changes Log]''' &amp;amp;mdash; list of the changes between versions&lt;br /&gt;
* '''https://okapiframework.org/wiki/index.php/Distributions''' &amp;amp;mdash; all downloads&lt;br /&gt;
* '''https://search.maven.org/search?q=net.sf.okapi''' &amp;amp;mdash; release artifacts on Maven Central&lt;br /&gt;
&lt;br /&gt;
==Downloads &amp;amp;mdash; Development Release==&lt;br /&gt;
For testers and advanced users, the latest development snapshots are here:&lt;br /&gt;
* Maven artifacts snapshots, nightly build: '''https://oss.sonatype.org/content/repositories/snapshots'''&lt;br /&gt;
* Latest snapshot of the distributions: '''[https://gitlab.com/okapiframework/Okapi/-/pipelines Nightly build on GitLab]'''&lt;br /&gt;
&amp;lt;/div&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;div style=&amp;quot;margin-left:5px; padding:10px; padding-top:0; background:#ffe8bf; border:1px solid #ccc;&amp;quot;&amp;gt;&lt;br /&gt;
&lt;br /&gt;
==Developers==&lt;br /&gt;
{| style=&amp;quot;background:#ffe8bf;&amp;quot;&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot; padding=&amp;quot;0&amp;quot;&lt;br /&gt;
|&lt;br /&gt;
* '''[https://gitlab.com/okapiframework/Okapi/ Project Git repository]''' at GitLab&lt;br /&gt;
* '''[http://okapiframework.org/devguide/index.html Developer's Guide]'''&lt;br /&gt;
* '''[http://okapiframework.org/javadoc/index.html API Documentation]'''&lt;br /&gt;
* '''[http://groups.google.com/group/okapi-devel/ Developers Group and Mailing List]''' &amp;amp;nbsp;&amp;amp;nbsp;&amp;amp;nbsp;&lt;br /&gt;
* '''[[About the Log4j2 vulnerabilities (2021)]]'''&lt;br /&gt;
|&lt;br /&gt;
* '''[https://gitlab.com/okapiframework/okapi/pipelines Continuous Build on GitLab]'''&lt;br /&gt;
* '''[https://gitlab.com/okapiframework/Okapi/-/wikis/Consultancy Consultancy resources]'''&lt;br /&gt;
* '''[https://gitlab.com/okapiframework/Okapi/-/issues Bugs and Requests]''' tracking list&lt;br /&gt;
|}&lt;br /&gt;
&amp;lt;/div&amp;gt;&lt;br /&gt;
&lt;br /&gt;
|}&lt;/div&gt;</summary>
		<author><name>Jhargraveiii</name></author>
	</entry>
	<entry>
		<id>http://okapiframework.org/wiki/index.php?title=Post-segmentation_Inline_Codes_Removal_Step&amp;diff=1046</id>
		<title>Post-segmentation Inline Codes Removal Step</title>
		<link rel="alternate" type="text/html" href="http://okapiframework.org/wiki/index.php?title=Post-segmentation_Inline_Codes_Removal_Step&amp;diff=1046"/>
		<updated>2025-09-24T02:09:56Z</updated>

		<summary type="html">&lt;p&gt;Jhargraveiii: /* Limitations */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;{{Steps Header}}&lt;br /&gt;
__TOC__&lt;br /&gt;
==Overview==&lt;br /&gt;
&lt;br /&gt;
This step attempts to simplify (trim and merge) as many inline codes as possible by looking at each linguistically distinct segment in a TextUnit.&lt;br /&gt;
&lt;br /&gt;
'''The step must be run after segmentation.''' Joins adjacent inline codes inside segments, and optionally moves leading and trailing codes from the segment to an inter-segment Textpart. Original (un-merged) codes are saved as okp:merged attributes inside the generated XLIFF file. Trimmed codes are simply written outside the &amp;quot;mrk&amp;quot; elements.&lt;br /&gt;
&lt;br /&gt;
Takes: Filter Events. Sends: Filter Events.&lt;br /&gt;
&lt;br /&gt;
==Parameters==&lt;br /&gt;
&lt;br /&gt;
&amp;lt;cite&amp;gt;Remove leading and trailing codes&amp;lt;/cite&amp;gt; &amp;amp;mdash; Set this option to remove leading and trailing inline codes from the text units and place them outside the segment.&lt;br /&gt;
&lt;br /&gt;
==Limitations==&lt;br /&gt;
&lt;br /&gt;
Currently bi-lingual formats such as XLIFF, TMX, TTX etc. will not have their codes simplified as the codes may differ in source and target. Codes must align with id's across source and target.&lt;br /&gt;
&lt;br /&gt;
[[Category:Steps]]&lt;/div&gt;</summary>
		<author><name>Jhargraveiii</name></author>
	</entry>
	<entry>
		<id>http://okapiframework.org/wiki/index.php?title=MIF_Filter&amp;diff=1026</id>
		<title>MIF Filter</title>
		<link rel="alternate" type="text/html" href="http://okapiframework.org/wiki/index.php?title=MIF_Filter&amp;diff=1026"/>
		<updated>2025-02-19T16:44:14Z</updated>

		<summary type="html">&lt;p&gt;Jhargraveiii: /* Limitations */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;{{Filters Header}}&lt;br /&gt;
==Overview==&lt;br /&gt;
&lt;br /&gt;
This filter allows you to process MIF documents. MIF (Maker Interchange Format) generated and read by Adobe FrameMaker. The specification for MIF 9.0 can be found [http://help.adobe.com/en_US/FrameMaker/9.0/MIF_Reference/MIF_Reference.pdf on the Adobe Web site].&lt;br /&gt;
&lt;br /&gt;
==Processing Details==&lt;br /&gt;
&lt;br /&gt;
===Input Encoding===&lt;br /&gt;
&lt;br /&gt;
The encoding of the input MIF document is automatically detected based on the version of the file and different other information in the document.&lt;br /&gt;
&lt;br /&gt;
MIF v8 and above use UTF-8.&lt;br /&gt;
&lt;br /&gt;
===Output Encoding===&lt;br /&gt;
&lt;br /&gt;
MIF v8 and above are automatically output in UTF-8.&lt;br /&gt;
&lt;br /&gt;
==Parameters==&lt;br /&gt;
&lt;br /&gt;
===Options Tab===&lt;br /&gt;
&lt;br /&gt;
&amp;lt;cite&amp;gt;Extract variables&amp;lt;/cite&amp;gt; &amp;amp;mdash; Set this option to extract the definitions of the variables.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;cite&amp;gt;Extract index markers&amp;lt;/cite&amp;gt; &amp;amp;mdash; Set this option to extract the index markers in the extractable pages. The text of each index entry is extracted in a separate text unit, before the text unit that contains the index marker.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;cite&amp;gt;Extract links&amp;lt;/cite&amp;gt; &amp;amp;mdash; Set this option to extract URLs of the the links in the extractable pages. Each URL is extracted in a separate text unit, before the text unit that contains the hypertext marker.&lt;br /&gt;
&lt;br /&gt;
====Type of page to extract====&lt;br /&gt;
&lt;br /&gt;
&amp;lt;cite&amp;gt;Body pages&amp;lt;/cite&amp;gt; &amp;amp;mdash; Set this option to extract the body pages.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;cite&amp;gt;Hidden pages&amp;lt;/cite&amp;gt; &amp;amp;mdash; Set this option to extract the hidden pages.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;cite&amp;gt;Master pages&amp;lt;/cite&amp;gt; &amp;amp;mdash; Set this option to extract the master pages.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;cite&amp;gt;Reference pages&amp;lt;/cite&amp;gt; &amp;amp;mdash; Set this option to extract the reference pages. Note that by default FrameMaker creates its new documents with several reference pages that contain text.&lt;br /&gt;
&lt;br /&gt;
===Inline Codes Tab===&lt;br /&gt;
&lt;br /&gt;
&amp;lt;cite&amp;gt;Has inline codes as defined below&amp;lt;/cite&amp;gt; &amp;amp;mdash; Set this option to use the specified regular expressions on the text of the extracted items. Any match will be converted to an inline code. By default the expression is:&lt;br /&gt;
&lt;br /&gt;
 &amp;lt;\$.*?&amp;gt;&lt;br /&gt;
&lt;br /&gt;
{{CodeFinder Help}}&lt;br /&gt;
&lt;br /&gt;
==Limitations==&lt;br /&gt;
&lt;br /&gt;
* Versions &amp;lt; MIF 8.0 are not supported.&lt;br /&gt;
* You may run into Java heap memory issue if the document includes very large embedded insets (e.g. images). The workaround for this is to link to external objects rather than embed them.&lt;br /&gt;
* The filter does not do font mapping yet, so if the translated file is in a language not supported by the fonts used in the source document, you need to update the paragraph and character catalogs to use fonts providing the proper support.&lt;br /&gt;
&lt;br /&gt;
[[Category:Filters]]&lt;/div&gt;</summary>
		<author><name>Jhargraveiii</name></author>
	</entry>
	<entry>
		<id>http://okapiframework.org/wiki/index.php?title=MIF_Filter&amp;diff=1025</id>
		<title>MIF Filter</title>
		<link rel="alternate" type="text/html" href="http://okapiframework.org/wiki/index.php?title=MIF_Filter&amp;diff=1025"/>
		<updated>2025-02-19T16:43:08Z</updated>

		<summary type="html">&lt;p&gt;Jhargraveiii: /* Processing Details */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;{{Filters Header}}&lt;br /&gt;
==Overview==&lt;br /&gt;
&lt;br /&gt;
This filter allows you to process MIF documents. MIF (Maker Interchange Format) generated and read by Adobe FrameMaker. The specification for MIF 9.0 can be found [http://help.adobe.com/en_US/FrameMaker/9.0/MIF_Reference/MIF_Reference.pdf on the Adobe Web site].&lt;br /&gt;
&lt;br /&gt;
==Processing Details==&lt;br /&gt;
&lt;br /&gt;
===Input Encoding===&lt;br /&gt;
&lt;br /&gt;
The encoding of the input MIF document is automatically detected based on the version of the file and different other information in the document.&lt;br /&gt;
&lt;br /&gt;
MIF v8 and above use UTF-8.&lt;br /&gt;
&lt;br /&gt;
===Output Encoding===&lt;br /&gt;
&lt;br /&gt;
MIF v8 and above are automatically output in UTF-8.&lt;br /&gt;
&lt;br /&gt;
==Parameters==&lt;br /&gt;
&lt;br /&gt;
===Options Tab===&lt;br /&gt;
&lt;br /&gt;
&amp;lt;cite&amp;gt;Extract variables&amp;lt;/cite&amp;gt; &amp;amp;mdash; Set this option to extract the definitions of the variables.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;cite&amp;gt;Extract index markers&amp;lt;/cite&amp;gt; &amp;amp;mdash; Set this option to extract the index markers in the extractable pages. The text of each index entry is extracted in a separate text unit, before the text unit that contains the index marker.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;cite&amp;gt;Extract links&amp;lt;/cite&amp;gt; &amp;amp;mdash; Set this option to extract URLs of the the links in the extractable pages. Each URL is extracted in a separate text unit, before the text unit that contains the hypertext marker.&lt;br /&gt;
&lt;br /&gt;
====Type of page to extract====&lt;br /&gt;
&lt;br /&gt;
&amp;lt;cite&amp;gt;Body pages&amp;lt;/cite&amp;gt; &amp;amp;mdash; Set this option to extract the body pages.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;cite&amp;gt;Hidden pages&amp;lt;/cite&amp;gt; &amp;amp;mdash; Set this option to extract the hidden pages.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;cite&amp;gt;Master pages&amp;lt;/cite&amp;gt; &amp;amp;mdash; Set this option to extract the master pages.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;cite&amp;gt;Reference pages&amp;lt;/cite&amp;gt; &amp;amp;mdash; Set this option to extract the reference pages. Note that by default FrameMaker creates its new documents with several reference pages that contain text.&lt;br /&gt;
&lt;br /&gt;
===Inline Codes Tab===&lt;br /&gt;
&lt;br /&gt;
&amp;lt;cite&amp;gt;Has inline codes as defined below&amp;lt;/cite&amp;gt; &amp;amp;mdash; Set this option to use the specified regular expressions on the text of the extracted items. Any match will be converted to an inline code. By default the expression is:&lt;br /&gt;
&lt;br /&gt;
 &amp;lt;\$.*?&amp;gt;&lt;br /&gt;
&lt;br /&gt;
{{CodeFinder Help}}&lt;br /&gt;
&lt;br /&gt;
==Limitations==&lt;br /&gt;
&lt;br /&gt;
* This filter is BETA.&lt;br /&gt;
* Support for MIF 7.0 with non-Latin-1 language is limited: you may get corrupted extended characters in some cases.&lt;br /&gt;
* You may run into Java heap memory issue if the document includes very large embedded insets (e.g. images). The workaround for this is to link to external objects rather than embed them.&lt;br /&gt;
* The filter does not do font mapping yet, so if the translated file is in a language not supported by the fonts used in the source document, you need to update the paragraph and character catalogs to use fonts providing the proper support.&lt;br /&gt;
&lt;br /&gt;
[[Category:Filters]]&lt;/div&gt;</summary>
		<author><name>Jhargraveiii</name></author>
	</entry>
	<entry>
		<id>http://okapiframework.org/wiki/index.php?title=Main_Page&amp;diff=1000</id>
		<title>Main Page</title>
		<link rel="alternate" type="text/html" href="http://okapiframework.org/wiki/index.php?title=Main_Page&amp;diff=1000"/>
		<updated>2024-01-03T00:23:04Z</updated>

		<summary type="html">&lt;p&gt;Jhargraveiii: /* Downloads &amp;amp;mdash; Stable Release (master branch) */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;__NOTOC__&lt;br /&gt;
{| border=&amp;quot;0&amp;quot; cellspacing=&amp;quot;0&amp;quot; cellpadding=&amp;quot;0&amp;quot; width=&amp;quot;100%&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
|colspan=&amp;quot;2&amp;quot;|&lt;br /&gt;
&amp;lt;div style=&amp;quot;margin-bottom: 10px; padding:5px; padding-left:10px; background:#ffffff; border:1px solid #ccc;&amp;quot;&amp;gt;&lt;br /&gt;
The Okapi Framework is a '''free''', '''open-source''' and '''cross-platform''' set of components and applications designed to help your '''localization''' and '''translation''' processes.&amp;lt;br&amp;gt;&lt;br /&gt;
The project uses and promotes '''[[Open Standards|open standards]]''' such as [[XLIFF]], [[TMX]], [[SRX]], [[ITS]] and other formats. It is an [[Open_Standards#OAXAL_1.0_Conformance_Statement|OAXAL 1.0 implementation]].&lt;br /&gt;
&amp;lt;/div&amp;gt;&lt;br /&gt;
&lt;br /&gt;
__NOTOC__&lt;br /&gt;
{| border=&amp;quot;0&amp;quot; cellspacing=&amp;quot;0&amp;quot; cellpadding=&amp;quot;0&amp;quot; width=&amp;quot;100%&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
|colspan=&amp;quot;2&amp;quot;|&lt;br /&gt;
&amp;lt;div style=&amp;quot;margin-bottom: 10px; margin-top: 0px; padding:5px; padding-left:10px; background-color: yellow; color: red !important; font-weight: bold; border:1px solid #ccc;&amp;quot;&amp;gt;&lt;br /&gt;
[[About the Log4j2 vulnerabilities]]&lt;br /&gt;
&amp;lt;/div&amp;gt;&lt;br /&gt;
&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
|&lt;br /&gt;
&amp;lt;div style=&amp;quot;margin-bottom:10px; margin-right:5px; padding:10px; padding-top:0; background:#ffffdf; border:1px solid #ccc;&amp;quot;&amp;gt;&lt;br /&gt;
==Users Help and Documentation==&lt;br /&gt;
* '''[[Getting Started]]''' &amp;amp;mdash; all you need to know to get started using the Okapi tools&lt;br /&gt;
* '''[[FAQ]]''' &amp;amp;mdash; the answers to some of the most asked questions&lt;br /&gt;
* '''[[Online Help]]''' &amp;amp;mdash; the applications online help pages&lt;br /&gt;
* '''[[Knowledge Base]]''' &amp;amp;mdash; various articles on how to work with the Okapi tools&lt;br /&gt;
* '''[[Tutorials]]''' &amp;amp;mdash; step-by-step tutorials on using the Okapi tools&lt;br /&gt;
* '''[[Glossary]]''' &amp;amp;mdash; list of the main terms used in the help and documentation&lt;br /&gt;
* '''[[Screenshots]]''' &amp;amp;mdash; of the applications on different platforms3&lt;br /&gt;
&lt;br /&gt;
Additional support:&lt;br /&gt;
* '''[https://groups.google.com/forum/#!forum/okapi-users Users Group]''' &amp;amp;mdash; users group and mailing list (or '''[http://twitter.com/OkapiFramework follow us on Twitter]''')&lt;br /&gt;
* '''[https://bitbucket.org/okapiframework/okapi/issues Bug Report]''' &amp;amp;mdash; list of bug reports (Login required to post one)&lt;br /&gt;
* '''[https://bitbucket.org/okapiframework/okapi/issues Feature Request]''' &amp;amp;mdash; list of feature requests (Login required to post one)&lt;br /&gt;
* '''[mailto:okapitools@opentag.com&amp;amp;subject=Feedback Send Feedback]''' &amp;amp;mdash; email some feedback (no login required)&lt;br /&gt;
&amp;lt;/div&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;div style=&amp;quot;padding:10px; margin-right:5px; padding-top:0; background:#f9f9ff; border:1px solid #ccc;&amp;quot;&amp;gt;&lt;br /&gt;
&lt;br /&gt;
==Applications==&lt;br /&gt;
* '''[[Rainbow]]''' &amp;amp;mdash; a toolbox to launch a large variety of localization tasks&lt;br /&gt;
* '''[[CheckMate]]''' &amp;amp;mdash; an application to perform quality checks on bilingual files&lt;br /&gt;
* '''[[Tikal]]''' &amp;amp;mdash; a command-line tool for basic localization tasks&lt;br /&gt;
* '''[[Ratel]]''' &amp;amp;mdash; a WYSIWYG editor to create, test and maintain SRX segmentation rules&lt;br /&gt;
* '''[[Longhorn]]''' &amp;amp;mdash; a batch processing server&lt;br /&gt;
* '''[http://github.com/vistatec/ocelot/blob/master/README.md Ocelot]''' &amp;amp;mdash; a XLIFF translator/reviewer workbench&lt;br /&gt;
&lt;br /&gt;
==Components==&lt;br /&gt;
* '''[[Filters]]''' &amp;amp;mdash; extract translatable text from various file formats&lt;br /&gt;
* '''[[Steps|Pipelines and Steps]]''' &amp;amp;mdash; perform a chain of user-defined tasks on input documents&lt;br /&gt;
* '''[[Connectors|Translation Resource Connectors]]''' &amp;amp;mdash; provide access to MT, TM, term-bases, etc.&lt;br /&gt;
&amp;lt;/div&amp;gt;&lt;br /&gt;
&lt;br /&gt;
|&lt;br /&gt;
&amp;lt;div style=&amp;quot;margin-bottom:10px; margin-left:5px; padding:10px; padding-top:0; background:#dfffe8; border:1px solid #ccc;&amp;quot;&amp;gt;&lt;br /&gt;
==Downloads &amp;amp;mdash; Stable Release ([https://bitbucket.org/okapiframework/okapi/src/master master branch])==&lt;br /&gt;
The latest stable release of Rainbow, CheckMate, Tikal, Ratel, etc. ('''1.46.0 - Jan 2, 2024'''):&lt;br /&gt;
* '''[https://okapiframework.org/binaries/main/1.46.0/okapi-apps_win32-x86_64_1.46.0.zip Windows 64-bit]''' &amp;amp;mdash; Windows platform&lt;br /&gt;
* '''[https://okapiframework.org/binaries/main/1.46.0/okapi-apps_gtk2-linux-x86_64_1.46.0.zip Linux 64-bit]''' &amp;amp;mdash; Linux platform (gtk2, Intel)&lt;br /&gt;
* '''[https://okapiframework.org/binaries/main/1.46.0/okapi-apps_cocoa-macosx-x86_64_1.46.0.dmg Macintosh Intel 64-bit]''' &amp;amp;mdash; Macintosh platform (Cocoa, Intel)&lt;br /&gt;
* '''[https://okapiframework.org/binaries/main/1.46.0/okapi-apps_cocoa-macosx-aarch64_1.46.0.dmg Macintosh Arm 64-bit]''' &amp;amp;mdash; Macintosh platform (Cocoa, Apple M1 chip)&lt;br /&gt;
* '''[https://okapiframework.org/binaries/longhorn/okapi-longhorn_all-platforms_1.43.0.zip Longhorn]''' &amp;amp;mdash; batch processing server ([[Longhorn|More information...)]]&lt;br /&gt;
* '''[https://okapiframework.org/binaries/omegat-plugin/okapiFiltersForOmegaT-1.13-1.46.0-dist.zip Plugin for OmegaT]''' &amp;amp;mdash; filters plugin for OmegaT ([[Okapi Filters Plugin for OmegaT|More information...)]]&lt;br /&gt;
* '''[https://okapiframework.org/binaries/ocelot Ocelot (v3.0)]''' &amp;amp;mdash; XLIFF reviewer workbench ([http://github.com/vistatec/ocelot/blob/master/README.md More information...)]&lt;br /&gt;
* '''[https://okapiframework.org/binaries/main/changes Changes Log]''' &amp;amp;mdash; list of the changes between versions&lt;br /&gt;
* '''https://okapiframework.org/wiki/index.php/Distributions''' &amp;amp;mdash; all downloads&lt;br /&gt;
* '''https://search.maven.org/search?q=net.sf.okapi''' &amp;amp;mdash; release artifacts on Maven Central&lt;br /&gt;
&lt;br /&gt;
==Downloads &amp;amp;mdash; Development Release ([https://bitbucket.org/okapiframework/okapi/src/dev/ dev branch])==&lt;br /&gt;
For testers and advanced users, the latest development snapshots are here:&lt;br /&gt;
* Maven artifacts snapshots, nightly build: '''https://oss.sonatype.org/content/repositories/snapshots'''&lt;br /&gt;
* Latest snapshot of the distributions: '''[https://gitlab.com/okapiframework/okapi/-/jobs/artifacts/dev/browse/deployment/maven/done?job=verification:jdk11 Nightly build on GitLab]'''&lt;br /&gt;
&amp;lt;/div&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;div style=&amp;quot;margin-left:5px; padding:10px; padding-top:0; background:#ffe8bf; border:1px solid #ccc;&amp;quot;&amp;gt;&lt;br /&gt;
&lt;br /&gt;
==Developers==&lt;br /&gt;
{| style=&amp;quot;background:#ffe8bf;&amp;quot;&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot; padding=&amp;quot;0&amp;quot;&lt;br /&gt;
|&lt;br /&gt;
* '''[https://bitbucket.org/okapiframework/ Project Home]''' at Bitbucket&lt;br /&gt;
* '''[http://okapiframework.org/devguide/index.html Developer's Guide]'''&lt;br /&gt;
* '''[http://okapiframework.org/javadoc/index.html API Documentation]'''&lt;br /&gt;
* '''[http://groups.google.com/group/okapi-devel/ Developers Group and Mailing List]''' &amp;amp;nbsp;&amp;amp;nbsp;&amp;amp;nbsp;&lt;br /&gt;
* '''[https://bitbucket.org/okapiframework/okapi/src Git repository]''' of the source code&lt;br /&gt;
* '''[[About the Log4j2 vulnerabilities]]'''&lt;br /&gt;
|&lt;br /&gt;
* '''[https://gitlab.com/okapiframework/okapi/pipelines Continuous Build on GitLab]'''&lt;br /&gt;
* List of the '''[https://bitbucket.org/okapiframework/okapi/src/dev/ Latest Code Changes]''' (dev branch)&lt;br /&gt;
* '''[https://bitbucket.org/okapiframework/okapi/wiki/Consultancy Consultancy resources]'''&lt;br /&gt;
* '''[https://bitbucket.org/okapiframework/okapi/issues Bugs and Requests]''' tracking list&lt;br /&gt;
* Notifications: '''[https://bitbucket.org/okapiframework/okapi/commits/all Commits]''' and '''[http://groups.google.com/group/okapi-builds CB Errors]'''&lt;br /&gt;
|}&lt;br /&gt;
&amp;lt;/div&amp;gt;&lt;br /&gt;
&lt;br /&gt;
|}&lt;/div&gt;</summary>
		<author><name>Jhargraveiii</name></author>
	</entry>
	<entry>
		<id>http://okapiframework.org/wiki/index.php?title=Main_Page&amp;diff=999</id>
		<title>Main Page</title>
		<link rel="alternate" type="text/html" href="http://okapiframework.org/wiki/index.php?title=Main_Page&amp;diff=999"/>
		<updated>2024-01-02T19:03:13Z</updated>

		<summary type="html">&lt;p&gt;Jhargraveiii: /* Downloads &amp;amp;mdash; Stable Release (master branch) */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;__NOTOC__&lt;br /&gt;
{| border=&amp;quot;0&amp;quot; cellspacing=&amp;quot;0&amp;quot; cellpadding=&amp;quot;0&amp;quot; width=&amp;quot;100%&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
|colspan=&amp;quot;2&amp;quot;|&lt;br /&gt;
&amp;lt;div style=&amp;quot;margin-bottom: 10px; padding:5px; padding-left:10px; background:#ffffff; border:1px solid #ccc;&amp;quot;&amp;gt;&lt;br /&gt;
The Okapi Framework is a '''free''', '''open-source''' and '''cross-platform''' set of components and applications designed to help your '''localization''' and '''translation''' processes.&amp;lt;br&amp;gt;&lt;br /&gt;
The project uses and promotes '''[[Open Standards|open standards]]''' such as [[XLIFF]], [[TMX]], [[SRX]], [[ITS]] and other formats. It is an [[Open_Standards#OAXAL_1.0_Conformance_Statement|OAXAL 1.0 implementation]].&lt;br /&gt;
&amp;lt;/div&amp;gt;&lt;br /&gt;
&lt;br /&gt;
__NOTOC__&lt;br /&gt;
{| border=&amp;quot;0&amp;quot; cellspacing=&amp;quot;0&amp;quot; cellpadding=&amp;quot;0&amp;quot; width=&amp;quot;100%&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
|colspan=&amp;quot;2&amp;quot;|&lt;br /&gt;
&amp;lt;div style=&amp;quot;margin-bottom: 10px; margin-top: 0px; padding:5px; padding-left:10px; background-color: yellow; color: red !important; font-weight: bold; border:1px solid #ccc;&amp;quot;&amp;gt;&lt;br /&gt;
[[About the Log4j2 vulnerabilities]]&lt;br /&gt;
&amp;lt;/div&amp;gt;&lt;br /&gt;
&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
|&lt;br /&gt;
&amp;lt;div style=&amp;quot;margin-bottom:10px; margin-right:5px; padding:10px; padding-top:0; background:#ffffdf; border:1px solid #ccc;&amp;quot;&amp;gt;&lt;br /&gt;
==Users Help and Documentation==&lt;br /&gt;
* '''[[Getting Started]]''' &amp;amp;mdash; all you need to know to get started using the Okapi tools&lt;br /&gt;
* '''[[FAQ]]''' &amp;amp;mdash; the answers to some of the most asked questions&lt;br /&gt;
* '''[[Online Help]]''' &amp;amp;mdash; the applications online help pages&lt;br /&gt;
* '''[[Knowledge Base]]''' &amp;amp;mdash; various articles on how to work with the Okapi tools&lt;br /&gt;
* '''[[Tutorials]]''' &amp;amp;mdash; step-by-step tutorials on using the Okapi tools&lt;br /&gt;
* '''[[Glossary]]''' &amp;amp;mdash; list of the main terms used in the help and documentation&lt;br /&gt;
* '''[[Screenshots]]''' &amp;amp;mdash; of the applications on different platforms3&lt;br /&gt;
&lt;br /&gt;
Additional support:&lt;br /&gt;
* '''[https://groups.google.com/forum/#!forum/okapi-users Users Group]''' &amp;amp;mdash; users group and mailing list (or '''[http://twitter.com/OkapiFramework follow us on Twitter]''')&lt;br /&gt;
* '''[https://bitbucket.org/okapiframework/okapi/issues Bug Report]''' &amp;amp;mdash; list of bug reports (Login required to post one)&lt;br /&gt;
* '''[https://bitbucket.org/okapiframework/okapi/issues Feature Request]''' &amp;amp;mdash; list of feature requests (Login required to post one)&lt;br /&gt;
* '''[mailto:okapitools@opentag.com&amp;amp;subject=Feedback Send Feedback]''' &amp;amp;mdash; email some feedback (no login required)&lt;br /&gt;
&amp;lt;/div&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;div style=&amp;quot;padding:10px; margin-right:5px; padding-top:0; background:#f9f9ff; border:1px solid #ccc;&amp;quot;&amp;gt;&lt;br /&gt;
&lt;br /&gt;
==Applications==&lt;br /&gt;
* '''[[Rainbow]]''' &amp;amp;mdash; a toolbox to launch a large variety of localization tasks&lt;br /&gt;
* '''[[CheckMate]]''' &amp;amp;mdash; an application to perform quality checks on bilingual files&lt;br /&gt;
* '''[[Tikal]]''' &amp;amp;mdash; a command-line tool for basic localization tasks&lt;br /&gt;
* '''[[Ratel]]''' &amp;amp;mdash; a WYSIWYG editor to create, test and maintain SRX segmentation rules&lt;br /&gt;
* '''[[Longhorn]]''' &amp;amp;mdash; a batch processing server&lt;br /&gt;
* '''[http://github.com/vistatec/ocelot/blob/master/README.md Ocelot]''' &amp;amp;mdash; a XLIFF translator/reviewer workbench&lt;br /&gt;
&lt;br /&gt;
==Components==&lt;br /&gt;
* '''[[Filters]]''' &amp;amp;mdash; extract translatable text from various file formats&lt;br /&gt;
* '''[[Steps|Pipelines and Steps]]''' &amp;amp;mdash; perform a chain of user-defined tasks on input documents&lt;br /&gt;
* '''[[Connectors|Translation Resource Connectors]]''' &amp;amp;mdash; provide access to MT, TM, term-bases, etc.&lt;br /&gt;
&amp;lt;/div&amp;gt;&lt;br /&gt;
&lt;br /&gt;
|&lt;br /&gt;
&amp;lt;div style=&amp;quot;margin-bottom:10px; margin-left:5px; padding:10px; padding-top:0; background:#dfffe8; border:1px solid #ccc;&amp;quot;&amp;gt;&lt;br /&gt;
==Downloads &amp;amp;mdash; Stable Release ([https://bitbucket.org/okapiframework/okapi/src/master master branch])==&lt;br /&gt;
The latest stable release of Rainbow, CheckMate, Tikal, Ratel, etc. ('''1.46.0 - Jan 2, 2024'''):&lt;br /&gt;
* '''[https://okapiframework.org/binaries/main/1.46.0/okapi-apps_win32-x86_1.46.0.zip Windows 32-bit]''' or '''[https://okapiframework.org/binaries/main/1.46.0/okapi-apps_win32-x86_64_1.46.0.zip Windows 64-bit]''' &amp;amp;mdash; Windows platform&lt;br /&gt;
* '''[https://okapiframework.org/binaries/main/1.46.0/okapi-apps_gtk2-linux-x86_1.46.0.zip Linux 32-bit]''' or '''[https://okapiframework.org/binaries/main/1.46.0/okapi-apps_gtk2-linux-x86_64_1.46.0.zip Linux 64-bit]''' &amp;amp;mdash; Linux platform (gtk2, Intel)&lt;br /&gt;
* '''[https://okapiframework.org/binaries/main/1.46.0/okapi-apps_cocoa-macosx-x86_64_1.46.0.dmg Macintosh Intel 64-bit]''' &amp;amp;mdash; Macintosh platform (Cocoa, Intel)&lt;br /&gt;
* '''[https://okapiframework.org/binaries/main/1.46.0/okapi-apps_cocoa-macosx-aarch64_1.46.0.dmg Macintosh Arm 64-bit]''' &amp;amp;mdash; Macintosh platform (Cocoa, Apple M1 chip)&lt;br /&gt;
* '''[https://okapiframework.org/binaries/longhorn/okapi-longhorn_all-platforms_1.43.0.zip Longhorn]''' &amp;amp;mdash; batch processing server ([[Longhorn|More information...)]]&lt;br /&gt;
* '''[https://okapiframework.org/binaries/omegat-plugin/okapiFiltersForOmegaT-1.13-1.46.0-dist.zip Plugin for OmegaT]''' &amp;amp;mdash; filters plugin for OmegaT ([[Okapi Filters Plugin for OmegaT|More information...)]]&lt;br /&gt;
* '''[https://okapiframework.org/binaries/ocelot Ocelot (v3.0)]''' &amp;amp;mdash; XLIFF reviewer workbench ([http://github.com/vistatec/ocelot/blob/master/README.md More information...)]&lt;br /&gt;
* '''[https://okapiframework.org/binaries/main/changes Changes Log]''' &amp;amp;mdash; list of the changes between versions&lt;br /&gt;
* '''https://okapiframework.org/wiki/index.php/Distributions''' &amp;amp;mdash; all downloads&lt;br /&gt;
* '''https://search.maven.org/search?q=net.sf.okapi''' &amp;amp;mdash; release artifacts on Maven Central&lt;br /&gt;
&lt;br /&gt;
==Downloads &amp;amp;mdash; Development Release ([https://bitbucket.org/okapiframework/okapi/src/dev/ dev branch])==&lt;br /&gt;
For testers and advanced users, the latest development snapshots are here:&lt;br /&gt;
* Maven artifacts snapshots, nightly build: '''https://oss.sonatype.org/content/repositories/snapshots'''&lt;br /&gt;
* Latest snapshot of the distributions: '''[https://gitlab.com/okapiframework/okapi/-/jobs/artifacts/dev/browse/deployment/maven/done?job=verification:jdk11 Nightly build on GitLab]'''&lt;br /&gt;
&amp;lt;/div&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;div style=&amp;quot;margin-left:5px; padding:10px; padding-top:0; background:#ffe8bf; border:1px solid #ccc;&amp;quot;&amp;gt;&lt;br /&gt;
&lt;br /&gt;
==Developers==&lt;br /&gt;
{| style=&amp;quot;background:#ffe8bf;&amp;quot;&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot; padding=&amp;quot;0&amp;quot;&lt;br /&gt;
|&lt;br /&gt;
* '''[https://bitbucket.org/okapiframework/ Project Home]''' at Bitbucket&lt;br /&gt;
* '''[http://okapiframework.org/devguide/index.html Developer's Guide]'''&lt;br /&gt;
* '''[http://okapiframework.org/javadoc/index.html API Documentation]'''&lt;br /&gt;
* '''[http://groups.google.com/group/okapi-devel/ Developers Group and Mailing List]''' &amp;amp;nbsp;&amp;amp;nbsp;&amp;amp;nbsp;&lt;br /&gt;
* '''[https://bitbucket.org/okapiframework/okapi/src Git repository]''' of the source code&lt;br /&gt;
* '''[[About the Log4j2 vulnerabilities]]'''&lt;br /&gt;
|&lt;br /&gt;
* '''[https://gitlab.com/okapiframework/okapi/pipelines Continuous Build on GitLab]'''&lt;br /&gt;
* List of the '''[https://bitbucket.org/okapiframework/okapi/src/dev/ Latest Code Changes]''' (dev branch)&lt;br /&gt;
* '''[https://bitbucket.org/okapiframework/okapi/wiki/Consultancy Consultancy resources]'''&lt;br /&gt;
* '''[https://bitbucket.org/okapiframework/okapi/issues Bugs and Requests]''' tracking list&lt;br /&gt;
* Notifications: '''[https://bitbucket.org/okapiframework/okapi/commits/all Commits]''' and '''[http://groups.google.com/group/okapi-builds CB Errors]'''&lt;br /&gt;
|}&lt;br /&gt;
&amp;lt;/div&amp;gt;&lt;br /&gt;
&lt;br /&gt;
|}&lt;/div&gt;</summary>
		<author><name>Jhargraveiii</name></author>
	</entry>
	<entry>
		<id>http://okapiframework.org/wiki/index.php?title=Main_Page&amp;diff=998</id>
		<title>Main Page</title>
		<link rel="alternate" type="text/html" href="http://okapiframework.org/wiki/index.php?title=Main_Page&amp;diff=998"/>
		<updated>2024-01-02T19:02:33Z</updated>

		<summary type="html">&lt;p&gt;Jhargraveiii: /* Downloads &amp;amp;mdash; Stable Release (master branch) */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;__NOTOC__&lt;br /&gt;
{| border=&amp;quot;0&amp;quot; cellspacing=&amp;quot;0&amp;quot; cellpadding=&amp;quot;0&amp;quot; width=&amp;quot;100%&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
|colspan=&amp;quot;2&amp;quot;|&lt;br /&gt;
&amp;lt;div style=&amp;quot;margin-bottom: 10px; padding:5px; padding-left:10px; background:#ffffff; border:1px solid #ccc;&amp;quot;&amp;gt;&lt;br /&gt;
The Okapi Framework is a '''free''', '''open-source''' and '''cross-platform''' set of components and applications designed to help your '''localization''' and '''translation''' processes.&amp;lt;br&amp;gt;&lt;br /&gt;
The project uses and promotes '''[[Open Standards|open standards]]''' such as [[XLIFF]], [[TMX]], [[SRX]], [[ITS]] and other formats. It is an [[Open_Standards#OAXAL_1.0_Conformance_Statement|OAXAL 1.0 implementation]].&lt;br /&gt;
&amp;lt;/div&amp;gt;&lt;br /&gt;
&lt;br /&gt;
__NOTOC__&lt;br /&gt;
{| border=&amp;quot;0&amp;quot; cellspacing=&amp;quot;0&amp;quot; cellpadding=&amp;quot;0&amp;quot; width=&amp;quot;100%&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
|colspan=&amp;quot;2&amp;quot;|&lt;br /&gt;
&amp;lt;div style=&amp;quot;margin-bottom: 10px; margin-top: 0px; padding:5px; padding-left:10px; background-color: yellow; color: red !important; font-weight: bold; border:1px solid #ccc;&amp;quot;&amp;gt;&lt;br /&gt;
[[About the Log4j2 vulnerabilities]]&lt;br /&gt;
&amp;lt;/div&amp;gt;&lt;br /&gt;
&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
|&lt;br /&gt;
&amp;lt;div style=&amp;quot;margin-bottom:10px; margin-right:5px; padding:10px; padding-top:0; background:#ffffdf; border:1px solid #ccc;&amp;quot;&amp;gt;&lt;br /&gt;
==Users Help and Documentation==&lt;br /&gt;
* '''[[Getting Started]]''' &amp;amp;mdash; all you need to know to get started using the Okapi tools&lt;br /&gt;
* '''[[FAQ]]''' &amp;amp;mdash; the answers to some of the most asked questions&lt;br /&gt;
* '''[[Online Help]]''' &amp;amp;mdash; the applications online help pages&lt;br /&gt;
* '''[[Knowledge Base]]''' &amp;amp;mdash; various articles on how to work with the Okapi tools&lt;br /&gt;
* '''[[Tutorials]]''' &amp;amp;mdash; step-by-step tutorials on using the Okapi tools&lt;br /&gt;
* '''[[Glossary]]''' &amp;amp;mdash; list of the main terms used in the help and documentation&lt;br /&gt;
* '''[[Screenshots]]''' &amp;amp;mdash; of the applications on different platforms3&lt;br /&gt;
&lt;br /&gt;
Additional support:&lt;br /&gt;
* '''[https://groups.google.com/forum/#!forum/okapi-users Users Group]''' &amp;amp;mdash; users group and mailing list (or '''[http://twitter.com/OkapiFramework follow us on Twitter]''')&lt;br /&gt;
* '''[https://bitbucket.org/okapiframework/okapi/issues Bug Report]''' &amp;amp;mdash; list of bug reports (Login required to post one)&lt;br /&gt;
* '''[https://bitbucket.org/okapiframework/okapi/issues Feature Request]''' &amp;amp;mdash; list of feature requests (Login required to post one)&lt;br /&gt;
* '''[mailto:okapitools@opentag.com&amp;amp;subject=Feedback Send Feedback]''' &amp;amp;mdash; email some feedback (no login required)&lt;br /&gt;
&amp;lt;/div&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;div style=&amp;quot;padding:10px; margin-right:5px; padding-top:0; background:#f9f9ff; border:1px solid #ccc;&amp;quot;&amp;gt;&lt;br /&gt;
&lt;br /&gt;
==Applications==&lt;br /&gt;
* '''[[Rainbow]]''' &amp;amp;mdash; a toolbox to launch a large variety of localization tasks&lt;br /&gt;
* '''[[CheckMate]]''' &amp;amp;mdash; an application to perform quality checks on bilingual files&lt;br /&gt;
* '''[[Tikal]]''' &amp;amp;mdash; a command-line tool for basic localization tasks&lt;br /&gt;
* '''[[Ratel]]''' &amp;amp;mdash; a WYSIWYG editor to create, test and maintain SRX segmentation rules&lt;br /&gt;
* '''[[Longhorn]]''' &amp;amp;mdash; a batch processing server&lt;br /&gt;
* '''[http://github.com/vistatec/ocelot/blob/master/README.md Ocelot]''' &amp;amp;mdash; a XLIFF translator/reviewer workbench&lt;br /&gt;
&lt;br /&gt;
==Components==&lt;br /&gt;
* '''[[Filters]]''' &amp;amp;mdash; extract translatable text from various file formats&lt;br /&gt;
* '''[[Steps|Pipelines and Steps]]''' &amp;amp;mdash; perform a chain of user-defined tasks on input documents&lt;br /&gt;
* '''[[Connectors|Translation Resource Connectors]]''' &amp;amp;mdash; provide access to MT, TM, term-bases, etc.&lt;br /&gt;
&amp;lt;/div&amp;gt;&lt;br /&gt;
&lt;br /&gt;
|&lt;br /&gt;
&amp;lt;div style=&amp;quot;margin-bottom:10px; margin-left:5px; padding:10px; padding-top:0; background:#dfffe8; border:1px solid #ccc;&amp;quot;&amp;gt;&lt;br /&gt;
==Downloads &amp;amp;mdash; Stable Release ([https://bitbucket.org/okapiframework/okapi/src/master master branch])==&lt;br /&gt;
The latest stable release of Rainbow, CheckMate, Tikal, Ratel, etc. ('''1.46.0 - Aug 28, 2022'''):&lt;br /&gt;
* '''[https://okapiframework.org/binaries/main/1.46.0/okapi-apps_win32-x86_1.46.0.zip Windows 32-bit]''' or '''[https://okapiframework.org/binaries/main/1.46.0/okapi-apps_win32-x86_64_1.46.0.zip Windows 64-bit]''' &amp;amp;mdash; Windows platform&lt;br /&gt;
* '''[https://okapiframework.org/binaries/main/1.46.0/okapi-apps_gtk2-linux-x86_1.46.0.zip Linux 32-bit]''' or '''[https://okapiframework.org/binaries/main/1.46.0/okapi-apps_gtk2-linux-x86_64_1.46.0.zip Linux 64-bit]''' &amp;amp;mdash; Linux platform (gtk2, Intel)&lt;br /&gt;
* '''[https://okapiframework.org/binaries/main/1.46.0/okapi-apps_cocoa-macosx-x86_64_1.46.0.dmg Macintosh Intel 64-bit]''' &amp;amp;mdash; Macintosh platform (Cocoa, Intel)&lt;br /&gt;
* '''[https://okapiframework.org/binaries/main/1.46.0/okapi-apps_cocoa-macosx-aarch64_1.46.0.dmg Macintosh Arm 64-bit]''' &amp;amp;mdash; Macintosh platform (Cocoa, Apple M1 chip)&lt;br /&gt;
* '''[https://okapiframework.org/binaries/longhorn/okapi-longhorn_all-platforms_1.43.0.zip Longhorn]''' &amp;amp;mdash; batch processing server ([[Longhorn|More information...)]]&lt;br /&gt;
* '''[https://okapiframework.org/binaries/omegat-plugin/okapiFiltersForOmegaT-1.13-1.46.0-dist.zip Plugin for OmegaT]''' &amp;amp;mdash; filters plugin for OmegaT ([[Okapi Filters Plugin for OmegaT|More information...)]]&lt;br /&gt;
* '''[https://okapiframework.org/binaries/ocelot Ocelot (v3.0)]''' &amp;amp;mdash; XLIFF reviewer workbench ([http://github.com/vistatec/ocelot/blob/master/README.md More information...)]&lt;br /&gt;
* '''[https://okapiframework.org/binaries/main/changes Changes Log]''' &amp;amp;mdash; list of the changes between versions&lt;br /&gt;
* '''https://okapiframework.org/wiki/index.php/Distributions''' &amp;amp;mdash; all downloads&lt;br /&gt;
* '''https://search.maven.org/search?q=net.sf.okapi''' &amp;amp;mdash; release artifacts on Maven Central&lt;br /&gt;
&lt;br /&gt;
==Downloads &amp;amp;mdash; Development Release ([https://bitbucket.org/okapiframework/okapi/src/dev/ dev branch])==&lt;br /&gt;
For testers and advanced users, the latest development snapshots are here:&lt;br /&gt;
* Maven artifacts snapshots, nightly build: '''https://oss.sonatype.org/content/repositories/snapshots'''&lt;br /&gt;
* Latest snapshot of the distributions: '''[https://gitlab.com/okapiframework/okapi/-/jobs/artifacts/dev/browse/deployment/maven/done?job=verification:jdk11 Nightly build on GitLab]'''&lt;br /&gt;
&amp;lt;/div&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;div style=&amp;quot;margin-left:5px; padding:10px; padding-top:0; background:#ffe8bf; border:1px solid #ccc;&amp;quot;&amp;gt;&lt;br /&gt;
&lt;br /&gt;
==Developers==&lt;br /&gt;
{| style=&amp;quot;background:#ffe8bf;&amp;quot;&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot; padding=&amp;quot;0&amp;quot;&lt;br /&gt;
|&lt;br /&gt;
* '''[https://bitbucket.org/okapiframework/ Project Home]''' at Bitbucket&lt;br /&gt;
* '''[http://okapiframework.org/devguide/index.html Developer's Guide]'''&lt;br /&gt;
* '''[http://okapiframework.org/javadoc/index.html API Documentation]'''&lt;br /&gt;
* '''[http://groups.google.com/group/okapi-devel/ Developers Group and Mailing List]''' &amp;amp;nbsp;&amp;amp;nbsp;&amp;amp;nbsp;&lt;br /&gt;
* '''[https://bitbucket.org/okapiframework/okapi/src Git repository]''' of the source code&lt;br /&gt;
* '''[[About the Log4j2 vulnerabilities]]'''&lt;br /&gt;
|&lt;br /&gt;
* '''[https://gitlab.com/okapiframework/okapi/pipelines Continuous Build on GitLab]'''&lt;br /&gt;
* List of the '''[https://bitbucket.org/okapiframework/okapi/src/dev/ Latest Code Changes]''' (dev branch)&lt;br /&gt;
* '''[https://bitbucket.org/okapiframework/okapi/wiki/Consultancy Consultancy resources]'''&lt;br /&gt;
* '''[https://bitbucket.org/okapiframework/okapi/issues Bugs and Requests]''' tracking list&lt;br /&gt;
* Notifications: '''[https://bitbucket.org/okapiframework/okapi/commits/all Commits]''' and '''[http://groups.google.com/group/okapi-builds CB Errors]'''&lt;br /&gt;
|}&lt;br /&gt;
&amp;lt;/div&amp;gt;&lt;br /&gt;
&lt;br /&gt;
|}&lt;/div&gt;</summary>
		<author><name>Jhargraveiii</name></author>
	</entry>
	<entry>
		<id>http://okapiframework.org/wiki/index.php?title=Steps&amp;diff=996</id>
		<title>Steps</title>
		<link rel="alternate" type="text/html" href="http://okapiframework.org/wiki/index.php?title=Steps&amp;diff=996"/>
		<updated>2023-12-15T18:46:29Z</updated>

		<summary type="html">&lt;p&gt;Jhargraveiii: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;Steps are components that execute one specific task. You use them by chaining them into '''pipelines'''. See for example &amp;quot;[[How to Create a Pipeline in Rainbow]]&amp;quot; to see how steps can be used. Rainbow also [[Rainbow - Utilities|comes with several pre-defined pipelines]] using some of these steps.&lt;br /&gt;
&lt;br /&gt;
The Okapi Framework comes with several ready-to-use steps:&lt;br /&gt;
&lt;br /&gt;
{| cellpadding=&amp;quot;8&amp;quot; width=&amp;quot;100%&amp;quot;&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
|&lt;br /&gt;
* &amp;lt;span class='blue'&amp;gt;RD-&amp;gt;RD&amp;lt;/span&amp;gt;&amp;amp;nbsp; [[Batch Translation Step]]&lt;br /&gt;
* &amp;lt;span class='blue'&amp;gt;RD-&amp;gt;RD&amp;lt;/span&amp;gt;&amp;amp;nbsp; [[BOM Conversion Step]]&lt;br /&gt;
* &amp;lt;span class='green'&amp;gt;FE-&amp;gt;FE&amp;lt;/span&amp;gt;&amp;amp;nbsp; [[Character Count Step]]&lt;br /&gt;
* &amp;lt;span class='green'&amp;gt;FE-&amp;gt;FE&amp;lt;/span&amp;gt;&amp;amp;nbsp; [[Cleanup Step]]&lt;br /&gt;
* &amp;lt;span class='blue'&amp;gt;RD-&amp;gt;RD&amp;lt;/span&amp;gt;&amp;amp;nbsp; [[Copy Or Move Step]]&lt;br /&gt;
* &amp;lt;span class='blue'&amp;gt;RD-&amp;gt;RD&amp;lt;/span&amp;gt;&amp;amp;nbsp; [[Combined Xliff Merger Step]]&lt;br /&gt;
* &amp;lt;span class='green'&amp;gt;FE-&amp;gt;FE&amp;lt;/span&amp;gt;&amp;amp;nbsp; [[Create Target Step]]&lt;br /&gt;
* &amp;lt;span class='green'&amp;gt;FE-&amp;gt;FE&amp;lt;/span&amp;gt;&amp;amp;nbsp; [[Desegmentation Step]]&lt;br /&gt;
* &amp;lt;span class='green'&amp;gt;FE-&amp;gt;FE&amp;lt;/span&amp;gt;&amp;amp;nbsp; [[Diff Leverage Step]]&lt;br /&gt;
* &amp;lt;span class='blue'&amp;gt;RD-&amp;gt;RD&amp;lt;/span&amp;gt;&amp;amp;nbsp; [[Encoding Conversion Step]]&lt;br /&gt;
* &amp;lt;span class='green'&amp;gt;FE-&amp;gt;FE&amp;lt;/span&amp;gt;&amp;amp;nbsp; [[Enrycher Step]]&lt;br /&gt;
* &amp;lt;span class='blue'&amp;gt;RD-&amp;gt;RD&amp;lt;/span&amp;gt;&amp;amp;nbsp; [[External Command Step]]&lt;br /&gt;
* &amp;lt;span class='blue'&amp;gt;RD-&amp;gt;RD&amp;lt;/span&amp;gt;&amp;amp;nbsp; [[Extraction Verification Step]]&lt;br /&gt;
* &amp;lt;span class='green'&amp;gt;FE&amp;lt;/span&amp;gt;-&amp;gt;&amp;lt;span class='blue'&amp;gt;RD&amp;lt;/span&amp;gt;&amp;amp;nbsp; [[Filter Events to Raw Document Step]]&lt;br /&gt;
* &amp;lt;span class='green'&amp;gt;FE-&amp;gt;FE&amp;lt;/span&amp;gt;&amp;amp;nbsp; [[Format Conversion Step]]&lt;br /&gt;
* &amp;lt;span class='green'&amp;gt;FE-&amp;gt;FE&amp;lt;/span&amp;gt;&amp;amp;nbsp; [[Full-Width Conversion Step]]&lt;br /&gt;
* &amp;lt;span class='green'&amp;gt;FE-&amp;gt;FE&amp;lt;/span&amp;gt;&amp;amp;nbsp; [[Generate SimpleTM Step]]&lt;br /&gt;
* &amp;lt;span class='green'&amp;gt;FE-&amp;gt;FE&amp;lt;/span&amp;gt;&amp;amp;nbsp; [[GTT Batch Translation Step]]&lt;br /&gt;
* &amp;lt;span class='green'&amp;gt;FE-&amp;gt;FE&amp;lt;/span&amp;gt;&amp;amp;nbsp; [[Id-Based Copy Step]]&lt;br /&gt;
* &amp;lt;span class='green'&amp;gt;FE-&amp;gt;FE&amp;lt;/span&amp;gt;&amp;amp;nbsp; [[Id-Based Aligner Step]]&lt;br /&gt;
* &amp;lt;span class='blue'&amp;gt;RD-&amp;gt;RD&amp;lt;/span&amp;gt;&amp;amp;nbsp; [[Image Modification Step]]&lt;br /&gt;
* &amp;lt;span class='green'&amp;gt;FE-&amp;gt;FE&amp;lt;/span&amp;gt;&amp;amp;nbsp; [[Inconsistency Check Step]]&lt;br /&gt;
* &amp;lt;span class='green'&amp;gt;FE-&amp;gt;FE&amp;lt;/span&amp;gt;&amp;amp;nbsp; [[Inline Codes Removal Step]]&lt;br /&gt;
* &amp;lt;span class='green'&amp;gt;FE-&amp;gt;FE&amp;lt;/span&amp;gt;&amp;amp;nbsp; [[Inline Codes Simplifier Step]]&lt;br /&gt;
* &amp;lt;span class='green'&amp;gt;FE-&amp;gt;FE&amp;lt;/span&amp;gt;&amp;amp;nbsp; [[LanguageTool Step]]&lt;br /&gt;
* &amp;lt;span class='green'&amp;gt;FE-&amp;gt;FE&amp;lt;/span&amp;gt;&amp;amp;nbsp; [[Leveraging Step]]&lt;br /&gt;
* &amp;lt;span class='blue'&amp;gt;RD-&amp;gt;RD&amp;lt;/span&amp;gt;&amp;amp;nbsp; [[Line-Break Conversion Step]]&lt;br /&gt;
* &amp;lt;span class='green'&amp;gt;FE-&amp;gt;FE&amp;lt;/span&amp;gt;&amp;amp;nbsp; [[Localizables Check Step]]&lt;br /&gt;
|&lt;br /&gt;
* &amp;lt;span class='green'&amp;gt;FE-&amp;gt;FE&amp;lt;/span&amp;gt;&amp;amp;nbsp; [[Microsoft Batch Translation Step]]&lt;br /&gt;
* &amp;lt;span class='green'&amp;gt;FE-&amp;gt;FE&amp;lt;/span&amp;gt;&amp;amp;nbsp; [[Microsoft Batch Submission Step]]&lt;br /&gt;
* &amp;lt;span class='green'&amp;gt;FE-&amp;gt;FE&amp;lt;/span&amp;gt;&amp;amp;nbsp; [[Moses InlineText Extraction Step]]&lt;br /&gt;
* &amp;lt;span class='green'&amp;gt;FE-&amp;gt;FE&amp;lt;/span&amp;gt;&amp;amp;nbsp; [[Moses InlineText Leveraging Step]]&lt;br /&gt;
* &amp;lt;span class='blue'&amp;gt;RD-&amp;gt;RD&amp;lt;/span&amp;gt;&amp;amp;nbsp; [[MS Word Resaver Step]]&lt;br /&gt;
* &amp;lt;span class='blue'&amp;gt;RD-&amp;gt;RD&amp;lt;/span&amp;gt;&amp;amp;nbsp; [[MS Word Search and Replace Step]]&lt;br /&gt;
* &amp;lt;span class='blue'&amp;gt;RD-&amp;gt;RD&amp;lt;/span&amp;gt;&amp;amp;nbsp; [[Original Document Xliff Merger Step]]&lt;br /&gt;
* &amp;lt;span class='green'&amp;gt;FE-&amp;gt;FE&amp;lt;/span&amp;gt;&amp;amp;nbsp; [[Paragraph Alignment Step]]&lt;br /&gt;
* &amp;lt;span class='green'&amp;gt;FE-&amp;gt;FE&amp;lt;/span&amp;gt;&amp;amp;nbsp; [[Post-segmentation Inline Codes Removal Step]]&lt;br /&gt;
* &amp;lt;span class='green'&amp;gt;FE-&amp;gt;FE&amp;lt;/span&amp;gt;&amp;amp;nbsp; [[Properties Setting Step]]&lt;br /&gt;
* &amp;lt;span class='green'&amp;gt;FE-&amp;gt;FE&amp;lt;/span&amp;gt;&amp;amp;nbsp; [[QuEst Quality Estimation Step]]&lt;br /&gt;
* &amp;lt;span class='green'&amp;gt;FE-&amp;gt;FE&amp;lt;/span&amp;gt;&amp;amp;nbsp; [[QuEst SVM Model Builder Step]]&lt;br /&gt;
* &amp;lt;span class='blue'&amp;gt;RD-&amp;gt;RD&amp;lt;/span&amp;gt;&amp;amp;nbsp; [[Quality Check Step]]&lt;br /&gt;
* &amp;lt;span class='green'&amp;gt;FE-&amp;gt;FE&amp;lt;/span&amp;gt;&amp;amp;nbsp; [[Quality Check Step]]&lt;br /&gt;
* &amp;lt;span class='blue'&amp;gt;RD&amp;lt;/span&amp;gt;-&amp;gt;&amp;lt;span class='green'&amp;gt;FE&amp;lt;/span&amp;gt;&amp;amp;nbsp; [[Raw Document to Filter Events Step]]&lt;br /&gt;
* &amp;lt;span class='green'&amp;gt;FE&amp;lt;/span&amp;gt;-&amp;gt;&amp;lt;span class='hi'&amp;gt;RD/FE&amp;lt;/span&amp;gt;&amp;amp;nbsp; [[Rainbow Translation Kit Creation Step]]&lt;br /&gt;
* &amp;lt;span class='green'&amp;gt;FE&amp;lt;/span&amp;gt;-&amp;gt;&amp;lt;span class='hi'&amp;gt;RD/FE&amp;lt;/span&amp;gt;&amp;amp;nbsp; [[Rainbow Translation Kit Merging Step]]&lt;br /&gt;
* &amp;lt;span class='blue'&amp;gt;RD-&amp;gt;RD&amp;lt;/span&amp;gt;&amp;amp;nbsp; [[RTF Conversion Step]]&lt;br /&gt;
* &amp;lt;span class='green'&amp;gt;FE-&amp;gt;FE&amp;lt;/span&amp;gt;&amp;amp;nbsp; [[Remove Target Step]]&lt;br /&gt;
* &amp;lt;span class='green'&amp;gt;FE-&amp;gt;FE&amp;lt;/span&amp;gt;&amp;amp;nbsp; [[Repetition Analysis Step]]&lt;br /&gt;
* &amp;lt;span class='green'&amp;gt;FE-&amp;gt;FE&amp;lt;/span&amp;gt;&amp;amp;nbsp; [[Resource Simplifier Step]]&lt;br /&gt;
* &amp;lt;span class='green'&amp;gt;FE-&amp;gt;FE&amp;lt;/span&amp;gt;&amp;amp;nbsp; [[Scoping Report Step]]&lt;br /&gt;
* &amp;lt;span class='blue'&amp;gt;RD-&amp;gt;RD&amp;lt;/span&amp;gt;&amp;amp;nbsp; [[Search and Replace Step]]&lt;br /&gt;
* &amp;lt;span class='green'&amp;gt;FE-&amp;gt;FE&amp;lt;/span&amp;gt;&amp;amp;nbsp; [[Search and Replace Step]]&lt;br /&gt;
* &amp;lt;span class='green'&amp;gt;FE-&amp;gt;FE&amp;lt;/span&amp;gt;&amp;amp;nbsp; [[Segmentation Step]]&lt;br /&gt;
* &amp;lt;span class='green'&amp;gt;FE-&amp;gt;FE&amp;lt;/span&amp;gt;&amp;amp;nbsp; [[Segments to Text Units Converter Step]]&lt;br /&gt;
* &amp;lt;span class='green'&amp;gt;FE-&amp;gt;FE&amp;lt;/span&amp;gt;&amp;amp;nbsp; [[Sentence Alignment Step]]&lt;br /&gt;
* &amp;lt;span class='blue'&amp;gt;RD-&amp;gt;RD&amp;lt;/span&amp;gt;&amp;amp;nbsp; [[SimpleTM to TMX Step]]&lt;br /&gt;
|&lt;br /&gt;
* &amp;lt;span class='green'&amp;gt;FE-&amp;gt;FE&amp;lt;/span&amp;gt;&amp;amp;nbsp; [[Space Check Step]]&lt;br /&gt;
* &amp;lt;span class='blue'&amp;gt;RD-&amp;gt;RD&amp;lt;/span&amp;gt;&amp;amp;nbsp; [[Skeleton Xliff Merger Step]]&lt;br /&gt;
* &amp;lt;span class='green'&amp;gt;FE-&amp;gt;FE&amp;lt;/span&amp;gt;&amp;amp;nbsp; [[Term Extraction Step]]&lt;br /&gt;
* &amp;lt;span class='green'&amp;gt;FE-&amp;gt;FE&amp;lt;/span&amp;gt;&amp;amp;nbsp; [[Terminology Leveraging Step]]&lt;br /&gt;
* &amp;lt;span class='green'&amp;gt;FE-&amp;gt;FE&amp;lt;/span&amp;gt;&amp;amp;nbsp; [[Text Modification Step]]&lt;br /&gt;
* &amp;lt;span class='green'&amp;gt;FE-&amp;gt;FE&amp;lt;/span&amp;gt;&amp;amp;nbsp; [[TM Import Step]]&lt;br /&gt;
* &amp;lt;span class='green'&amp;gt;FE-&amp;gt;FE&amp;lt;/span&amp;gt;&amp;amp;nbsp; [[Tokenization Step]]&lt;br /&gt;
* &amp;lt;span class='green'&amp;gt;FE-&amp;gt;FE&amp;lt;/span&amp;gt;&amp;amp;nbsp; [[Translation Comparison Step]]&lt;br /&gt;
* &amp;lt;span class='green'&amp;gt;FE-&amp;gt;FE&amp;lt;/span&amp;gt;&amp;amp;nbsp; [[URI Conversion Step]]&lt;br /&gt;
* &amp;lt;span class='green'&amp;gt;FE-&amp;gt;FE&amp;lt;/span&amp;gt;&amp;amp;nbsp; [[Used Characters Listing Step]]&lt;br /&gt;
* &amp;lt;span class='green'&amp;gt;FE-&amp;gt;FE&amp;lt;/span&amp;gt;&amp;amp;nbsp; [[Word Count Step]]&lt;br /&gt;
* &amp;lt;span class='green'&amp;gt;FE-&amp;gt;FE&amp;lt;/span&amp;gt;&amp;amp;nbsp; [[Simple Word Count Step]]&lt;br /&gt;
* &amp;lt;span class='green&amp;gt;FE-&amp;gt;FE&amp;lt;/span&amp;gt;&amp;amp;nbsp; [[Whitespace Correction Step]]&lt;br /&gt;
* &amp;lt;span class='blue'&amp;gt;RD-&amp;gt;RD&amp;lt;/span&amp;gt;&amp;amp;nbsp; [[Trados Analysis Step]]&lt;br /&gt;
* &amp;lt;span class='blue'&amp;gt;RD-&amp;gt;RD&amp;lt;/span&amp;gt;&amp;amp;nbsp; [[Trados Cleanup Step]]&lt;br /&gt;
* &amp;lt;span class='blue'&amp;gt;RD-&amp;gt;RD&amp;lt;/span&amp;gt;&amp;amp;nbsp; [[Trados Export Step]]&lt;br /&gt;
* &amp;lt;span class='blue'&amp;gt;RD-&amp;gt;RD&amp;lt;/span&amp;gt;&amp;amp;nbsp; [[Trados Import Step]]&lt;br /&gt;
* &amp;lt;span class='blue'&amp;gt;RD-&amp;gt;RD&amp;lt;/span&amp;gt;&amp;amp;nbsp; [[Trados Translation Step]]&lt;br /&gt;
* &amp;lt;span class='blue'&amp;gt;RD-&amp;gt;RD&amp;lt;/span&amp;gt;&amp;amp;nbsp; [[TTX Joiner Step]]&lt;br /&gt;
* &amp;lt;span class='blue'&amp;gt;RD-&amp;gt;RD&amp;lt;/span&amp;gt;&amp;amp;nbsp; [[TTX Splitter Step]]&lt;br /&gt;
* &amp;lt;span class='blue'&amp;gt;RD-&amp;gt;RD&amp;lt;/span&amp;gt;&amp;amp;nbsp; [[XLIFF Joiner Step]]&lt;br /&gt;
* &amp;lt;span class='blue'&amp;gt;RD-&amp;gt;RD&amp;lt;/span&amp;gt;&amp;amp;nbsp; [[XLIFF Splitter Step]]&lt;br /&gt;
* &amp;lt;span class='blue'&amp;gt;RD-&amp;gt;RD&amp;lt;/span&amp;gt;&amp;amp;nbsp; [[XML Analysis Step]]&lt;br /&gt;
* &amp;lt;span class='blue'&amp;gt;RD-&amp;gt;RD&amp;lt;/span&amp;gt;&amp;amp;nbsp; [[XML Characters Fixing Step]]&lt;br /&gt;
* &amp;lt;span class='blue'&amp;gt;RD-&amp;gt;RD&amp;lt;/span&amp;gt;&amp;amp;nbsp; [[XML Validation Step]]&lt;br /&gt;
* &amp;lt;span class='blue'&amp;gt;RD-&amp;gt;RD&amp;lt;/span&amp;gt;&amp;amp;nbsp; [[XSL Transformation Step]]&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
Legend:&lt;br /&gt;
: &amp;lt;span class='blue'&amp;gt;RD-&amp;gt;RD&amp;lt;/span&amp;gt; = the step takes raw document and sends raw document &lt;br /&gt;
: &amp;lt;span class='blue'&amp;gt;RD&amp;lt;/span&amp;gt;-&amp;gt;&amp;lt;span the step class='green'&amp;gt;FE&amp;lt;/span&amp;gt; = the step takes raw document and sends filter events&lt;br /&gt;
: &amp;lt;span class='green'&amp;gt;FE-&amp;gt;FE&amp;lt;/span&amp;gt; = the step takes filter events and sends filter events&lt;br /&gt;
: &amp;lt;span class='green'&amp;gt;FE&amp;lt;/span&amp;gt;-&amp;gt;&amp;lt;span class='blue'&amp;gt;RD&amp;lt;/span&amp;gt; = the step takes filter events and sends raw document&lt;br /&gt;
: &amp;lt;span class='green'&amp;gt;FE&amp;lt;/span&amp;gt;-&amp;gt;&amp;lt;span class='hi'&amp;gt;RD/FE&amp;lt;/span&amp;gt; = the step takes filter events and sends either raw document or filter events&lt;br /&gt;
&lt;br /&gt;
[[Category:Steps]]&lt;/div&gt;</summary>
		<author><name>Jhargraveiii</name></author>
	</entry>
	<entry>
		<id>http://okapiframework.org/wiki/index.php?title=Message_Format_Filter&amp;diff=995</id>
		<title>Message Format Filter</title>
		<link rel="alternate" type="text/html" href="http://okapiframework.org/wiki/index.php?title=Message_Format_Filter&amp;diff=995"/>
		<updated>2023-12-14T21:44:07Z</updated>

		<summary type="html">&lt;p&gt;Jhargraveiii: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;{{Filters Header}}&lt;br /&gt;
==Overview==&lt;br /&gt;
&lt;br /&gt;
The MessageFormatFilter is designed to handle message formats commonly found in software applications. This includes formats such as ICU Message Format and Java MessageFormat. This filter is meant to be used as a subfilter and the message strings can be in various container formats: JSON, YAML XML etc..&lt;br /&gt;
&lt;br /&gt;
==Limitations==&lt;br /&gt;
&amp;lt;code&amp;gt;choice&amp;lt;/code&amp;gt; syntax is not supported. For example: &amp;quot;&amp;lt;code&amp;gt;The value {count, choice, 0 #is none |1 #is one |1 &amp;lt;is more than one}&amp;lt;/code&amp;gt;&amp;quot;. This syntax is deprecated and should be converted to &amp;lt;code&amp;gt;select&amp;lt;/code&amp;gt; or &amp;lt;code&amp;gt;plural&amp;lt;/code&amp;gt;.&lt;br /&gt;
&lt;br /&gt;
==Parameters==&lt;br /&gt;
&lt;br /&gt;
*addPluralForms - false by default. If enabled new plural forms will be added based on the target locale. The source string is modified and then refiltered to provide the new plural forms to the translator.&lt;br /&gt;
&lt;br /&gt;
*normalize - false by default. If enabled the source will be normalized to move leading and trailing text inside each complex (plural, select etc..) variant. This makes translation easier by forcing each variant to be a complete phrase or sentence. Note: This option can increase word and character counts for the source!  &lt;br /&gt;
&lt;br /&gt;
*prettyPrint - false by default. If true the output is formatted to enhance readability. Any whitespace added is not significant. The default option returns a compact string (normally a single line) &lt;br /&gt;
&lt;br /&gt;
[[Category:Filters]] [[Category:Translation Formats]]&lt;/div&gt;</summary>
		<author><name>Jhargraveiii</name></author>
	</entry>
	<entry>
		<id>http://okapiframework.org/wiki/index.php?title=Message_Format_Filter&amp;diff=994</id>
		<title>Message Format Filter</title>
		<link rel="alternate" type="text/html" href="http://okapiframework.org/wiki/index.php?title=Message_Format_Filter&amp;diff=994"/>
		<updated>2023-12-14T21:40:38Z</updated>

		<summary type="html">&lt;p&gt;Jhargraveiii: /* Overview */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;{{Filters Header}}&lt;br /&gt;
==Overview==&lt;br /&gt;
&lt;br /&gt;
The MessageFormatFilter is designed to handle message formats commonly found in software applications. This includes formats such as ICU Message Format and Java MessageFormat. This filter is meant to be used as a subfilter and the message strings can be in various container formats: JSON, YAML XML etc..&lt;br /&gt;
&lt;br /&gt;
==Limitations==&lt;br /&gt;
choice syntax is not supported. For example: &amp;quot;The value {count, choice, 0 #is none |1 #is one |1 &amp;lt;is more than one}&amp;quot;. This syntax is deprecated and should be converted to select or plural.&lt;br /&gt;
&lt;br /&gt;
==Parameters==&lt;br /&gt;
&lt;br /&gt;
*addPluralForms - false by default. If enabled new plural forms will be added based on the target locale. The source string is modified and then refiltered to provide the new plural forms to the translator.&lt;br /&gt;
&lt;br /&gt;
*normalize - false by default. If enabled the source will be normalized to move leading and trailing text inside each complex (plural, select etc..) variant. This makes translation easier by forcing each variant to be a complete phrase or sentence. Note: This option can increase word and character counts for the source!  &lt;br /&gt;
&lt;br /&gt;
*prettyPrint - false by default. If true the output is formatted to enhance readability. Any whitespace added is not significant. The default option returns a compact string (normally a single line) &lt;br /&gt;
&lt;br /&gt;
[[Category:Filters]] [[Category:Translation Formats]]&lt;/div&gt;</summary>
		<author><name>Jhargraveiii</name></author>
	</entry>
	<entry>
		<id>http://okapiframework.org/wiki/index.php?title=Filters&amp;diff=992</id>
		<title>Filters</title>
		<link rel="alternate" type="text/html" href="http://okapiframework.org/wiki/index.php?title=Filters&amp;diff=992"/>
		<updated>2023-11-16T19:54:32Z</updated>

		<summary type="html">&lt;p&gt;Jhargraveiii: /* List of the Filters */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;Filters are the components that convert input documents from their native file format into a common internal set of [[Glossary#Resource|resources]] that all Okapi components use. The extracted content can be re-written into the original file format. When using the steps, the extraction is done by the [[Raw Document to Filter Events Step]] and the re-writing by the [[Filter Events to Raw Document Step]].&lt;br /&gt;
&lt;br /&gt;
Note: The [[Okapi Filters Plugin for OmegaT]] allows you to use some of the filters directly from [http://www.omegat.org OmegaT].&lt;br /&gt;
&lt;br /&gt;
==List of the Filters==&lt;br /&gt;
&lt;br /&gt;
The framework distribution comes with the following filters:&lt;br /&gt;
&lt;br /&gt;
{| cellpadding=&amp;quot;8&amp;quot; width=100%&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
|&lt;br /&gt;
* [[Archive Filter]]&lt;br /&gt;
* [[DTD Filter]]&lt;br /&gt;
* [[Doxygen Filter]]&lt;br /&gt;
* [[EPUB Filter]]&lt;br /&gt;
* [[HTML Filter]]&lt;br /&gt;
* [[HTML5-ITS Filter]]&lt;br /&gt;
* [[ICML Filter]]&lt;br /&gt;
* [[IDML Filter]]&lt;br /&gt;
* [[JSON Filter]]&lt;br /&gt;
* [[Markdown Filter]]&lt;br /&gt;
* [[Message Format Filter]]&lt;br /&gt;
* [[MIF Filter]]&lt;br /&gt;
* [[Moses Text Filter]]&lt;br /&gt;
* [[Multi-Parsers Filter]]&lt;br /&gt;
* [[OpenOffice Filter]]&lt;br /&gt;
* [[OpenXML Filter|OpenXML (MS Office) Filter]]&lt;br /&gt;
|&lt;br /&gt;
* [[PDF Filter]]&lt;br /&gt;
* [[Pensieve TM Filter]]&lt;br /&gt;
* [[PHP Content Filter]]&lt;br /&gt;
* [[Plain Text Filter]]&lt;br /&gt;
* [[PO Filter]]&lt;br /&gt;
* [[Properties Filter]]&lt;br /&gt;
* [[Rainbow Translation Kit Filter]]&lt;br /&gt;
* [[Regex Filter]]&lt;br /&gt;
* [[SDL Trados Package Filter]]&lt;br /&gt;
* [[Simplification Filter]]&lt;br /&gt;
* [[Table Filter]]&lt;br /&gt;
* [[TMX Filter]]&lt;br /&gt;
* [[Trados-Tagged RTF Filter]]&lt;br /&gt;
|&lt;br /&gt;
* [[Transifex Filter]]&lt;br /&gt;
* [[TS Filter]]&lt;br /&gt;
* [[TTX Filter]]&lt;br /&gt;
* [[TXML Filter]]&lt;br /&gt;
* [[Wiki Filter]]&lt;br /&gt;
* [[WSXZ Package Filter]]&lt;br /&gt;
* [[Vignette Filter]]&lt;br /&gt;
* [[XLIFF Filter]]&lt;br /&gt;
* [[XLIFF-2 Filter]]&lt;br /&gt;
* [[XML Filter]]&lt;br /&gt;
* [[XML Stream Filter]]&lt;br /&gt;
* [[YAML Filter]]&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
==Supported File Formats==&lt;br /&gt;
&lt;br /&gt;
The following is a list of some of the file formats supported by the distribution through [[Understanding Filter Configurations|pre-defined configurations]]:&lt;br /&gt;
&lt;br /&gt;
{| border=&amp;quot;1&amp;quot; cellpadding=&amp;quot;6&amp;quot; cellspacing=&amp;quot;0&amp;quot;&lt;br /&gt;
|+&lt;br /&gt;
| '''Format''' || '''Extensions''' || '''Pre-Defined Configuration''' || '''Filter''' || '''Notes'''&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Android Strings || .xml || &amp;lt;code&amp;gt;okf_xml-AndroidStrings&amp;lt;/code&amp;gt; || [[XML Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Apple Stringsdict || .stringsdict || &amp;lt;code&amp;gt;okf_xml-AppleStringsdict&amp;lt;/code&amp;gt; || [[XML Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Archive || .zip || &amp;lt;code&amp;gt;okf_archive&amp;lt;/code&amp;gt; || [[Archive Filter]] || Meta filter that processes zip files with various formats as one file.&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Auto Xliff || .xlf, .xliff || &amp;lt;code&amp;gt;okf_autoxliff&amp;lt;/code&amp;gt; || [[Auto Xliff Filter]] || Detects the version of an XLIFF file and then hands parsing off to the appropriate filter &lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| CSV (Comma-separated values files) || .csv, .txt || &amp;lt;code&amp;gt;okf_table_csv&amp;lt;/code&amp;gt; || [[Table Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| CSV (Multiple complex sub-formats) || .csv || &amp;lt;code&amp;gt;okf_multiparsers&amp;lt;/code&amp;gt; || [[Multi-Parsers Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| DITA || .dita, .ditamap, .xml || &amp;lt;code&amp;gt;okf_xmlstream-dita&amp;lt;/code&amp;gt; || [[XML Stream Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| DocBook v5.0 || .xml || &amp;lt;code&amp;gt;okf_xml-docbook&amp;lt;/code&amp;gt; || [[XML Filter]] || Since Okapi 1.42. &amp;amp;lt;footnote&amp;gt; is not handled properly.&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| DokuWiki pages || .txt || &amp;lt;code&amp;gt;okf_wiki&amp;lt;/code&amp;gt; || [[Wiki Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Doxygen-commented files || .c, .h, cpp || &amp;lt;code&amp;gt;okf_doxygen&amp;lt;/code&amp;gt; || [[Doxygen Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| DTD || .dtd || &amp;lt;code&amp;gt;okf_dtd&amp;lt;/code&amp;gt; || [[DTD Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| EPUB || .epub || &amp;lt;code&amp;gt;okf_epub&amp;lt;/code&amp;gt; || [[EPUB Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Fixed-Width Columns Table || .txt || &amp;lt;code&amp;gt;okf_table_fwc&amp;lt;/code&amp;gt; || [[Table Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Idiom WorldServer XLIFF || .xlf || &amp;lt;code&amp;gt;okf_xliff-iws&amp;lt;/code&amp;gt; || [[XLIFF Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| InCopy ICML || .wcml || &amp;lt;code&amp;gt;okf_icml&amp;lt;/code&amp;gt; || [[ICML Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| InDesign IDML || .idml || &amp;lt;code&amp;gt;okf_idml&amp;lt;/code&amp;gt; || [[IDML Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| iOS/Mac Strings|| .strings || &amp;lt;code&amp;gt;okf_regex-macStrings&amp;lt;/code&amp;gt; || [[Regex Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Java Properties || .properties || &amp;lt;code&amp;gt;okf_properties&amp;lt;/code&amp;gt; || [[Properties Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Java Properties (Output not escaped) || .properties || &amp;lt;code&amp;gt;okf_properties-outputNotEscaped&amp;lt;/code&amp;gt; || [[Properties Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Java XML Properties || .xml || &amp;lt;code&amp;gt;okf_xml-JavaProperties&amp;lt;/code&amp;gt; || [[XML Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Java XML Properties (HTML strings) || .xml || &amp;lt;code&amp;gt;okf_xmlstream-JavaPropertiesHTML&amp;lt;/code&amp;gt; || [[XML Stream Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| JSON || .json || &amp;lt;code&amp;gt;okf_json&amp;lt;/code&amp;gt; || [[JSON Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Haiku CatKeys || .catkeys || &amp;lt;code&amp;gt;okf_table_catkeys&amp;lt;/code&amp;gt; || [[Table Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| HTML (any) || .html, .htm || &amp;lt;code&amp;gt;okf_html&amp;lt;/code&amp;gt; || [[HTML Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| HTML (Well-formed, and XHTML) || .html, .htm|| &amp;lt;code&amp;gt;okf_html-wellFormed&amp;lt;/code&amp;gt; || [[HTML Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| HTML5 (and XHTML5) || .html, .htm|| &amp;lt;code&amp;gt;okf_itshtml5&amp;lt;/code&amp;gt; || [[HTML5-ITS Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Markdown || .md || &amp;lt;code&amp;gt;okf_markdown&amp;lt;/code&amp;gt; || [[Markdown Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Microsoft Excel 2007/2010 || .xlsx, .xlsm, .xltx, .xltm || &amp;lt;code&amp;gt;okf_openxml&amp;lt;/code&amp;gt; || [[OpenXML Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Microsoft PowerPoint 2007/2010 || .pptx, .pptm, .potx, .potm, .ppsx, .ppsm || &amp;lt;code&amp;gt;okf_openxml&amp;lt;/code&amp;gt; || [[OpenXML Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Microsoft Visio || .vsdx, .vsdm || &amp;lt;code&amp;gt;okf_openxml&amp;lt;/code&amp;gt; || [[OpenXML Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Microsoft Word 2007/2010 || .docx, .docm, .dotx, .dotm || &amp;lt;code&amp;gt;okf_openxml&amp;lt;/code&amp;gt; || [[OpenXML Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| MIF || .mif || &amp;lt;code&amp;gt;okf_mif&amp;lt;/code&amp;gt; || [[MIF Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Moses Text || .txt || &amp;lt;code&amp;gt;okf_mosestext&amp;lt;/code&amp;gt; || [[Moses Text Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| OpenOffice.org Calc || .ods, .ots || &amp;lt;code&amp;gt;okf_odf&amp;lt;/code&amp;gt; || [[OpenOffice Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| OpenOffice.org Draw || .odg, .otg || &amp;lt;code&amp;gt;okf_odf&amp;lt;/code&amp;gt; || [[OpenOffice Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| OpenOffice.org Impress || .odp, .otp || &amp;lt;code&amp;gt;okf_odf&amp;lt;/code&amp;gt; || [[OpenOffice Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| OpenOffice.org Writer || .odt, .ott || &amp;lt;code&amp;gt;okf_odf&amp;lt;/code&amp;gt; || [[OpenOffice Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| PDF || .pdf || &amp;lt;code&amp;gt;okf_pdf&amp;lt;/code&amp;gt; || [[PDF Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| [[Pensieve TM]] || .pentm || &amp;lt;code&amp;gt;okf_pensieve&amp;lt;/code&amp;gt; || [[Pensieve TM Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| PHP Content || .php || &amp;lt;code&amp;gt;okf_phpcontent&amp;lt;/code&amp;gt; || [[PHP Content Filter]] || Can be used as a subfilter only&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Plain Text (Line = text unit) || .txt || &amp;lt;code&amp;gt;okf_plaintext&amp;lt;/code&amp;gt; || [[ Plain Text Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Plain Text (Paragraph = text unit) || .txt || &amp;lt;code&amp;gt;okf_plaintext_paragraphs&amp;lt;/code&amp;gt; || [[Plain Text Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| PO || .po || &amp;lt;code&amp;gt;okf_po&amp;lt;/code&amp;gt; || [[PO Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| PO (Monolingual style) || .po || &amp;lt;code&amp;gt;okf_po-monolingual&amp;lt;/code&amp;gt; || [[PO Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Rainbow Translation Kit manifests || .rkm || &amp;lt;code&amp;gt;okf_rainbowkit&amp;lt;/code&amp;gt; || [[Rainbow Translation Kit Filter]] || Used as a tkit reader only&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Regex (Any text-based format) || .txt || &amp;lt;code&amp;gt;okf_regex&amp;lt;/code&amp;gt; || [[Regex Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| RDF (Mozilla RDF) || .rdf || &amp;lt;code&amp;gt;okf_xml-MozillaRDF&amp;lt;/code&amp;gt; || [[XML Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| RESX || .resx || &amp;lt;code&amp;gt;okf_xml-resx&amp;lt;/code&amp;gt; || [[XML Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| SDLPPX || .sdlppx || &amp;lt;code&amp;gt;okf_sdlpackage&amp;lt;/code&amp;gt; || [[SDL Trados Package Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| SDLRPX || .sdlrpx || &amp;lt;code&amp;gt;okf_sdlpackage&amp;lt;/code&amp;gt; || [[SDL Trados Package Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| SDL[[XLIFF]] || .sdlxlf || &amp;lt;code&amp;gt;okf_xliff-sdl&amp;lt;/code&amp;gt; || [[XLIFF Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Skype Language Files || .lang || &amp;lt;code&amp;gt;okf_properties-skypeLang&amp;lt;/code&amp;gt; || [[Properties Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| SRT (Sub-Rip Text, sub-titles files) || .srt || &amp;lt;code&amp;gt;okf_regex-srt&amp;lt;/code&amp;gt; || [[Regex Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Tab-Delimiter files || .tsv, .txt || &amp;lt;code&amp;gt;okf_table_tsv&amp;lt;/code&amp;gt; || [[Table Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Tex files || .tex || &amp;lt;code&amp;gt;okf_tex&amp;lt;/code&amp;gt; || [[TEX Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| [[TMX]] || .tmx || &amp;lt;code&amp;gt;okf_tmx&amp;lt;/code&amp;gt; || [[TMX Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Transifex project || .txp || &amp;lt;code&amp;gt;okf_transifex&amp;lt;/code&amp;gt; || [[Transifex Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Trados-Tagged RTF || .rtf || &amp;lt;code&amp;gt;okf_tradosrtf&amp;lt;/code&amp;gt; || [[Trados-Tagged RTF Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| TS - Qt TS files || .ts || &amp;lt;code&amp;gt;okf_ts&amp;lt;/code&amp;gt; || [[TS Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| TTX - Trados TagEditor TTX files || .ttx || &amp;lt;code&amp;gt;okf_ttx&amp;lt;/code&amp;gt; || [[TTX Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| TXML - Wordfast Pro TXML files || .txml || &amp;lt;code&amp;gt;okf_txml&amp;lt;/code&amp;gt; || [[TXML Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Vignette Export/Import Content || .xml || &amp;lt;code&amp;gt;okf_vignette&amp;lt;/code&amp;gt; || [[Vignette Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| WSXZ Package Filter || .wsxz || &amp;lt;code&amp;gt;okf_wsxzpackage&amp;lt;/code&amp;gt; || [[WSXZ Package Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| XHTML || .html, .htm || &amp;lt;code&amp;gt;okf_html-wellFormed&amp;lt;/code&amp;gt; || [[HTML Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| WIX (Windows Installer XML) localization files || .wix || &amp;lt;code&amp;gt;okf_xml-WixLocalization&amp;lt;/code&amp;gt; || [[XML Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| [[XLIFF]] v1.2 || .xlf, .xliff || &amp;lt;code&amp;gt;okf_xliff&amp;lt;/code&amp;gt; || [[XLIFF Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| [[XLIFF]] v2 || .xlf || &amp;lt;code&amp;gt;okf_xliff2&amp;lt;/code&amp;gt; || [[XLIFF-2 Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| XML (Generic, using [[ITS]] defaults) || .xml || &amp;lt;code&amp;gt;okf_xml&amp;lt;/code&amp;gt; || [[XML Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| XML (Generic, using stream reader) || .xml || &amp;lt;code&amp;gt;okf_xmlstream&amp;lt;/code&amp;gt; || [[XML Stream Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| YAML (Generic YAML filter) || .yml, .yaml || &amp;lt;code&amp;gt;okf_yaml&amp;lt;/code&amp;gt; || [[YAML Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Message Format (ICU Message Format Filter) || Any container format that supports subfilters || &amp;lt;code&amp;gt;okf_messageformat&amp;lt;/code&amp;gt; || [[Message Format Filter]] ||&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
Note that most filters allow you to [[Understanding Filter Configurations|create your own configurations]] to support more file formats.&lt;br /&gt;
&lt;br /&gt;
==Code Simplification Rules==&lt;br /&gt;
&lt;br /&gt;
All filters support code simplification rules. By default the [[Inline Codes Simplifier Step]], [[Simplification Filter]] and [[Post-segmentation Inline Codes Removal Step]] maximize the trimming and merging (aka simplification) of inline codes. In some cases this may not be desired. The simplification rules allow you to override the default behavior and prevent specific codes from being trimmed or merged.&lt;br /&gt;
&lt;br /&gt;
===General Syntax===&lt;br /&gt;
&lt;br /&gt;
The rules parser ignores irrelevant whitespace. Rules can be separated by spaces, newlines or nothing. This makes it easier to accommodate various container formats and their whitespace normalization rules. When a rule applies it means &amp;quot;do not simplify the match code&amp;quot;. Uppercase tokens are constants and predefined by the rule parser. Multiple rules are always OR'ed together.&lt;br /&gt;
&lt;br /&gt;
For more details see the JavaCC grammar: &amp;lt;code&amp;gt;../okapi/core/src/main/javacc/SimplifierRules.jj&amp;lt;/code&amp;gt;&lt;br /&gt;
&lt;br /&gt;
===Rule Examples===&lt;br /&gt;
&lt;br /&gt;
If Code has any of these flags then don't simplify &lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;if DELETABLE or ADDABLE or CLONEABLE;&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;quot;=&amp;quot; is string match&lt;br /&gt;
Match basic TAGTYPE opening, closing or standalone &lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;if DATA = &amp;quot;a&amp;quot; and TAGTYPE = OPENING;&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;quot;~&amp;quot; is regex match&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;if DATA ~ &amp;quot;a.*&amp;quot;;&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
You can negate any of the match operators &lt;br /&gt;
Don't simplify if the DATA does not match the regex &lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;if DATA !~ &amp;quot;a.*&amp;quot;;&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Match on type, linebreak in this case, don't simplify &lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;if TYPE = &amp;quot;lb&amp;quot;;&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Don't simplify any rich text types&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;if TYPE = &amp;quot;bold&amp;quot; or TYPE = &amp;quot;italic&amp;quot; or TYPE = &amp;quot;underline&amp;quot;;&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Expressions can be recursive (supports embedded parens)&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;if TYPE = &amp;quot;bold&amp;quot; or (DATA = &amp;quot;bar&amp;quot; or (DATA = &amp;quot;foo&amp;quot; and TYPE = &amp;quot;underline&amp;quot;));&amp;lt;/pre&amp;gt;&lt;br /&gt;
 &lt;br /&gt;
&lt;br /&gt;
===Filter Config Examples===&lt;br /&gt;
&lt;br /&gt;
Examples of using simplifier rules within the filter config formats used by Okapi.&lt;br /&gt;
&lt;br /&gt;
'''YAML:'''&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
simplifierRules: |&lt;br /&gt;
  if ADDABLE or DELETABLE or CLONEABLE;&lt;br /&gt;
  if DATA = &amp;quot;&amp;lt;br/&amp;gt;&amp;quot; or DATA = &amp;quot;&amp;lt;font&amp;gt;&amp;quot; or DATA = &amp;quot;&amp;lt;/font&amp;gt;&amp;quot; or DATA = &amp;quot;&amp;lt;/a&amp;gt;&amp;quot;;&lt;br /&gt;
  if DATA ~ &amp;quot;\\&amp;lt;font.+&amp;quot; or DATA ~ &amp;quot;\\&amp;lt;img.+&amp;quot; or DATA ~ &amp;quot;\\&amp;lt;a.+&amp;quot;;&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
'''ITS:'''&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
&amp;lt;?xml version=&amp;quot;1.0&amp;quot; encoding=&amp;quot;UTF-8&amp;quot;?&amp;gt;&lt;br /&gt;
&amp;lt;its:rules xmlns:its=&amp;quot;http://www.w3.org/2005/11/its&amp;quot; version=&amp;quot;1.0&amp;quot; xmlns:itsx=&amp;quot;http://www.w3.org/2008/12/its-extensions&amp;quot; xmlns:okp=&amp;quot;okapi-framework:xmlfilter-options&amp;quot;&amp;gt;&lt;br /&gt;
&amp;lt;!-- See ITS specification at: http://www.w3.org/TR/its/ --&amp;gt;&lt;br /&gt;
 &amp;lt;its:translateRule selector=&amp;quot;//*&amp;quot; translate=&amp;quot;yes&amp;quot;/&amp;gt;&lt;br /&gt;
 &amp;lt;its:withinTextRule selector=&amp;quot;//codeph&amp;quot; withinText=&amp;quot;yes&amp;quot;/&amp;gt;&lt;br /&gt;
 &amp;lt;its:withinTextRule selector=&amp;quot;//ph&amp;quot; withinText=&amp;quot;yes&amp;quot;/&amp;gt;&lt;br /&gt;
 &amp;lt;okp:simplifierRules&amp;gt;&lt;br /&gt;
 if ADDABLE or DELETABLE or CLONEABLE; if DATA ~ &amp;quot;.+&amp;quot;;&lt;br /&gt;
 &amp;lt;/okp:simplifierRules&amp;gt;&lt;br /&gt;
&amp;lt;/its:rules&amp;gt;&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
'''FPRM (Parameters):'''&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
#v1&lt;br /&gt;
extractNotes.b=true&lt;br /&gt;
simplifierRules=if ADDABLE or DELETABLE or CLONEABLE; if DATA ~ &amp;quot;.+&amp;quot;;&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
==Font Mapping==&lt;br /&gt;
&lt;br /&gt;
The font mapping can be considered as a filter's ability to automatically substitute font information in the target document on the fly, according to a provided configuration - this helps to reduce the amount of reformatting and post-translation DTP. It is supported by IDML and OpenXML (DOCX, PPTX and XLSX documents) filters at the moment.&lt;br /&gt;
&lt;br /&gt;
The following font mapping configuration options are available:&lt;br /&gt;
* The source locale regular expression pattern: &amp;lt;code&amp;gt;.*&amp;lt;/code&amp;gt;, &amp;lt;code&amp;gt;en.*&amp;lt;/code&amp;gt;, &amp;lt;code&amp;gt;en-UK&amp;lt;/code&amp;gt;, etc. It can be ommited to apply the mapping to any source locale.&lt;br /&gt;
* The target locale regular expression pattern: &amp;lt;code&amp;gt;.*&amp;lt;/code&amp;gt;, &amp;lt;code&amp;gt;ru.*&amp;lt;/code&amp;gt;, &amp;lt;code&amp;gt;ru-RU&amp;lt;/code&amp;gt;, etc. It can be ommited to apply the mapping to any target locale.&lt;br /&gt;
* The source font name regular expression pattern: &amp;lt;code&amp;gt;.*&amp;lt;/code&amp;gt;, &amp;lt;code&amp;gt;Arial.*&amp;lt;/code&amp;gt;, &amp;lt;code&amp;gt;Times New Roman&amp;lt;/code&amp;gt;, etc. It can be ommited to apply the mapping to any source font name found.&lt;br /&gt;
* The target font name: &amp;lt;code&amp;gt;Arial&amp;lt;/code&amp;gt;, &amp;lt;code&amp;gt;Times New Roman&amp;lt;/code&amp;gt;, etc. It should not be empty. And if it is made so, the mapping configuration is ignored.&lt;br /&gt;
&lt;br /&gt;
Also, the configured font mappings are applied in the order they are stated. And the final target font value is determined by a sequential&lt;br /&gt;
substitution of the source font values. I.e. if there is more than one mapping:&lt;br /&gt;
# &amp;lt;code&amp;gt;Arial&amp;lt;/code&amp;gt; -&amp;gt; &amp;lt;code&amp;gt;Times New Roman&amp;lt;/code&amp;gt;&lt;br /&gt;
# &amp;lt;code&amp;gt;Times New Roman&amp;lt;/code&amp;gt; -&amp;gt; &amp;lt;code&amp;gt;Sans Serif&amp;lt;/code&amp;gt;&lt;br /&gt;
then the first mapping will produce &amp;lt;code&amp;gt;Times New Roman&amp;lt;/code&amp;gt; replacement and the second one will be applied to this new value, thus, ending up with the &amp;lt;code&amp;gt;Sans Serif&amp;lt;/code&amp;gt;.&lt;br /&gt;
&lt;br /&gt;
The parameters serialisation format can look like that:&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
fontMappings.0.sourceLocalePattern=en.*&lt;br /&gt;
fontMappings.0.targetLocalePattern=ru.*&lt;br /&gt;
fontMappings.0.sourceFontPattern=Times.*&lt;br /&gt;
fontMappings.0.targetFont=Arial Unicode MS&lt;br /&gt;
fontMappings.1.sourceLocalePattern=ru&lt;br /&gt;
fontMappings.1.targetLocalePattern=fr&lt;br /&gt;
fontMappings.1.sourceFontPattern=The Sims Sans&lt;br /&gt;
fontMappings.1.targetFont=Arial Unicode MS&lt;br /&gt;
fontMappings.number.i=2&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
When source locale, target locale and source font are omitted:&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
fontMappings.0.targetFont=Arial Unicode MS&lt;br /&gt;
fontMappings.number.i=1&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
And this is the same as the abovementioned:&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
fontMappings.0.sourceLocalePattern=.*&lt;br /&gt;
fontMappings.0.targetLocalePattern=.*&lt;br /&gt;
fontMappings.0.sourceFontPattern=.*&lt;br /&gt;
fontMappings.0.targetFont=Arial Unicode MS&lt;br /&gt;
fontMappings.number.i=1&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
[[Category:Filters]]&lt;/div&gt;</summary>
		<author><name>Jhargraveiii</name></author>
	</entry>
	<entry>
		<id>http://okapiframework.org/wiki/index.php?title=Filters&amp;diff=991</id>
		<title>Filters</title>
		<link rel="alternate" type="text/html" href="http://okapiframework.org/wiki/index.php?title=Filters&amp;diff=991"/>
		<updated>2023-11-13T21:54:48Z</updated>

		<summary type="html">&lt;p&gt;Jhargraveiii: /* General Syntax */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;Filters are the components that convert input documents from their native file format into a common internal set of [[Glossary#Resource|resources]] that all Okapi components use. The extracted content can be re-written into the original file format. When using the steps, the extraction is done by the [[Raw Document to Filter Events Step]] and the re-writing by the [[Filter Events to Raw Document Step]].&lt;br /&gt;
&lt;br /&gt;
Note: The [[Okapi Filters Plugin for OmegaT]] allows you to use some of the filters directly from [http://www.omegat.org OmegaT].&lt;br /&gt;
&lt;br /&gt;
==List of the Filters==&lt;br /&gt;
&lt;br /&gt;
The framework distribution comes with the following filters:&lt;br /&gt;
&lt;br /&gt;
{| cellpadding=&amp;quot;8&amp;quot; width=100%&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
|&lt;br /&gt;
* [[Archive Filter]]&lt;br /&gt;
* [[DTD Filter]]&lt;br /&gt;
* [[Doxygen Filter]]&lt;br /&gt;
* [[EPUB Filter]]&lt;br /&gt;
* [[HTML Filter]]&lt;br /&gt;
* [[HTML5-ITS Filter]]&lt;br /&gt;
* [[ICML Filter]]&lt;br /&gt;
* [[IDML Filter]]&lt;br /&gt;
* [[JSON Filter]]&lt;br /&gt;
* [[Markdown Filter]]&lt;br /&gt;
* [[MIF Filter]]&lt;br /&gt;
* [[Moses Text Filter]]&lt;br /&gt;
* [[Multi-Parsers Filter]]&lt;br /&gt;
* [[OpenOffice Filter]]&lt;br /&gt;
* [[OpenXML Filter|OpenXML (MS Office) Filter]]&lt;br /&gt;
|&lt;br /&gt;
* [[PDF Filter]]&lt;br /&gt;
* [[Pensieve TM Filter]]&lt;br /&gt;
* [[PHP Content Filter]]&lt;br /&gt;
* [[Plain Text Filter]]&lt;br /&gt;
* [[PO Filter]]&lt;br /&gt;
* [[Properties Filter]]&lt;br /&gt;
* [[Rainbow Translation Kit Filter]]&lt;br /&gt;
* [[Regex Filter]]&lt;br /&gt;
* [[SDL Trados Package Filter]]&lt;br /&gt;
* [[Simplification Filter]]&lt;br /&gt;
* [[Table Filter]]&lt;br /&gt;
* [[TMX Filter]]&lt;br /&gt;
* [[Trados-Tagged RTF Filter]]&lt;br /&gt;
|&lt;br /&gt;
* [[Transifex Filter]]&lt;br /&gt;
* [[TS Filter]]&lt;br /&gt;
* [[TTX Filter]]&lt;br /&gt;
* [[TXML Filter]]&lt;br /&gt;
* [[Wiki Filter]]&lt;br /&gt;
* [[WSXZ Package Filter]]&lt;br /&gt;
* [[Vignette Filter]]&lt;br /&gt;
* [[XLIFF Filter]]&lt;br /&gt;
* [[XLIFF-2 Filter]]&lt;br /&gt;
* [[XML Filter]]&lt;br /&gt;
* [[XML Stream Filter]]&lt;br /&gt;
* [[YAML Filter]]&lt;br /&gt;
* [[Message Format Filter]]&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
==Supported File Formats==&lt;br /&gt;
&lt;br /&gt;
The following is a list of some of the file formats supported by the distribution through [[Understanding Filter Configurations|pre-defined configurations]]:&lt;br /&gt;
&lt;br /&gt;
{| border=&amp;quot;1&amp;quot; cellpadding=&amp;quot;6&amp;quot; cellspacing=&amp;quot;0&amp;quot;&lt;br /&gt;
|+&lt;br /&gt;
| '''Format''' || '''Extensions''' || '''Pre-Defined Configuration''' || '''Filter''' || '''Notes'''&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Android Strings || .xml || &amp;lt;code&amp;gt;okf_xml-AndroidStrings&amp;lt;/code&amp;gt; || [[XML Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Apple Stringsdict || .stringsdict || &amp;lt;code&amp;gt;okf_xml-AppleStringsdict&amp;lt;/code&amp;gt; || [[XML Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Archive || .zip || &amp;lt;code&amp;gt;okf_archive&amp;lt;/code&amp;gt; || [[Archive Filter]] || Meta filter that processes zip files with various formats as one file.&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Auto Xliff || .xlf, .xliff || &amp;lt;code&amp;gt;okf_autoxliff&amp;lt;/code&amp;gt; || [[Auto Xliff Filter]] || Detects the version of an XLIFF file and then hands parsing off to the appropriate filter &lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| CSV (Comma-separated values files) || .csv, .txt || &amp;lt;code&amp;gt;okf_table_csv&amp;lt;/code&amp;gt; || [[Table Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| CSV (Multiple complex sub-formats) || .csv || &amp;lt;code&amp;gt;okf_multiparsers&amp;lt;/code&amp;gt; || [[Multi-Parsers Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| DITA || .dita, .ditamap, .xml || &amp;lt;code&amp;gt;okf_xmlstream-dita&amp;lt;/code&amp;gt; || [[XML Stream Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| DocBook v5.0 || .xml || &amp;lt;code&amp;gt;okf_xml-docbook&amp;lt;/code&amp;gt; || [[XML Filter]] || Since Okapi 1.42. &amp;amp;lt;footnote&amp;gt; is not handled properly.&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| DokuWiki pages || .txt || &amp;lt;code&amp;gt;okf_wiki&amp;lt;/code&amp;gt; || [[Wiki Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Doxygen-commented files || .c, .h, cpp || &amp;lt;code&amp;gt;okf_doxygen&amp;lt;/code&amp;gt; || [[Doxygen Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| DTD || .dtd || &amp;lt;code&amp;gt;okf_dtd&amp;lt;/code&amp;gt; || [[DTD Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| EPUB || .epub || &amp;lt;code&amp;gt;okf_epub&amp;lt;/code&amp;gt; || [[EPUB Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Fixed-Width Columns Table || .txt || &amp;lt;code&amp;gt;okf_table_fwc&amp;lt;/code&amp;gt; || [[Table Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Idiom WorldServer XLIFF || .xlf || &amp;lt;code&amp;gt;okf_xliff-iws&amp;lt;/code&amp;gt; || [[XLIFF Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| InCopy ICML || .wcml || &amp;lt;code&amp;gt;okf_icml&amp;lt;/code&amp;gt; || [[ICML Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| InDesign IDML || .idml || &amp;lt;code&amp;gt;okf_idml&amp;lt;/code&amp;gt; || [[IDML Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| iOS/Mac Strings|| .strings || &amp;lt;code&amp;gt;okf_regex-macStrings&amp;lt;/code&amp;gt; || [[Regex Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Java Properties || .properties || &amp;lt;code&amp;gt;okf_properties&amp;lt;/code&amp;gt; || [[Properties Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Java Properties (Output not escaped) || .properties || &amp;lt;code&amp;gt;okf_properties-outputNotEscaped&amp;lt;/code&amp;gt; || [[Properties Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Java XML Properties || .xml || &amp;lt;code&amp;gt;okf_xml-JavaProperties&amp;lt;/code&amp;gt; || [[XML Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Java XML Properties (HTML strings) || .xml || &amp;lt;code&amp;gt;okf_xmlstream-JavaPropertiesHTML&amp;lt;/code&amp;gt; || [[XML Stream Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| JSON || .json || &amp;lt;code&amp;gt;okf_json&amp;lt;/code&amp;gt; || [[JSON Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Haiku CatKeys || .catkeys || &amp;lt;code&amp;gt;okf_table_catkeys&amp;lt;/code&amp;gt; || [[Table Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| HTML (any) || .html, .htm || &amp;lt;code&amp;gt;okf_html&amp;lt;/code&amp;gt; || [[HTML Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| HTML (Well-formed, and XHTML) || .html, .htm|| &amp;lt;code&amp;gt;okf_html-wellFormed&amp;lt;/code&amp;gt; || [[HTML Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| HTML5 (and XHTML5) || .html, .htm|| &amp;lt;code&amp;gt;okf_itshtml5&amp;lt;/code&amp;gt; || [[HTML5-ITS Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Markdown || .md || &amp;lt;code&amp;gt;okf_markdown&amp;lt;/code&amp;gt; || [[Markdown Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Microsoft Excel 2007/2010 || .xlsx, .xlsm, .xltx, .xltm || &amp;lt;code&amp;gt;okf_openxml&amp;lt;/code&amp;gt; || [[OpenXML Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Microsoft PowerPoint 2007/2010 || .pptx, .pptm, .potx, .potm, .ppsx, .ppsm || &amp;lt;code&amp;gt;okf_openxml&amp;lt;/code&amp;gt; || [[OpenXML Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Microsoft Visio || .vsdx, .vsdm || &amp;lt;code&amp;gt;okf_openxml&amp;lt;/code&amp;gt; || [[OpenXML Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Microsoft Word 2007/2010 || .docx, .docm, .dotx, .dotm || &amp;lt;code&amp;gt;okf_openxml&amp;lt;/code&amp;gt; || [[OpenXML Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| MIF || .mif || &amp;lt;code&amp;gt;okf_mif&amp;lt;/code&amp;gt; || [[MIF Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Moses Text || .txt || &amp;lt;code&amp;gt;okf_mosestext&amp;lt;/code&amp;gt; || [[Moses Text Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| OpenOffice.org Calc || .ods, .ots || &amp;lt;code&amp;gt;okf_odf&amp;lt;/code&amp;gt; || [[OpenOffice Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| OpenOffice.org Draw || .odg, .otg || &amp;lt;code&amp;gt;okf_odf&amp;lt;/code&amp;gt; || [[OpenOffice Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| OpenOffice.org Impress || .odp, .otp || &amp;lt;code&amp;gt;okf_odf&amp;lt;/code&amp;gt; || [[OpenOffice Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| OpenOffice.org Writer || .odt, .ott || &amp;lt;code&amp;gt;okf_odf&amp;lt;/code&amp;gt; || [[OpenOffice Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| PDF || .pdf || &amp;lt;code&amp;gt;okf_pdf&amp;lt;/code&amp;gt; || [[PDF Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| [[Pensieve TM]] || .pentm || &amp;lt;code&amp;gt;okf_pensieve&amp;lt;/code&amp;gt; || [[Pensieve TM Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| PHP Content || .php || &amp;lt;code&amp;gt;okf_phpcontent&amp;lt;/code&amp;gt; || [[PHP Content Filter]] || Can be used as a subfilter only&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Plain Text (Line = text unit) || .txt || &amp;lt;code&amp;gt;okf_plaintext&amp;lt;/code&amp;gt; || [[ Plain Text Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Plain Text (Paragraph = text unit) || .txt || &amp;lt;code&amp;gt;okf_plaintext_paragraphs&amp;lt;/code&amp;gt; || [[Plain Text Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| PO || .po || &amp;lt;code&amp;gt;okf_po&amp;lt;/code&amp;gt; || [[PO Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| PO (Monolingual style) || .po || &amp;lt;code&amp;gt;okf_po-monolingual&amp;lt;/code&amp;gt; || [[PO Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Rainbow Translation Kit manifests || .rkm || &amp;lt;code&amp;gt;okf_rainbowkit&amp;lt;/code&amp;gt; || [[Rainbow Translation Kit Filter]] || Used as a tkit reader only&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Regex (Any text-based format) || .txt || &amp;lt;code&amp;gt;okf_regex&amp;lt;/code&amp;gt; || [[Regex Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| RDF (Mozilla RDF) || .rdf || &amp;lt;code&amp;gt;okf_xml-MozillaRDF&amp;lt;/code&amp;gt; || [[XML Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| RESX || .resx || &amp;lt;code&amp;gt;okf_xml-resx&amp;lt;/code&amp;gt; || [[XML Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| SDLPPX || .sdlppx || &amp;lt;code&amp;gt;okf_sdlpackage&amp;lt;/code&amp;gt; || [[SDL Trados Package Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| SDLRPX || .sdlrpx || &amp;lt;code&amp;gt;okf_sdlpackage&amp;lt;/code&amp;gt; || [[SDL Trados Package Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| SDL[[XLIFF]] || .sdlxlf || &amp;lt;code&amp;gt;okf_xliff-sdl&amp;lt;/code&amp;gt; || [[XLIFF Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Skype Language Files || .lang || &amp;lt;code&amp;gt;okf_properties-skypeLang&amp;lt;/code&amp;gt; || [[Properties Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| SRT (Sub-Rip Text, sub-titles files) || .srt || &amp;lt;code&amp;gt;okf_regex-srt&amp;lt;/code&amp;gt; || [[Regex Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Tab-Delimiter files || .tsv, .txt || &amp;lt;code&amp;gt;okf_table_tsv&amp;lt;/code&amp;gt; || [[Table Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Tex files || .tex || &amp;lt;code&amp;gt;okf_tex&amp;lt;/code&amp;gt; || [[TEX Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| [[TMX]] || .tmx || &amp;lt;code&amp;gt;okf_tmx&amp;lt;/code&amp;gt; || [[TMX Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Transifex project || .txp || &amp;lt;code&amp;gt;okf_transifex&amp;lt;/code&amp;gt; || [[Transifex Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Trados-Tagged RTF || .rtf || &amp;lt;code&amp;gt;okf_tradosrtf&amp;lt;/code&amp;gt; || [[Trados-Tagged RTF Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| TS - Qt TS files || .ts || &amp;lt;code&amp;gt;okf_ts&amp;lt;/code&amp;gt; || [[TS Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| TTX - Trados TagEditor TTX files || .ttx || &amp;lt;code&amp;gt;okf_ttx&amp;lt;/code&amp;gt; || [[TTX Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| TXML - Wordfast Pro TXML files || .txml || &amp;lt;code&amp;gt;okf_txml&amp;lt;/code&amp;gt; || [[TXML Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Vignette Export/Import Content || .xml || &amp;lt;code&amp;gt;okf_vignette&amp;lt;/code&amp;gt; || [[Vignette Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| WSXZ Package Filter || .wsxz || &amp;lt;code&amp;gt;okf_wsxzpackage&amp;lt;/code&amp;gt; || [[WSXZ Package Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| XHTML || .html, .htm || &amp;lt;code&amp;gt;okf_html-wellFormed&amp;lt;/code&amp;gt; || [[HTML Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| WIX (Windows Installer XML) localization files || .wix || &amp;lt;code&amp;gt;okf_xml-WixLocalization&amp;lt;/code&amp;gt; || [[XML Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| [[XLIFF]] v1.2 || .xlf, .xliff || &amp;lt;code&amp;gt;okf_xliff&amp;lt;/code&amp;gt; || [[XLIFF Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| [[XLIFF]] v2 || .xlf || &amp;lt;code&amp;gt;okf_xliff2&amp;lt;/code&amp;gt; || [[XLIFF-2 Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| XML (Generic, using [[ITS]] defaults) || .xml || &amp;lt;code&amp;gt;okf_xml&amp;lt;/code&amp;gt; || [[XML Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| XML (Generic, using stream reader) || .xml || &amp;lt;code&amp;gt;okf_xmlstream&amp;lt;/code&amp;gt; || [[XML Stream Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| YAML (Generic YAML filter) || .yml, .yaml || &amp;lt;code&amp;gt;okf_yaml&amp;lt;/code&amp;gt; || [[YAML Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Message Format (ICU Message Format Filter) || Any container format that supports subfilters || &amp;lt;code&amp;gt;okf_messageformat&amp;lt;/code&amp;gt; || [[Message Format Filter]] ||&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
Note that most filters allow you to [[Understanding Filter Configurations|create your own configurations]] to support more file formats.&lt;br /&gt;
&lt;br /&gt;
==Code Simplification Rules==&lt;br /&gt;
&lt;br /&gt;
All filters support code simplification rules. By default the [[Inline Codes Simplifier Step]], [[Simplification Filter]] and [[Post-segmentation Inline Codes Removal Step]] maximize the trimming and merging (aka simplification) of inline codes. In some cases this may not be desired. The simplification rules allow you to override the default behavior and prevent specific codes from being trimmed or merged.&lt;br /&gt;
&lt;br /&gt;
===General Syntax===&lt;br /&gt;
&lt;br /&gt;
The rules parser ignores irrelevant whitespace. Rules can be separated by spaces, newlines or nothing. This makes it easier to accommodate various container formats and their whitespace normalization rules. When a rule applies it means &amp;quot;do not simplify the match code&amp;quot;. Uppercase tokens are constants and predefined by the rule parser. Multiple rules are always OR'ed together.&lt;br /&gt;
&lt;br /&gt;
For more details see the JavaCC grammar: &amp;lt;code&amp;gt;../okapi/core/src/main/javacc/SimplifierRules.jj&amp;lt;/code&amp;gt;&lt;br /&gt;
&lt;br /&gt;
===Rule Examples===&lt;br /&gt;
&lt;br /&gt;
If Code has any of these flags then don't simplify &lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;if DELETABLE or ADDABLE or CLONEABLE;&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;quot;=&amp;quot; is string match&lt;br /&gt;
Match basic TAGTYPE opening, closing or standalone &lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;if DATA = &amp;quot;a&amp;quot; and TAGTYPE = OPENING;&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;quot;~&amp;quot; is regex match&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;if DATA ~ &amp;quot;a.*&amp;quot;;&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
You can negate any of the match operators &lt;br /&gt;
Don't simplify if the DATA does not match the regex &lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;if DATA !~ &amp;quot;a.*&amp;quot;;&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Match on type, linebreak in this case, don't simplify &lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;if TYPE = &amp;quot;lb&amp;quot;;&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Don't simplify any rich text types&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;if TYPE = &amp;quot;bold&amp;quot; or TYPE = &amp;quot;italic&amp;quot; or TYPE = &amp;quot;underline&amp;quot;;&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Expressions can be recursive (supports embedded parens)&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;if TYPE = &amp;quot;bold&amp;quot; or (DATA = &amp;quot;bar&amp;quot; or (DATA = &amp;quot;foo&amp;quot; and TYPE = &amp;quot;underline&amp;quot;));&amp;lt;/pre&amp;gt;&lt;br /&gt;
 &lt;br /&gt;
&lt;br /&gt;
===Filter Config Examples===&lt;br /&gt;
&lt;br /&gt;
Examples of using simplifier rules within the filter config formats used by Okapi.&lt;br /&gt;
&lt;br /&gt;
'''YAML:'''&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
simplifierRules: |&lt;br /&gt;
  if ADDABLE or DELETABLE or CLONEABLE;&lt;br /&gt;
  if DATA = &amp;quot;&amp;lt;br/&amp;gt;&amp;quot; or DATA = &amp;quot;&amp;lt;font&amp;gt;&amp;quot; or DATA = &amp;quot;&amp;lt;/font&amp;gt;&amp;quot; or DATA = &amp;quot;&amp;lt;/a&amp;gt;&amp;quot;;&lt;br /&gt;
  if DATA ~ &amp;quot;\\&amp;lt;font.+&amp;quot; or DATA ~ &amp;quot;\\&amp;lt;img.+&amp;quot; or DATA ~ &amp;quot;\\&amp;lt;a.+&amp;quot;;&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
'''ITS:'''&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
&amp;lt;?xml version=&amp;quot;1.0&amp;quot; encoding=&amp;quot;UTF-8&amp;quot;?&amp;gt;&lt;br /&gt;
&amp;lt;its:rules xmlns:its=&amp;quot;http://www.w3.org/2005/11/its&amp;quot; version=&amp;quot;1.0&amp;quot; xmlns:itsx=&amp;quot;http://www.w3.org/2008/12/its-extensions&amp;quot; xmlns:okp=&amp;quot;okapi-framework:xmlfilter-options&amp;quot;&amp;gt;&lt;br /&gt;
&amp;lt;!-- See ITS specification at: http://www.w3.org/TR/its/ --&amp;gt;&lt;br /&gt;
 &amp;lt;its:translateRule selector=&amp;quot;//*&amp;quot; translate=&amp;quot;yes&amp;quot;/&amp;gt;&lt;br /&gt;
 &amp;lt;its:withinTextRule selector=&amp;quot;//codeph&amp;quot; withinText=&amp;quot;yes&amp;quot;/&amp;gt;&lt;br /&gt;
 &amp;lt;its:withinTextRule selector=&amp;quot;//ph&amp;quot; withinText=&amp;quot;yes&amp;quot;/&amp;gt;&lt;br /&gt;
 &amp;lt;okp:simplifierRules&amp;gt;&lt;br /&gt;
 if ADDABLE or DELETABLE or CLONEABLE; if DATA ~ &amp;quot;.+&amp;quot;;&lt;br /&gt;
 &amp;lt;/okp:simplifierRules&amp;gt;&lt;br /&gt;
&amp;lt;/its:rules&amp;gt;&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
'''FPRM (Parameters):'''&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
#v1&lt;br /&gt;
extractNotes.b=true&lt;br /&gt;
simplifierRules=if ADDABLE or DELETABLE or CLONEABLE; if DATA ~ &amp;quot;.+&amp;quot;;&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
==Font Mapping==&lt;br /&gt;
&lt;br /&gt;
The font mapping can be considered as a filter's ability to automatically substitute font information in the target document on the fly, according to a provided configuration - this helps to reduce the amount of reformatting and post-translation DTP. It is supported by IDML and OpenXML (DOCX, PPTX and XLSX documents) filters at the moment.&lt;br /&gt;
&lt;br /&gt;
The following font mapping configuration options are available:&lt;br /&gt;
* The source locale regular expression pattern: &amp;lt;code&amp;gt;.*&amp;lt;/code&amp;gt;, &amp;lt;code&amp;gt;en.*&amp;lt;/code&amp;gt;, &amp;lt;code&amp;gt;en-UK&amp;lt;/code&amp;gt;, etc. It can be ommited to apply the mapping to any source locale.&lt;br /&gt;
* The target locale regular expression pattern: &amp;lt;code&amp;gt;.*&amp;lt;/code&amp;gt;, &amp;lt;code&amp;gt;ru.*&amp;lt;/code&amp;gt;, &amp;lt;code&amp;gt;ru-RU&amp;lt;/code&amp;gt;, etc. It can be ommited to apply the mapping to any target locale.&lt;br /&gt;
* The source font name regular expression pattern: &amp;lt;code&amp;gt;.*&amp;lt;/code&amp;gt;, &amp;lt;code&amp;gt;Arial.*&amp;lt;/code&amp;gt;, &amp;lt;code&amp;gt;Times New Roman&amp;lt;/code&amp;gt;, etc. It can be ommited to apply the mapping to any source font name found.&lt;br /&gt;
* The target font name: &amp;lt;code&amp;gt;Arial&amp;lt;/code&amp;gt;, &amp;lt;code&amp;gt;Times New Roman&amp;lt;/code&amp;gt;, etc. It should not be empty. And if it is made so, the mapping configuration is ignored.&lt;br /&gt;
&lt;br /&gt;
Also, the configured font mappings are applied in the order they are stated. And the final target font value is determined by a sequential&lt;br /&gt;
substitution of the source font values. I.e. if there is more than one mapping:&lt;br /&gt;
# &amp;lt;code&amp;gt;Arial&amp;lt;/code&amp;gt; -&amp;gt; &amp;lt;code&amp;gt;Times New Roman&amp;lt;/code&amp;gt;&lt;br /&gt;
# &amp;lt;code&amp;gt;Times New Roman&amp;lt;/code&amp;gt; -&amp;gt; &amp;lt;code&amp;gt;Sans Serif&amp;lt;/code&amp;gt;&lt;br /&gt;
then the first mapping will produce &amp;lt;code&amp;gt;Times New Roman&amp;lt;/code&amp;gt; replacement and the second one will be applied to this new value, thus, ending up with the &amp;lt;code&amp;gt;Sans Serif&amp;lt;/code&amp;gt;.&lt;br /&gt;
&lt;br /&gt;
The parameters serialisation format can look like that:&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
fontMappings.0.sourceLocalePattern=en.*&lt;br /&gt;
fontMappings.0.targetLocalePattern=ru.*&lt;br /&gt;
fontMappings.0.sourceFontPattern=Times.*&lt;br /&gt;
fontMappings.0.targetFont=Arial Unicode MS&lt;br /&gt;
fontMappings.1.sourceLocalePattern=ru&lt;br /&gt;
fontMappings.1.targetLocalePattern=fr&lt;br /&gt;
fontMappings.1.sourceFontPattern=The Sims Sans&lt;br /&gt;
fontMappings.1.targetFont=Arial Unicode MS&lt;br /&gt;
fontMappings.number.i=2&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
When source locale, target locale and source font are omitted:&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
fontMappings.0.targetFont=Arial Unicode MS&lt;br /&gt;
fontMappings.number.i=1&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
And this is the same as the abovementioned:&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
fontMappings.0.sourceLocalePattern=.*&lt;br /&gt;
fontMappings.0.targetLocalePattern=.*&lt;br /&gt;
fontMappings.0.sourceFontPattern=.*&lt;br /&gt;
fontMappings.0.targetFont=Arial Unicode MS&lt;br /&gt;
fontMappings.number.i=1&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
[[Category:Filters]]&lt;/div&gt;</summary>
		<author><name>Jhargraveiii</name></author>
	</entry>
	<entry>
		<id>http://okapiframework.org/wiki/index.php?title=Message_Format_Filter&amp;diff=990</id>
		<title>Message Format Filter</title>
		<link rel="alternate" type="text/html" href="http://okapiframework.org/wiki/index.php?title=Message_Format_Filter&amp;diff=990"/>
		<updated>2023-11-12T17:37:36Z</updated>

		<summary type="html">&lt;p&gt;Jhargraveiii: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;{{Filters Header}}&lt;br /&gt;
==Overview==&lt;br /&gt;
&lt;br /&gt;
The MessageFormatFilter is designed to handle message formats commonly found in software applications. This includes formats such as ICU Message Format and Java MessageFormat. This filter is meant to be used as a subfilter and the message strings can be in various container formats: JSON, YAML XML etc.. &lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
==Parameters==&lt;br /&gt;
&lt;br /&gt;
*addPluralForms - false by default. If enabled new plural forms will be added based on the target locale. The source string is modified and then refiltered to provide the new plural forms to the translator.&lt;br /&gt;
&lt;br /&gt;
*normalize - false by default. If enabled the source will be normalized to move leading and trailing text inside each complex (plural, select etc..) variant. This makes translation easier by forcing each variant to be a complete phrase or sentence. Note: This option can increase word and character counts for the source!  &lt;br /&gt;
&lt;br /&gt;
*prettyPrint - false by default. If true the output is formatted to enhance readability. Any whitespace added is not significant. The default option returns a compact string (normally a single line) &lt;br /&gt;
&lt;br /&gt;
[[Category:Filters]] [[Category:Translation Formats]]&lt;/div&gt;</summary>
		<author><name>Jhargraveiii</name></author>
	</entry>
	<entry>
		<id>http://okapiframework.org/wiki/index.php?title=Message_Format_Filter&amp;diff=989</id>
		<title>Message Format Filter</title>
		<link rel="alternate" type="text/html" href="http://okapiframework.org/wiki/index.php?title=Message_Format_Filter&amp;diff=989"/>
		<updated>2023-11-10T21:46:06Z</updated>

		<summary type="html">&lt;p&gt;Jhargraveiii: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;{{Filters Header}}&lt;br /&gt;
==Overview==&lt;br /&gt;
&lt;br /&gt;
The MessageFormatFilter is designed to handle message formats commonly found in software applications. This includes formats such as ICU Message Format and Java MessageFormat. This filter is meant to be used as a subfilter and the message strings can be in various container formats: JSON, YAML XML etc.. &lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
==Parameters==&lt;br /&gt;
&lt;br /&gt;
*addPluralForms - false by default. If enabled new plural forms will be added based on the target locale. The source string is modified and then refiltered to provide the new plural forms to the translator.&lt;br /&gt;
&lt;br /&gt;
*normalize - false by default. If enabled the source will be normalized to move leading and trailing text inside each complex (plural, select etc..) variant. This makes translation easier by forcing each variant to be a complete phrase or sentence. Note: This option can increase word and character counts for the source!  &lt;br /&gt;
&lt;br /&gt;
[[Category:Filters]] [[Category:Translation Formats]]&lt;/div&gt;</summary>
		<author><name>Jhargraveiii</name></author>
	</entry>
	<entry>
		<id>http://okapiframework.org/wiki/index.php?title=Message_Format_Filter&amp;diff=988</id>
		<title>Message Format Filter</title>
		<link rel="alternate" type="text/html" href="http://okapiframework.org/wiki/index.php?title=Message_Format_Filter&amp;diff=988"/>
		<updated>2023-11-06T17:50:01Z</updated>

		<summary type="html">&lt;p&gt;Jhargraveiii: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;{{Filters Header}}&lt;br /&gt;
==Overview==&lt;br /&gt;
&lt;br /&gt;
The MessageFormatFilter is designed to handle message formats commonly found in software applications. This includes formats such as ICU Message Format and Java MessageFormat. This filter is meant to be used as a subfilter and the message strings can be in various container formats: JSON, YAML XML etc.. &lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
==Parameters==&lt;br /&gt;
&lt;br /&gt;
*addPluralForms - false by default. If enabled new plural forms will be added based on the target locale. The source string is modified and then refiltered to provide the new plural forms to the translator.&lt;br /&gt;
&lt;br /&gt;
*normalize - false by default. If enabled the source will be normalized to move leading and trailing text inside each complex (plural, select etc..) variant. This makes translation easier by forcing each variant to be a complete phrase or sentence. Note: This option can change word and character counts for the source!  &lt;br /&gt;
&lt;br /&gt;
[[Category:Filters]] [[Category:Translation Formats]]&lt;/div&gt;</summary>
		<author><name>Jhargraveiii</name></author>
	</entry>
	<entry>
		<id>http://okapiframework.org/wiki/index.php?title=Filters&amp;diff=987</id>
		<title>Filters</title>
		<link rel="alternate" type="text/html" href="http://okapiframework.org/wiki/index.php?title=Filters&amp;diff=987"/>
		<updated>2023-10-12T21:46:46Z</updated>

		<summary type="html">&lt;p&gt;Jhargraveiii: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;Filters are the components that convert input documents from their native file format into a common internal set of [[Glossary#Resource|resources]] that all Okapi components use. The extracted content can be re-written into the original file format. When using the steps, the extraction is done by the [[Raw Document to Filter Events Step]] and the re-writing by the [[Filter Events to Raw Document Step]].&lt;br /&gt;
&lt;br /&gt;
Note: The [[Okapi Filters Plugin for OmegaT]] allows you to use some of the filters directly from [http://www.omegat.org OmegaT].&lt;br /&gt;
&lt;br /&gt;
==List of the Filters==&lt;br /&gt;
&lt;br /&gt;
The framework distribution comes with the following filters:&lt;br /&gt;
&lt;br /&gt;
{| cellpadding=&amp;quot;8&amp;quot; width=100%&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
|&lt;br /&gt;
* [[Archive Filter]]&lt;br /&gt;
* [[DTD Filter]]&lt;br /&gt;
* [[Doxygen Filter]]&lt;br /&gt;
* [[EPUB Filter]]&lt;br /&gt;
* [[HTML Filter]]&lt;br /&gt;
* [[HTML5-ITS Filter]]&lt;br /&gt;
* [[ICML Filter]]&lt;br /&gt;
* [[IDML Filter]]&lt;br /&gt;
* [[JSON Filter]]&lt;br /&gt;
* [[Markdown Filter]]&lt;br /&gt;
* [[MIF Filter]]&lt;br /&gt;
* [[Moses Text Filter]]&lt;br /&gt;
* [[Multi-Parsers Filter]]&lt;br /&gt;
* [[OpenOffice Filter]]&lt;br /&gt;
* [[OpenXML Filter|OpenXML (MS Office) Filter]]&lt;br /&gt;
|&lt;br /&gt;
* [[PDF Filter]]&lt;br /&gt;
* [[Pensieve TM Filter]]&lt;br /&gt;
* [[PHP Content Filter]]&lt;br /&gt;
* [[Plain Text Filter]]&lt;br /&gt;
* [[PO Filter]]&lt;br /&gt;
* [[Properties Filter]]&lt;br /&gt;
* [[Rainbow Translation Kit Filter]]&lt;br /&gt;
* [[Regex Filter]]&lt;br /&gt;
* [[SDL Trados Package Filter]]&lt;br /&gt;
* [[Simplification Filter]]&lt;br /&gt;
* [[Table Filter]]&lt;br /&gt;
* [[TMX Filter]]&lt;br /&gt;
* [[Trados-Tagged RTF Filter]]&lt;br /&gt;
|&lt;br /&gt;
* [[Transifex Filter]]&lt;br /&gt;
* [[TS Filter]]&lt;br /&gt;
* [[TTX Filter]]&lt;br /&gt;
* [[TXML Filter]]&lt;br /&gt;
* [[Wiki Filter]]&lt;br /&gt;
* [[WSXZ Package Filter]]&lt;br /&gt;
* [[Vignette Filter]]&lt;br /&gt;
* [[XLIFF Filter]]&lt;br /&gt;
* [[XLIFF-2 Filter]]&lt;br /&gt;
* [[XML Filter]]&lt;br /&gt;
* [[XML Stream Filter]]&lt;br /&gt;
* [[YAML Filter]]&lt;br /&gt;
* [[Message Format Filter]]&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
==Supported File Formats==&lt;br /&gt;
&lt;br /&gt;
The following is a list of some of the file formats supported by the distribution through [[Understanding Filter Configurations|pre-defined configurations]]:&lt;br /&gt;
&lt;br /&gt;
{| border=&amp;quot;1&amp;quot; cellpadding=&amp;quot;6&amp;quot; cellspacing=&amp;quot;0&amp;quot;&lt;br /&gt;
|+&lt;br /&gt;
| '''Format''' || '''Extensions''' || '''Pre-Defined Configuration''' || '''Filter''' || '''Notes'''&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Android Strings || .xml || &amp;lt;code&amp;gt;okf_xml-AndroidStrings&amp;lt;/code&amp;gt; || [[XML Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Apple Stringsdict || .stringsdict || &amp;lt;code&amp;gt;okf_xml-AppleStringsdict&amp;lt;/code&amp;gt; || [[XML Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Archive || .zip || &amp;lt;code&amp;gt;okf_archive&amp;lt;/code&amp;gt; || [[Archive Filter]] || Meta filter that processes zip files with various formats as one file.&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Auto Xliff || .xlf, .xliff || &amp;lt;code&amp;gt;okf_autoxliff&amp;lt;/code&amp;gt; || [[Auto Xliff Filter]] || Detects the version of an XLIFF file and then hands parsing off to the appropriate filter &lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| CSV (Comma-separated values files) || .csv, .txt || &amp;lt;code&amp;gt;okf_table_csv&amp;lt;/code&amp;gt; || [[Table Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| CSV (Multiple complex sub-formats) || .csv || &amp;lt;code&amp;gt;okf_multiparsers&amp;lt;/code&amp;gt; || [[Multi-Parsers Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| DITA || .dita, .ditamap, .xml || &amp;lt;code&amp;gt;okf_xmlstream-dita&amp;lt;/code&amp;gt; || [[XML Stream Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| DocBook v5.0 || .xml || &amp;lt;code&amp;gt;okf_xml-docbook&amp;lt;/code&amp;gt; || [[XML Filter]] || Since Okapi 1.42. &amp;amp;lt;footnote&amp;gt; is not handled properly.&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| DokuWiki pages || .txt || &amp;lt;code&amp;gt;okf_wiki&amp;lt;/code&amp;gt; || [[Wiki Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Doxygen-commented files || .c, .h, cpp || &amp;lt;code&amp;gt;okf_doxygen&amp;lt;/code&amp;gt; || [[Doxygen Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| DTD || .dtd || &amp;lt;code&amp;gt;okf_dtd&amp;lt;/code&amp;gt; || [[DTD Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| EPUB || .epub || &amp;lt;code&amp;gt;okf_epub&amp;lt;/code&amp;gt; || [[EPUB Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Fixed-Width Columns Table || .txt || &amp;lt;code&amp;gt;okf_table_fwc&amp;lt;/code&amp;gt; || [[Table Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Idiom WorldServer XLIFF || .xlf || &amp;lt;code&amp;gt;okf_xliff-iws&amp;lt;/code&amp;gt; || [[XLIFF Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| InCopy ICML || .wcml || &amp;lt;code&amp;gt;okf_icml&amp;lt;/code&amp;gt; || [[ICML Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| InDesign IDML || .idml || &amp;lt;code&amp;gt;okf_idml&amp;lt;/code&amp;gt; || [[IDML Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| iOS/Mac Strings|| .strings || &amp;lt;code&amp;gt;okf_regex-macStrings&amp;lt;/code&amp;gt; || [[Regex Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Java Properties || .properties || &amp;lt;code&amp;gt;okf_properties&amp;lt;/code&amp;gt; || [[Properties Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Java Properties (Output not escaped) || .properties || &amp;lt;code&amp;gt;okf_properties-outputNotEscaped&amp;lt;/code&amp;gt; || [[Properties Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Java XML Properties || .xml || &amp;lt;code&amp;gt;okf_xml-JavaProperties&amp;lt;/code&amp;gt; || [[XML Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Java XML Properties (HTML strings) || .xml || &amp;lt;code&amp;gt;okf_xmlstream-JavaPropertiesHTML&amp;lt;/code&amp;gt; || [[XML Stream Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| JSON || .json || &amp;lt;code&amp;gt;okf_json&amp;lt;/code&amp;gt; || [[JSON Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Haiku CatKeys || .catkeys || &amp;lt;code&amp;gt;okf_table_catkeys&amp;lt;/code&amp;gt; || [[Table Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| HTML (any) || .html, .htm || &amp;lt;code&amp;gt;okf_html&amp;lt;/code&amp;gt; || [[HTML Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| HTML (Well-formed, and XHTML) || .html, .htm|| &amp;lt;code&amp;gt;okf_html-wellFormed&amp;lt;/code&amp;gt; || [[HTML Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| HTML5 (and XHTML5) || .html, .htm|| &amp;lt;code&amp;gt;okf_itshtml5&amp;lt;/code&amp;gt; || [[HTML5-ITS Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Markdown || .md || &amp;lt;code&amp;gt;okf_markdown&amp;lt;/code&amp;gt; || [[Markdown Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Microsoft Excel 2007/2010 || .xlsx, .xlsm, .xltx, .xltm || &amp;lt;code&amp;gt;okf_openxml&amp;lt;/code&amp;gt; || [[OpenXML Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Microsoft PowerPoint 2007/2010 || .pptx, .pptm, .potx, .potm, .ppsx, .ppsm || &amp;lt;code&amp;gt;okf_openxml&amp;lt;/code&amp;gt; || [[OpenXML Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Microsoft Visio || .vsdx, .vsdm || &amp;lt;code&amp;gt;okf_openxml&amp;lt;/code&amp;gt; || [[OpenXML Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Microsoft Word 2007/2010 || .docx, .docm, .dotx, .dotm || &amp;lt;code&amp;gt;okf_openxml&amp;lt;/code&amp;gt; || [[OpenXML Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| MIF || .mif || &amp;lt;code&amp;gt;okf_mif&amp;lt;/code&amp;gt; || [[MIF Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Moses Text || .txt || &amp;lt;code&amp;gt;okf_mosestext&amp;lt;/code&amp;gt; || [[Moses Text Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| OpenOffice.org Calc || .ods, .ots || &amp;lt;code&amp;gt;okf_odf&amp;lt;/code&amp;gt; || [[OpenOffice Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| OpenOffice.org Draw || .odg, .otg || &amp;lt;code&amp;gt;okf_odf&amp;lt;/code&amp;gt; || [[OpenOffice Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| OpenOffice.org Impress || .odp, .otp || &amp;lt;code&amp;gt;okf_odf&amp;lt;/code&amp;gt; || [[OpenOffice Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| OpenOffice.org Writer || .odt, .ott || &amp;lt;code&amp;gt;okf_odf&amp;lt;/code&amp;gt; || [[OpenOffice Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| PDF || .pdf || &amp;lt;code&amp;gt;okf_pdf&amp;lt;/code&amp;gt; || [[PDF Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| [[Pensieve TM]] || .pentm || &amp;lt;code&amp;gt;okf_pensieve&amp;lt;/code&amp;gt; || [[Pensieve TM Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| PHP Content || .php || &amp;lt;code&amp;gt;okf_phpcontent&amp;lt;/code&amp;gt; || [[PHP Content Filter]] || Can be used as a subfilter only&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Plain Text (Line = text unit) || .txt || &amp;lt;code&amp;gt;okf_plaintext&amp;lt;/code&amp;gt; || [[ Plain Text Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Plain Text (Paragraph = text unit) || .txt || &amp;lt;code&amp;gt;okf_plaintext_paragraphs&amp;lt;/code&amp;gt; || [[Plain Text Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| PO || .po || &amp;lt;code&amp;gt;okf_po&amp;lt;/code&amp;gt; || [[PO Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| PO (Monolingual style) || .po || &amp;lt;code&amp;gt;okf_po-monolingual&amp;lt;/code&amp;gt; || [[PO Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Rainbow Translation Kit manifests || .rkm || &amp;lt;code&amp;gt;okf_rainbowkit&amp;lt;/code&amp;gt; || [[Rainbow Translation Kit Filter]] || Used as a tkit reader only&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Regex (Any text-based format) || .txt || &amp;lt;code&amp;gt;okf_regex&amp;lt;/code&amp;gt; || [[Regex Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| RDF (Mozilla RDF) || .rdf || &amp;lt;code&amp;gt;okf_xml-MozillaRDF&amp;lt;/code&amp;gt; || [[XML Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| RESX || .resx || &amp;lt;code&amp;gt;okf_xml-resx&amp;lt;/code&amp;gt; || [[XML Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| SDLPPX || .sdlppx || &amp;lt;code&amp;gt;okf_sdlpackage&amp;lt;/code&amp;gt; || [[SDL Trados Package Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| SDLRPX || .sdlrpx || &amp;lt;code&amp;gt;okf_sdlpackage&amp;lt;/code&amp;gt; || [[SDL Trados Package Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| SDL[[XLIFF]] || .sdlxlf || &amp;lt;code&amp;gt;okf_xliff-sdl&amp;lt;/code&amp;gt; || [[XLIFF Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Skype Language Files || .lang || &amp;lt;code&amp;gt;okf_properties-skypeLang&amp;lt;/code&amp;gt; || [[Properties Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| SRT (Sub-Rip Text, sub-titles files) || .srt || &amp;lt;code&amp;gt;okf_regex-srt&amp;lt;/code&amp;gt; || [[Regex Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Tab-Delimiter files || .tsv, .txt || &amp;lt;code&amp;gt;okf_table_tsv&amp;lt;/code&amp;gt; || [[Table Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Tex files || .tex || &amp;lt;code&amp;gt;okf_tex&amp;lt;/code&amp;gt; || [[TEX Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| [[TMX]] || .tmx || &amp;lt;code&amp;gt;okf_tmx&amp;lt;/code&amp;gt; || [[TMX Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Transifex project || .txp || &amp;lt;code&amp;gt;okf_transifex&amp;lt;/code&amp;gt; || [[Transifex Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Trados-Tagged RTF || .rtf || &amp;lt;code&amp;gt;okf_tradosrtf&amp;lt;/code&amp;gt; || [[Trados-Tagged RTF Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| TS - Qt TS files || .ts || &amp;lt;code&amp;gt;okf_ts&amp;lt;/code&amp;gt; || [[TS Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| TTX - Trados TagEditor TTX files || .ttx || &amp;lt;code&amp;gt;okf_ttx&amp;lt;/code&amp;gt; || [[TTX Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| TXML - Wordfast Pro TXML files || .txml || &amp;lt;code&amp;gt;okf_txml&amp;lt;/code&amp;gt; || [[TXML Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Vignette Export/Import Content || .xml || &amp;lt;code&amp;gt;okf_vignette&amp;lt;/code&amp;gt; || [[Vignette Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| WSXZ Package Filter || .wsxz || &amp;lt;code&amp;gt;okf_wsxzpackage&amp;lt;/code&amp;gt; || [[WSXZ Package Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| XHTML || .html, .htm || &amp;lt;code&amp;gt;okf_html-wellFormed&amp;lt;/code&amp;gt; || [[HTML Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| WIX (Windows Installer XML) localization files || .wix || &amp;lt;code&amp;gt;okf_xml-WixLocalization&amp;lt;/code&amp;gt; || [[XML Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| [[XLIFF]] v1.2 || .xlf, .xliff || &amp;lt;code&amp;gt;okf_xliff&amp;lt;/code&amp;gt; || [[XLIFF Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| [[XLIFF]] v2 || .xlf || &amp;lt;code&amp;gt;okf_xliff2&amp;lt;/code&amp;gt; || [[XLIFF-2 Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| XML (Generic, using [[ITS]] defaults) || .xml || &amp;lt;code&amp;gt;okf_xml&amp;lt;/code&amp;gt; || [[XML Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| XML (Generic, using stream reader) || .xml || &amp;lt;code&amp;gt;okf_xmlstream&amp;lt;/code&amp;gt; || [[XML Stream Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| YAML (Generic YAML filter) || .yml, .yaml || &amp;lt;code&amp;gt;okf_yaml&amp;lt;/code&amp;gt; || [[YAML Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Message Format (ICU Message Format Filter) || Any container format that supports subfilters || &amp;lt;code&amp;gt;okf_messageformat&amp;lt;/code&amp;gt; || [[Message Format Filter]] ||&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
Note that most filters allow you to [[Understanding Filter Configurations|create your own configurations]] to support more file formats.&lt;br /&gt;
&lt;br /&gt;
==Code Simplification Rules==&lt;br /&gt;
&lt;br /&gt;
All filters support code simplification rules. By default the [[Inline Codes Simplifier Step]], [[Simplification Filter]] and [[Post-segmentation Inline Codes Removal Step]] maximize the trimming and merging (aka simplification) of inline codes. In some cases this may not be desired. The simplification rules allow you to override the default behavior and prevent specific codes from being trimmed or merged.&lt;br /&gt;
&lt;br /&gt;
===General Syntax===&lt;br /&gt;
&lt;br /&gt;
The rules parser ignores irrelevant whitespace. Rules can be separated by spaces, newlines or nothing. This makes it easier to accommodate various container formats and their whitespace normalization rules. When a rule applies it means &amp;quot;do not simplify the match code&amp;quot;. Uppercase tokens are constants and predefined by the rule parser. Multiple rules are always OR'ed together.&lt;br /&gt;
&lt;br /&gt;
For more details see the JavaCC grammar: &amp;lt;code&amp;gt;../okapi-core/src/main/java/net/sf/okapi/core/simplifierrules/SimplifierRules.jj&amp;lt;/code&amp;gt; &lt;br /&gt;
&lt;br /&gt;
===Rule Examples===&lt;br /&gt;
&lt;br /&gt;
If Code has any of these flags then don't simplify &lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;if DELETABLE or ADDABLE or CLONEABLE;&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;quot;=&amp;quot; is string match&lt;br /&gt;
Match basic TAGTYPE opening, closing or standalone &lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;if DATA = &amp;quot;a&amp;quot; and TAGTYPE = OPENING;&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;quot;~&amp;quot; is regex match&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;if DATA ~ &amp;quot;a.*&amp;quot;;&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
You can negate any of the match operators &lt;br /&gt;
Don't simplify if the DATA does not match the regex &lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;if DATA !~ &amp;quot;a.*&amp;quot;;&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Match on type, linebreak in this case, don't simplify &lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;if TYPE = &amp;quot;lb&amp;quot;;&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Don't simplify any rich text types&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;if TYPE = &amp;quot;bold&amp;quot; or TYPE = &amp;quot;italic&amp;quot; or TYPE = &amp;quot;underline&amp;quot;;&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Expressions can be recursive (supports embedded parens)&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;if TYPE = &amp;quot;bold&amp;quot; or (DATA = &amp;quot;bar&amp;quot; or (DATA = &amp;quot;foo&amp;quot; and TYPE = &amp;quot;underline&amp;quot;));&amp;lt;/pre&amp;gt;&lt;br /&gt;
 &lt;br /&gt;
&lt;br /&gt;
===Filter Config Examples===&lt;br /&gt;
&lt;br /&gt;
Examples of using simplifier rules within the filter config formats used by Okapi.&lt;br /&gt;
&lt;br /&gt;
'''YAML:'''&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
simplifierRules: |&lt;br /&gt;
  if ADDABLE or DELETABLE or CLONEABLE;&lt;br /&gt;
  if DATA = &amp;quot;&amp;lt;br/&amp;gt;&amp;quot; or DATA = &amp;quot;&amp;lt;font&amp;gt;&amp;quot; or DATA = &amp;quot;&amp;lt;/font&amp;gt;&amp;quot; or DATA = &amp;quot;&amp;lt;/a&amp;gt;&amp;quot;;&lt;br /&gt;
  if DATA ~ &amp;quot;\\&amp;lt;font.+&amp;quot; or DATA ~ &amp;quot;\\&amp;lt;img.+&amp;quot; or DATA ~ &amp;quot;\\&amp;lt;a.+&amp;quot;;&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
'''ITS:'''&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
&amp;lt;?xml version=&amp;quot;1.0&amp;quot; encoding=&amp;quot;UTF-8&amp;quot;?&amp;gt;&lt;br /&gt;
&amp;lt;its:rules xmlns:its=&amp;quot;http://www.w3.org/2005/11/its&amp;quot; version=&amp;quot;1.0&amp;quot; xmlns:itsx=&amp;quot;http://www.w3.org/2008/12/its-extensions&amp;quot; xmlns:okp=&amp;quot;okapi-framework:xmlfilter-options&amp;quot;&amp;gt;&lt;br /&gt;
&amp;lt;!-- See ITS specification at: http://www.w3.org/TR/its/ --&amp;gt;&lt;br /&gt;
 &amp;lt;its:translateRule selector=&amp;quot;//*&amp;quot; translate=&amp;quot;yes&amp;quot;/&amp;gt;&lt;br /&gt;
 &amp;lt;its:withinTextRule selector=&amp;quot;//codeph&amp;quot; withinText=&amp;quot;yes&amp;quot;/&amp;gt;&lt;br /&gt;
 &amp;lt;its:withinTextRule selector=&amp;quot;//ph&amp;quot; withinText=&amp;quot;yes&amp;quot;/&amp;gt;&lt;br /&gt;
 &amp;lt;okp:simplifierRules&amp;gt;&lt;br /&gt;
 if ADDABLE or DELETABLE or CLONEABLE; if DATA ~ &amp;quot;.+&amp;quot;;&lt;br /&gt;
 &amp;lt;/okp:simplifierRules&amp;gt;&lt;br /&gt;
&amp;lt;/its:rules&amp;gt;&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
'''FPRM (Parameters):'''&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
#v1&lt;br /&gt;
extractNotes.b=true&lt;br /&gt;
simplifierRules=if ADDABLE or DELETABLE or CLONEABLE; if DATA ~ &amp;quot;.+&amp;quot;;&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
==Font Mapping==&lt;br /&gt;
&lt;br /&gt;
The font mapping can be considered as a filter's ability to automatically substitute font information in the target document on the fly, according to a provided configuration - this helps to reduce the amount of reformatting and post-translation DTP. It is supported by IDML and OpenXML (DOCX, PPTX and XLSX documents) filters at the moment.&lt;br /&gt;
&lt;br /&gt;
The following font mapping configuration options are available:&lt;br /&gt;
* The source locale regular expression pattern: &amp;lt;code&amp;gt;.*&amp;lt;/code&amp;gt;, &amp;lt;code&amp;gt;en.*&amp;lt;/code&amp;gt;, &amp;lt;code&amp;gt;en-UK&amp;lt;/code&amp;gt;, etc. It can be ommited to apply the mapping to any source locale.&lt;br /&gt;
* The target locale regular expression pattern: &amp;lt;code&amp;gt;.*&amp;lt;/code&amp;gt;, &amp;lt;code&amp;gt;ru.*&amp;lt;/code&amp;gt;, &amp;lt;code&amp;gt;ru-RU&amp;lt;/code&amp;gt;, etc. It can be ommited to apply the mapping to any target locale.&lt;br /&gt;
* The source font name regular expression pattern: &amp;lt;code&amp;gt;.*&amp;lt;/code&amp;gt;, &amp;lt;code&amp;gt;Arial.*&amp;lt;/code&amp;gt;, &amp;lt;code&amp;gt;Times New Roman&amp;lt;/code&amp;gt;, etc. It can be ommited to apply the mapping to any source font name found.&lt;br /&gt;
* The target font name: &amp;lt;code&amp;gt;Arial&amp;lt;/code&amp;gt;, &amp;lt;code&amp;gt;Times New Roman&amp;lt;/code&amp;gt;, etc. It should not be empty. And if it is made so, the mapping configuration is ignored.&lt;br /&gt;
&lt;br /&gt;
Also, the configured font mappings are applied in the order they are stated. And the final target font value is determined by a sequential&lt;br /&gt;
substitution of the source font values. I.e. if there is more than one mapping:&lt;br /&gt;
# &amp;lt;code&amp;gt;Arial&amp;lt;/code&amp;gt; -&amp;gt; &amp;lt;code&amp;gt;Times New Roman&amp;lt;/code&amp;gt;&lt;br /&gt;
# &amp;lt;code&amp;gt;Times New Roman&amp;lt;/code&amp;gt; -&amp;gt; &amp;lt;code&amp;gt;Sans Serif&amp;lt;/code&amp;gt;&lt;br /&gt;
then the first mapping will produce &amp;lt;code&amp;gt;Times New Roman&amp;lt;/code&amp;gt; replacement and the second one will be applied to this new value, thus, ending up with the &amp;lt;code&amp;gt;Sans Serif&amp;lt;/code&amp;gt;.&lt;br /&gt;
&lt;br /&gt;
The parameters serialisation format can look like that:&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
fontMappings.0.sourceLocalePattern=en.*&lt;br /&gt;
fontMappings.0.targetLocalePattern=ru.*&lt;br /&gt;
fontMappings.0.sourceFontPattern=Times.*&lt;br /&gt;
fontMappings.0.targetFont=Arial Unicode MS&lt;br /&gt;
fontMappings.1.sourceLocalePattern=ru&lt;br /&gt;
fontMappings.1.targetLocalePattern=fr&lt;br /&gt;
fontMappings.1.sourceFontPattern=The Sims Sans&lt;br /&gt;
fontMappings.1.targetFont=Arial Unicode MS&lt;br /&gt;
fontMappings.number.i=2&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
When source locale, target locale and source font are omitted:&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
fontMappings.0.targetFont=Arial Unicode MS&lt;br /&gt;
fontMappings.number.i=1&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
And this is the same as the abovementioned:&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
fontMappings.0.sourceLocalePattern=.*&lt;br /&gt;
fontMappings.0.targetLocalePattern=.*&lt;br /&gt;
fontMappings.0.sourceFontPattern=.*&lt;br /&gt;
fontMappings.0.targetFont=Arial Unicode MS&lt;br /&gt;
fontMappings.number.i=1&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
[[Category:Filters]]&lt;/div&gt;</summary>
		<author><name>Jhargraveiii</name></author>
	</entry>
	<entry>
		<id>http://okapiframework.org/wiki/index.php?title=Filters&amp;diff=986</id>
		<title>Filters</title>
		<link rel="alternate" type="text/html" href="http://okapiframework.org/wiki/index.php?title=Filters&amp;diff=986"/>
		<updated>2023-10-12T21:45:57Z</updated>

		<summary type="html">&lt;p&gt;Jhargraveiii: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;Filters are the components that convert input documents from their native file format into a common internal set of [[Glossary#Resource|resources]] that all Okapi components use. The extracted content can be re-written into the original file format. When using the steps, the extraction is done by the [[Raw Document to Filter Events Step]] and the re-writing by the [[Filter Events to Raw Document Step]].&lt;br /&gt;
&lt;br /&gt;
Note: The [[Okapi Filters Plugin for OmegaT]] allows you to use some of the filters directly from [http://www.omegat.org OmegaT].&lt;br /&gt;
&lt;br /&gt;
==List of the Filters==&lt;br /&gt;
&lt;br /&gt;
The framework distribution comes with the following filters:&lt;br /&gt;
&lt;br /&gt;
{| cellpadding=&amp;quot;8&amp;quot; width=100%&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
|&lt;br /&gt;
* [[Archive Filter]]&lt;br /&gt;
* [[DTD Filter]]&lt;br /&gt;
* [[Doxygen Filter]]&lt;br /&gt;
* [[EPUB Filter]]&lt;br /&gt;
* [[HTML Filter]]&lt;br /&gt;
* [[HTML5-ITS Filter]]&lt;br /&gt;
* [[ICML Filter]]&lt;br /&gt;
* [[IDML Filter]]&lt;br /&gt;
* [[JSON Filter]]&lt;br /&gt;
* [[Markdown Filter]]&lt;br /&gt;
* [[MIF Filter]]&lt;br /&gt;
* [[Moses Text Filter]]&lt;br /&gt;
* [[Multi-Parsers Filter]]&lt;br /&gt;
* [[OpenOffice Filter]]&lt;br /&gt;
* [[OpenXML Filter|OpenXML (MS Office) Filter]]&lt;br /&gt;
|&lt;br /&gt;
* [[PDF Filter]]&lt;br /&gt;
* [[Pensieve TM Filter]]&lt;br /&gt;
* [[PHP Content Filter]]&lt;br /&gt;
* [[Plain Text Filter]]&lt;br /&gt;
* [[PO Filter]]&lt;br /&gt;
* [[Properties Filter]]&lt;br /&gt;
* [[Rainbow Translation Kit Filter]]&lt;br /&gt;
* [[Regex Filter]]&lt;br /&gt;
* [[SDL Trados Package Filter]]&lt;br /&gt;
* [[Simplification Filter]]&lt;br /&gt;
* [[Table Filter]]&lt;br /&gt;
* [[TMX Filter]]&lt;br /&gt;
* [[Trados-Tagged RTF Filter]]&lt;br /&gt;
|&lt;br /&gt;
* [[Transifex Filter]]&lt;br /&gt;
* [[TS Filter]]&lt;br /&gt;
* [[TTX Filter]]&lt;br /&gt;
* [[TXML Filter]]&lt;br /&gt;
* [[Wiki Filter]]&lt;br /&gt;
* [[WSXZ Package Filter]]&lt;br /&gt;
* [[Vignette Filter]]&lt;br /&gt;
* [[XLIFF Filter]]&lt;br /&gt;
* [[XLIFF-2 Filter]]&lt;br /&gt;
* [[XML Filter]]&lt;br /&gt;
* [[XML Stream Filter]]&lt;br /&gt;
* [[YAML Filter]]&lt;br /&gt;
* [[Message Format Filter]]&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
==Supported File Formats==&lt;br /&gt;
&lt;br /&gt;
The following is a list of some of the file formats supported by the distribution through [[Understanding Filter Configurations|pre-defined configurations]]:&lt;br /&gt;
&lt;br /&gt;
{| border=&amp;quot;1&amp;quot; cellpadding=&amp;quot;6&amp;quot; cellspacing=&amp;quot;0&amp;quot;&lt;br /&gt;
|+&lt;br /&gt;
| '''Format''' || '''Extensions''' || '''Pre-Defined Configuration''' || '''Filter''' || '''Notes'''&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Android Strings || .xml || &amp;lt;code&amp;gt;okf_xml-AndroidStrings&amp;lt;/code&amp;gt; || [[XML Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Apple Stringsdict || .stringsdict || &amp;lt;code&amp;gt;okf_xml-AppleStringsdict&amp;lt;/code&amp;gt; || [[XML Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Archive || .zip || &amp;lt;code&amp;gt;okf_archive&amp;lt;/code&amp;gt; || [[Archive Filter]] || Meta filter that processes zip files with various formats as one file.&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Auto Xliff || .xlf, .xliff || &amp;lt;code&amp;gt;okf_autoxliff&amp;lt;/code&amp;gt; || [[Auto Xliff Filter]] || Detects the version of an XLIFF file and then hands parsing off to the appropriate filter &lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| CSV (Comma-separated values files) || .csv, .txt || &amp;lt;code&amp;gt;okf_table_csv&amp;lt;/code&amp;gt; || [[Table Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| CSV (Multiple complex sub-formats) || .csv || &amp;lt;code&amp;gt;okf_multiparsers&amp;lt;/code&amp;gt; || [[Multi-Parsers Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| DITA || .dita, .ditamap, .xml || &amp;lt;code&amp;gt;okf_xmlstream-dita&amp;lt;/code&amp;gt; || [[XML Stream Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| DocBook v5.0 || .xml || &amp;lt;code&amp;gt;okf_xml-docbook&amp;lt;/code&amp;gt; || [[XML Filter]] || Since Okapi 1.42. &amp;amp;lt;footnote&amp;gt; is not handled properly.&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| DokuWiki pages || .txt || &amp;lt;code&amp;gt;okf_wiki&amp;lt;/code&amp;gt; || [[Wiki Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Doxygen-commented files || .c, .h, cpp || &amp;lt;code&amp;gt;okf_doxygen&amp;lt;/code&amp;gt; || [[Doxygen Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| DTD || .dtd || &amp;lt;code&amp;gt;okf_dtd&amp;lt;/code&amp;gt; || [[DTD Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| EPUB || .epub || &amp;lt;code&amp;gt;okf_epub&amp;lt;/code&amp;gt; || [[EPUB Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Fixed-Width Columns Table || .txt || &amp;lt;code&amp;gt;okf_table_fwc&amp;lt;/code&amp;gt; || [[Table Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Idiom WorldServer XLIFF || .xlf || &amp;lt;code&amp;gt;okf_xliff-iws&amp;lt;/code&amp;gt; || [[XLIFF Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| InCopy ICML || .wcml || &amp;lt;code&amp;gt;okf_icml&amp;lt;/code&amp;gt; || [[ICML Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| InDesign IDML || .idml || &amp;lt;code&amp;gt;okf_idml&amp;lt;/code&amp;gt; || [[IDML Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| iOS/Mac Strings|| .strings || &amp;lt;code&amp;gt;okf_regex-macStrings&amp;lt;/code&amp;gt; || [[Regex Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Java Properties || .properties || &amp;lt;code&amp;gt;okf_properties&amp;lt;/code&amp;gt; || [[Properties Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Java Properties (Output not escaped) || .properties || &amp;lt;code&amp;gt;okf_properties-outputNotEscaped&amp;lt;/code&amp;gt; || [[Properties Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Java XML Properties || .xml || &amp;lt;code&amp;gt;okf_xml-JavaProperties&amp;lt;/code&amp;gt; || [[XML Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Java XML Properties (HTML strings) || .xml || &amp;lt;code&amp;gt;okf_xmlstream-JavaPropertiesHTML&amp;lt;/code&amp;gt; || [[XML Stream Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| JSON || .json || &amp;lt;code&amp;gt;okf_json&amp;lt;/code&amp;gt; || [[JSON Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Haiku CatKeys || .catkeys || &amp;lt;code&amp;gt;okf_table_catkeys&amp;lt;/code&amp;gt; || [[Table Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| HTML (any) || .html, .htm || &amp;lt;code&amp;gt;okf_html&amp;lt;/code&amp;gt; || [[HTML Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| HTML (Well-formed, and XHTML) || .html, .htm|| &amp;lt;code&amp;gt;okf_html-wellFormed&amp;lt;/code&amp;gt; || [[HTML Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| HTML5 (and XHTML5) || .html, .htm|| &amp;lt;code&amp;gt;okf_itshtml5&amp;lt;/code&amp;gt; || [[HTML5-ITS Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Markdown || .md || &amp;lt;code&amp;gt;okf_markdown&amp;lt;/code&amp;gt; || [[Markdown Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Microsoft Excel 2007/2010 || .xlsx, .xlsm, .xltx, .xltm || &amp;lt;code&amp;gt;okf_openxml&amp;lt;/code&amp;gt; || [[OpenXML Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Microsoft PowerPoint 2007/2010 || .pptx, .pptm, .potx, .potm, .ppsx, .ppsm || &amp;lt;code&amp;gt;okf_openxml&amp;lt;/code&amp;gt; || [[OpenXML Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Microsoft Visio || .vsdx, .vsdm || &amp;lt;code&amp;gt;okf_openxml&amp;lt;/code&amp;gt; || [[OpenXML Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Microsoft Word 2007/2010 || .docx, .docm, .dotx, .dotm || &amp;lt;code&amp;gt;okf_openxml&amp;lt;/code&amp;gt; || [[OpenXML Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| MIF || .mif || &amp;lt;code&amp;gt;okf_mif&amp;lt;/code&amp;gt; || [[MIF Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Moses Text || .txt || &amp;lt;code&amp;gt;okf_mosestext&amp;lt;/code&amp;gt; || [[Moses Text Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| OpenOffice.org Calc || .ods, .ots || &amp;lt;code&amp;gt;okf_odf&amp;lt;/code&amp;gt; || [[OpenOffice Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| OpenOffice.org Draw || .odg, .otg || &amp;lt;code&amp;gt;okf_odf&amp;lt;/code&amp;gt; || [[OpenOffice Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| OpenOffice.org Impress || .odp, .otp || &amp;lt;code&amp;gt;okf_odf&amp;lt;/code&amp;gt; || [[OpenOffice Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| OpenOffice.org Writer || .odt, .ott || &amp;lt;code&amp;gt;okf_odf&amp;lt;/code&amp;gt; || [[OpenOffice Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| PDF || .pdf || &amp;lt;code&amp;gt;okf_pdf&amp;lt;/code&amp;gt; || [[PDF Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| [[Pensieve TM]] || .pentm || &amp;lt;code&amp;gt;okf_pensieve&amp;lt;/code&amp;gt; || [[Pensieve TM Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| PHP Content || .php || &amp;lt;code&amp;gt;okf_phpcontent&amp;lt;/code&amp;gt; || [[PHP Content Filter]] || Can be used as a subfilter only&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Plain Text (Line = text unit) || .txt || &amp;lt;code&amp;gt;okf_plaintext&amp;lt;/code&amp;gt; || [[ Plain Text Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Plain Text (Paragraph = text unit) || .txt || &amp;lt;code&amp;gt;okf_plaintext_paragraphs&amp;lt;/code&amp;gt; || [[Plain Text Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| PO || .po || &amp;lt;code&amp;gt;okf_po&amp;lt;/code&amp;gt; || [[PO Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| PO (Monolingual style) || .po || &amp;lt;code&amp;gt;okf_po-monolingual&amp;lt;/code&amp;gt; || [[PO Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Rainbow Translation Kit manifests || .rkm || &amp;lt;code&amp;gt;okf_rainbowkit&amp;lt;/code&amp;gt; || [[Rainbow Translation Kit Filter]] || Used as a tkit reader only&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Regex (Any text-based format) || .txt || &amp;lt;code&amp;gt;okf_regex&amp;lt;/code&amp;gt; || [[Regex Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| RDF (Mozilla RDF) || .rdf || &amp;lt;code&amp;gt;okf_xml-MozillaRDF&amp;lt;/code&amp;gt; || [[XML Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| RESX || .resx || &amp;lt;code&amp;gt;okf_xml-resx&amp;lt;/code&amp;gt; || [[XML Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| SDLPPX || .sdlppx || &amp;lt;code&amp;gt;okf_sdlpackage&amp;lt;/code&amp;gt; || [[SDL Trados Package Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| SDLRPX || .sdlrpx || &amp;lt;code&amp;gt;okf_sdlpackage&amp;lt;/code&amp;gt; || [[SDL Trados Package Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| SDL[[XLIFF]] || .sdlxlf || &amp;lt;code&amp;gt;okf_xliff-sdl&amp;lt;/code&amp;gt; || [[XLIFF Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Skype Language Files || .lang || &amp;lt;code&amp;gt;okf_properties-skypeLang&amp;lt;/code&amp;gt; || [[Properties Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| SRT (Sub-Rip Text, sub-titles files) || .srt || &amp;lt;code&amp;gt;okf_regex-srt&amp;lt;/code&amp;gt; || [[Regex Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Tab-Delimiter files || .tsv, .txt || &amp;lt;code&amp;gt;okf_table_tsv&amp;lt;/code&amp;gt; || [[Table Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Tex files || .tex || &amp;lt;code&amp;gt;okf_tex&amp;lt;/code&amp;gt; || [[TEX Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| [[TMX]] || .tmx || &amp;lt;code&amp;gt;okf_tmx&amp;lt;/code&amp;gt; || [[TMX Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Transifex project || .txp || &amp;lt;code&amp;gt;okf_transifex&amp;lt;/code&amp;gt; || [[Transifex Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Trados-Tagged RTF || .rtf || &amp;lt;code&amp;gt;okf_tradosrtf&amp;lt;/code&amp;gt; || [[Trados-Tagged RTF Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| TS - Qt TS files || .ts || &amp;lt;code&amp;gt;okf_ts&amp;lt;/code&amp;gt; || [[TS Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| TTX - Trados TagEditor TTX files || .ttx || &amp;lt;code&amp;gt;okf_ttx&amp;lt;/code&amp;gt; || [[TTX Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| TXML - Wordfast Pro TXML files || .txml || &amp;lt;code&amp;gt;okf_txml&amp;lt;/code&amp;gt; || [[TXML Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Vignette Export/Import Content || .xml || &amp;lt;code&amp;gt;okf_vignette&amp;lt;/code&amp;gt; || [[Vignette Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| WSXZ Package Filter || .wsxz || &amp;lt;code&amp;gt;okf_wsxzpackage&amp;lt;/code&amp;gt; || [[WSXZ Package Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| XHTML || .html, .htm || &amp;lt;code&amp;gt;okf_html-wellFormed&amp;lt;/code&amp;gt; || [[HTML Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| WIX (Windows Installer XML) localization files || .wix || &amp;lt;code&amp;gt;okf_xml-WixLocalization&amp;lt;/code&amp;gt; || [[XML Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| [[XLIFF]] v1.2 || .xlf, .xliff || &amp;lt;code&amp;gt;okf_xliff&amp;lt;/code&amp;gt; || [[XLIFF Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| [[XLIFF]] v2 || .xlf || &amp;lt;code&amp;gt;okf_xliff2&amp;lt;/code&amp;gt; || [[XLIFF-2 Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| XML (Generic, using [[ITS]] defaults) || .xml || &amp;lt;code&amp;gt;okf_xml&amp;lt;/code&amp;gt; || [[XML Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| XML (Generic, using stream reader) || .xml || &amp;lt;code&amp;gt;okf_xmlstream&amp;lt;/code&amp;gt; || [[XML Stream Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| YAML (Generic YAML filter) || .yml, .yaml || &amp;lt;code&amp;gt;okf_yaml&amp;lt;/code&amp;gt; || [[YAML Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Message Format (ICU Message Format Filter) || &amp;lt;code&amp;gt;okf_messageformat&amp;lt;/code&amp;gt; || [[Message Format Filter]] ||&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
Note that most filters allow you to [[Understanding Filter Configurations|create your own configurations]] to support more file formats.&lt;br /&gt;
&lt;br /&gt;
==Code Simplification Rules==&lt;br /&gt;
&lt;br /&gt;
All filters support code simplification rules. By default the [[Inline Codes Simplifier Step]], [[Simplification Filter]] and [[Post-segmentation Inline Codes Removal Step]] maximize the trimming and merging (aka simplification) of inline codes. In some cases this may not be desired. The simplification rules allow you to override the default behavior and prevent specific codes from being trimmed or merged.&lt;br /&gt;
&lt;br /&gt;
===General Syntax===&lt;br /&gt;
&lt;br /&gt;
The rules parser ignores irrelevant whitespace. Rules can be separated by spaces, newlines or nothing. This makes it easier to accommodate various container formats and their whitespace normalization rules. When a rule applies it means &amp;quot;do not simplify the match code&amp;quot;. Uppercase tokens are constants and predefined by the rule parser. Multiple rules are always OR'ed together.&lt;br /&gt;
&lt;br /&gt;
For more details see the JavaCC grammar: &amp;lt;code&amp;gt;../okapi-core/src/main/java/net/sf/okapi/core/simplifierrules/SimplifierRules.jj&amp;lt;/code&amp;gt; &lt;br /&gt;
&lt;br /&gt;
===Rule Examples===&lt;br /&gt;
&lt;br /&gt;
If Code has any of these flags then don't simplify &lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;if DELETABLE or ADDABLE or CLONEABLE;&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;quot;=&amp;quot; is string match&lt;br /&gt;
Match basic TAGTYPE opening, closing or standalone &lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;if DATA = &amp;quot;a&amp;quot; and TAGTYPE = OPENING;&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;quot;~&amp;quot; is regex match&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;if DATA ~ &amp;quot;a.*&amp;quot;;&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
You can negate any of the match operators &lt;br /&gt;
Don't simplify if the DATA does not match the regex &lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;if DATA !~ &amp;quot;a.*&amp;quot;;&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Match on type, linebreak in this case, don't simplify &lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;if TYPE = &amp;quot;lb&amp;quot;;&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Don't simplify any rich text types&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;if TYPE = &amp;quot;bold&amp;quot; or TYPE = &amp;quot;italic&amp;quot; or TYPE = &amp;quot;underline&amp;quot;;&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Expressions can be recursive (supports embedded parens)&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;if TYPE = &amp;quot;bold&amp;quot; or (DATA = &amp;quot;bar&amp;quot; or (DATA = &amp;quot;foo&amp;quot; and TYPE = &amp;quot;underline&amp;quot;));&amp;lt;/pre&amp;gt;&lt;br /&gt;
 &lt;br /&gt;
&lt;br /&gt;
===Filter Config Examples===&lt;br /&gt;
&lt;br /&gt;
Examples of using simplifier rules within the filter config formats used by Okapi.&lt;br /&gt;
&lt;br /&gt;
'''YAML:'''&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
simplifierRules: |&lt;br /&gt;
  if ADDABLE or DELETABLE or CLONEABLE;&lt;br /&gt;
  if DATA = &amp;quot;&amp;lt;br/&amp;gt;&amp;quot; or DATA = &amp;quot;&amp;lt;font&amp;gt;&amp;quot; or DATA = &amp;quot;&amp;lt;/font&amp;gt;&amp;quot; or DATA = &amp;quot;&amp;lt;/a&amp;gt;&amp;quot;;&lt;br /&gt;
  if DATA ~ &amp;quot;\\&amp;lt;font.+&amp;quot; or DATA ~ &amp;quot;\\&amp;lt;img.+&amp;quot; or DATA ~ &amp;quot;\\&amp;lt;a.+&amp;quot;;&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
'''ITS:'''&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
&amp;lt;?xml version=&amp;quot;1.0&amp;quot; encoding=&amp;quot;UTF-8&amp;quot;?&amp;gt;&lt;br /&gt;
&amp;lt;its:rules xmlns:its=&amp;quot;http://www.w3.org/2005/11/its&amp;quot; version=&amp;quot;1.0&amp;quot; xmlns:itsx=&amp;quot;http://www.w3.org/2008/12/its-extensions&amp;quot; xmlns:okp=&amp;quot;okapi-framework:xmlfilter-options&amp;quot;&amp;gt;&lt;br /&gt;
&amp;lt;!-- See ITS specification at: http://www.w3.org/TR/its/ --&amp;gt;&lt;br /&gt;
 &amp;lt;its:translateRule selector=&amp;quot;//*&amp;quot; translate=&amp;quot;yes&amp;quot;/&amp;gt;&lt;br /&gt;
 &amp;lt;its:withinTextRule selector=&amp;quot;//codeph&amp;quot; withinText=&amp;quot;yes&amp;quot;/&amp;gt;&lt;br /&gt;
 &amp;lt;its:withinTextRule selector=&amp;quot;//ph&amp;quot; withinText=&amp;quot;yes&amp;quot;/&amp;gt;&lt;br /&gt;
 &amp;lt;okp:simplifierRules&amp;gt;&lt;br /&gt;
 if ADDABLE or DELETABLE or CLONEABLE; if DATA ~ &amp;quot;.+&amp;quot;;&lt;br /&gt;
 &amp;lt;/okp:simplifierRules&amp;gt;&lt;br /&gt;
&amp;lt;/its:rules&amp;gt;&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
'''FPRM (Parameters):'''&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
#v1&lt;br /&gt;
extractNotes.b=true&lt;br /&gt;
simplifierRules=if ADDABLE or DELETABLE or CLONEABLE; if DATA ~ &amp;quot;.+&amp;quot;;&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
==Font Mapping==&lt;br /&gt;
&lt;br /&gt;
The font mapping can be considered as a filter's ability to automatically substitute font information in the target document on the fly, according to a provided configuration - this helps to reduce the amount of reformatting and post-translation DTP. It is supported by IDML and OpenXML (DOCX, PPTX and XLSX documents) filters at the moment.&lt;br /&gt;
&lt;br /&gt;
The following font mapping configuration options are available:&lt;br /&gt;
* The source locale regular expression pattern: &amp;lt;code&amp;gt;.*&amp;lt;/code&amp;gt;, &amp;lt;code&amp;gt;en.*&amp;lt;/code&amp;gt;, &amp;lt;code&amp;gt;en-UK&amp;lt;/code&amp;gt;, etc. It can be ommited to apply the mapping to any source locale.&lt;br /&gt;
* The target locale regular expression pattern: &amp;lt;code&amp;gt;.*&amp;lt;/code&amp;gt;, &amp;lt;code&amp;gt;ru.*&amp;lt;/code&amp;gt;, &amp;lt;code&amp;gt;ru-RU&amp;lt;/code&amp;gt;, etc. It can be ommited to apply the mapping to any target locale.&lt;br /&gt;
* The source font name regular expression pattern: &amp;lt;code&amp;gt;.*&amp;lt;/code&amp;gt;, &amp;lt;code&amp;gt;Arial.*&amp;lt;/code&amp;gt;, &amp;lt;code&amp;gt;Times New Roman&amp;lt;/code&amp;gt;, etc. It can be ommited to apply the mapping to any source font name found.&lt;br /&gt;
* The target font name: &amp;lt;code&amp;gt;Arial&amp;lt;/code&amp;gt;, &amp;lt;code&amp;gt;Times New Roman&amp;lt;/code&amp;gt;, etc. It should not be empty. And if it is made so, the mapping configuration is ignored.&lt;br /&gt;
&lt;br /&gt;
Also, the configured font mappings are applied in the order they are stated. And the final target font value is determined by a sequential&lt;br /&gt;
substitution of the source font values. I.e. if there is more than one mapping:&lt;br /&gt;
# &amp;lt;code&amp;gt;Arial&amp;lt;/code&amp;gt; -&amp;gt; &amp;lt;code&amp;gt;Times New Roman&amp;lt;/code&amp;gt;&lt;br /&gt;
# &amp;lt;code&amp;gt;Times New Roman&amp;lt;/code&amp;gt; -&amp;gt; &amp;lt;code&amp;gt;Sans Serif&amp;lt;/code&amp;gt;&lt;br /&gt;
then the first mapping will produce &amp;lt;code&amp;gt;Times New Roman&amp;lt;/code&amp;gt; replacement and the second one will be applied to this new value, thus, ending up with the &amp;lt;code&amp;gt;Sans Serif&amp;lt;/code&amp;gt;.&lt;br /&gt;
&lt;br /&gt;
The parameters serialisation format can look like that:&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
fontMappings.0.sourceLocalePattern=en.*&lt;br /&gt;
fontMappings.0.targetLocalePattern=ru.*&lt;br /&gt;
fontMappings.0.sourceFontPattern=Times.*&lt;br /&gt;
fontMappings.0.targetFont=Arial Unicode MS&lt;br /&gt;
fontMappings.1.sourceLocalePattern=ru&lt;br /&gt;
fontMappings.1.targetLocalePattern=fr&lt;br /&gt;
fontMappings.1.sourceFontPattern=The Sims Sans&lt;br /&gt;
fontMappings.1.targetFont=Arial Unicode MS&lt;br /&gt;
fontMappings.number.i=2&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
When source locale, target locale and source font are omitted:&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
fontMappings.0.targetFont=Arial Unicode MS&lt;br /&gt;
fontMappings.number.i=1&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
And this is the same as the abovementioned:&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
fontMappings.0.sourceLocalePattern=.*&lt;br /&gt;
fontMappings.0.targetLocalePattern=.*&lt;br /&gt;
fontMappings.0.sourceFontPattern=.*&lt;br /&gt;
fontMappings.0.targetFont=Arial Unicode MS&lt;br /&gt;
fontMappings.number.i=1&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
[[Category:Filters]]&lt;/div&gt;</summary>
		<author><name>Jhargraveiii</name></author>
	</entry>
	<entry>
		<id>http://okapiframework.org/wiki/index.php?title=Message_Format_Filter&amp;diff=985</id>
		<title>Message Format Filter</title>
		<link rel="alternate" type="text/html" href="http://okapiframework.org/wiki/index.php?title=Message_Format_Filter&amp;diff=985"/>
		<updated>2023-10-12T21:45:24Z</updated>

		<summary type="html">&lt;p&gt;Jhargraveiii: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;{{Filters Header}}&lt;br /&gt;
==Overview==&lt;br /&gt;
&lt;br /&gt;
The MessageFormatFilter is designed to handle message formats commonly found in software applications. This includes formats such as ICU Message Format and Java MessageFormat. This filter is meant to be used as a subfilter and the message strings can be in various container formats: JSON, YAML XML etc.. &lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
==Parameters==&lt;br /&gt;
&lt;br /&gt;
`addPluralForms` - false by default. If enabled new plural forms will be added based on the target locale. The source string is modified and then refiltered to provide the new plural forms to the translator.&lt;br /&gt;
&lt;br /&gt;
[[Category:Filters]] [[Category:Translation Formats]]&lt;/div&gt;</summary>
		<author><name>Jhargraveiii</name></author>
	</entry>
	<entry>
		<id>http://okapiframework.org/wiki/index.php?title=Message_Format_Filter&amp;diff=984</id>
		<title>Message Format Filter</title>
		<link rel="alternate" type="text/html" href="http://okapiframework.org/wiki/index.php?title=Message_Format_Filter&amp;diff=984"/>
		<updated>2023-10-12T21:43:13Z</updated>

		<summary type="html">&lt;p&gt;Jhargraveiii: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;{{Filters Header}}&lt;br /&gt;
==Overview==&lt;br /&gt;
&lt;br /&gt;
The MessageFormatFilter is designed to handle message formats commonly found in software applications. This includes formats such as ICU Message Format and Java MessageFormat.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
==Parameters==&lt;br /&gt;
&lt;br /&gt;
`addPluralForms` - false by default. If enabled new plural forms will be added based on the target locale. The source string is modified and then refiltered to provide the new plural forms to the trnaslator.&lt;br /&gt;
&lt;br /&gt;
[[Category:Filters]] [[Category:Translation Formats]]&lt;/div&gt;</summary>
		<author><name>Jhargraveiii</name></author>
	</entry>
	<entry>
		<id>http://okapiframework.org/wiki/index.php?title=Message_Format_Filter&amp;diff=983</id>
		<title>Message Format Filter</title>
		<link rel="alternate" type="text/html" href="http://okapiframework.org/wiki/index.php?title=Message_Format_Filter&amp;diff=983"/>
		<updated>2023-10-12T21:42:41Z</updated>

		<summary type="html">&lt;p&gt;Jhargraveiii: Created page with &amp;quot;{{Filters Header}} ==Overview==  The MessageFormatFilter is designed to handle message formats commonly found in software applications. This includes formats such as ICU Message Format and Java MessageFormat.   ==Parameters==  `addPluralForms` - false by default. If enabled new plural forms will be added based on the target locale. The source string is modified and then refiltered to provide the new plural forms to the trnaslator.  Category:Filters Category:Segment...&amp;quot;&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;{{Filters Header}}&lt;br /&gt;
==Overview==&lt;br /&gt;
&lt;br /&gt;
The MessageFormatFilter is designed to handle message formats commonly found in software applications. This includes formats such as ICU Message Format and Java MessageFormat.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
==Parameters==&lt;br /&gt;
&lt;br /&gt;
`addPluralForms` - false by default. If enabled new plural forms will be added based on the target locale. The source string is modified and then refiltered to provide the new plural forms to the trnaslator.&lt;br /&gt;
&lt;br /&gt;
[[Category:Filters]] [[Category:Segmentation]] [[Category:Translation Formats]]&lt;/div&gt;</summary>
		<author><name>Jhargraveiii</name></author>
	</entry>
	<entry>
		<id>http://okapiframework.org/wiki/index.php?title=Filters&amp;diff=982</id>
		<title>Filters</title>
		<link rel="alternate" type="text/html" href="http://okapiframework.org/wiki/index.php?title=Filters&amp;diff=982"/>
		<updated>2023-10-12T21:36:11Z</updated>

		<summary type="html">&lt;p&gt;Jhargraveiii: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;Filters are the components that convert input documents from their native file format into a common internal set of [[Glossary#Resource|resources]] that all Okapi components use. The extracted content can be re-written into the original file format. When using the steps, the extraction is done by the [[Raw Document to Filter Events Step]] and the re-writing by the [[Filter Events to Raw Document Step]].&lt;br /&gt;
&lt;br /&gt;
Note: The [[Okapi Filters Plugin for OmegaT]] allows you to use some of the filters directly from [http://www.omegat.org OmegaT].&lt;br /&gt;
&lt;br /&gt;
==List of the Filters==&lt;br /&gt;
&lt;br /&gt;
The framework distribution comes with the following filters:&lt;br /&gt;
&lt;br /&gt;
{| cellpadding=&amp;quot;8&amp;quot; width=100%&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
|&lt;br /&gt;
* [[Archive Filter]]&lt;br /&gt;
* [[DTD Filter]]&lt;br /&gt;
* [[Doxygen Filter]]&lt;br /&gt;
* [[EPUB Filter]]&lt;br /&gt;
* [[HTML Filter]]&lt;br /&gt;
* [[HTML5-ITS Filter]]&lt;br /&gt;
* [[ICML Filter]]&lt;br /&gt;
* [[IDML Filter]]&lt;br /&gt;
* [[JSON Filter]]&lt;br /&gt;
* [[Markdown Filter]]&lt;br /&gt;
* [[MIF Filter]]&lt;br /&gt;
* [[Moses Text Filter]]&lt;br /&gt;
* [[Multi-Parsers Filter]]&lt;br /&gt;
* [[OpenOffice Filter]]&lt;br /&gt;
* [[OpenXML Filter|OpenXML (MS Office) Filter]]&lt;br /&gt;
|&lt;br /&gt;
* [[PDF Filter]]&lt;br /&gt;
* [[Pensieve TM Filter]]&lt;br /&gt;
* [[PHP Content Filter]]&lt;br /&gt;
* [[Plain Text Filter]]&lt;br /&gt;
* [[PO Filter]]&lt;br /&gt;
* [[Properties Filter]]&lt;br /&gt;
* [[Rainbow Translation Kit Filter]]&lt;br /&gt;
* [[Regex Filter]]&lt;br /&gt;
* [[SDL Trados Package Filter]]&lt;br /&gt;
* [[Simplification Filter]]&lt;br /&gt;
* [[Table Filter]]&lt;br /&gt;
* [[TMX Filter]]&lt;br /&gt;
* [[Trados-Tagged RTF Filter]]&lt;br /&gt;
|&lt;br /&gt;
* [[Transifex Filter]]&lt;br /&gt;
* [[TS Filter]]&lt;br /&gt;
* [[TTX Filter]]&lt;br /&gt;
* [[TXML Filter]]&lt;br /&gt;
* [[Wiki Filter]]&lt;br /&gt;
* [[WSXZ Package Filter]]&lt;br /&gt;
* [[Vignette Filter]]&lt;br /&gt;
* [[XLIFF Filter]]&lt;br /&gt;
* [[XLIFF-2 Filter]]&lt;br /&gt;
* [[XML Filter]]&lt;br /&gt;
* [[XML Stream Filter]]&lt;br /&gt;
* [[YAML Filter]]&lt;br /&gt;
* [[Message Format Filter]]&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
==Supported File Formats==&lt;br /&gt;
&lt;br /&gt;
The following is a list of some of the file formats supported by the distribution through [[Understanding Filter Configurations|pre-defined configurations]]:&lt;br /&gt;
&lt;br /&gt;
{| border=&amp;quot;1&amp;quot; cellpadding=&amp;quot;6&amp;quot; cellspacing=&amp;quot;0&amp;quot;&lt;br /&gt;
|+&lt;br /&gt;
| '''Format''' || '''Extensions''' || '''Pre-Defined Configuration''' || '''Filter''' || '''Notes'''&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Android Strings || .xml || &amp;lt;code&amp;gt;okf_xml-AndroidStrings&amp;lt;/code&amp;gt; || [[XML Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Apple Stringsdict || .stringsdict || &amp;lt;code&amp;gt;okf_xml-AppleStringsdict&amp;lt;/code&amp;gt; || [[XML Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Archive || .zip || &amp;lt;code&amp;gt;okf_archive&amp;lt;/code&amp;gt; || [[Archive Filter]] || Meta filter that processes zip files with various formats as one file.&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Auto Xliff || .xlf, .xliff || &amp;lt;code&amp;gt;okf_autoxliff&amp;lt;/code&amp;gt; || [[Auto Xliff Filter]] || Detects the version of an XLIFF file and then hands parsing off to the appropriate filter &lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| CSV (Comma-separated values files) || .csv, .txt || &amp;lt;code&amp;gt;okf_table_csv&amp;lt;/code&amp;gt; || [[Table Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| CSV (Multiple complex sub-formats) || .csv || &amp;lt;code&amp;gt;okf_multiparsers&amp;lt;/code&amp;gt; || [[Multi-Parsers Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| DITA || .dita, .ditamap, .xml || &amp;lt;code&amp;gt;okf_xmlstream-dita&amp;lt;/code&amp;gt; || [[XML Stream Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| DocBook v5.0 || .xml || &amp;lt;code&amp;gt;okf_xml-docbook&amp;lt;/code&amp;gt; || [[XML Filter]] || Since Okapi 1.42. &amp;amp;lt;footnote&amp;gt; is not handled properly.&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| DokuWiki pages || .txt || &amp;lt;code&amp;gt;okf_wiki&amp;lt;/code&amp;gt; || [[Wiki Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Doxygen-commented files || .c, .h, cpp || &amp;lt;code&amp;gt;okf_doxygen&amp;lt;/code&amp;gt; || [[Doxygen Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| DTD || .dtd || &amp;lt;code&amp;gt;okf_dtd&amp;lt;/code&amp;gt; || [[DTD Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| EPUB || .epub || &amp;lt;code&amp;gt;okf_epub&amp;lt;/code&amp;gt; || [[EPUB Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Fixed-Width Columns Table || .txt || &amp;lt;code&amp;gt;okf_table_fwc&amp;lt;/code&amp;gt; || [[Table Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Idiom WorldServer XLIFF || .xlf || &amp;lt;code&amp;gt;okf_xliff-iws&amp;lt;/code&amp;gt; || [[XLIFF Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| InCopy ICML || .wcml || &amp;lt;code&amp;gt;okf_icml&amp;lt;/code&amp;gt; || [[ICML Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| InDesign IDML || .idml || &amp;lt;code&amp;gt;okf_idml&amp;lt;/code&amp;gt; || [[IDML Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| iOS/Mac Strings|| .strings || &amp;lt;code&amp;gt;okf_regex-macStrings&amp;lt;/code&amp;gt; || [[Regex Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Java Properties || .properties || &amp;lt;code&amp;gt;okf_properties&amp;lt;/code&amp;gt; || [[Properties Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Java Properties (Output not escaped) || .properties || &amp;lt;code&amp;gt;okf_properties-outputNotEscaped&amp;lt;/code&amp;gt; || [[Properties Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Java XML Properties || .xml || &amp;lt;code&amp;gt;okf_xml-JavaProperties&amp;lt;/code&amp;gt; || [[XML Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Java XML Properties (HTML strings) || .xml || &amp;lt;code&amp;gt;okf_xmlstream-JavaPropertiesHTML&amp;lt;/code&amp;gt; || [[XML Stream Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| JSON || .json || &amp;lt;code&amp;gt;okf_json&amp;lt;/code&amp;gt; || [[JSON Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Haiku CatKeys || .catkeys || &amp;lt;code&amp;gt;okf_table_catkeys&amp;lt;/code&amp;gt; || [[Table Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| HTML (any) || .html, .htm || &amp;lt;code&amp;gt;okf_html&amp;lt;/code&amp;gt; || [[HTML Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| HTML (Well-formed, and XHTML) || .html, .htm|| &amp;lt;code&amp;gt;okf_html-wellFormed&amp;lt;/code&amp;gt; || [[HTML Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| HTML5 (and XHTML5) || .html, .htm|| &amp;lt;code&amp;gt;okf_itshtml5&amp;lt;/code&amp;gt; || [[HTML5-ITS Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Markdown || .md || &amp;lt;code&amp;gt;okf_markdown&amp;lt;/code&amp;gt; || [[Markdown Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Microsoft Excel 2007/2010 || .xlsx, .xlsm, .xltx, .xltm || &amp;lt;code&amp;gt;okf_openxml&amp;lt;/code&amp;gt; || [[OpenXML Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Microsoft PowerPoint 2007/2010 || .pptx, .pptm, .potx, .potm, .ppsx, .ppsm || &amp;lt;code&amp;gt;okf_openxml&amp;lt;/code&amp;gt; || [[OpenXML Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Microsoft Visio || .vsdx, .vsdm || &amp;lt;code&amp;gt;okf_openxml&amp;lt;/code&amp;gt; || [[OpenXML Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Microsoft Word 2007/2010 || .docx, .docm, .dotx, .dotm || &amp;lt;code&amp;gt;okf_openxml&amp;lt;/code&amp;gt; || [[OpenXML Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| MIF || .mif || &amp;lt;code&amp;gt;okf_mif&amp;lt;/code&amp;gt; || [[MIF Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Moses Text || .txt || &amp;lt;code&amp;gt;okf_mosestext&amp;lt;/code&amp;gt; || [[Moses Text Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| OpenOffice.org Calc || .ods, .ots || &amp;lt;code&amp;gt;okf_odf&amp;lt;/code&amp;gt; || [[OpenOffice Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| OpenOffice.org Draw || .odg, .otg || &amp;lt;code&amp;gt;okf_odf&amp;lt;/code&amp;gt; || [[OpenOffice Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| OpenOffice.org Impress || .odp, .otp || &amp;lt;code&amp;gt;okf_odf&amp;lt;/code&amp;gt; || [[OpenOffice Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| OpenOffice.org Writer || .odt, .ott || &amp;lt;code&amp;gt;okf_odf&amp;lt;/code&amp;gt; || [[OpenOffice Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| PDF || .pdf || &amp;lt;code&amp;gt;okf_pdf&amp;lt;/code&amp;gt; || [[PDF Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| [[Pensieve TM]] || .pentm || &amp;lt;code&amp;gt;okf_pensieve&amp;lt;/code&amp;gt; || [[Pensieve TM Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| PHP Content || .php || &amp;lt;code&amp;gt;okf_phpcontent&amp;lt;/code&amp;gt; || [[PHP Content Filter]] || Can be used as a subfilter only&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Plain Text (Line = text unit) || .txt || &amp;lt;code&amp;gt;okf_plaintext&amp;lt;/code&amp;gt; || [[ Plain Text Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Plain Text (Paragraph = text unit) || .txt || &amp;lt;code&amp;gt;okf_plaintext_paragraphs&amp;lt;/code&amp;gt; || [[Plain Text Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| PO || .po || &amp;lt;code&amp;gt;okf_po&amp;lt;/code&amp;gt; || [[PO Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| PO (Monolingual style) || .po || &amp;lt;code&amp;gt;okf_po-monolingual&amp;lt;/code&amp;gt; || [[PO Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Rainbow Translation Kit manifests || .rkm || &amp;lt;code&amp;gt;okf_rainbowkit&amp;lt;/code&amp;gt; || [[Rainbow Translation Kit Filter]] || Used as a tkit reader only&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Regex (Any text-based format) || .txt || &amp;lt;code&amp;gt;okf_regex&amp;lt;/code&amp;gt; || [[Regex Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| RDF (Mozilla RDF) || .rdf || &amp;lt;code&amp;gt;okf_xml-MozillaRDF&amp;lt;/code&amp;gt; || [[XML Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| RESX || .resx || &amp;lt;code&amp;gt;okf_xml-resx&amp;lt;/code&amp;gt; || [[XML Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| SDLPPX || .sdlppx || &amp;lt;code&amp;gt;okf_sdlpackage&amp;lt;/code&amp;gt; || [[SDL Trados Package Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| SDLRPX || .sdlrpx || &amp;lt;code&amp;gt;okf_sdlpackage&amp;lt;/code&amp;gt; || [[SDL Trados Package Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| SDL[[XLIFF]] || .sdlxlf || &amp;lt;code&amp;gt;okf_xliff-sdl&amp;lt;/code&amp;gt; || [[XLIFF Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Skype Language Files || .lang || &amp;lt;code&amp;gt;okf_properties-skypeLang&amp;lt;/code&amp;gt; || [[Properties Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| SRT (Sub-Rip Text, sub-titles files) || .srt || &amp;lt;code&amp;gt;okf_regex-srt&amp;lt;/code&amp;gt; || [[Regex Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Tab-Delimiter files || .tsv, .txt || &amp;lt;code&amp;gt;okf_table_tsv&amp;lt;/code&amp;gt; || [[Table Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Tex files || .tex || &amp;lt;code&amp;gt;okf_tex&amp;lt;/code&amp;gt; || [[TEX Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| [[TMX]] || .tmx || &amp;lt;code&amp;gt;okf_tmx&amp;lt;/code&amp;gt; || [[TMX Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Transifex project || .txp || &amp;lt;code&amp;gt;okf_transifex&amp;lt;/code&amp;gt; || [[Transifex Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Trados-Tagged RTF || .rtf || &amp;lt;code&amp;gt;okf_tradosrtf&amp;lt;/code&amp;gt; || [[Trados-Tagged RTF Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| TS - Qt TS files || .ts || &amp;lt;code&amp;gt;okf_ts&amp;lt;/code&amp;gt; || [[TS Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| TTX - Trados TagEditor TTX files || .ttx || &amp;lt;code&amp;gt;okf_ttx&amp;lt;/code&amp;gt; || [[TTX Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| TXML - Wordfast Pro TXML files || .txml || &amp;lt;code&amp;gt;okf_txml&amp;lt;/code&amp;gt; || [[TXML Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Vignette Export/Import Content || .xml || &amp;lt;code&amp;gt;okf_vignette&amp;lt;/code&amp;gt; || [[Vignette Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| WSXZ Package Filter || .wsxz || &amp;lt;code&amp;gt;okf_wsxzpackage&amp;lt;/code&amp;gt; || [[WSXZ Package Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| XHTML || .html, .htm || &amp;lt;code&amp;gt;okf_html-wellFormed&amp;lt;/code&amp;gt; || [[HTML Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| WIX (Windows Installer XML) localization files || .wix || &amp;lt;code&amp;gt;okf_xml-WixLocalization&amp;lt;/code&amp;gt; || [[XML Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| [[XLIFF]] v1.2 || .xlf, .xliff || &amp;lt;code&amp;gt;okf_xliff&amp;lt;/code&amp;gt; || [[XLIFF Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| [[XLIFF]] v2 || .xlf || &amp;lt;code&amp;gt;okf_xliff2&amp;lt;/code&amp;gt; || [[XLIFF-2 Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| XML (Generic, using [[ITS]] defaults) || .xml || &amp;lt;code&amp;gt;okf_xml&amp;lt;/code&amp;gt; || [[XML Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| XML (Generic, using stream reader) || .xml || &amp;lt;code&amp;gt;okf_xmlstream&amp;lt;/code&amp;gt; || [[XML Stream Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| YAML (Generic YAML filter) || .yml, .yaml || &amp;lt;code&amp;gt;okf_yaml&amp;lt;/code&amp;gt; || [[YAML Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Message Format (ICU Message Format Filter) || .yml, .yaml || &amp;lt;code&amp;gt;okf_messageformat&amp;lt;/code&amp;gt; || [[Message Format Filter]] ||&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
Note that most filters allow you to [[Understanding Filter Configurations|create your own configurations]] to support more file formats.&lt;br /&gt;
&lt;br /&gt;
==Code Simplification Rules==&lt;br /&gt;
&lt;br /&gt;
All filters support code simplification rules. By default the [[Inline Codes Simplifier Step]], [[Simplification Filter]] and [[Post-segmentation Inline Codes Removal Step]] maximize the trimming and merging (aka simplification) of inline codes. In some cases this may not be desired. The simplification rules allow you to override the default behavior and prevent specific codes from being trimmed or merged.&lt;br /&gt;
&lt;br /&gt;
===General Syntax===&lt;br /&gt;
&lt;br /&gt;
The rules parser ignores irrelevant whitespace. Rules can be separated by spaces, newlines or nothing. This makes it easier to accommodate various container formats and their whitespace normalization rules. When a rule applies it means &amp;quot;do not simplify the match code&amp;quot;. Uppercase tokens are constants and predefined by the rule parser. Multiple rules are always OR'ed together.&lt;br /&gt;
&lt;br /&gt;
For more details see the JavaCC grammar: &amp;lt;code&amp;gt;../okapi-core/src/main/java/net/sf/okapi/core/simplifierrules/SimplifierRules.jj&amp;lt;/code&amp;gt; &lt;br /&gt;
&lt;br /&gt;
===Rule Examples===&lt;br /&gt;
&lt;br /&gt;
If Code has any of these flags then don't simplify &lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;if DELETABLE or ADDABLE or CLONEABLE;&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;quot;=&amp;quot; is string match&lt;br /&gt;
Match basic TAGTYPE opening, closing or standalone &lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;if DATA = &amp;quot;a&amp;quot; and TAGTYPE = OPENING;&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;quot;~&amp;quot; is regex match&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;if DATA ~ &amp;quot;a.*&amp;quot;;&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
You can negate any of the match operators &lt;br /&gt;
Don't simplify if the DATA does not match the regex &lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;if DATA !~ &amp;quot;a.*&amp;quot;;&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Match on type, linebreak in this case, don't simplify &lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;if TYPE = &amp;quot;lb&amp;quot;;&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Don't simplify any rich text types&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;if TYPE = &amp;quot;bold&amp;quot; or TYPE = &amp;quot;italic&amp;quot; or TYPE = &amp;quot;underline&amp;quot;;&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Expressions can be recursive (supports embedded parens)&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;if TYPE = &amp;quot;bold&amp;quot; or (DATA = &amp;quot;bar&amp;quot; or (DATA = &amp;quot;foo&amp;quot; and TYPE = &amp;quot;underline&amp;quot;));&amp;lt;/pre&amp;gt;&lt;br /&gt;
 &lt;br /&gt;
&lt;br /&gt;
===Filter Config Examples===&lt;br /&gt;
&lt;br /&gt;
Examples of using simplifier rules within the filter config formats used by Okapi.&lt;br /&gt;
&lt;br /&gt;
'''YAML:'''&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
simplifierRules: |&lt;br /&gt;
  if ADDABLE or DELETABLE or CLONEABLE;&lt;br /&gt;
  if DATA = &amp;quot;&amp;lt;br/&amp;gt;&amp;quot; or DATA = &amp;quot;&amp;lt;font&amp;gt;&amp;quot; or DATA = &amp;quot;&amp;lt;/font&amp;gt;&amp;quot; or DATA = &amp;quot;&amp;lt;/a&amp;gt;&amp;quot;;&lt;br /&gt;
  if DATA ~ &amp;quot;\\&amp;lt;font.+&amp;quot; or DATA ~ &amp;quot;\\&amp;lt;img.+&amp;quot; or DATA ~ &amp;quot;\\&amp;lt;a.+&amp;quot;;&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
'''ITS:'''&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
&amp;lt;?xml version=&amp;quot;1.0&amp;quot; encoding=&amp;quot;UTF-8&amp;quot;?&amp;gt;&lt;br /&gt;
&amp;lt;its:rules xmlns:its=&amp;quot;http://www.w3.org/2005/11/its&amp;quot; version=&amp;quot;1.0&amp;quot; xmlns:itsx=&amp;quot;http://www.w3.org/2008/12/its-extensions&amp;quot; xmlns:okp=&amp;quot;okapi-framework:xmlfilter-options&amp;quot;&amp;gt;&lt;br /&gt;
&amp;lt;!-- See ITS specification at: http://www.w3.org/TR/its/ --&amp;gt;&lt;br /&gt;
 &amp;lt;its:translateRule selector=&amp;quot;//*&amp;quot; translate=&amp;quot;yes&amp;quot;/&amp;gt;&lt;br /&gt;
 &amp;lt;its:withinTextRule selector=&amp;quot;//codeph&amp;quot; withinText=&amp;quot;yes&amp;quot;/&amp;gt;&lt;br /&gt;
 &amp;lt;its:withinTextRule selector=&amp;quot;//ph&amp;quot; withinText=&amp;quot;yes&amp;quot;/&amp;gt;&lt;br /&gt;
 &amp;lt;okp:simplifierRules&amp;gt;&lt;br /&gt;
 if ADDABLE or DELETABLE or CLONEABLE; if DATA ~ &amp;quot;.+&amp;quot;;&lt;br /&gt;
 &amp;lt;/okp:simplifierRules&amp;gt;&lt;br /&gt;
&amp;lt;/its:rules&amp;gt;&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
'''FPRM (Parameters):'''&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
#v1&lt;br /&gt;
extractNotes.b=true&lt;br /&gt;
simplifierRules=if ADDABLE or DELETABLE or CLONEABLE; if DATA ~ &amp;quot;.+&amp;quot;;&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
==Font Mapping==&lt;br /&gt;
&lt;br /&gt;
The font mapping can be considered as a filter's ability to automatically substitute font information in the target document on the fly, according to a provided configuration - this helps to reduce the amount of reformatting and post-translation DTP. It is supported by IDML and OpenXML (DOCX, PPTX and XLSX documents) filters at the moment.&lt;br /&gt;
&lt;br /&gt;
The following font mapping configuration options are available:&lt;br /&gt;
* The source locale regular expression pattern: &amp;lt;code&amp;gt;.*&amp;lt;/code&amp;gt;, &amp;lt;code&amp;gt;en.*&amp;lt;/code&amp;gt;, &amp;lt;code&amp;gt;en-UK&amp;lt;/code&amp;gt;, etc. It can be ommited to apply the mapping to any source locale.&lt;br /&gt;
* The target locale regular expression pattern: &amp;lt;code&amp;gt;.*&amp;lt;/code&amp;gt;, &amp;lt;code&amp;gt;ru.*&amp;lt;/code&amp;gt;, &amp;lt;code&amp;gt;ru-RU&amp;lt;/code&amp;gt;, etc. It can be ommited to apply the mapping to any target locale.&lt;br /&gt;
* The source font name regular expression pattern: &amp;lt;code&amp;gt;.*&amp;lt;/code&amp;gt;, &amp;lt;code&amp;gt;Arial.*&amp;lt;/code&amp;gt;, &amp;lt;code&amp;gt;Times New Roman&amp;lt;/code&amp;gt;, etc. It can be ommited to apply the mapping to any source font name found.&lt;br /&gt;
* The target font name: &amp;lt;code&amp;gt;Arial&amp;lt;/code&amp;gt;, &amp;lt;code&amp;gt;Times New Roman&amp;lt;/code&amp;gt;, etc. It should not be empty. And if it is made so, the mapping configuration is ignored.&lt;br /&gt;
&lt;br /&gt;
Also, the configured font mappings are applied in the order they are stated. And the final target font value is determined by a sequential&lt;br /&gt;
substitution of the source font values. I.e. if there is more than one mapping:&lt;br /&gt;
# &amp;lt;code&amp;gt;Arial&amp;lt;/code&amp;gt; -&amp;gt; &amp;lt;code&amp;gt;Times New Roman&amp;lt;/code&amp;gt;&lt;br /&gt;
# &amp;lt;code&amp;gt;Times New Roman&amp;lt;/code&amp;gt; -&amp;gt; &amp;lt;code&amp;gt;Sans Serif&amp;lt;/code&amp;gt;&lt;br /&gt;
then the first mapping will produce &amp;lt;code&amp;gt;Times New Roman&amp;lt;/code&amp;gt; replacement and the second one will be applied to this new value, thus, ending up with the &amp;lt;code&amp;gt;Sans Serif&amp;lt;/code&amp;gt;.&lt;br /&gt;
&lt;br /&gt;
The parameters serialisation format can look like that:&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
fontMappings.0.sourceLocalePattern=en.*&lt;br /&gt;
fontMappings.0.targetLocalePattern=ru.*&lt;br /&gt;
fontMappings.0.sourceFontPattern=Times.*&lt;br /&gt;
fontMappings.0.targetFont=Arial Unicode MS&lt;br /&gt;
fontMappings.1.sourceLocalePattern=ru&lt;br /&gt;
fontMappings.1.targetLocalePattern=fr&lt;br /&gt;
fontMappings.1.sourceFontPattern=The Sims Sans&lt;br /&gt;
fontMappings.1.targetFont=Arial Unicode MS&lt;br /&gt;
fontMappings.number.i=2&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
When source locale, target locale and source font are omitted:&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
fontMappings.0.targetFont=Arial Unicode MS&lt;br /&gt;
fontMappings.number.i=1&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
And this is the same as the abovementioned:&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
fontMappings.0.sourceLocalePattern=.*&lt;br /&gt;
fontMappings.0.targetLocalePattern=.*&lt;br /&gt;
fontMappings.0.sourceFontPattern=.*&lt;br /&gt;
fontMappings.0.targetFont=Arial Unicode MS&lt;br /&gt;
fontMappings.number.i=1&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
[[Category:Filters]]&lt;/div&gt;</summary>
		<author><name>Jhargraveiii</name></author>
	</entry>
	<entry>
		<id>http://okapiframework.org/wiki/index.php?title=Rainbow_-_Command_Line&amp;diff=979</id>
		<title>Rainbow - Command Line</title>
		<link rel="alternate" type="text/html" href="http://okapiframework.org/wiki/index.php?title=Rainbow_-_Command_Line&amp;diff=979"/>
		<updated>2023-07-19T15:06:14Z</updated>

		<summary type="html">&lt;p&gt;Jhargraveiii: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;{{Rainbow Common Menu}}&lt;br /&gt;
&lt;br /&gt;
When starting Rainbow has different behaviors depending on the arguments it has when starting:&lt;br /&gt;
&lt;br /&gt;
* If Rainbow is started with just one argument: it starts in normal mode and takes the argument as a project file to be loaded.&lt;br /&gt;
&lt;br /&gt;
* If Rainbow is started with more than one argument: it starts in command-line mode and interprets the arguments as described in the table below.&lt;br /&gt;
&lt;br /&gt;
When running in batch mode, the log is saved into a file named &amp;lt;code&amp;gt;rainbowBatchLog.txt&amp;lt;/code&amp;gt; in the home directory of the user.&lt;br /&gt;
&lt;br /&gt;
Note that you can also use [[Tikal]] to execute various function from a command line.&lt;br /&gt;
&lt;br /&gt;
The arguments of the command-line can be the following:&lt;br /&gt;
&lt;br /&gt;
{| border=&amp;quot;1&amp;quot; cellpadding=&amp;quot;5&amp;quot; cellspacing=&amp;quot;0&amp;quot;&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| &amp;lt;code&amp;gt;&amp;amp;lt;inputFile&amp;gt;[ -fc &amp;amp;lt;filterConfiguration&amp;gt;]&amp;lt;/code&amp;gt;&lt;br /&gt;
| Sets the input file, and optionally sets the filter configuration to assign to it. You can specify an absolute or a local filename. The input file root is reset to the folder of the given input file. If a project was loaded, all input files in that projects are removed and the input file root reset.&lt;br /&gt;
&lt;br /&gt;
If you specify several input files (and their filter configurations) the first one will be assigned to the &amp;lt;cite&amp;gt;Input List 1&amp;lt;/cite&amp;gt;, the second to the &amp;lt;cite&amp;gt;Input List 2&amp;lt;/cite&amp;gt;, etc.&lt;br /&gt;
&lt;br /&gt;
If the filter configuration is not specified in the command line, the default filter (if one can be found) is used.&lt;br /&gt;
Input files must be specified prior to an output location being specified (via &amp;lt;code&amp;gt;-o&amp;lt;/code&amp;gt;), and the &amp;lt;code&amp;gt;-fc&amp;lt;/code&amp;gt; option must always follow an input file.&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| &amp;lt;code&amp;gt;-p &amp;amp;lt;projectFilename&amp;gt;&amp;lt;/code&amp;gt;&lt;br /&gt;
| Loads an existing project file &amp;lt;code&amp;gt;&amp;amp;lt;projectFilename&amp;gt;.&amp;lt;/code&amp;gt;&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| &amp;lt;code&amp;gt;-x &amp;amp;lt;Id&amp;gt;&amp;lt;/code&amp;gt;&lt;br /&gt;
| Executes the [[Rainbow - Utilities|utility or the predefined pipeline]] with the ID &amp;lt;code&amp;gt;&amp;amp;lt;Id&amp;gt;&amp;lt;/code&amp;gt;. This is done after all arguments of the command line have been processed.&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| &amp;lt;code&amp;gt;-pln &amp;amp;lt;pipelineFilename&amp;gt;&amp;lt;/code&amp;gt;&lt;br /&gt;
| Loads and execute the specified pipeline stored in &amp;lt;code&amp;gt;&amp;amp;lt;pipelineFilename&amp;gt;&amp;lt;/code&amp;gt;. A pipeline file can be created by selecting Utilities -&amp;gt; Edit / Execute Pipeline from the menu bar, adding steps by clicking Add Step... button, and clicking the Save button.&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| &amp;lt;code&amp;gt;-se &amp;amp;lt;encoding&amp;gt;&amp;lt;/code&amp;gt;&lt;br /&gt;
| Sets the default source encoding to &amp;lt;code&amp;gt;&amp;amp;lt;encoding&amp;gt;&amp;lt;/code&amp;gt;.&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| &amp;lt;code&amp;gt;-te &amp;amp;lt;encoding&amp;gt;&amp;lt;/code&amp;gt;&lt;br /&gt;
| Sets the default target encoding to &amp;lt;code&amp;gt;&amp;amp;lt;encoding&amp;gt;&amp;lt;/code&amp;gt;.&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| &amp;lt;code&amp;gt;-sl &amp;amp;lt;langCode&amp;gt;&amp;lt;/code&amp;gt;&lt;br /&gt;
| Sets the source language using &amp;lt;code&amp;gt;&amp;amp;lt;langCode&amp;gt;&amp;lt;/code&amp;gt;.&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| &amp;lt;code&amp;gt;-tl &amp;amp;lt;langCode&amp;gt;&amp;lt;/code&amp;gt;&lt;br /&gt;
| Sets the target language using &amp;lt;code&amp;gt;&amp;amp;lt;langCode&amp;gt;&amp;lt;/code&amp;gt;.&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| &amp;lt;code&amp;gt;-opt &amp;amp;lt;optionFilename&amp;gt;&amp;lt;/code&amp;gt;&lt;br /&gt;
| Sets the options file to use for the utility to execute. Use the &amp;lt;code&amp;gt;-np&amp;lt;/code&amp;gt; flag to be prompted or not to modify the options when the command line is executed. The options file must be for the utility defined with &amp;lt;code&amp;gt;-x&amp;lt;/code&amp;gt;. Note that option file are only for utilities, not predefined pipelines. If a non-default behavior of a predefined configuration is desired, define your own pipeline and then use &amp;lt;code&amp;gt;-pln &amp;amp;lt;pipelineFilename&amp;gt;&amp;lt;code&amp;gt;.&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| &amp;lt;code&amp;gt;-log &amp;amp;lt;logFile&amp;gt;&amp;lt;/code&amp;gt;&lt;br /&gt;
| Sets the path to the log file. If not specified &amp;lt;code&amp;gt;{user.home}/rainbowBatchLog.txt&amp;lt;/code&amp;gt; is used.&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| &amp;lt;code&amp;gt;-np&amp;lt;/code&amp;gt;&lt;br /&gt;
| No prompt for utility's options.&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| &amp;lt;code&amp;gt;-o &amp;amp;lt;outputFile&amp;gt;&amp;lt;/code&amp;gt;&lt;br /&gt;
| Sets the output file. If this option is not used and an input file is specified, the output file path and name is build based on the output options of the project (loaded or default).  If this option is specified before an input file is provided, an error will be reported in the log.&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| &amp;lt;code&amp;gt;-pd &amp;amp;lt;directory&amp;gt;&amp;lt;/code&amp;gt;&lt;br /&gt;
| Sets the parameters directory (the location where the filter parameters files are stored). You can use &amp;lt;code&amp;gt;.&amp;lt;/code&amp;gt; (dot) to specify the current directory. By default, if not project is loaded, the default parameters directory is the user home directory.&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| &amp;lt;code&amp;gt;-ir &amp;amp;lt;directory&amp;gt;&amp;lt;/code&amp;gt;&lt;br /&gt;
| Sets the input root directory for the first input list. You can use &amp;lt;code&amp;gt;.&amp;lt;/code&amp;gt; (dot) to specify the current directory. This value is also used to set the &amp;lt;code&amp;gt;${inputRootDir}&amp;lt;/code&amp;gt; variable that can be used in some path parameters.&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| &amp;lt;code&amp;gt;-rd &amp;amp;lt;directory&amp;gt;&amp;lt;/code&amp;gt;&lt;br /&gt;
| Sets the root directory. You can use &amp;lt;code&amp;gt;.&amp;lt;/code&amp;gt; (dot) to specify the current directory. This value is also used to set the &amp;lt;code&amp;gt;${rootDir}&amp;lt;/code&amp;gt; variable that can be used in some path parameters.&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| &amp;lt;code&amp;gt;-? or -h&amp;lt;/code&amp;gt;&lt;br /&gt;
| Opens this help page.&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
Here are some example of command lines in '''Windows'''.&lt;br /&gt;
&lt;br /&gt;
 C:\&amp;gt;rainbow.exe -x TextRewriting -sl EN -tl FR myInput.xlf -o myOutput.xlf&lt;br /&gt;
&lt;br /&gt;
The command-line above executes the Text Rewriting predefined pipeline with the source language set to EN and the target language set to FR. The input document is the XLIFF file &amp;lt;code&amp;gt;myInput.xlf&amp;lt;/code&amp;gt;, and the modified file is saved as &amp;lt;code&amp;gt;myOutput.xlf&amp;lt;/code&amp;gt;.&lt;br /&gt;
&lt;br /&gt;
 C:\&amp;gt;rainbow.exe -x TranslationComparison -sl EN -tl FR -pd . myHumanTrans.xlf myMachineTrans.txt -fc okf_regex@myText&lt;br /&gt;
&lt;br /&gt;
The command-line above executes the Translation Comparison predefined pipeline with the source language set to EN and the target language set to FR. The current folder (&amp;lt;code&amp;gt;.&amp;lt;/code&amp;gt;) is specified as the parameters directory. The input file &amp;lt;code&amp;gt;myHumanTrans.xlf&amp;lt;/code&amp;gt; is the input document for the &amp;lt;cite&amp;gt;Input List 1&amp;lt;/cite&amp;gt;, and the default XLIFF filter configuration assigned to it. The input file &amp;lt;code&amp;gt;myMachineTrans.txt&amp;lt;/code&amp;gt; is the input document for the &amp;lt;cite&amp;gt;Input List 2&amp;lt;/cite&amp;gt;, and the custom filter parameters &amp;lt;code&amp;gt;okf_regex@myText.fprm&amp;lt;/code&amp;gt; is associated with it. No utility options are specified, so the use will be prompted to set the options.&lt;br /&gt;
&lt;br /&gt;
 C:\&amp;gt;rainbow.exe -h&lt;br /&gt;
&lt;br /&gt;
The command-line above opens this help page.&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
On '''macOS''', go to /Applications, ~/Applications, or wherever you installed Okapi. For example:&lt;br /&gt;
&lt;br /&gt;
 Applications $ Rainbow.app/Contents/MacOS/rainbow.sh -h&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
[[Category:Rainbow]]&lt;/div&gt;</summary>
		<author><name>Jhargraveiii</name></author>
	</entry>
	<entry>
		<id>http://okapiframework.org/wiki/index.php?title=Rainbow_-_Command_Line&amp;diff=978</id>
		<title>Rainbow - Command Line</title>
		<link rel="alternate" type="text/html" href="http://okapiframework.org/wiki/index.php?title=Rainbow_-_Command_Line&amp;diff=978"/>
		<updated>2023-07-19T15:05:13Z</updated>

		<summary type="html">&lt;p&gt;Jhargraveiii: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;{{Rainbow Common Menu}}&lt;br /&gt;
&lt;br /&gt;
When starting Rainbow has different behaviors depending on the arguments it has when starting:&lt;br /&gt;
&lt;br /&gt;
* If Rainbow is started with just one argument: it starts in normal mode and takes the argument as a project file to be loaded.&lt;br /&gt;
&lt;br /&gt;
* If Rainbow is started with more than one argument: it starts in command-line mode and interprets the arguments as described in the table below.&lt;br /&gt;
&lt;br /&gt;
When running in batch mode, the log is saved into a file named &amp;lt;code&amp;gt;rainbowBatchLog.txt&amp;lt;/code&amp;gt; in the home directory of the user.&lt;br /&gt;
&lt;br /&gt;
Note that you can also use [[Tikal]] to execute various function from a command line.&lt;br /&gt;
&lt;br /&gt;
The arguments of the command-line can be the following:&lt;br /&gt;
&lt;br /&gt;
{| border=&amp;quot;1&amp;quot; cellpadding=&amp;quot;5&amp;quot; cellspacing=&amp;quot;0&amp;quot;&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| &amp;lt;code&amp;gt;&amp;amp;lt;inputFile&amp;gt;[ -fc &amp;amp;lt;filterConfiguration&amp;gt;]&amp;lt;/code&amp;gt;&lt;br /&gt;
| Sets the input file, and optionally sets the filter configuration to assign to it. You can specify an absolute or a local filename. The input file root is reset to the folder of the given input file. If a project was loaded, all input files in that projects are removed and the input file root reset.&lt;br /&gt;
&lt;br /&gt;
If you specify several input files (and their filter configurations) the first one will be assigned to the &amp;lt;cite&amp;gt;Input List 1&amp;lt;/cite&amp;gt;, the second to the &amp;lt;cite&amp;gt;Input List 2&amp;lt;/cite&amp;gt;, etc.&lt;br /&gt;
&lt;br /&gt;
If the filter configuration is not specified in the command line, the default filter (if one can be found) is used.&lt;br /&gt;
Input files must be specified prior to an output location being specified (via &amp;lt;code&amp;gt;-o&amp;lt;/code&amp;gt;), and the &amp;lt;code&amp;gt;-fc&amp;lt;/code&amp;gt; option must always follow an input file.&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| &amp;lt;code&amp;gt;-p &amp;amp;lt;projectFilename&amp;gt;&amp;lt;/code&amp;gt;&lt;br /&gt;
| Loads an existing project file &amp;lt;code&amp;gt;&amp;amp;lt;projectFilename&amp;gt;.&amp;lt;/code&amp;gt;&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| &amp;lt;code&amp;gt;-x &amp;amp;lt;Id&amp;gt;&amp;lt;/code&amp;gt;&lt;br /&gt;
| Executes the [[Rainbow - Utilities|utility or the predefined pipeline]] with the ID &amp;lt;code&amp;gt;&amp;amp;lt;Id&amp;gt;&amp;lt;/code&amp;gt;. This is done after all arguments of the command line have been processed.&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| &amp;lt;code&amp;gt;-pln &amp;amp;lt;pipelineFilename&amp;gt;&amp;lt;/code&amp;gt;&lt;br /&gt;
| Loads and execute the specified pipeline stored in &amp;lt;code&amp;gt;&amp;amp;lt;pipelineFilename&amp;gt;&amp;lt;/code&amp;gt;. A pipeline file can be created by selecting Utilities -&amp;gt; Edit / Execute Pipeline from the menu bar, adding steps by clicking Add Step... button, and clicking the Save button.&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| &amp;lt;code&amp;gt;-se &amp;amp;lt;encoding&amp;gt;&amp;lt;/code&amp;gt;&lt;br /&gt;
| Sets the default source encoding to &amp;lt;code&amp;gt;&amp;amp;lt;encoding&amp;gt;&amp;lt;/code&amp;gt;.&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| &amp;lt;code&amp;gt;-te &amp;amp;lt;encoding&amp;gt;&amp;lt;/code&amp;gt;&lt;br /&gt;
| Sets the default target encoding to &amp;lt;code&amp;gt;&amp;amp;lt;encoding&amp;gt;&amp;lt;/code&amp;gt;.&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| &amp;lt;code&amp;gt;-sl &amp;amp;lt;langCode&amp;gt;&amp;lt;/code&amp;gt;&lt;br /&gt;
| Sets the source language using &amp;lt;code&amp;gt;&amp;amp;lt;langCode&amp;gt;&amp;lt;/code&amp;gt;.&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| &amp;lt;code&amp;gt;-tl &amp;amp;lt;langCode&amp;gt;&amp;lt;/code&amp;gt;&lt;br /&gt;
| Sets the target language using &amp;lt;code&amp;gt;&amp;amp;lt;langCode&amp;gt;&amp;lt;/code&amp;gt;.&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| &amp;lt;code&amp;gt;-opt &amp;amp;lt;optionFilename&amp;gt;&amp;lt;/code&amp;gt;&lt;br /&gt;
| Sets the options file to use for the utility to execute. Use the &amp;lt;code&amp;gt;-np&amp;lt;/code&amp;gt; flag to be prompted or not to modify the options when the command line is executed. The options file must be for the utility defined with &amp;lt;code&amp;gt;-x&amp;lt;/code&amp;gt;. Note that option file are only for utilities, not predefined pipelines. If a non-default behavior of a predefined configuration is desired, define your own pipeline and then use &amp;lt;code&amp;gt;-pln &amp;amp;lt;pipelineFilename&amp;gt;&amp;lt;code&amp;gt;.&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| &amp;lt;code&amp;gt;-log &amp;amp;lt;logFile&amp;gt;&amp;lt;/code&amp;gt;&lt;br /&gt;
| Sets the path to the log file. If not specified &amp;lt;code&amp;gt;{user.home}/rainbowBatchLog.txt&amp;lt;/code&amp;gt; is used.&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| &amp;lt;code&amp;gt;-np&amp;lt;/code&amp;gt;&lt;br /&gt;
| No prompt for utility's options.&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| &amp;lt;code&amp;gt;-o &amp;amp;lt;outputFile&amp;gt;&amp;lt;/code&amp;gt;&lt;br /&gt;
| Sets the output file. If this option is not used and an input file is specified, the output file path and name is build based on the output options of the project (loaded or default).  If this option is specified before an input file is provided, an error will be reported in the log.&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| &amp;lt;code&amp;gt;-pd &amp;amp;lt;directory&amp;gt;&amp;lt;/code&amp;gt;&lt;br /&gt;
| Sets the parameters directory (the location where the filter parameters files are stored). You can use &amp;lt;code&amp;gt;.&amp;lt;/code&amp;gt; (dot) to specify the current directory. By default, if not project is loaded, the default parameters directory is the user home directory.&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| &amp;lt;code&amp;gt;-ir &amp;amp;lt;directory&amp;gt;&amp;lt;/code&amp;gt;&lt;br /&gt;
| Sets the input root directory for the first input list. You can use &amp;lt;code&amp;gt;.&amp;lt;/code&amp;gt; (dot) to specify the current directory. This value is also used to set the &amp;lt;code&amp;gt;${inputRootDir}&amp;lt;/code&amp;gt; variable that can be used in some path parameters.&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| &amp;lt;code&amp;gt;-rd &amp;amp;lt;directory&amp;gt;&amp;lt;/code&amp;gt;&lt;br /&gt;
| Sets the root directory. You can use &amp;lt;code&amp;gt;.&amp;lt;/code&amp;gt; (dot) to specify the current directory. This value is also used to set the &amp;lt;code&amp;gt;${rootDir}&amp;lt;/code&amp;gt; variable that can be used in some path parameters.&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| &amp;lt;code&amp;gt;-? or -h&amp;lt;/code&amp;gt;&lt;br /&gt;
| Opens this help page.&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
Here are some example of command lines in '''Windows'''.&lt;br /&gt;
&lt;br /&gt;
 C:\&amp;gt;rainbow.exe -x TextRewriting -sl EN -tl FR myInput.xlf -o myOutput.xlf&lt;br /&gt;
&lt;br /&gt;
The command-line above executes the Text Rewriting predefined pipeline with the source language set to EN and the target language set to FR. The input document is the XLIFF file &amp;lt;code&amp;gt;myInput.xlf&amp;lt;/code&amp;gt;, and the modified file is saved as &amp;lt;code&amp;gt;myOutput.xlf&amp;lt;/code&amp;gt;.&lt;br /&gt;
&lt;br /&gt;
 C:\&amp;gt;rainbow.exe -x TranslationComparison -sl EN -tl FR -pd . myHumanTrans.xlf myMachineTrans.txt -fc okf_regex@myText&lt;br /&gt;
&lt;br /&gt;
The command-line above executes the Translation Comparison predefined pipeline with the source language set to EN and the target language set to FR. The current folder (&amp;lt;code&amp;gt;.&amp;lt;/code&amp;gt;) is specified as the parameters directory. The input file &amp;lt;code&amp;gt;myHumanTrans.xlf&amp;lt;/code&amp;gt; is the input document for the &amp;lt;cite&amp;gt;Input List 1&amp;lt;/cite&amp;gt;, and the default XLIFF filter configuration assigned to it. The input file &amp;lt;code&amp;gt;myMachineTrans.txt&amp;lt;/code&amp;gt; is the input document for the &amp;lt;cite&amp;gt;Input List 2&amp;lt;/cite&amp;gt;, and the custom filter parameters &amp;lt;code&amp;gt;okf_regex@myText.fprm&amp;lt;/code&amp;gt; is associated with it. No utility options are specified, so the use will be prompted to set the options.&lt;br /&gt;
&lt;br /&gt;
 C:\&amp;gt;rainbow.exe -h&lt;br /&gt;
&lt;br /&gt;
The command-line above opens this help page.&lt;br /&gt;
&lt;br /&gt;
----&lt;br /&gt;
On '''macOS''', go to /Applications, ~/Applications, or wherever you installed Okapi, and replace the &amp;quot;java -jar \rnb\lib\rainbow.jar&amp;quot; part above with &amp;quot;Rainbow.app/Contents/MacOS/rainbow.sh&amp;quot;. For example:&lt;br /&gt;
&lt;br /&gt;
 Applications $ Rainbow.app/Contents/MacOS/rainbow.sh -h&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
[[Category:Rainbow]]&lt;/div&gt;</summary>
		<author><name>Jhargraveiii</name></author>
	</entry>
	<entry>
		<id>http://okapiframework.org/wiki/index.php?title=Filters&amp;diff=976</id>
		<title>Filters</title>
		<link rel="alternate" type="text/html" href="http://okapiframework.org/wiki/index.php?title=Filters&amp;diff=976"/>
		<updated>2023-03-09T15:49:56Z</updated>

		<summary type="html">&lt;p&gt;Jhargraveiii: /* Supported File Formats */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;Filters are the components that convert input documents from their native file format into a common internal set of [[Glossary#Resource|resources]] that all Okapi components use. The extracted content can be re-written into the original file format. When using the steps, the extraction is done by the [[Raw Document to Filter Events Step]] and the re-writing by the [[Filter Events to Raw Document Step]].&lt;br /&gt;
&lt;br /&gt;
Note: The [[Okapi Filters Plugin for OmegaT]] allows you to use some of the filters directly from [http://www.omegat.org OmegaT].&lt;br /&gt;
&lt;br /&gt;
==List of the Filters==&lt;br /&gt;
&lt;br /&gt;
The framework distribution comes with the following filters:&lt;br /&gt;
&lt;br /&gt;
{| cellpadding=&amp;quot;8&amp;quot; width=100%&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
|&lt;br /&gt;
* [[Archive Filter]]&lt;br /&gt;
* [[DTD Filter]]&lt;br /&gt;
* [[Doxygen Filter]]&lt;br /&gt;
* [[EPUB Filter]]&lt;br /&gt;
* [[HTML Filter]]&lt;br /&gt;
* [[HTML5-ITS Filter]]&lt;br /&gt;
* [[ICML Filter]]&lt;br /&gt;
* [[IDML Filter]]&lt;br /&gt;
* [[JSON Filter]]&lt;br /&gt;
* [[Markdown Filter]]&lt;br /&gt;
* [[MIF Filter]]&lt;br /&gt;
* [[Moses Text Filter]]&lt;br /&gt;
* [[Multi-Parsers Filter]]&lt;br /&gt;
* [[OpenOffice Filter]]&lt;br /&gt;
* [[OpenXML Filter|OpenXML (MS Office) Filter]]&lt;br /&gt;
|&lt;br /&gt;
* [[PDF Filter]]&lt;br /&gt;
* [[Pensieve TM Filter]]&lt;br /&gt;
* [[PHP Content Filter]]&lt;br /&gt;
* [[Plain Text Filter]]&lt;br /&gt;
* [[PO Filter]]&lt;br /&gt;
* [[Properties Filter]]&lt;br /&gt;
* [[Rainbow Translation Kit Filter]]&lt;br /&gt;
* [[Regex Filter]]&lt;br /&gt;
* [[SDL Trados Package Filter]]&lt;br /&gt;
* [[Simplification Filter]]&lt;br /&gt;
* [[Table Filter]]&lt;br /&gt;
* [[TMX Filter]]&lt;br /&gt;
* [[Trados-Tagged RTF Filter]]&lt;br /&gt;
|&lt;br /&gt;
* [[Transifex Filter]]&lt;br /&gt;
* [[TS Filter]]&lt;br /&gt;
* [[TTX Filter]]&lt;br /&gt;
* [[TXML Filter]]&lt;br /&gt;
* [[Wiki Filter]]&lt;br /&gt;
* [[WSXZ Package Filter]]&lt;br /&gt;
* [[Vignette Filter]]&lt;br /&gt;
* [[XLIFF Filter]]&lt;br /&gt;
* [[XLIFF-2 Filter]]&lt;br /&gt;
* [[XML Filter]]&lt;br /&gt;
* [[XML Stream Filter]]&lt;br /&gt;
* [[YAML Filter]]&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
==Supported File Formats==&lt;br /&gt;
&lt;br /&gt;
The following is a list of some of the file formats supported by the distribution through [[Understanding Filter Configurations|pre-defined configurations]]:&lt;br /&gt;
&lt;br /&gt;
{| border=&amp;quot;1&amp;quot; cellpadding=&amp;quot;6&amp;quot; cellspacing=&amp;quot;0&amp;quot;&lt;br /&gt;
|+&lt;br /&gt;
| '''Format''' || '''Extensions''' || '''Pre-Defined Configuration''' || '''Filter''' || '''Notes'''&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Android Strings || .xml || &amp;lt;code&amp;gt;okf_xml-AndroidStrings&amp;lt;/code&amp;gt; || [[XML Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Apple Stringsdict || .stringsdict || &amp;lt;code&amp;gt;okf_xml-AppleStringsdict&amp;lt;/code&amp;gt; || [[XML Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Archive || .zip || &amp;lt;code&amp;gt;okf_archive&amp;lt;/code&amp;gt; || [[Archive Filter]] || Meta filter that processes zip files with various formats as one file.&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Auto Xliff || .xlf, .xliff || &amp;lt;code&amp;gt;okf_autoxliff&amp;lt;/code&amp;gt; || [[Auto Xliff Filter]] || Detects the version of an XLIFF file and then hands parsing off to the appropriate filter &lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| CSV (Comma-separated values files) || .csv, .txt || &amp;lt;code&amp;gt;okf_table_csv&amp;lt;/code&amp;gt; || [[Table Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| CSV (Multiple complex sub-formats) || .csv || &amp;lt;code&amp;gt;okf_multiparsers&amp;lt;/code&amp;gt; || [[Multi-Parsers Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| DITA || .dita, .ditamap, .xml || &amp;lt;code&amp;gt;okf_xmlstream-dita&amp;lt;/code&amp;gt; || [[XML Stream Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| DocBook v5.0 || .xml || &amp;lt;code&amp;gt;okf_xml-docbook&amp;lt;/code&amp;gt; || [[XML Filter]] || Since Okapi 1.42. &amp;amp;lt;footnote&amp;gt; is not handled properly.&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| DokuWiki pages || .txt || &amp;lt;code&amp;gt;okf_wiki&amp;lt;/code&amp;gt; || [[Wiki Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Doxygen-commented files || .c, .h, cpp || &amp;lt;code&amp;gt;okf_doxygen&amp;lt;/code&amp;gt; || [[Doxygen Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| DTD || .dtd || &amp;lt;code&amp;gt;okf_dtd&amp;lt;/code&amp;gt; || [[DTD Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| EPUB || .epub || &amp;lt;code&amp;gt;okf_epub&amp;lt;/code&amp;gt; || [[EPUB Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Fixed-Width Columns Table || .txt || &amp;lt;code&amp;gt;okf_table_fwc&amp;lt;/code&amp;gt; || [[Table Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Idiom WorldServer XLIFF || .xlf || &amp;lt;code&amp;gt;okf_xliff-iws&amp;lt;/code&amp;gt; || [[XLIFF Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| InCopy ICML || .wcml || &amp;lt;code&amp;gt;okf_icml&amp;lt;/code&amp;gt; || [[ICML Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| InDesign IDML || .idml || &amp;lt;code&amp;gt;okf_idml&amp;lt;/code&amp;gt; || [[IDML Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| iOS/Mac Strings|| .strings || &amp;lt;code&amp;gt;okf_regex-macStrings&amp;lt;/code&amp;gt; || [[Regex Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Java Properties || .properties || &amp;lt;code&amp;gt;okf_properties&amp;lt;/code&amp;gt; || [[Properties Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Java Properties (Output not escaped) || .properties || &amp;lt;code&amp;gt;okf_properties-outputNotEscaped&amp;lt;/code&amp;gt; || [[Properties Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Java XML Properties || .xml || &amp;lt;code&amp;gt;okf_xml-JavaProperties&amp;lt;/code&amp;gt; || [[XML Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Java XML Properties (HTML strings) || .xml || &amp;lt;code&amp;gt;okf_xmlstream-JavaPropertiesHTML&amp;lt;/code&amp;gt; || [[XML Stream Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| JSON || .json || &amp;lt;code&amp;gt;okf_json&amp;lt;/code&amp;gt; || [[JSON Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Haiku CatKeys || .catkeys || &amp;lt;code&amp;gt;okf_table_catkeys&amp;lt;/code&amp;gt; || [[Table Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| HTML (any) || .html, .htm || &amp;lt;code&amp;gt;okf_html&amp;lt;/code&amp;gt; || [[HTML Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| HTML (Well-formed, and XHTML) || .html, .htm|| &amp;lt;code&amp;gt;okf_html-wellFormed&amp;lt;/code&amp;gt; || [[HTML Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| HTML5 (and XHTML5) || .html, .htm|| &amp;lt;code&amp;gt;okf_itshtml5&amp;lt;/code&amp;gt; || [[HTML5-ITS Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Markdown || .md || &amp;lt;code&amp;gt;okf_markdown&amp;lt;/code&amp;gt; || [[Markdown Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Microsoft Excel 2007/2010 || .xlsx, .xlsm, .xltx, .xltm || &amp;lt;code&amp;gt;okf_openxml&amp;lt;/code&amp;gt; || [[OpenXML Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Microsoft PowerPoint 2007/2010 || .pptx, .pptm, .potx, .potm, .ppsx, .ppsm || &amp;lt;code&amp;gt;okf_openxml&amp;lt;/code&amp;gt; || [[OpenXML Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Microsoft Visio || .vsdx, .vsdm || &amp;lt;code&amp;gt;okf_openxml&amp;lt;/code&amp;gt; || [[OpenXML Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Microsoft Word 2007/2010 || .docx, .docm, .dotx, .dotm || &amp;lt;code&amp;gt;okf_openxml&amp;lt;/code&amp;gt; || [[OpenXML Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| MIF || .mif || &amp;lt;code&amp;gt;okf_mif&amp;lt;/code&amp;gt; || [[MIF Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Moses Text || .txt || &amp;lt;code&amp;gt;okf_mosestext&amp;lt;/code&amp;gt; || [[Moses Text Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| OpenOffice.org Calc || .ods, .ots || &amp;lt;code&amp;gt;okf_odf&amp;lt;/code&amp;gt; || [[OpenOffice Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| OpenOffice.org Draw || .odg, .otg || &amp;lt;code&amp;gt;okf_odf&amp;lt;/code&amp;gt; || [[OpenOffice Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| OpenOffice.org Impress || .odp, .otp || &amp;lt;code&amp;gt;okf_odf&amp;lt;/code&amp;gt; || [[OpenOffice Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| OpenOffice.org Writer || .odt, .ott || &amp;lt;code&amp;gt;okf_odf&amp;lt;/code&amp;gt; || [[OpenOffice Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| PDF || .pdf || &amp;lt;code&amp;gt;okf_pdf&amp;lt;/code&amp;gt; || [[PDF Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| [[Pensieve TM]] || .pentm || &amp;lt;code&amp;gt;okf_pensieve&amp;lt;/code&amp;gt; || [[Pensieve TM Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| PHP Content || .php || &amp;lt;code&amp;gt;okf_phpcontent&amp;lt;/code&amp;gt; || [[PHP Content Filter]] || Can be used as a subfilter only&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Plain Text (Line = text unit) || .txt || &amp;lt;code&amp;gt;okf_plaintext&amp;lt;/code&amp;gt; || [[ Plain Text Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Plain Text (Paragraph = text unit) || .txt || &amp;lt;code&amp;gt;okf_plaintext_paragraphs&amp;lt;/code&amp;gt; || [[Plain Text Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| PO || .po || &amp;lt;code&amp;gt;okf_po&amp;lt;/code&amp;gt; || [[PO Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| PO (Monolingual style) || .po || &amp;lt;code&amp;gt;okf_po-monolingual&amp;lt;/code&amp;gt; || [[PO Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Rainbow Translation Kit manifests || .rkm || &amp;lt;code&amp;gt;okf_rainbowkit&amp;lt;/code&amp;gt; || [[Rainbow Translation Kit Filter]] || Used as a tkit reader only&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Regex (Any text-based format) || .txt || &amp;lt;code&amp;gt;okf_regex&amp;lt;/code&amp;gt; || [[Regex Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| RDF (Mozilla RDF) || .rdf || &amp;lt;code&amp;gt;okf_xml-MozillaRDF&amp;lt;/code&amp;gt; || [[XML Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| RESX || .resx || &amp;lt;code&amp;gt;okf_xml-resx&amp;lt;/code&amp;gt; || [[XML Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| SDLPPX || .sdlppx || &amp;lt;code&amp;gt;okf_sdlpackage&amp;lt;/code&amp;gt; || [[SDL Trados Package Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| SDLRPX || .sdlrpx || &amp;lt;code&amp;gt;okf_sdlpackage&amp;lt;/code&amp;gt; || [[SDL Trados Package Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| SDL[[XLIFF]] || .sdlxlf || &amp;lt;code&amp;gt;okf_xliff-sdl&amp;lt;/code&amp;gt; || [[XLIFF Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Skype Language Files || .lang || &amp;lt;code&amp;gt;okf_properties-skypeLang&amp;lt;/code&amp;gt; || [[Properties Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| SRT (Sub-Rip Text, sub-titles files) || .srt || &amp;lt;code&amp;gt;okf_regex-srt&amp;lt;/code&amp;gt; || [[Regex Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Tab-Delimiter files || .tsv, .txt || &amp;lt;code&amp;gt;okf_table_tsv&amp;lt;/code&amp;gt; || [[Table Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Tex files || .tex || &amp;lt;code&amp;gt;okf_tex&amp;lt;/code&amp;gt; || [[TEX Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| [[TMX]] || .tmx || &amp;lt;code&amp;gt;okf_tmx&amp;lt;/code&amp;gt; || [[TMX Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Transifex project || .txp || &amp;lt;code&amp;gt;okf_transifex&amp;lt;/code&amp;gt; || [[Transifex Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Trados-Tagged RTF || .rtf || &amp;lt;code&amp;gt;okf_tradosrtf&amp;lt;/code&amp;gt; || [[Trados-Tagged RTF Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| TS - Qt TS files || .ts || &amp;lt;code&amp;gt;okf_ts&amp;lt;/code&amp;gt; || [[TS Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| TTX - Trados TagEditor TTX files || .ttx || &amp;lt;code&amp;gt;okf_ttx&amp;lt;/code&amp;gt; || [[TTX Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| TXML - Wordfast Pro TXML files || .txml || &amp;lt;code&amp;gt;okf_txml&amp;lt;/code&amp;gt; || [[TXML Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Vignette Export/Import Content || .xml || &amp;lt;code&amp;gt;okf_vignette&amp;lt;/code&amp;gt; || [[Vignette Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| WSXZ Package Filter || .wsxz || &amp;lt;code&amp;gt;okf_wsxzpackage&amp;lt;/code&amp;gt; || [[WSXZ Package Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| XHTML || .html, .htm || &amp;lt;code&amp;gt;okf_html-wellFormed&amp;lt;/code&amp;gt; || [[HTML Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| WIX (Windows Installer XML) localization files || .wix || &amp;lt;code&amp;gt;okf_xml-WixLocalization&amp;lt;/code&amp;gt; || [[XML Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| [[XLIFF]] v1.2 || .xlf, .xliff || &amp;lt;code&amp;gt;okf_xliff&amp;lt;/code&amp;gt; || [[XLIFF Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| [[XLIFF]] v2 || .xlf || &amp;lt;code&amp;gt;okf_xliff2&amp;lt;/code&amp;gt; || [[XLIFF-2 Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| XML (Generic, using [[ITS]] defaults) || .xml || &amp;lt;code&amp;gt;okf_xml&amp;lt;/code&amp;gt; || [[XML Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| XML (Generic, using stream reader) || .xml || &amp;lt;code&amp;gt;okf_xmlstream&amp;lt;/code&amp;gt; || [[XML Stream Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| YAML (Generic YAML filter) || .yml, .yaml || &amp;lt;code&amp;gt;okf_yaml&amp;lt;/code&amp;gt; || [[YAML Filter]] ||&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
Note that most filters allow you to [[Understanding Filter Configurations|create your own configurations]] to support more file formats.&lt;br /&gt;
&lt;br /&gt;
==Code Simplification Rules==&lt;br /&gt;
&lt;br /&gt;
All filters support code simplification rules. By default the [[Inline Codes Simplifier Step]], [[Simplification Filter]] and [[Post-segmentation Inline Codes Removal Step]] maximize the trimming and merging (aka simplification) of inline codes. In some cases this may not be desired. The simplification rules allow you to override the default behavior and prevent specific codes from being trimmed or merged.&lt;br /&gt;
&lt;br /&gt;
===General Syntax===&lt;br /&gt;
&lt;br /&gt;
The rules parser ignores irrelevant whitespace. Rules can be separated by spaces, newlines or nothing. This makes it easier to accommodate various container formats and their whitespace normalization rules. When a rule applies it means &amp;quot;do not simplify the match code&amp;quot;. Uppercase tokens are constants and predefined by the rule parser. Multiple rules are always OR'ed together.&lt;br /&gt;
&lt;br /&gt;
For more details see the JavaCC grammar: &amp;lt;code&amp;gt;../okapi-core/src/main/java/net/sf/okapi/core/simplifierrules/SimplifierRules.jj&amp;lt;/code&amp;gt; &lt;br /&gt;
&lt;br /&gt;
===Rule Examples===&lt;br /&gt;
&lt;br /&gt;
If Code has any of these flags then don't simplify &lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;if DELETABLE or ADDABLE or CLONEABLE;&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;quot;=&amp;quot; is string match&lt;br /&gt;
Match basic TAGTYPE opening, closing or standalone &lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;if DATA = &amp;quot;a&amp;quot; and TAGTYPE = OPENING;&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;quot;~&amp;quot; is regex match&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;if DATA ~ &amp;quot;a.*&amp;quot;;&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
You can negate any of the match operators &lt;br /&gt;
Don't simplify if the DATA does not match the regex &lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;if DATA !~ &amp;quot;a.*&amp;quot;;&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Match on type, linebreak in this case, don't simplify &lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;if TYPE = &amp;quot;lb&amp;quot;;&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Don't simplify any rich text types&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;if TYPE = &amp;quot;bold&amp;quot; or TYPE = &amp;quot;italic&amp;quot; or TYPE = &amp;quot;underline&amp;quot;;&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Expressions can be recursive (supports embedded parens)&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;if TYPE = &amp;quot;bold&amp;quot; or (DATA = &amp;quot;bar&amp;quot; or (DATA = &amp;quot;foo&amp;quot; and TYPE = &amp;quot;underline&amp;quot;));&amp;lt;/pre&amp;gt;&lt;br /&gt;
 &lt;br /&gt;
&lt;br /&gt;
===Filter Config Examples===&lt;br /&gt;
&lt;br /&gt;
Examples of using simplifier rules within the filter config formats used by Okapi.&lt;br /&gt;
&lt;br /&gt;
'''YAML:'''&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
simplifierRules: |&lt;br /&gt;
  if ADDABLE or DELETABLE or CLONEABLE;&lt;br /&gt;
  if DATA = &amp;quot;&amp;lt;br/&amp;gt;&amp;quot; or DATA = &amp;quot;&amp;lt;font&amp;gt;&amp;quot; or DATA = &amp;quot;&amp;lt;/font&amp;gt;&amp;quot; or DATA = &amp;quot;&amp;lt;/a&amp;gt;&amp;quot;;&lt;br /&gt;
  if DATA ~ &amp;quot;\\&amp;lt;font.+&amp;quot; or DATA ~ &amp;quot;\\&amp;lt;img.+&amp;quot; or DATA ~ &amp;quot;\\&amp;lt;a.+&amp;quot;;&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
'''ITS:'''&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
&amp;lt;?xml version=&amp;quot;1.0&amp;quot; encoding=&amp;quot;UTF-8&amp;quot;?&amp;gt;&lt;br /&gt;
&amp;lt;its:rules xmlns:its=&amp;quot;http://www.w3.org/2005/11/its&amp;quot; version=&amp;quot;1.0&amp;quot; xmlns:itsx=&amp;quot;http://www.w3.org/2008/12/its-extensions&amp;quot; xmlns:okp=&amp;quot;okapi-framework:xmlfilter-options&amp;quot;&amp;gt;&lt;br /&gt;
&amp;lt;!-- See ITS specification at: http://www.w3.org/TR/its/ --&amp;gt;&lt;br /&gt;
 &amp;lt;its:translateRule selector=&amp;quot;//*&amp;quot; translate=&amp;quot;yes&amp;quot;/&amp;gt;&lt;br /&gt;
 &amp;lt;its:withinTextRule selector=&amp;quot;//codeph&amp;quot; withinText=&amp;quot;yes&amp;quot;/&amp;gt;&lt;br /&gt;
 &amp;lt;its:withinTextRule selector=&amp;quot;//ph&amp;quot; withinText=&amp;quot;yes&amp;quot;/&amp;gt;&lt;br /&gt;
 &amp;lt;okp:simplifierRules&amp;gt;&lt;br /&gt;
 if ADDABLE or DELETABLE or CLONEABLE; if DATA ~ &amp;quot;.+&amp;quot;;&lt;br /&gt;
 &amp;lt;/okp:simplifierRules&amp;gt;&lt;br /&gt;
&amp;lt;/its:rules&amp;gt;&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
'''FPRM (Parameters):'''&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
#v1&lt;br /&gt;
extractNotes.b=true&lt;br /&gt;
simplifierRules=if ADDABLE or DELETABLE or CLONEABLE; if DATA ~ &amp;quot;.+&amp;quot;;&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
==Font Mapping==&lt;br /&gt;
&lt;br /&gt;
The font mapping can be considered as a filter's ability to automatically substitute font information in the target document on the fly, according to a provided configuration - this helps to reduce the amount of reformatting and post-translation DTP. It is supported by IDML and OpenXML (DOCX, PPTX and XLSX documents) filters at the moment.&lt;br /&gt;
&lt;br /&gt;
The following font mapping configuration options are available:&lt;br /&gt;
* The source locale regular expression pattern: &amp;lt;code&amp;gt;.*&amp;lt;/code&amp;gt;, &amp;lt;code&amp;gt;en.*&amp;lt;/code&amp;gt;, &amp;lt;code&amp;gt;en-UK&amp;lt;/code&amp;gt;, etc. It can be ommited to apply the mapping to any source locale.&lt;br /&gt;
* The target locale regular expression pattern: &amp;lt;code&amp;gt;.*&amp;lt;/code&amp;gt;, &amp;lt;code&amp;gt;ru.*&amp;lt;/code&amp;gt;, &amp;lt;code&amp;gt;ru-RU&amp;lt;/code&amp;gt;, etc. It can be ommited to apply the mapping to any target locale.&lt;br /&gt;
* The source font name regular expression pattern: &amp;lt;code&amp;gt;.*&amp;lt;/code&amp;gt;, &amp;lt;code&amp;gt;Arial.*&amp;lt;/code&amp;gt;, &amp;lt;code&amp;gt;Times New Roman&amp;lt;/code&amp;gt;, etc. It can be ommited to apply the mapping to any source font name found.&lt;br /&gt;
* The target font name: &amp;lt;code&amp;gt;Arial&amp;lt;/code&amp;gt;, &amp;lt;code&amp;gt;Times New Roman&amp;lt;/code&amp;gt;, etc. It should not be empty. And if it is made so, the mapping configuration is ignored.&lt;br /&gt;
&lt;br /&gt;
Also, the configured font mappings are applied in the order they are stated. And the final target font value is determined by a sequential&lt;br /&gt;
substitution of the source font values. I.e. if there is more than one mapping:&lt;br /&gt;
# &amp;lt;code&amp;gt;Arial&amp;lt;/code&amp;gt; -&amp;gt; &amp;lt;code&amp;gt;Times New Roman&amp;lt;/code&amp;gt;&lt;br /&gt;
# &amp;lt;code&amp;gt;Times New Roman&amp;lt;/code&amp;gt; -&amp;gt; &amp;lt;code&amp;gt;Sans Serif&amp;lt;/code&amp;gt;&lt;br /&gt;
then the first mapping will produce &amp;lt;code&amp;gt;Times New Roman&amp;lt;/code&amp;gt; replacement and the second one will be applied to this new value, thus, ending up with the &amp;lt;code&amp;gt;Sans Serif&amp;lt;/code&amp;gt;.&lt;br /&gt;
&lt;br /&gt;
The parameters serialisation format can look like that:&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
fontMappings.0.sourceLocalePattern=en.*&lt;br /&gt;
fontMappings.0.targetLocalePattern=ru.*&lt;br /&gt;
fontMappings.0.sourceFontPattern=Times.*&lt;br /&gt;
fontMappings.0.targetFont=Arial Unicode MS&lt;br /&gt;
fontMappings.1.sourceLocalePattern=ru&lt;br /&gt;
fontMappings.1.targetLocalePattern=fr&lt;br /&gt;
fontMappings.1.sourceFontPattern=The Sims Sans&lt;br /&gt;
fontMappings.1.targetFont=Arial Unicode MS&lt;br /&gt;
fontMappings.number.i=2&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
When source locale, target locale and source font are omitted:&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
fontMappings.0.targetFont=Arial Unicode MS&lt;br /&gt;
fontMappings.number.i=1&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
And this is the same as the abovementioned:&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
fontMappings.0.sourceLocalePattern=.*&lt;br /&gt;
fontMappings.0.targetLocalePattern=.*&lt;br /&gt;
fontMappings.0.sourceFontPattern=.*&lt;br /&gt;
fontMappings.0.targetFont=Arial Unicode MS&lt;br /&gt;
fontMappings.number.i=1&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
[[Category:Filters]]&lt;/div&gt;</summary>
		<author><name>Jhargraveiii</name></author>
	</entry>
	<entry>
		<id>http://okapiframework.org/wiki/index.php?title=WSXZ_Package_Filter&amp;diff=973</id>
		<title>WSXZ Package Filter</title>
		<link rel="alternate" type="text/html" href="http://okapiframework.org/wiki/index.php?title=WSXZ_Package_Filter&amp;diff=973"/>
		<updated>2023-02-19T18:24:41Z</updated>

		<summary type="html">&lt;p&gt;Jhargraveiii: Created page with &amp;quot;{{Filters Header}} ==Overview==  The WSXZ Package Filter is an Okapi component that implements the IFilter interface for WSXZ (Worldserver TKIT) files.   ==Processing Details==  This filter is an extension of the Archive Filter. It reads the input package, detects the SDLXLIFF files for the specified language pair, and uses the XLIFF Filter (with the &amp;lt;code&amp;gt;okf_xliff-sdl&amp;lt;/code&amp;gt;) to process the content. Each SDLXLIFF file inside the package corresponds to a sub-doc...&amp;quot;&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;{{Filters Header}}&lt;br /&gt;
==Overview==&lt;br /&gt;
&lt;br /&gt;
The WSXZ Package Filter is an Okapi component that implements the IFilter interface for WSXZ (Worldserver TKIT) files. &lt;br /&gt;
&lt;br /&gt;
==Processing Details==&lt;br /&gt;
&lt;br /&gt;
This filter is an extension of the [[Archive Filter]]. It reads the input package, detects the SDLXLIFF files for the specified language pair, and uses the [[XLIFF Filter]] (with the &amp;lt;code&amp;gt;okf_xliff-sdl&amp;lt;/code&amp;gt;) to process the content. Each SDLXLIFF file inside the package corresponds to a sub-document in the Okapi filter events.&lt;br /&gt;
&lt;br /&gt;
==Limitations==&lt;br /&gt;
&lt;br /&gt;
* TMX and TBX files are ignored.&lt;br /&gt;
* See all the limitation for the [[XLIFF Filter]].&lt;br /&gt;
&lt;br /&gt;
[[Category:Filters]]&lt;/div&gt;</summary>
		<author><name>Jhargraveiii</name></author>
	</entry>
	<entry>
		<id>http://okapiframework.org/wiki/index.php?title=EPUB_Filter&amp;diff=972</id>
		<title>EPUB Filter</title>
		<link rel="alternate" type="text/html" href="http://okapiframework.org/wiki/index.php?title=EPUB_Filter&amp;diff=972"/>
		<updated>2023-02-19T18:21:54Z</updated>

		<summary type="html">&lt;p&gt;Jhargraveiii: Created page with &amp;quot;{{Filters Header}} ==Overview==  This filter allows you to process EPUB (Electronic Publication) documents.  ==Processing Details==  ===Input Encoding===  The filter decides which encoding to use for the input file using the following logic:  * If the file has a Unicode Byte-Order-Mark: ** Then, the corresponding encoding (e.g. UTF-8, UTF-16, etc.) is used.  * Otherwise, the input encoding used is the default encoding that was specified when setting the filter options....&amp;quot;&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;{{Filters Header}}&lt;br /&gt;
==Overview==&lt;br /&gt;
&lt;br /&gt;
This filter allows you to process EPUB (Electronic Publication) documents.&lt;br /&gt;
&lt;br /&gt;
==Processing Details==&lt;br /&gt;
&lt;br /&gt;
===Input Encoding===&lt;br /&gt;
&lt;br /&gt;
The filter decides which encoding to use for the input file using the following logic:&lt;br /&gt;
&lt;br /&gt;
* If the file has a Unicode Byte-Order-Mark:&lt;br /&gt;
** Then, the corresponding encoding (e.g. UTF-8, UTF-16, etc.) is used. &lt;br /&gt;
* Otherwise, the input encoding used is the default encoding that was specified when setting the filter options. &lt;br /&gt;
&lt;br /&gt;
===Output Encoding===&lt;br /&gt;
&lt;br /&gt;
If the output encoding is UTF-8:&lt;br /&gt;
* If the input encoding was also UTF-8, a Byte-Order-Mark is used for the output document only if one was detected in the input document. &lt;br /&gt;
* If the input encoding was not UTF-8, no Byte-Order-Mark is used in the output document. &lt;br /&gt;
&lt;br /&gt;
==Parameters==&lt;br /&gt;
&lt;br /&gt;
At this time, this filter does not have an editor to create or modify its configuration file. You need to use a text editor to edit custom configurations.&lt;br /&gt;
&lt;br /&gt;
==Limitations==&lt;br /&gt;
&lt;br /&gt;
None known.&lt;br /&gt;
&lt;br /&gt;
[[Category:Filters]]&lt;/div&gt;</summary>
		<author><name>Jhargraveiii</name></author>
	</entry>
	<entry>
		<id>http://okapiframework.org/wiki/index.php?title=Filters&amp;diff=971</id>
		<title>Filters</title>
		<link rel="alternate" type="text/html" href="http://okapiframework.org/wiki/index.php?title=Filters&amp;diff=971"/>
		<updated>2023-02-19T18:19:18Z</updated>

		<summary type="html">&lt;p&gt;Jhargraveiii: /* Supported File Formats */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;Filters are the components that convert input documents from their native file format into a common internal set of [[Glossary#Resource|resources]] that all Okapi components use. The extracted content can be re-written into the original file format. When using the steps, the extraction is done by the [[Raw Document to Filter Events Step]] and the re-writing by the [[Filter Events to Raw Document Step]].&lt;br /&gt;
&lt;br /&gt;
Note: The [[Okapi Filters Plugin for OmegaT]] allows you to use some of the filters directly from [http://www.omegat.org OmegaT].&lt;br /&gt;
&lt;br /&gt;
==List of the Filters==&lt;br /&gt;
&lt;br /&gt;
The framework distribution comes with the following filters:&lt;br /&gt;
&lt;br /&gt;
{| cellpadding=&amp;quot;8&amp;quot; width=100%&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
|&lt;br /&gt;
* [[Archive Filter]]&lt;br /&gt;
* [[DTD Filter]]&lt;br /&gt;
* [[Doxygen Filter]]&lt;br /&gt;
* [[EPUB Filter]]&lt;br /&gt;
* [[HTML Filter]]&lt;br /&gt;
* [[HTML5-ITS Filter]]&lt;br /&gt;
* [[ICML Filter]]&lt;br /&gt;
* [[IDML Filter]]&lt;br /&gt;
* [[JSON Filter]]&lt;br /&gt;
* [[Markdown Filter]]&lt;br /&gt;
* [[MIF Filter]]&lt;br /&gt;
* [[Moses Text Filter]]&lt;br /&gt;
* [[Multi-Parsers Filter]]&lt;br /&gt;
* [[OpenOffice Filter]]&lt;br /&gt;
* [[OpenXML Filter|OpenXML (MS Office) Filter]]&lt;br /&gt;
|&lt;br /&gt;
* [[PDF Filter]]&lt;br /&gt;
* [[Pensieve TM Filter]]&lt;br /&gt;
* [[PHP Content Filter]]&lt;br /&gt;
* [[Plain Text Filter]]&lt;br /&gt;
* [[PO Filter]]&lt;br /&gt;
* [[Properties Filter]]&lt;br /&gt;
* [[Rainbow Translation Kit Filter]]&lt;br /&gt;
* [[Regex Filter]]&lt;br /&gt;
* [[SDL Trados Package Filter]]&lt;br /&gt;
* [[Simplification Filter]]&lt;br /&gt;
* [[Table Filter]]&lt;br /&gt;
* [[TMX Filter]]&lt;br /&gt;
* [[Trados-Tagged RTF Filter]]&lt;br /&gt;
|&lt;br /&gt;
* [[Transifex Filter]]&lt;br /&gt;
* [[TS Filter]]&lt;br /&gt;
* [[TTX Filter]]&lt;br /&gt;
* [[TXML Filter]]&lt;br /&gt;
* [[Wiki Filter]]&lt;br /&gt;
* [[WSXZ Package Filter]]&lt;br /&gt;
* [[Vignette Filter]]&lt;br /&gt;
* [[XLIFF Filter]]&lt;br /&gt;
* [[XLIFF-2 Filter]]&lt;br /&gt;
* [[XML Filter]]&lt;br /&gt;
* [[XML Stream Filter]]&lt;br /&gt;
* [[YAML Filter]]&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
==Supported File Formats==&lt;br /&gt;
&lt;br /&gt;
The following is a list of some of the file formats supported by the distribution through [[Understanding Filter Configurations|pre-defined configurations]]:&lt;br /&gt;
&lt;br /&gt;
{| border=&amp;quot;1&amp;quot; cellpadding=&amp;quot;6&amp;quot; cellspacing=&amp;quot;0&amp;quot;&lt;br /&gt;
|+&lt;br /&gt;
| '''Format''' || '''Extensions''' || '''Pre-Defined Configuration''' || '''Filter''' || '''Notes'''&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Android Strings || .xml || &amp;lt;code&amp;gt;okf_xml-AndroidStrings&amp;lt;/code&amp;gt; || [[XML Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Apple Stringsdict || .stringsdict || &amp;lt;code&amp;gt;okf_xml-AppleStringsdict&amp;lt;/code&amp;gt; || [[XML Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Archive || .zip || &amp;lt;code&amp;gt;okf_archive&amp;lt;/code&amp;gt; || [[Archive Filter]] || Meta filter that processes zip files with various formats as one file.&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Auto Xliff || .xlf, .xliff || &amp;lt;code&amp;gt;okf_autoxliff&amp;lt;/code&amp;gt; || [[Auto Xliff Filter]] || Detects the version of an XLIFF file and then hands parsing off to the appropriate filter &lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| CSV (Comma-separated values files) || .csv, .txt || &amp;lt;code&amp;gt;okf_table_csv&amp;lt;/code&amp;gt; || [[Table Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| CSV (Multiple complex sub-formats) || .csv || &amp;lt;code&amp;gt;okf_multiparsers&amp;lt;/code&amp;gt; || [[Multi-Parsers Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| DITA || .dita, .ditamap, .xml || &amp;lt;code&amp;gt;okf_xmlstream-dita&amp;lt;/code&amp;gt; || [[XML Stream Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| DocBook v5.0 || .xml || &amp;lt;code&amp;gt;okf_xml-docbook&amp;lt;/code&amp;gt; || [[XML Filter]] || Since Okapi 1.42. &amp;amp;lt;footnote&amp;gt; is not handled properly.&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| DokuWiki pages || .txt || &amp;lt;code&amp;gt;okf_wiki&amp;lt;/code&amp;gt; || [[Wiki Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Doxygen-commented files || .c, .h, cpp || &amp;lt;code&amp;gt;okf_doxygen&amp;lt;/code&amp;gt; || [[Doxygen Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| DTD || .dtd || &amp;lt;code&amp;gt;okf_dtd&amp;lt;/code&amp;gt; || [[DTD Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| EPUB || .epub || &amp;lt;code&amp;gt;okf_epub&amp;lt;/code&amp;gt; || [[EPUB Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Fixed-Width Columns Table || .txt || &amp;lt;code&amp;gt;okf_table_fwc&amp;lt;/code&amp;gt; || [[Table Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Idiom WorldServer XLIFF || .xlf || &amp;lt;code&amp;gt;okf_xliff-iws&amp;lt;/code&amp;gt; || [[XLIFF Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| InCopy ICML || .wcml || &amp;lt;code&amp;gt;okf_icml&amp;lt;/code&amp;gt; || [[ICML Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| InDesign IDML || .idml || &amp;lt;code&amp;gt;okf_idml&amp;lt;/code&amp;gt; || [[IDML Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| iOS/Mac Strings|| .strings || &amp;lt;code&amp;gt;okf_regex-macStrings&amp;lt;/code&amp;gt; || [[Regex Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Java Properties || .properties || &amp;lt;code&amp;gt;okf_properties&amp;lt;/code&amp;gt; || [[Properties Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Java Properties (Output not escaped) || .properties || &amp;lt;code&amp;gt;okf_properties-outputNotEscaped&amp;lt;/code&amp;gt; || [[Properties Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Java XML Properties || .xml || &amp;lt;code&amp;gt;okf_xml-JavaProperties&amp;lt;/code&amp;gt; || [[XML Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Java XML Properties (HTML strings) || .xml || &amp;lt;code&amp;gt;okf_xmlstream-JavaPropertiesHTML&amp;lt;/code&amp;gt; || [[XML Stream Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| JSON || .json || &amp;lt;code&amp;gt;okf_json&amp;lt;/code&amp;gt; || [[JSON Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Haiku CatKeys || .catkeys || &amp;lt;code&amp;gt;okf_table_catkeys&amp;lt;/code&amp;gt; || [[Table Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| HTML (any) || .html, .htm || &amp;lt;code&amp;gt;okf_html&amp;lt;/code&amp;gt; || [[HTML Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| HTML (Well-formed, and XHTML) || .html, .htm|| &amp;lt;code&amp;gt;okf_html-wellFormed&amp;lt;/code&amp;gt; || [[HTML Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| HTML5 (and XHTML5) || .html, .htm|| &amp;lt;code&amp;gt;okf_itshtml5&amp;lt;/code&amp;gt; || [[HTML5-ITS Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Markdown || .md || &amp;lt;code&amp;gt;okf_markdown&amp;lt;/code&amp;gt; || [[Markdown Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Microsoft Excel 2007/2010 || .xlsx, .xlsm, .xltx, .xltm || &amp;lt;code&amp;gt;okf_openxml&amp;lt;/code&amp;gt; || [[OpenXML Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Microsoft PowerPoint 2007/2010 || .pptx, .pptm, .potx, .potm, .ppsx, .ppsm || &amp;lt;code&amp;gt;okf_openxml&amp;lt;/code&amp;gt; || [[OpenXML Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Microsoft Visio || .vsdx, .vsdm || &amp;lt;code&amp;gt;okf_openxml&amp;lt;/code&amp;gt; || [[OpenXML Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Microsoft Word 2007/2010 || .docx, .docm, .dotx, .dotm || &amp;lt;code&amp;gt;okf_openxml&amp;lt;/code&amp;gt; || [[OpenXML Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| MIF || .mif || &amp;lt;code&amp;gt;okf_mif&amp;lt;/code&amp;gt; || [[MIF Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Moses Text || .txt || &amp;lt;code&amp;gt;okf_mosestext&amp;lt;/code&amp;gt; || [[Moses Text Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| OpenOffice.org Calc || .ods, .ots || &amp;lt;code&amp;gt;okf_odf&amp;lt;/code&amp;gt; || [[OpenOffice Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| OpenOffice.org Draw || .odg, .otg || &amp;lt;code&amp;gt;okf_odf&amp;lt;/code&amp;gt; || [[OpenOffice Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| OpenOffice.org Impress || .odp, .otp || &amp;lt;code&amp;gt;okf_odf&amp;lt;/code&amp;gt; || [[OpenOffice Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| OpenOffice.org Writer || .odt, .ott || &amp;lt;code&amp;gt;okf_odf&amp;lt;/code&amp;gt; || [[OpenOffice Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| PDF || .pdf || &amp;lt;code&amp;gt;okf_pdf&amp;lt;/code&amp;gt; || [[PDF Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| [[Pensieve TM]] || .pentm || &amp;lt;code&amp;gt;okf_pensieve&amp;lt;/code&amp;gt; || [[Pensieve TM Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| PHP Content || .php || &amp;lt;code&amp;gt;okf_phpcontent&amp;lt;/code&amp;gt; || [[PHP Content Filter]] || Can be used as a subfilter only&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Plain Text (Line = text unit) || .txt || &amp;lt;code&amp;gt;okf_plaintext&amp;lt;/code&amp;gt; || [[ Plain Text Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Plain Text (Paragraph = text unit) || .txt || &amp;lt;code&amp;gt;okf_plaintext_paragraphs&amp;lt;/code&amp;gt; || [[Plain Text Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| PO || .po || &amp;lt;code&amp;gt;okf_po&amp;lt;/code&amp;gt; || [[PO Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| PO (Monolingual style) || .po || &amp;lt;code&amp;gt;okf_po-monolingual&amp;lt;/code&amp;gt; || [[PO Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Rainbow Translation Kit manifests || .rkm || &amp;lt;code&amp;gt;okf_rainbowkit&amp;lt;/code&amp;gt; || [[Rainbow Translation Kit Filter]] || Used as a tkit reader only&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Regex (Any text-based format) || .txt || &amp;lt;code&amp;gt;okf_regex&amp;lt;/code&amp;gt; || [[Regex Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| RDF (Mozilla RDF) || .rdf || &amp;lt;code&amp;gt;okf_xml-MozillaRDF&amp;lt;/code&amp;gt; || [[XML Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| RESX || .resx || &amp;lt;code&amp;gt;okf_xml-resx&amp;lt;/code&amp;gt; || [[XML Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| SDLPPX || .sdlppx || &amp;lt;code&amp;gt;okf_sdlpackage&amp;lt;/code&amp;gt; || [[SDL Trados Package Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| SDLRPX || .sdlrpx || &amp;lt;code&amp;gt;okf_sdlpackage&amp;lt;/code&amp;gt; || [[SDL Trados Package Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| SDL[[XLIFF]] || .sdlxlf || &amp;lt;code&amp;gt;okf_xliff-sdl&amp;lt;/code&amp;gt; || [[XLIFF Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Skype Language Files || .lang || &amp;lt;code&amp;gt;okf_properties-skypeLang&amp;lt;/code&amp;gt; || [[Properties Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| SRT (Sub-Rip Text, sub-titles files) || .srt || &amp;lt;code&amp;gt;okf_regex-srt&amp;lt;/code&amp;gt; || [[Regex Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Tab-Delimiter files || .tsv, .txt || &amp;lt;code&amp;gt;okf_table_tsv&amp;lt;/code&amp;gt; || [[Table Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Tex files || .tex || &amp;lt;code&amp;gt;okf_tex&amp;lt;/code&amp;gt; || [[TEX Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| [[TMX]] || .tmx || &amp;lt;code&amp;gt;okf_tmx&amp;lt;/code&amp;gt; || [[TMX Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Transifex project || .txp || &amp;lt;code&amp;gt;okf_transifex&amp;lt;/code&amp;gt; || [[Transifex Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Trados-Tagged RTF || .rtf || &amp;lt;code&amp;gt;okf_tradosrtf&amp;lt;/code&amp;gt; || [[Trados-Tagged RTF Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| TS - Qt TS files || .ts || &amp;lt;code&amp;gt;okf_ts&amp;lt;/code&amp;gt; || [[TS Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| TTX - Trados TagEditor TTX files || .ttx || &amp;lt;code&amp;gt;okf_ttx&amp;lt;/code&amp;gt; || [[TTX Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| TXML - Wordfast Pro TXML files || .txml || &amp;lt;code&amp;gt;okf_txml&amp;lt;/code&amp;gt; || [[TXML Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Vignette Export/Import Content || .xml || &amp;lt;code&amp;gt;okf_vignette&amp;lt;/code&amp;gt; || [[Vignette Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| WSXZ Package Filter || .wsxz || &amp;lt;code&amp;gt;okf_wsxz&amp;lt;/code&amp;gt; || [[WSXZ Package Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| XHTML || .html, .htm || &amp;lt;code&amp;gt;okf_html-wellFormed&amp;lt;/code&amp;gt; || [[HTML Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| WIX (Windows Installer XML) localization files || .wix || &amp;lt;code&amp;gt;okf_xml-WixLocalization&amp;lt;/code&amp;gt; || [[XML Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| [[XLIFF]] v1.2 || .xlf, .xliff || &amp;lt;code&amp;gt;okf_xliff&amp;lt;/code&amp;gt; || [[XLIFF Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| [[XLIFF]] v2 || .xlf || &amp;lt;code&amp;gt;okf_xliff2&amp;lt;/code&amp;gt; || [[XLIFF-2 Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| XML (Generic, using [[ITS]] defaults) || .xml || &amp;lt;code&amp;gt;okf_xml&amp;lt;/code&amp;gt; || [[XML Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| XML (Generic, using stream reader) || .xml || &amp;lt;code&amp;gt;okf_xmlstream&amp;lt;/code&amp;gt; || [[XML Stream Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| YAML (Generic YAML filter) || .yml, .yaml || &amp;lt;code&amp;gt;okf_yaml&amp;lt;/code&amp;gt; || [[YAML Filter]] ||&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
Note that most filters allow you to [[Understanding Filter Configurations|create your own configurations]] to support more file formats.&lt;br /&gt;
&lt;br /&gt;
==Code Simplification Rules==&lt;br /&gt;
&lt;br /&gt;
All filters support code simplification rules. By default the [[Inline Codes Simplifier Step]], [[Simplification Filter]] and [[Post-segmentation Inline Codes Removal Step]] maximize the trimming and merging (aka simplification) of inline codes. In some cases this may not be desired. The simplification rules allow you to override the default behavior and prevent specific codes from being trimmed or merged.&lt;br /&gt;
&lt;br /&gt;
===General Syntax===&lt;br /&gt;
&lt;br /&gt;
The rules parser ignores irrelevant whitespace. Rules can be separated by spaces, newlines or nothing. This makes it easier to accommodate various container formats and their whitespace normalization rules. When a rule applies it means &amp;quot;do not simplify the match code&amp;quot;. Uppercase tokens are constants and predefined by the rule parser. Multiple rules are always OR'ed together.&lt;br /&gt;
&lt;br /&gt;
For more details see the JavaCC grammar: &amp;lt;code&amp;gt;../okapi-core/src/main/java/net/sf/okapi/core/simplifierrules/SimplifierRules.jj&amp;lt;/code&amp;gt; &lt;br /&gt;
&lt;br /&gt;
===Rule Examples===&lt;br /&gt;
&lt;br /&gt;
If Code has any of these flags then don't simplify &lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;if DELETABLE or ADDABLE or CLONEABLE;&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;quot;=&amp;quot; is string match&lt;br /&gt;
Match basic TAGTYPE opening, closing or standalone &lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;if DATA = &amp;quot;a&amp;quot; and TAGTYPE = OPENING;&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;quot;~&amp;quot; is regex match&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;if DATA ~ &amp;quot;a.*&amp;quot;;&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
You can negate any of the match operators &lt;br /&gt;
Don't simplify if the DATA does not match the regex &lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;if DATA !~ &amp;quot;a.*&amp;quot;;&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Match on type, linebreak in this case, don't simplify &lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;if TYPE = &amp;quot;lb&amp;quot;;&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Don't simplify any rich text types&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;if TYPE = &amp;quot;bold&amp;quot; or TYPE = &amp;quot;italic&amp;quot; or TYPE = &amp;quot;underline&amp;quot;;&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Expressions can be recursive (supports embedded parens)&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;if TYPE = &amp;quot;bold&amp;quot; or (DATA = &amp;quot;bar&amp;quot; or (DATA = &amp;quot;foo&amp;quot; and TYPE = &amp;quot;underline&amp;quot;));&amp;lt;/pre&amp;gt;&lt;br /&gt;
 &lt;br /&gt;
&lt;br /&gt;
===Filter Config Examples===&lt;br /&gt;
&lt;br /&gt;
Examples of using simplifier rules within the filter config formats used by Okapi.&lt;br /&gt;
&lt;br /&gt;
'''YAML:'''&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
simplifierRules: |&lt;br /&gt;
  if ADDABLE or DELETABLE or CLONEABLE;&lt;br /&gt;
  if DATA = &amp;quot;&amp;lt;br/&amp;gt;&amp;quot; or DATA = &amp;quot;&amp;lt;font&amp;gt;&amp;quot; or DATA = &amp;quot;&amp;lt;/font&amp;gt;&amp;quot; or DATA = &amp;quot;&amp;lt;/a&amp;gt;&amp;quot;;&lt;br /&gt;
  if DATA ~ &amp;quot;\\&amp;lt;font.+&amp;quot; or DATA ~ &amp;quot;\\&amp;lt;img.+&amp;quot; or DATA ~ &amp;quot;\\&amp;lt;a.+&amp;quot;;&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
'''ITS:'''&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
&amp;lt;?xml version=&amp;quot;1.0&amp;quot; encoding=&amp;quot;UTF-8&amp;quot;?&amp;gt;&lt;br /&gt;
&amp;lt;its:rules xmlns:its=&amp;quot;http://www.w3.org/2005/11/its&amp;quot; version=&amp;quot;1.0&amp;quot; xmlns:itsx=&amp;quot;http://www.w3.org/2008/12/its-extensions&amp;quot; xmlns:okp=&amp;quot;okapi-framework:xmlfilter-options&amp;quot;&amp;gt;&lt;br /&gt;
&amp;lt;!-- See ITS specification at: http://www.w3.org/TR/its/ --&amp;gt;&lt;br /&gt;
 &amp;lt;its:translateRule selector=&amp;quot;//*&amp;quot; translate=&amp;quot;yes&amp;quot;/&amp;gt;&lt;br /&gt;
 &amp;lt;its:withinTextRule selector=&amp;quot;//codeph&amp;quot; withinText=&amp;quot;yes&amp;quot;/&amp;gt;&lt;br /&gt;
 &amp;lt;its:withinTextRule selector=&amp;quot;//ph&amp;quot; withinText=&amp;quot;yes&amp;quot;/&amp;gt;&lt;br /&gt;
 &amp;lt;okp:simplifierRules&amp;gt;&lt;br /&gt;
 if ADDABLE or DELETABLE or CLONEABLE; if DATA ~ &amp;quot;.+&amp;quot;;&lt;br /&gt;
 &amp;lt;/okp:simplifierRules&amp;gt;&lt;br /&gt;
&amp;lt;/its:rules&amp;gt;&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
'''FPRM (Parameters):'''&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
#v1&lt;br /&gt;
extractNotes.b=true&lt;br /&gt;
simplifierRules=if ADDABLE or DELETABLE or CLONEABLE; if DATA ~ &amp;quot;.+&amp;quot;;&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
==Font Mapping==&lt;br /&gt;
&lt;br /&gt;
The font mapping can be considered as a filter's ability to automatically substitute font information in the target document on the fly, according to a provided configuration - this helps to reduce the amount of reformatting and post-translation DTP. It is supported by IDML and OpenXML (DOCX, PPTX and XLSX documents) filters at the moment.&lt;br /&gt;
&lt;br /&gt;
The following font mapping configuration options are available:&lt;br /&gt;
* The source locale regular expression pattern: &amp;lt;code&amp;gt;.*&amp;lt;/code&amp;gt;, &amp;lt;code&amp;gt;en.*&amp;lt;/code&amp;gt;, &amp;lt;code&amp;gt;en-UK&amp;lt;/code&amp;gt;, etc. It can be ommited to apply the mapping to any source locale.&lt;br /&gt;
* The target locale regular expression pattern: &amp;lt;code&amp;gt;.*&amp;lt;/code&amp;gt;, &amp;lt;code&amp;gt;ru.*&amp;lt;/code&amp;gt;, &amp;lt;code&amp;gt;ru-RU&amp;lt;/code&amp;gt;, etc. It can be ommited to apply the mapping to any target locale.&lt;br /&gt;
* The source font name regular expression pattern: &amp;lt;code&amp;gt;.*&amp;lt;/code&amp;gt;, &amp;lt;code&amp;gt;Arial.*&amp;lt;/code&amp;gt;, &amp;lt;code&amp;gt;Times New Roman&amp;lt;/code&amp;gt;, etc. It can be ommited to apply the mapping to any source font name found.&lt;br /&gt;
* The target font name: &amp;lt;code&amp;gt;Arial&amp;lt;/code&amp;gt;, &amp;lt;code&amp;gt;Times New Roman&amp;lt;/code&amp;gt;, etc. It should not be empty. And if it is made so, the mapping configuration is ignored.&lt;br /&gt;
&lt;br /&gt;
Also, the configured font mappings are applied in the order they are stated. And the final target font value is determined by a sequential&lt;br /&gt;
substitution of the source font values. I.e. if there is more than one mapping:&lt;br /&gt;
# &amp;lt;code&amp;gt;Arial&amp;lt;/code&amp;gt; -&amp;gt; &amp;lt;code&amp;gt;Times New Roman&amp;lt;/code&amp;gt;&lt;br /&gt;
# &amp;lt;code&amp;gt;Times New Roman&amp;lt;/code&amp;gt; -&amp;gt; &amp;lt;code&amp;gt;Sans Serif&amp;lt;/code&amp;gt;&lt;br /&gt;
then the first mapping will produce &amp;lt;code&amp;gt;Times New Roman&amp;lt;/code&amp;gt; replacement and the second one will be applied to this new value, thus, ending up with the &amp;lt;code&amp;gt;Sans Serif&amp;lt;/code&amp;gt;.&lt;br /&gt;
&lt;br /&gt;
The parameters serialisation format can look like that:&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
fontMappings.0.sourceLocalePattern=en.*&lt;br /&gt;
fontMappings.0.targetLocalePattern=ru.*&lt;br /&gt;
fontMappings.0.sourceFontPattern=Times.*&lt;br /&gt;
fontMappings.0.targetFont=Arial Unicode MS&lt;br /&gt;
fontMappings.1.sourceLocalePattern=ru&lt;br /&gt;
fontMappings.1.targetLocalePattern=fr&lt;br /&gt;
fontMappings.1.sourceFontPattern=The Sims Sans&lt;br /&gt;
fontMappings.1.targetFont=Arial Unicode MS&lt;br /&gt;
fontMappings.number.i=2&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
When source locale, target locale and source font are omitted:&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
fontMappings.0.targetFont=Arial Unicode MS&lt;br /&gt;
fontMappings.number.i=1&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
And this is the same as the abovementioned:&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
fontMappings.0.sourceLocalePattern=.*&lt;br /&gt;
fontMappings.0.targetLocalePattern=.*&lt;br /&gt;
fontMappings.0.sourceFontPattern=.*&lt;br /&gt;
fontMappings.0.targetFont=Arial Unicode MS&lt;br /&gt;
fontMappings.number.i=1&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
[[Category:Filters]]&lt;/div&gt;</summary>
		<author><name>Jhargraveiii</name></author>
	</entry>
	<entry>
		<id>http://okapiframework.org/wiki/index.php?title=Filters&amp;diff=970</id>
		<title>Filters</title>
		<link rel="alternate" type="text/html" href="http://okapiframework.org/wiki/index.php?title=Filters&amp;diff=970"/>
		<updated>2023-02-19T18:15:53Z</updated>

		<summary type="html">&lt;p&gt;Jhargraveiii: /* List of the Filters */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;Filters are the components that convert input documents from their native file format into a common internal set of [[Glossary#Resource|resources]] that all Okapi components use. The extracted content can be re-written into the original file format. When using the steps, the extraction is done by the [[Raw Document to Filter Events Step]] and the re-writing by the [[Filter Events to Raw Document Step]].&lt;br /&gt;
&lt;br /&gt;
Note: The [[Okapi Filters Plugin for OmegaT]] allows you to use some of the filters directly from [http://www.omegat.org OmegaT].&lt;br /&gt;
&lt;br /&gt;
==List of the Filters==&lt;br /&gt;
&lt;br /&gt;
The framework distribution comes with the following filters:&lt;br /&gt;
&lt;br /&gt;
{| cellpadding=&amp;quot;8&amp;quot; width=100%&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
|&lt;br /&gt;
* [[Archive Filter]]&lt;br /&gt;
* [[DTD Filter]]&lt;br /&gt;
* [[Doxygen Filter]]&lt;br /&gt;
* [[EPUB Filter]]&lt;br /&gt;
* [[HTML Filter]]&lt;br /&gt;
* [[HTML5-ITS Filter]]&lt;br /&gt;
* [[ICML Filter]]&lt;br /&gt;
* [[IDML Filter]]&lt;br /&gt;
* [[JSON Filter]]&lt;br /&gt;
* [[Markdown Filter]]&lt;br /&gt;
* [[MIF Filter]]&lt;br /&gt;
* [[Moses Text Filter]]&lt;br /&gt;
* [[Multi-Parsers Filter]]&lt;br /&gt;
* [[OpenOffice Filter]]&lt;br /&gt;
* [[OpenXML Filter|OpenXML (MS Office) Filter]]&lt;br /&gt;
|&lt;br /&gt;
* [[PDF Filter]]&lt;br /&gt;
* [[Pensieve TM Filter]]&lt;br /&gt;
* [[PHP Content Filter]]&lt;br /&gt;
* [[Plain Text Filter]]&lt;br /&gt;
* [[PO Filter]]&lt;br /&gt;
* [[Properties Filter]]&lt;br /&gt;
* [[Rainbow Translation Kit Filter]]&lt;br /&gt;
* [[Regex Filter]]&lt;br /&gt;
* [[SDL Trados Package Filter]]&lt;br /&gt;
* [[Simplification Filter]]&lt;br /&gt;
* [[Table Filter]]&lt;br /&gt;
* [[TMX Filter]]&lt;br /&gt;
* [[Trados-Tagged RTF Filter]]&lt;br /&gt;
|&lt;br /&gt;
* [[Transifex Filter]]&lt;br /&gt;
* [[TS Filter]]&lt;br /&gt;
* [[TTX Filter]]&lt;br /&gt;
* [[TXML Filter]]&lt;br /&gt;
* [[Wiki Filter]]&lt;br /&gt;
* [[WSXZ Package Filter]]&lt;br /&gt;
* [[Vignette Filter]]&lt;br /&gt;
* [[XLIFF Filter]]&lt;br /&gt;
* [[XLIFF-2 Filter]]&lt;br /&gt;
* [[XML Filter]]&lt;br /&gt;
* [[XML Stream Filter]]&lt;br /&gt;
* [[YAML Filter]]&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
==Supported File Formats==&lt;br /&gt;
&lt;br /&gt;
The following is a list of some of the file formats supported by the distribution through [[Understanding Filter Configurations|pre-defined configurations]]:&lt;br /&gt;
&lt;br /&gt;
{| border=&amp;quot;1&amp;quot; cellpadding=&amp;quot;6&amp;quot; cellspacing=&amp;quot;0&amp;quot;&lt;br /&gt;
|+&lt;br /&gt;
| '''Format''' || '''Extensions''' || '''Pre-Defined Configuration''' || '''Filter''' || '''Notes'''&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Android Strings || .xml || &amp;lt;code&amp;gt;okf_xml-AndroidStrings&amp;lt;/code&amp;gt; || [[XML Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Apple Stringsdict || .stringsdict || &amp;lt;code&amp;gt;okf_xml-AppleStringsdict&amp;lt;/code&amp;gt; || [[XML Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Archive || .zip || &amp;lt;code&amp;gt;okf_archive&amp;lt;/code&amp;gt; || [[Archive Filter]] || Meta filter that processes zip files with various formats as one file.&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Auto Xliff || .xlf, .xliff || &amp;lt;code&amp;gt;okf_autoxliff&amp;lt;/code&amp;gt; || [[Auto Xliff Filter]] || Detects the version of an XLIFF file and then hands parsing off to the appropriate filter &lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| CSV (Comma-separated values files) || .csv, .txt || &amp;lt;code&amp;gt;okf_table_csv&amp;lt;/code&amp;gt; || [[Table Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| CSV (Multiple complex sub-formats) || .csv || &amp;lt;code&amp;gt;okf_multiparsers&amp;lt;/code&amp;gt; || [[Multi-Parsers Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| DITA || .dita, .ditamap, .xml || &amp;lt;code&amp;gt;okf_xmlstream-dita&amp;lt;/code&amp;gt; || [[XML Stream Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| DocBook v5.0 || .xml || &amp;lt;code&amp;gt;okf_xml-docbook&amp;lt;/code&amp;gt; || [[XML Filter]] || Since Okapi 1.42. &amp;amp;lt;footnote&amp;gt; is not handled properly.&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| DokuWiki pages || .txt || &amp;lt;code&amp;gt;okf_wiki&amp;lt;/code&amp;gt; || [[Wiki Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Doxygen-commented files || .c, .h, cpp || &amp;lt;code&amp;gt;okf_doxygen&amp;lt;/code&amp;gt; || [[Doxygen Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| DTD || .dtd || &amp;lt;code&amp;gt;okf_dtd&amp;lt;/code&amp;gt; || [[DTD Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Fixed-Width Columns Table || .txt || &amp;lt;code&amp;gt;okf_table_fwc&amp;lt;/code&amp;gt; || [[Table Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Idiom WorldServer XLIFF || .xlf || &amp;lt;code&amp;gt;okf_xliff-iws&amp;lt;/code&amp;gt; || [[XLIFF Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| InCopy ICML || .wcml || &amp;lt;code&amp;gt;okf_icml&amp;lt;/code&amp;gt; || [[ICML Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| InDesign IDML || .idml || &amp;lt;code&amp;gt;okf_idml&amp;lt;/code&amp;gt; || [[IDML Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| iOS/Mac Strings|| .strings || &amp;lt;code&amp;gt;okf_regex-macStrings&amp;lt;/code&amp;gt; || [[Regex Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Java Properties || .properties || &amp;lt;code&amp;gt;okf_properties&amp;lt;/code&amp;gt; || [[Properties Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Java Properties (Output not escaped) || .properties || &amp;lt;code&amp;gt;okf_properties-outputNotEscaped&amp;lt;/code&amp;gt; || [[Properties Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Java XML Properties || .xml || &amp;lt;code&amp;gt;okf_xml-JavaProperties&amp;lt;/code&amp;gt; || [[XML Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Java XML Properties (HTML strings) || .xml || &amp;lt;code&amp;gt;okf_xmlstream-JavaPropertiesHTML&amp;lt;/code&amp;gt; || [[XML Stream Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| JSON || .json || &amp;lt;code&amp;gt;okf_json&amp;lt;/code&amp;gt; || [[JSON Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Haiku CatKeys || .catkeys || &amp;lt;code&amp;gt;okf_table_catkeys&amp;lt;/code&amp;gt; || [[Table Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| HTML (any) || .html, .htm || &amp;lt;code&amp;gt;okf_html&amp;lt;/code&amp;gt; || [[HTML Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| HTML (Well-formed, and XHTML) || .html, .htm|| &amp;lt;code&amp;gt;okf_html-wellFormed&amp;lt;/code&amp;gt; || [[HTML Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| HTML5 (and XHTML5) || .html, .htm|| &amp;lt;code&amp;gt;okf_itshtml5&amp;lt;/code&amp;gt; || [[HTML5-ITS Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Markdown || .md || &amp;lt;code&amp;gt;okf_markdown&amp;lt;/code&amp;gt; || [[Markdown Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Microsoft Excel 2007/2010 || .xlsx, .xlsm, .xltx, .xltm || &amp;lt;code&amp;gt;okf_openxml&amp;lt;/code&amp;gt; || [[OpenXML Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Microsoft PowerPoint 2007/2010 || .pptx, .pptm, .potx, .potm, .ppsx, .ppsm || &amp;lt;code&amp;gt;okf_openxml&amp;lt;/code&amp;gt; || [[OpenXML Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Microsoft Visio || .vsdx, .vsdm || &amp;lt;code&amp;gt;okf_openxml&amp;lt;/code&amp;gt; || [[OpenXML Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Microsoft Word 2007/2010 || .docx, .docm, .dotx, .dotm || &amp;lt;code&amp;gt;okf_openxml&amp;lt;/code&amp;gt; || [[OpenXML Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| MIF || .mif || &amp;lt;code&amp;gt;okf_mif&amp;lt;/code&amp;gt; || [[MIF Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Moses Text || .txt || &amp;lt;code&amp;gt;okf_mosestext&amp;lt;/code&amp;gt; || [[Moses Text Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| OpenOffice.org Calc || .ods, .ots || &amp;lt;code&amp;gt;okf_odf&amp;lt;/code&amp;gt; || [[OpenOffice Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| OpenOffice.org Draw || .odg, .otg || &amp;lt;code&amp;gt;okf_odf&amp;lt;/code&amp;gt; || [[OpenOffice Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| OpenOffice.org Impress || .odp, .otp || &amp;lt;code&amp;gt;okf_odf&amp;lt;/code&amp;gt; || [[OpenOffice Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| OpenOffice.org Writer || .odt, .ott || &amp;lt;code&amp;gt;okf_odf&amp;lt;/code&amp;gt; || [[OpenOffice Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| PDF || .pdf || &amp;lt;code&amp;gt;okf_pdf&amp;lt;/code&amp;gt; || [[PDF Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| [[Pensieve TM]] || .pentm || &amp;lt;code&amp;gt;okf_pensieve&amp;lt;/code&amp;gt; || [[Pensieve TM Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| PHP Content || .php || &amp;lt;code&amp;gt;okf_phpcontent&amp;lt;/code&amp;gt; || [[PHP Content Filter]] || Can be used as a subfilter only&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Plain Text (Line = text unit) || .txt || &amp;lt;code&amp;gt;okf_plaintext&amp;lt;/code&amp;gt; || [[ Plain Text Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Plain Text (Paragraph = text unit) || .txt || &amp;lt;code&amp;gt;okf_plaintext_paragraphs&amp;lt;/code&amp;gt; || [[Plain Text Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| PO || .po || &amp;lt;code&amp;gt;okf_po&amp;lt;/code&amp;gt; || [[PO Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| PO (Monolingual style) || .po || &amp;lt;code&amp;gt;okf_po-monolingual&amp;lt;/code&amp;gt; || [[PO Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Rainbow Translation Kit manifests || .rkm || &amp;lt;code&amp;gt;okf_rainbowkit&amp;lt;/code&amp;gt; || [[Rainbow Translation Kit Filter]] || Used as a tkit reader only&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Regex (Any text-based format) || .txt || &amp;lt;code&amp;gt;okf_regex&amp;lt;/code&amp;gt; || [[Regex Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| RDF (Mozilla RDF) || .rdf || &amp;lt;code&amp;gt;okf_xml-MozillaRDF&amp;lt;/code&amp;gt; || [[XML Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| RESX || .resx || &amp;lt;code&amp;gt;okf_xml-resx&amp;lt;/code&amp;gt; || [[XML Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| SDLPPX || .sdlppx || &amp;lt;code&amp;gt;okf_sdlpackage&amp;lt;/code&amp;gt; || [[SDL Trados Package Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| SDLRPX || .sdlrpx || &amp;lt;code&amp;gt;okf_sdlpackage&amp;lt;/code&amp;gt; || [[SDL Trados Package Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| SDL[[XLIFF]] || .sdlxlf || &amp;lt;code&amp;gt;okf_xliff-sdl&amp;lt;/code&amp;gt; || [[XLIFF Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Skype Language Files || .lang || &amp;lt;code&amp;gt;okf_properties-skypeLang&amp;lt;/code&amp;gt; || [[Properties Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| SRT (Sub-Rip Text, sub-titles files) || .srt || &amp;lt;code&amp;gt;okf_regex-srt&amp;lt;/code&amp;gt; || [[Regex Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Tab-Delimiter files || .tsv, .txt || &amp;lt;code&amp;gt;okf_table_tsv&amp;lt;/code&amp;gt; || [[Table Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Tex files || .tex || &amp;lt;code&amp;gt;okf_tex&amp;lt;/code&amp;gt; || [[TEX Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| [[TMX]] || .tmx || &amp;lt;code&amp;gt;okf_tmx&amp;lt;/code&amp;gt; || [[TMX Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Transifex project || .txp || &amp;lt;code&amp;gt;okf_transifex&amp;lt;/code&amp;gt; || [[Transifex Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Trados-Tagged RTF || .rtf || &amp;lt;code&amp;gt;okf_tradosrtf&amp;lt;/code&amp;gt; || [[Trados-Tagged RTF Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| TS - Qt TS files || .ts || &amp;lt;code&amp;gt;okf_ts&amp;lt;/code&amp;gt; || [[TS Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| TTX - Trados TagEditor TTX files || .ttx || &amp;lt;code&amp;gt;okf_ttx&amp;lt;/code&amp;gt; || [[TTX Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| TXML - Wordfast Pro TXML files || .txml || &amp;lt;code&amp;gt;okf_txml&amp;lt;/code&amp;gt; || [[TXML Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Vignette Export/Import Content || .xml || &amp;lt;code&amp;gt;okf_vignette&amp;lt;/code&amp;gt; || [[Vignette Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| XHTML || .html, .htm || &amp;lt;code&amp;gt;okf_html-wellFormed&amp;lt;/code&amp;gt; || [[HTML Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| WIX (Windows Installer XML) localization files || .wix || &amp;lt;code&amp;gt;okf_xml-WixLocalization&amp;lt;/code&amp;gt; || [[XML Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| [[XLIFF]] v1.2 || .xlf, .xliff || &amp;lt;code&amp;gt;okf_xliff&amp;lt;/code&amp;gt; || [[XLIFF Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| [[XLIFF]] v2 || .xlf || &amp;lt;code&amp;gt;okf_xliff2&amp;lt;/code&amp;gt; || [[XLIFF-2 Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| XML (Generic, using [[ITS]] defaults) || .xml || &amp;lt;code&amp;gt;okf_xml&amp;lt;/code&amp;gt; || [[XML Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| XML (Generic, using stream reader) || .xml || &amp;lt;code&amp;gt;okf_xmlstream&amp;lt;/code&amp;gt; || [[XML Stream Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| YAML (Generic YAML filter) || .yml, .yaml || &amp;lt;code&amp;gt;okf_yaml&amp;lt;/code&amp;gt; || [[YAML Filter]] ||&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
Note that most filters allow you to [[Understanding Filter Configurations|create your own configurations]] to support more file formats.&lt;br /&gt;
&lt;br /&gt;
==Code Simplification Rules==&lt;br /&gt;
&lt;br /&gt;
All filters support code simplification rules. By default the [[Inline Codes Simplifier Step]], [[Simplification Filter]] and [[Post-segmentation Inline Codes Removal Step]] maximize the trimming and merging (aka simplification) of inline codes. In some cases this may not be desired. The simplification rules allow you to override the default behavior and prevent specific codes from being trimmed or merged.&lt;br /&gt;
&lt;br /&gt;
===General Syntax===&lt;br /&gt;
&lt;br /&gt;
The rules parser ignores irrelevant whitespace. Rules can be separated by spaces, newlines or nothing. This makes it easier to accommodate various container formats and their whitespace normalization rules. When a rule applies it means &amp;quot;do not simplify the match code&amp;quot;. Uppercase tokens are constants and predefined by the rule parser. Multiple rules are always OR'ed together.&lt;br /&gt;
&lt;br /&gt;
For more details see the JavaCC grammar: &amp;lt;code&amp;gt;../okapi-core/src/main/java/net/sf/okapi/core/simplifierrules/SimplifierRules.jj&amp;lt;/code&amp;gt; &lt;br /&gt;
&lt;br /&gt;
===Rule Examples===&lt;br /&gt;
&lt;br /&gt;
If Code has any of these flags then don't simplify &lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;if DELETABLE or ADDABLE or CLONEABLE;&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;quot;=&amp;quot; is string match&lt;br /&gt;
Match basic TAGTYPE opening, closing or standalone &lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;if DATA = &amp;quot;a&amp;quot; and TAGTYPE = OPENING;&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;quot;~&amp;quot; is regex match&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;if DATA ~ &amp;quot;a.*&amp;quot;;&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
You can negate any of the match operators &lt;br /&gt;
Don't simplify if the DATA does not match the regex &lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;if DATA !~ &amp;quot;a.*&amp;quot;;&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Match on type, linebreak in this case, don't simplify &lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;if TYPE = &amp;quot;lb&amp;quot;;&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Don't simplify any rich text types&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;if TYPE = &amp;quot;bold&amp;quot; or TYPE = &amp;quot;italic&amp;quot; or TYPE = &amp;quot;underline&amp;quot;;&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Expressions can be recursive (supports embedded parens)&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;if TYPE = &amp;quot;bold&amp;quot; or (DATA = &amp;quot;bar&amp;quot; or (DATA = &amp;quot;foo&amp;quot; and TYPE = &amp;quot;underline&amp;quot;));&amp;lt;/pre&amp;gt;&lt;br /&gt;
 &lt;br /&gt;
&lt;br /&gt;
===Filter Config Examples===&lt;br /&gt;
&lt;br /&gt;
Examples of using simplifier rules within the filter config formats used by Okapi.&lt;br /&gt;
&lt;br /&gt;
'''YAML:'''&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
simplifierRules: |&lt;br /&gt;
  if ADDABLE or DELETABLE or CLONEABLE;&lt;br /&gt;
  if DATA = &amp;quot;&amp;lt;br/&amp;gt;&amp;quot; or DATA = &amp;quot;&amp;lt;font&amp;gt;&amp;quot; or DATA = &amp;quot;&amp;lt;/font&amp;gt;&amp;quot; or DATA = &amp;quot;&amp;lt;/a&amp;gt;&amp;quot;;&lt;br /&gt;
  if DATA ~ &amp;quot;\\&amp;lt;font.+&amp;quot; or DATA ~ &amp;quot;\\&amp;lt;img.+&amp;quot; or DATA ~ &amp;quot;\\&amp;lt;a.+&amp;quot;;&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
'''ITS:'''&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
&amp;lt;?xml version=&amp;quot;1.0&amp;quot; encoding=&amp;quot;UTF-8&amp;quot;?&amp;gt;&lt;br /&gt;
&amp;lt;its:rules xmlns:its=&amp;quot;http://www.w3.org/2005/11/its&amp;quot; version=&amp;quot;1.0&amp;quot; xmlns:itsx=&amp;quot;http://www.w3.org/2008/12/its-extensions&amp;quot; xmlns:okp=&amp;quot;okapi-framework:xmlfilter-options&amp;quot;&amp;gt;&lt;br /&gt;
&amp;lt;!-- See ITS specification at: http://www.w3.org/TR/its/ --&amp;gt;&lt;br /&gt;
 &amp;lt;its:translateRule selector=&amp;quot;//*&amp;quot; translate=&amp;quot;yes&amp;quot;/&amp;gt;&lt;br /&gt;
 &amp;lt;its:withinTextRule selector=&amp;quot;//codeph&amp;quot; withinText=&amp;quot;yes&amp;quot;/&amp;gt;&lt;br /&gt;
 &amp;lt;its:withinTextRule selector=&amp;quot;//ph&amp;quot; withinText=&amp;quot;yes&amp;quot;/&amp;gt;&lt;br /&gt;
 &amp;lt;okp:simplifierRules&amp;gt;&lt;br /&gt;
 if ADDABLE or DELETABLE or CLONEABLE; if DATA ~ &amp;quot;.+&amp;quot;;&lt;br /&gt;
 &amp;lt;/okp:simplifierRules&amp;gt;&lt;br /&gt;
&amp;lt;/its:rules&amp;gt;&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
'''FPRM (Parameters):'''&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
#v1&lt;br /&gt;
extractNotes.b=true&lt;br /&gt;
simplifierRules=if ADDABLE or DELETABLE or CLONEABLE; if DATA ~ &amp;quot;.+&amp;quot;;&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
==Font Mapping==&lt;br /&gt;
&lt;br /&gt;
The font mapping can be considered as a filter's ability to automatically substitute font information in the target document on the fly, according to a provided configuration - this helps to reduce the amount of reformatting and post-translation DTP. It is supported by IDML and OpenXML (DOCX, PPTX and XLSX documents) filters at the moment.&lt;br /&gt;
&lt;br /&gt;
The following font mapping configuration options are available:&lt;br /&gt;
* The source locale regular expression pattern: &amp;lt;code&amp;gt;.*&amp;lt;/code&amp;gt;, &amp;lt;code&amp;gt;en.*&amp;lt;/code&amp;gt;, &amp;lt;code&amp;gt;en-UK&amp;lt;/code&amp;gt;, etc. It can be ommited to apply the mapping to any source locale.&lt;br /&gt;
* The target locale regular expression pattern: &amp;lt;code&amp;gt;.*&amp;lt;/code&amp;gt;, &amp;lt;code&amp;gt;ru.*&amp;lt;/code&amp;gt;, &amp;lt;code&amp;gt;ru-RU&amp;lt;/code&amp;gt;, etc. It can be ommited to apply the mapping to any target locale.&lt;br /&gt;
* The source font name regular expression pattern: &amp;lt;code&amp;gt;.*&amp;lt;/code&amp;gt;, &amp;lt;code&amp;gt;Arial.*&amp;lt;/code&amp;gt;, &amp;lt;code&amp;gt;Times New Roman&amp;lt;/code&amp;gt;, etc. It can be ommited to apply the mapping to any source font name found.&lt;br /&gt;
* The target font name: &amp;lt;code&amp;gt;Arial&amp;lt;/code&amp;gt;, &amp;lt;code&amp;gt;Times New Roman&amp;lt;/code&amp;gt;, etc. It should not be empty. And if it is made so, the mapping configuration is ignored.&lt;br /&gt;
&lt;br /&gt;
Also, the configured font mappings are applied in the order they are stated. And the final target font value is determined by a sequential&lt;br /&gt;
substitution of the source font values. I.e. if there is more than one mapping:&lt;br /&gt;
# &amp;lt;code&amp;gt;Arial&amp;lt;/code&amp;gt; -&amp;gt; &amp;lt;code&amp;gt;Times New Roman&amp;lt;/code&amp;gt;&lt;br /&gt;
# &amp;lt;code&amp;gt;Times New Roman&amp;lt;/code&amp;gt; -&amp;gt; &amp;lt;code&amp;gt;Sans Serif&amp;lt;/code&amp;gt;&lt;br /&gt;
then the first mapping will produce &amp;lt;code&amp;gt;Times New Roman&amp;lt;/code&amp;gt; replacement and the second one will be applied to this new value, thus, ending up with the &amp;lt;code&amp;gt;Sans Serif&amp;lt;/code&amp;gt;.&lt;br /&gt;
&lt;br /&gt;
The parameters serialisation format can look like that:&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
fontMappings.0.sourceLocalePattern=en.*&lt;br /&gt;
fontMappings.0.targetLocalePattern=ru.*&lt;br /&gt;
fontMappings.0.sourceFontPattern=Times.*&lt;br /&gt;
fontMappings.0.targetFont=Arial Unicode MS&lt;br /&gt;
fontMappings.1.sourceLocalePattern=ru&lt;br /&gt;
fontMappings.1.targetLocalePattern=fr&lt;br /&gt;
fontMappings.1.sourceFontPattern=The Sims Sans&lt;br /&gt;
fontMappings.1.targetFont=Arial Unicode MS&lt;br /&gt;
fontMappings.number.i=2&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
When source locale, target locale and source font are omitted:&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
fontMappings.0.targetFont=Arial Unicode MS&lt;br /&gt;
fontMappings.number.i=1&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
And this is the same as the abovementioned:&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
fontMappings.0.sourceLocalePattern=.*&lt;br /&gt;
fontMappings.0.targetLocalePattern=.*&lt;br /&gt;
fontMappings.0.sourceFontPattern=.*&lt;br /&gt;
fontMappings.0.targetFont=Arial Unicode MS&lt;br /&gt;
fontMappings.number.i=1&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
[[Category:Filters]]&lt;/div&gt;</summary>
		<author><name>Jhargraveiii</name></author>
	</entry>
	<entry>
		<id>http://okapiframework.org/wiki/index.php?title=Distributions&amp;diff=966</id>
		<title>Distributions</title>
		<link rel="alternate" type="text/html" href="http://okapiframework.org/wiki/index.php?title=Distributions&amp;diff=966"/>
		<updated>2023-02-09T16:56:30Z</updated>

		<summary type="html">&lt;p&gt;Jhargraveiii: /* Main Project */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;&lt;br /&gt;
== Main Project ==&lt;br /&gt;
&lt;br /&gt;
This includes platform-specific distributions for the applications (Rainbow, Tikal, Ratel, etc.) as well as the libraries-only package (all platforms).&lt;br /&gt;
&lt;br /&gt;
* Latest release ( &amp;lt;b&amp;gt;1.45.0 - Feb 9, 2023&amp;lt;/b&amp;gt; ) : [https://okapiframework.org/binaries/main/1.45.0 https://okapiframework.org/binaries/main/1.45.0]&lt;br /&gt;
* Recent releases: [https://okapiframework.org/binaries/main https://okapiframework.org/binaries/main]&lt;br /&gt;
* Release artifacts on Maven Central: https://search.maven.org/search?q=net.sf.okapi&lt;br /&gt;
* Changes log: [https://okapiframework.org/binaries/main/changes/files/1.45.0.html https://okapiframework.org/binaries/main/changes/files/1.45.0.html]&lt;br /&gt;
&lt;br /&gt;
Snapshots:&lt;br /&gt;
&lt;br /&gt;
* Latest Development Snapshots (nightly builds): [https://gitlab.com/okapiframework/okapi/-/jobs/artifacts/dev/browse/deployment/maven/done?job=verification:jdk11 https://gitlab.com/okapiframework/okapi/-/jobs/artifacts/dev/browse/deployment/maven/done]&lt;br /&gt;
&lt;br /&gt;
== Longhorn ==&lt;br /&gt;
&lt;br /&gt;
This includes the Longhorn distributions (all platforms).&lt;br /&gt;
&lt;br /&gt;
* Latest release ( &amp;lt;b&amp;gt;1.44.0 - Jan 16, 2023&amp;lt;/b&amp;gt; ) : [https://okapiframework.org/binaries/longhorn/okapi-longhorn_all-platforms_1.44.0.zip okapi-longhorn_all-platforms_1.44.0.zip]&lt;br /&gt;
* Recent releases: [https://okapiframework.org/binaries/longhorn https://okapiframework.org/binaries/longhorn]&lt;br /&gt;
&lt;br /&gt;
== OmegaT Filter Plugin ==&lt;br /&gt;
&lt;br /&gt;
This includes the Filters Plugin for OmegaT (all platforms).&lt;br /&gt;
&lt;br /&gt;
* Latest release ( &amp;lt;b&amp;gt;1.12-1.44.0 - Nov 23, 2022&amp;lt;/b&amp;gt; ) : [https://okapiframework.org/binaries/omegat-plugin/okapiFiltersForOmegaT-1.12-1.44.0-dist.zip okapiFiltersForOmegaT-1.12-1.44.0-dist.zip]&lt;br /&gt;
* All releases: [https://okapiframework.org/binaries/omegat-plugin https://okapiframework.org/binaries/omegat-plugin]&lt;br /&gt;
&lt;br /&gt;
== Ocelot ==&lt;br /&gt;
&lt;br /&gt;
This includes the  Review Workbench application Ocelot (all platforms).&lt;br /&gt;
&lt;br /&gt;
* Latest release ( &amp;lt;b&amp;gt;3.0 - Oct 17, 2017&amp;lt;/b&amp;gt; ) : [https://okapiframework.org/binaries/ocelot/Ocelot-3.0.jar Ocelot-3.0.jar]&lt;br /&gt;
* Recent releases: [https://okapiframework.org/binaries/ocelot https://okapiframework.org/binaries/ocelot]&lt;br /&gt;
&lt;br /&gt;
== Archives ==&lt;br /&gt;
&lt;br /&gt;
Older distributions that are not included above.&lt;br /&gt;
&lt;br /&gt;
* [https://okapiframework.org/binaries/archives Archives (https://okapiframework.org/binaries/archives)]&lt;/div&gt;</summary>
		<author><name>Jhargraveiii</name></author>
	</entry>
	<entry>
		<id>http://okapiframework.org/wiki/index.php?title=Distributions&amp;diff=965</id>
		<title>Distributions</title>
		<link rel="alternate" type="text/html" href="http://okapiframework.org/wiki/index.php?title=Distributions&amp;diff=965"/>
		<updated>2023-02-09T16:55:43Z</updated>

		<summary type="html">&lt;p&gt;Jhargraveiii: /* Main Project */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;&lt;br /&gt;
== Main Project ==&lt;br /&gt;
&lt;br /&gt;
This includes platform-specific distributions for the applications (Rainbow, Tikal, Ratel, etc.) as well as the libraries-only package (all platforms).&lt;br /&gt;
&lt;br /&gt;
* Latest release ( &amp;lt;b&amp;gt;1.45.0 - Feb 9, 2023&amp;lt;/b&amp;gt; ) : [https://okapiframework.org/binaries/main/1.45.0 https://okapiframework.org/binaries/main/1.45.0]&lt;br /&gt;
* Recent releases: [https://okapiframework.org/binaries/main https://okapiframework.org/binaries/main]&lt;br /&gt;
* Release artifacts on Maven Central: https://search.maven.org/search?q=net.sf.okapi&lt;br /&gt;
* Changes log: [https://okapiframework.org/binaries/main/changes/index.html https://okapiframework.org/binaries/main/changes/index.html]&lt;br /&gt;
&lt;br /&gt;
Snapshots:&lt;br /&gt;
&lt;br /&gt;
* Latest Development Snapshots (nightly builds): [https://gitlab.com/okapiframework/okapi/-/jobs/artifacts/dev/browse/deployment/maven/done?job=verification:jdk11 https://gitlab.com/okapiframework/okapi/-/jobs/artifacts/dev/browse/deployment/maven/done]&lt;br /&gt;
&lt;br /&gt;
== Longhorn ==&lt;br /&gt;
&lt;br /&gt;
This includes the Longhorn distributions (all platforms).&lt;br /&gt;
&lt;br /&gt;
* Latest release ( &amp;lt;b&amp;gt;1.44.0 - Jan 16, 2023&amp;lt;/b&amp;gt; ) : [https://okapiframework.org/binaries/longhorn/okapi-longhorn_all-platforms_1.44.0.zip okapi-longhorn_all-platforms_1.44.0.zip]&lt;br /&gt;
* Recent releases: [https://okapiframework.org/binaries/longhorn https://okapiframework.org/binaries/longhorn]&lt;br /&gt;
&lt;br /&gt;
== OmegaT Filter Plugin ==&lt;br /&gt;
&lt;br /&gt;
This includes the Filters Plugin for OmegaT (all platforms).&lt;br /&gt;
&lt;br /&gt;
* Latest release ( &amp;lt;b&amp;gt;1.12-1.44.0 - Nov 23, 2022&amp;lt;/b&amp;gt; ) : [https://okapiframework.org/binaries/omegat-plugin/okapiFiltersForOmegaT-1.12-1.44.0-dist.zip okapiFiltersForOmegaT-1.12-1.44.0-dist.zip]&lt;br /&gt;
* All releases: [https://okapiframework.org/binaries/omegat-plugin https://okapiframework.org/binaries/omegat-plugin]&lt;br /&gt;
&lt;br /&gt;
== Ocelot ==&lt;br /&gt;
&lt;br /&gt;
This includes the  Review Workbench application Ocelot (all platforms).&lt;br /&gt;
&lt;br /&gt;
* Latest release ( &amp;lt;b&amp;gt;3.0 - Oct 17, 2017&amp;lt;/b&amp;gt; ) : [https://okapiframework.org/binaries/ocelot/Ocelot-3.0.jar Ocelot-3.0.jar]&lt;br /&gt;
* Recent releases: [https://okapiframework.org/binaries/ocelot https://okapiframework.org/binaries/ocelot]&lt;br /&gt;
&lt;br /&gt;
== Archives ==&lt;br /&gt;
&lt;br /&gt;
Older distributions that are not included above.&lt;br /&gt;
&lt;br /&gt;
* [https://okapiframework.org/binaries/archives Archives (https://okapiframework.org/binaries/archives)]&lt;/div&gt;</summary>
		<author><name>Jhargraveiii</name></author>
	</entry>
	<entry>
		<id>http://okapiframework.org/wiki/index.php?title=Distributions&amp;diff=964</id>
		<title>Distributions</title>
		<link rel="alternate" type="text/html" href="http://okapiframework.org/wiki/index.php?title=Distributions&amp;diff=964"/>
		<updated>2023-02-09T16:51:54Z</updated>

		<summary type="html">&lt;p&gt;Jhargraveiii: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;&lt;br /&gt;
== Main Project ==&lt;br /&gt;
&lt;br /&gt;
This includes platform-specific distributions for the applications (Rainbow, Tikal, Ratel, etc.) as well as the libraries-only package (all platforms).&lt;br /&gt;
&lt;br /&gt;
* Latest release ( &amp;lt;b&amp;gt;1.45.0 - Feb 9, 2023&amp;lt;/b&amp;gt; ) : [https://okapiframework.org/binaries/main/1.45.0 https://okapiframework.org/binaries/main/1.45.0]&lt;br /&gt;
* Recent releases: [https://okapiframework.org/binaries/main https://okapiframework.org/binaries/main]&lt;br /&gt;
* Release artifacts on Maven Central: https://search.maven.org/search?q=net.sf.okapi&lt;br /&gt;
* Changes log: [https://okapiframework.org/binaries/main/changes.html https://okapiframework.org/binaries/main/changes.html]&lt;br /&gt;
&lt;br /&gt;
Snapshots:&lt;br /&gt;
&lt;br /&gt;
* Latest Development Snapshots (nightly builds): [https://gitlab.com/okapiframework/okapi/-/jobs/artifacts/dev/browse/deployment/maven/done?job=verification:jdk11 https://gitlab.com/okapiframework/okapi/-/jobs/artifacts/dev/browse/deployment/maven/done]&lt;br /&gt;
&lt;br /&gt;
== Longhorn ==&lt;br /&gt;
&lt;br /&gt;
This includes the Longhorn distributions (all platforms).&lt;br /&gt;
&lt;br /&gt;
* Latest release ( &amp;lt;b&amp;gt;1.44.0 - Jan 16, 2023&amp;lt;/b&amp;gt; ) : [https://okapiframework.org/binaries/longhorn/okapi-longhorn_all-platforms_1.44.0.zip okapi-longhorn_all-platforms_1.44.0.zip]&lt;br /&gt;
* Recent releases: [https://okapiframework.org/binaries/longhorn https://okapiframework.org/binaries/longhorn]&lt;br /&gt;
&lt;br /&gt;
== OmegaT Filter Plugin ==&lt;br /&gt;
&lt;br /&gt;
This includes the Filters Plugin for OmegaT (all platforms).&lt;br /&gt;
&lt;br /&gt;
* Latest release ( &amp;lt;b&amp;gt;1.12-1.44.0 - Nov 23, 2022&amp;lt;/b&amp;gt; ) : [https://okapiframework.org/binaries/omegat-plugin/okapiFiltersForOmegaT-1.12-1.44.0-dist.zip okapiFiltersForOmegaT-1.12-1.44.0-dist.zip]&lt;br /&gt;
* All releases: [https://okapiframework.org/binaries/omegat-plugin https://okapiframework.org/binaries/omegat-plugin]&lt;br /&gt;
&lt;br /&gt;
== Ocelot ==&lt;br /&gt;
&lt;br /&gt;
This includes the  Review Workbench application Ocelot (all platforms).&lt;br /&gt;
&lt;br /&gt;
* Latest release ( &amp;lt;b&amp;gt;3.0 - Oct 17, 2017&amp;lt;/b&amp;gt; ) : [https://okapiframework.org/binaries/ocelot/Ocelot-3.0.jar Ocelot-3.0.jar]&lt;br /&gt;
* Recent releases: [https://okapiframework.org/binaries/ocelot https://okapiframework.org/binaries/ocelot]&lt;br /&gt;
&lt;br /&gt;
== Archives ==&lt;br /&gt;
&lt;br /&gt;
Older distributions that are not included above.&lt;br /&gt;
&lt;br /&gt;
* [https://okapiframework.org/binaries/archives Archives (https://okapiframework.org/binaries/archives)]&lt;/div&gt;</summary>
		<author><name>Jhargraveiii</name></author>
	</entry>
	<entry>
		<id>http://okapiframework.org/wiki/index.php?title=Main_Page&amp;diff=963</id>
		<title>Main Page</title>
		<link rel="alternate" type="text/html" href="http://okapiframework.org/wiki/index.php?title=Main_Page&amp;diff=963"/>
		<updated>2023-02-09T16:46:40Z</updated>

		<summary type="html">&lt;p&gt;Jhargraveiii: /* Downloads &amp;amp;mdash; Stable Release (master branch) */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;__NOTOC__&lt;br /&gt;
{| border=&amp;quot;0&amp;quot; cellspacing=&amp;quot;0&amp;quot; cellpadding=&amp;quot;0&amp;quot; width=&amp;quot;100%&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
|colspan=&amp;quot;2&amp;quot;|&lt;br /&gt;
&amp;lt;div style=&amp;quot;margin-bottom: 10px; padding:5px; padding-left:10px; background:#ffffff; border:1px solid #ccc;&amp;quot;&amp;gt;&lt;br /&gt;
The Okapi Framework is a '''free''', '''open-source''' and '''cross-platform''' set of components and applications designed to help your '''localization''' and '''translation''' processes.&amp;lt;br&amp;gt;&lt;br /&gt;
The project uses and promotes '''[[Open Standards|open standards]]''' such as [[XLIFF]], [[TMX]], [[SRX]], [[ITS]] and other formats. It is an [[Open_Standards#OAXAL_1.0_Conformance_Statement|OAXAL 1.0 implementation]].&lt;br /&gt;
&amp;lt;/div&amp;gt;&lt;br /&gt;
&lt;br /&gt;
__NOTOC__&lt;br /&gt;
{| border=&amp;quot;0&amp;quot; cellspacing=&amp;quot;0&amp;quot; cellpadding=&amp;quot;0&amp;quot; width=&amp;quot;100%&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
|colspan=&amp;quot;2&amp;quot;|&lt;br /&gt;
&amp;lt;div style=&amp;quot;margin-bottom: 10px; margin-top: 0px; padding:5px; padding-left:10px; background-color: yellow; color: red !important; font-weight: bold; border:1px solid #ccc;&amp;quot;&amp;gt;&lt;br /&gt;
[[About the Log4j2 vulnerabilities]]&lt;br /&gt;
&amp;lt;/div&amp;gt;&lt;br /&gt;
&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
|&lt;br /&gt;
&amp;lt;div style=&amp;quot;margin-bottom:10px; margin-right:5px; padding:10px; padding-top:0; background:#ffffdf; border:1px solid #ccc;&amp;quot;&amp;gt;&lt;br /&gt;
==Users Help and Documentation==&lt;br /&gt;
* '''[[Getting Started]]''' &amp;amp;mdash; all you need to know to get started using the Okapi tools&lt;br /&gt;
* '''[[FAQ]]''' &amp;amp;mdash; the answers to some of the most asked questions&lt;br /&gt;
* '''[[Online Help]]''' &amp;amp;mdash; the applications online help pages&lt;br /&gt;
* '''[[Knowledge Base]]''' &amp;amp;mdash; various articles on how to work with the Okapi tools&lt;br /&gt;
* '''[[Tutorials]]''' &amp;amp;mdash; step-by-step tutorials on using the Okapi tools&lt;br /&gt;
* '''[[Glossary]]''' &amp;amp;mdash; list of the main terms used in the help and documentation&lt;br /&gt;
* '''[[Screenshots]]''' &amp;amp;mdash; of the applications on different platforms3&lt;br /&gt;
&lt;br /&gt;
Additional support:&lt;br /&gt;
* '''[https://groups.google.com/forum/#!forum/okapi-users Users Group]''' &amp;amp;mdash; users group and mailing list (or '''[http://twitter.com/OkapiFramework follow us on Twitter]''')&lt;br /&gt;
* '''[https://bitbucket.org/okapiframework/okapi/issues Bug Report]''' &amp;amp;mdash; list of bug reports (Login required to post one)&lt;br /&gt;
* '''[https://bitbucket.org/okapiframework/okapi/issues Feature Request]''' &amp;amp;mdash; list of feature requests (Login required to post one)&lt;br /&gt;
* '''[mailto:okapitools@opentag.com&amp;amp;subject=Feedback Send Feedback]''' &amp;amp;mdash; email some feedback (no login required)&lt;br /&gt;
&amp;lt;/div&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;div style=&amp;quot;padding:10px; margin-right:5px; padding-top:0; background:#f9f9ff; border:1px solid #ccc;&amp;quot;&amp;gt;&lt;br /&gt;
&lt;br /&gt;
==Applications==&lt;br /&gt;
* '''[[Rainbow]]''' &amp;amp;mdash; a toolbox to launch a large variety of localization tasks&lt;br /&gt;
* '''[[CheckMate]]''' &amp;amp;mdash; an application to perform quality checks on bilingual files&lt;br /&gt;
* '''[[Tikal]]''' &amp;amp;mdash; a command-line tool for basic localization tasks&lt;br /&gt;
* '''[[Ratel]]''' &amp;amp;mdash; a WYSIWYG editor to create, test and maintain SRX segmentation rules&lt;br /&gt;
* '''[[Longhorn]]''' &amp;amp;mdash; a batch processing server&lt;br /&gt;
* '''[http://github.com/vistatec/ocelot/blob/master/README.md Ocelot]''' &amp;amp;mdash; a XLIFF translator/reviewer workbench&lt;br /&gt;
&lt;br /&gt;
==Components==&lt;br /&gt;
* '''[[Filters]]''' &amp;amp;mdash; extract translatable text from various file formats&lt;br /&gt;
* '''[[Steps|Pipelines and Steps]]''' &amp;amp;mdash; perform a chain of user-defined tasks on input documents&lt;br /&gt;
* '''[[Connectors|Translation Resource Connectors]]''' &amp;amp;mdash; provide access to MT, TM, term-bases, etc.&lt;br /&gt;
&amp;lt;/div&amp;gt;&lt;br /&gt;
&lt;br /&gt;
|&lt;br /&gt;
&amp;lt;div style=&amp;quot;margin-bottom:10px; margin-left:5px; padding:10px; padding-top:0; background:#dfffe8; border:1px solid #ccc;&amp;quot;&amp;gt;&lt;br /&gt;
==Downloads &amp;amp;mdash; Stable Release ([https://bitbucket.org/okapiframework/okapi/src/master master branch])==&lt;br /&gt;
The latest stable release of Rainbow, CheckMate, Tikal, Ratel, etc. ('''1.45.0 - Aug 28, 2022'''):&lt;br /&gt;
* '''[https://okapiframework.org/binaries/main/1.45.0/okapi-apps_win32-x86_1.45.0.zip Windows 32-bit]''' or '''[https://okapiframework.org/binaries/main/1.45.0/okapi-apps_win32-x86_64_1.45.0.zip Windows 64-bit]''' &amp;amp;mdash; Windows platform&lt;br /&gt;
* '''[https://okapiframework.org/binaries/main/1.45.0/okapi-apps_gtk2-linux-x86_1.45.0.zip Linux 32-bit]''' or '''[https://okapiframework.org/binaries/main/1.45.0/okapi-apps_gtk2-linux-x86_64_1.45.0.zip Linux 64-bit]''' &amp;amp;mdash; Linux platform (gtk2, Intel)&lt;br /&gt;
* '''[https://okapiframework.org/binaries/main/1.45.0/okapi-apps_cocoa-macosx-x86_64_1.45.0.dmg Macintosh Intel 64-bit]''' &amp;amp;mdash; Macintosh platform (Cocoa, Intel)&lt;br /&gt;
* '''[https://okapiframework.org/binaries/main/1.45.0/okapi-apps_cocoa-macosx-aarch64_1.45.0.dmg Macintosh Arm 64-bit]''' &amp;amp;mdash; Macintosh platform (Cocoa, Apple M1 chip)&lt;br /&gt;
* '''[https://okapiframework.org/binaries/longhorn/okapi-longhorn_all-platforms_1.43.0.zip Longhorn]''' &amp;amp;mdash; batch processing server ([[Longhorn|More information...)]]&lt;br /&gt;
* '''[https://okapiframework.org/binaries/omegat-plugin/okapiFiltersForOmegaT-1.11-1.43.0-dist.zip Plugin for OmegaT]''' &amp;amp;mdash; filters plugin for OmegaT ([[Okapi Filters Plugin for OmegaT|More information...)]]&lt;br /&gt;
* '''[https://okapiframework.org/binaries/ocelot Ocelot (v3.0)]''' &amp;amp;mdash; XLIFF reviewer workbench ([http://github.com/vistatec/ocelot/blob/master/README.md More information...)]&lt;br /&gt;
* '''[https://okapiframework.org/binaries/main/changes Changes Log]''' &amp;amp;mdash; list of the changes between versions&lt;br /&gt;
* '''https://okapiframework.org/wiki/index.php/Distributions''' &amp;amp;mdash; all downloads&lt;br /&gt;
* '''https://search.maven.org/search?q=net.sf.okapi''' &amp;amp;mdash; release artifacts on Maven Central&lt;br /&gt;
&lt;br /&gt;
==Downloads &amp;amp;mdash; Development Release ([https://bitbucket.org/okapiframework/okapi/src/dev/ dev branch])==&lt;br /&gt;
For testers and advanced users, the latest development snapshots are here:&lt;br /&gt;
* Maven artifacts snapshots, nightly build: '''https://oss.sonatype.org/content/repositories/snapshots'''&lt;br /&gt;
* Latest snapshot of the distributions: '''[https://gitlab.com/okapiframework/okapi/-/jobs/artifacts/dev/browse/deployment/maven/done?job=verification:jdk11 Nightly build on GitLab]'''&lt;br /&gt;
&amp;lt;/div&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;div style=&amp;quot;margin-left:5px; padding:10px; padding-top:0; background:#ffe8bf; border:1px solid #ccc;&amp;quot;&amp;gt;&lt;br /&gt;
&lt;br /&gt;
==Developers==&lt;br /&gt;
{| style=&amp;quot;background:#ffe8bf;&amp;quot;&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot; padding=&amp;quot;0&amp;quot;&lt;br /&gt;
|&lt;br /&gt;
* '''[https://bitbucket.org/okapiframework/ Project Home]''' at Bitbucket&lt;br /&gt;
* '''[http://okapiframework.org/devguide/index.html Developer's Guide]'''&lt;br /&gt;
* '''[http://okapiframework.org/javadoc/index.html API Documentation]'''&lt;br /&gt;
* '''[http://groups.google.com/group/okapi-devel/ Developers Group and Mailing List]''' &amp;amp;nbsp;&amp;amp;nbsp;&amp;amp;nbsp;&lt;br /&gt;
* '''[https://bitbucket.org/okapiframework/okapi/src Git repository]''' of the source code&lt;br /&gt;
* '''[[About the Log4j2 vulnerabilities]]'''&lt;br /&gt;
|&lt;br /&gt;
* '''[https://gitlab.com/okapiframework/okapi/pipelines Continuous Build on GitLab]'''&lt;br /&gt;
* List of the '''[https://bitbucket.org/okapiframework/okapi/src/dev/ Latest Code Changes]''' (dev branch)&lt;br /&gt;
* '''[https://bitbucket.org/okapiframework/okapi/wiki/Consultancy Consultancy resources]'''&lt;br /&gt;
* '''[https://bitbucket.org/okapiframework/okapi/issues Bugs and Requests]''' tracking list&lt;br /&gt;
* Notifications: '''[https://bitbucket.org/okapiframework/okapi/commits/all Commits]''' and '''[http://groups.google.com/group/okapi-builds CB Errors]'''&lt;br /&gt;
|}&lt;br /&gt;
&amp;lt;/div&amp;gt;&lt;br /&gt;
&lt;br /&gt;
|}&lt;/div&gt;</summary>
		<author><name>Jhargraveiii</name></author>
	</entry>
	<entry>
		<id>http://okapiframework.org/wiki/index.php?title=Main_Page&amp;diff=962</id>
		<title>Main Page</title>
		<link rel="alternate" type="text/html" href="http://okapiframework.org/wiki/index.php?title=Main_Page&amp;diff=962"/>
		<updated>2023-02-09T16:44:35Z</updated>

		<summary type="html">&lt;p&gt;Jhargraveiii: /* Downloads &amp;amp;mdash; Stable Release (master branch) */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;__NOTOC__&lt;br /&gt;
{| border=&amp;quot;0&amp;quot; cellspacing=&amp;quot;0&amp;quot; cellpadding=&amp;quot;0&amp;quot; width=&amp;quot;100%&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
|colspan=&amp;quot;2&amp;quot;|&lt;br /&gt;
&amp;lt;div style=&amp;quot;margin-bottom: 10px; padding:5px; padding-left:10px; background:#ffffff; border:1px solid #ccc;&amp;quot;&amp;gt;&lt;br /&gt;
The Okapi Framework is a '''free''', '''open-source''' and '''cross-platform''' set of components and applications designed to help your '''localization''' and '''translation''' processes.&amp;lt;br&amp;gt;&lt;br /&gt;
The project uses and promotes '''[[Open Standards|open standards]]''' such as [[XLIFF]], [[TMX]], [[SRX]], [[ITS]] and other formats. It is an [[Open_Standards#OAXAL_1.0_Conformance_Statement|OAXAL 1.0 implementation]].&lt;br /&gt;
&amp;lt;/div&amp;gt;&lt;br /&gt;
&lt;br /&gt;
__NOTOC__&lt;br /&gt;
{| border=&amp;quot;0&amp;quot; cellspacing=&amp;quot;0&amp;quot; cellpadding=&amp;quot;0&amp;quot; width=&amp;quot;100%&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
|colspan=&amp;quot;2&amp;quot;|&lt;br /&gt;
&amp;lt;div style=&amp;quot;margin-bottom: 10px; margin-top: 0px; padding:5px; padding-left:10px; background-color: yellow; color: red !important; font-weight: bold; border:1px solid #ccc;&amp;quot;&amp;gt;&lt;br /&gt;
[[About the Log4j2 vulnerabilities]]&lt;br /&gt;
&amp;lt;/div&amp;gt;&lt;br /&gt;
&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
|&lt;br /&gt;
&amp;lt;div style=&amp;quot;margin-bottom:10px; margin-right:5px; padding:10px; padding-top:0; background:#ffffdf; border:1px solid #ccc;&amp;quot;&amp;gt;&lt;br /&gt;
==Users Help and Documentation==&lt;br /&gt;
* '''[[Getting Started]]''' &amp;amp;mdash; all you need to know to get started using the Okapi tools&lt;br /&gt;
* '''[[FAQ]]''' &amp;amp;mdash; the answers to some of the most asked questions&lt;br /&gt;
* '''[[Online Help]]''' &amp;amp;mdash; the applications online help pages&lt;br /&gt;
* '''[[Knowledge Base]]''' &amp;amp;mdash; various articles on how to work with the Okapi tools&lt;br /&gt;
* '''[[Tutorials]]''' &amp;amp;mdash; step-by-step tutorials on using the Okapi tools&lt;br /&gt;
* '''[[Glossary]]''' &amp;amp;mdash; list of the main terms used in the help and documentation&lt;br /&gt;
* '''[[Screenshots]]''' &amp;amp;mdash; of the applications on different platforms3&lt;br /&gt;
&lt;br /&gt;
Additional support:&lt;br /&gt;
* '''[https://groups.google.com/forum/#!forum/okapi-users Users Group]''' &amp;amp;mdash; users group and mailing list (or '''[http://twitter.com/OkapiFramework follow us on Twitter]''')&lt;br /&gt;
* '''[https://bitbucket.org/okapiframework/okapi/issues Bug Report]''' &amp;amp;mdash; list of bug reports (Login required to post one)&lt;br /&gt;
* '''[https://bitbucket.org/okapiframework/okapi/issues Feature Request]''' &amp;amp;mdash; list of feature requests (Login required to post one)&lt;br /&gt;
* '''[mailto:okapitools@opentag.com&amp;amp;subject=Feedback Send Feedback]''' &amp;amp;mdash; email some feedback (no login required)&lt;br /&gt;
&amp;lt;/div&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;div style=&amp;quot;padding:10px; margin-right:5px; padding-top:0; background:#f9f9ff; border:1px solid #ccc;&amp;quot;&amp;gt;&lt;br /&gt;
&lt;br /&gt;
==Applications==&lt;br /&gt;
* '''[[Rainbow]]''' &amp;amp;mdash; a toolbox to launch a large variety of localization tasks&lt;br /&gt;
* '''[[CheckMate]]''' &amp;amp;mdash; an application to perform quality checks on bilingual files&lt;br /&gt;
* '''[[Tikal]]''' &amp;amp;mdash; a command-line tool for basic localization tasks&lt;br /&gt;
* '''[[Ratel]]''' &amp;amp;mdash; a WYSIWYG editor to create, test and maintain SRX segmentation rules&lt;br /&gt;
* '''[[Longhorn]]''' &amp;amp;mdash; a batch processing server&lt;br /&gt;
* '''[http://github.com/vistatec/ocelot/blob/master/README.md Ocelot]''' &amp;amp;mdash; a XLIFF translator/reviewer workbench&lt;br /&gt;
&lt;br /&gt;
==Components==&lt;br /&gt;
* '''[[Filters]]''' &amp;amp;mdash; extract translatable text from various file formats&lt;br /&gt;
* '''[[Steps|Pipelines and Steps]]''' &amp;amp;mdash; perform a chain of user-defined tasks on input documents&lt;br /&gt;
* '''[[Connectors|Translation Resource Connectors]]''' &amp;amp;mdash; provide access to MT, TM, term-bases, etc.&lt;br /&gt;
&amp;lt;/div&amp;gt;&lt;br /&gt;
&lt;br /&gt;
|&lt;br /&gt;
&amp;lt;div style=&amp;quot;margin-bottom:10px; margin-left:5px; padding:10px; padding-top:0; background:#dfffe8; border:1px solid #ccc;&amp;quot;&amp;gt;&lt;br /&gt;
==Downloads &amp;amp;mdash; Stable Release ([https://bitbucket.org/okapiframework/okapi/src/master master branch])==&lt;br /&gt;
The latest stable release of Rainbow, CheckMate, Tikal, Ratel, etc. ('''1.45.0 - Aug 28, 2022'''):&lt;br /&gt;
* '''[https://okapiframework.org/binaries/main/1.45.0/okapi-apps_win32-x86_1.44.0.zip Windows 32-bit]''' or '''[https://okapiframework.org/binaries/main/1.45.0/okapi-apps_win32-x86_64_1.44.0.zip Windows 64-bit]''' &amp;amp;mdash; Windows platform&lt;br /&gt;
* '''[https://okapiframework.org/binaries/main/1.45.0/okapi-apps_gtk2-linux-x86_1.44.0.zip Linux 32-bit]''' or '''[https://okapiframework.org/binaries/main/1.45.0/okapi-apps_gtk2-linux-x86_64_1.44.0.zip Linux 64-bit]''' &amp;amp;mdash; Linux platform (gtk2, Intel)&lt;br /&gt;
* '''[https://okapiframework.org/binaries/main/1.45.0/okapi-apps_cocoa-macosx-x86_64_1.44.0.dmg Macintosh Intel 64-bit]''' &amp;amp;mdash; Macintosh platform (Cocoa, Intel)&lt;br /&gt;
* '''[https://okapiframework.org/binaries/main/1.45.0/okapi-apps_cocoa-macosx-aarch64_1.44.0.dmg Macintosh Arm 64-bit]''' &amp;amp;mdash; Macintosh platform (Cocoa, Apple M1 chip)&lt;br /&gt;
* '''[https://okapiframework.org/binaries/longhorn/okapi-longhorn_all-platforms_1.43.0.zip Longhorn]''' &amp;amp;mdash; batch processing server ([[Longhorn|More information...)]]&lt;br /&gt;
* '''[https://okapiframework.org/binaries/omegat-plugin/okapiFiltersForOmegaT-1.11-1.43.0-dist.zip Plugin for OmegaT]''' &amp;amp;mdash; filters plugin for OmegaT ([[Okapi Filters Plugin for OmegaT|More information...)]]&lt;br /&gt;
* '''[https://okapiframework.org/binaries/ocelot Ocelot (v3.0)]''' &amp;amp;mdash; XLIFF reviewer workbench ([http://github.com/vistatec/ocelot/blob/master/README.md More information...)]&lt;br /&gt;
* '''[https://okapiframework.org/binaries/main/changes Changes Log]''' &amp;amp;mdash; list of the changes between versions&lt;br /&gt;
* '''https://okapiframework.org/wiki/index.php/Distributions''' &amp;amp;mdash; all downloads&lt;br /&gt;
* '''https://search.maven.org/search?q=net.sf.okapi''' &amp;amp;mdash; release artifacts on Maven Central&lt;br /&gt;
&lt;br /&gt;
==Downloads &amp;amp;mdash; Development Release ([https://bitbucket.org/okapiframework/okapi/src/dev/ dev branch])==&lt;br /&gt;
For testers and advanced users, the latest development snapshots are here:&lt;br /&gt;
* Maven artifacts snapshots, nightly build: '''https://oss.sonatype.org/content/repositories/snapshots'''&lt;br /&gt;
* Latest snapshot of the distributions: '''[https://gitlab.com/okapiframework/okapi/-/jobs/artifacts/dev/browse/deployment/maven/done?job=verification:jdk11 Nightly build on GitLab]'''&lt;br /&gt;
&amp;lt;/div&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;div style=&amp;quot;margin-left:5px; padding:10px; padding-top:0; background:#ffe8bf; border:1px solid #ccc;&amp;quot;&amp;gt;&lt;br /&gt;
&lt;br /&gt;
==Developers==&lt;br /&gt;
{| style=&amp;quot;background:#ffe8bf;&amp;quot;&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot; padding=&amp;quot;0&amp;quot;&lt;br /&gt;
|&lt;br /&gt;
* '''[https://bitbucket.org/okapiframework/ Project Home]''' at Bitbucket&lt;br /&gt;
* '''[http://okapiframework.org/devguide/index.html Developer's Guide]'''&lt;br /&gt;
* '''[http://okapiframework.org/javadoc/index.html API Documentation]'''&lt;br /&gt;
* '''[http://groups.google.com/group/okapi-devel/ Developers Group and Mailing List]''' &amp;amp;nbsp;&amp;amp;nbsp;&amp;amp;nbsp;&lt;br /&gt;
* '''[https://bitbucket.org/okapiframework/okapi/src Git repository]''' of the source code&lt;br /&gt;
* '''[[About the Log4j2 vulnerabilities]]'''&lt;br /&gt;
|&lt;br /&gt;
* '''[https://gitlab.com/okapiframework/okapi/pipelines Continuous Build on GitLab]'''&lt;br /&gt;
* List of the '''[https://bitbucket.org/okapiframework/okapi/src/dev/ Latest Code Changes]''' (dev branch)&lt;br /&gt;
* '''[https://bitbucket.org/okapiframework/okapi/wiki/Consultancy Consultancy resources]'''&lt;br /&gt;
* '''[https://bitbucket.org/okapiframework/okapi/issues Bugs and Requests]''' tracking list&lt;br /&gt;
* Notifications: '''[https://bitbucket.org/okapiframework/okapi/commits/all Commits]''' and '''[http://groups.google.com/group/okapi-builds CB Errors]'''&lt;br /&gt;
|}&lt;br /&gt;
&amp;lt;/div&amp;gt;&lt;br /&gt;
&lt;br /&gt;
|}&lt;/div&gt;</summary>
		<author><name>Jhargraveiii</name></author>
	</entry>
	<entry>
		<id>http://okapiframework.org/wiki/index.php?title=Archive_Filter&amp;diff=961</id>
		<title>Archive Filter</title>
		<link rel="alternate" type="text/html" href="http://okapiframework.org/wiki/index.php?title=Archive_Filter&amp;diff=961"/>
		<updated>2023-02-01T16:26:52Z</updated>

		<summary type="html">&lt;p&gt;Jhargraveiii: /* Limitations */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;{{Filters Header}}&lt;br /&gt;
==Overview==&lt;br /&gt;
&lt;br /&gt;
This filter allows you to process different type of files stored in a ZIP archive file (e.g. &amp;lt;code&amp;gt;.zip&amp;lt;/code&amp;gt; or &amp;lt;code&amp;gt;.jar&amp;lt;/code&amp;gt; files).&lt;br /&gt;
&lt;br /&gt;
==Processing Details==&lt;br /&gt;
&lt;br /&gt;
This filter takes as input an archive document, for example a ZIP file, and extract from it any file that match the name or pattern specified in the &amp;lt;cite&amp;gt;File name&amp;lt;/cite&amp;gt; field, using the filter specified by the filter configuration identifier specified in the &amp;lt;cite&amp;gt;Filter configuration id&amp;lt;/cite&amp;gt; field.&lt;br /&gt;
&lt;br /&gt;
For example if you specify:&lt;br /&gt;
&lt;br /&gt;
* File name: &amp;lt;code&amp;gt;Res.properties&amp;lt;/code&amp;gt;&lt;br /&gt;
* Filter configuration id: &amp;lt;code&amp;gt;okf_properties&amp;lt;/code&amp;gt;&lt;br /&gt;
&lt;br /&gt;
All the files named &amp;lt;code&amp;gt;Res.properties&amp;lt;/code&amp;gt; (in any directory inside the ZIP file) will be extracted using the [[Properties Filter]] with its default configuration.&lt;br /&gt;
&lt;br /&gt;
==Parameters==&lt;br /&gt;
&lt;br /&gt;
&amp;lt;cite&amp;gt;MIME type of the filter's container format&amp;lt;/cite&amp;gt; &amp;amp;mdash; Enter the MIME type for the format of the ''archive'' file.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;cite&amp;gt;File names&amp;lt;/cite&amp;gt; &amp;amp;mdash; Enter one or more file names. If several file names are listed they must be comma-delimited. You can use file wild-chars (&amp;lt;code&amp;gt;?&amp;lt;/code&amp;gt;, &amp;lt;code&amp;gt;*&amp;lt;/code&amp;gt;).&lt;br /&gt;
&lt;br /&gt;
You should have as many file names as filter configuration identifiers in the next field.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;cite&amp;gt;Filter configuration ids&amp;lt;/cite&amp;gt; &amp;amp;mdash; Enter one or more filter configuration identifier. They must been as many as there are file names, and they must be in the same order.&lt;br /&gt;
&lt;br /&gt;
==Limitations==&lt;br /&gt;
&lt;br /&gt;
* None known.&lt;br /&gt;
&lt;br /&gt;
[[Category:Filters]]&lt;/div&gt;</summary>
		<author><name>Jhargraveiii</name></author>
	</entry>
	<entry>
		<id>http://okapiframework.org/wiki/index.php?title=Regex_Filter&amp;diff=948</id>
		<title>Regex Filter</title>
		<link rel="alternate" type="text/html" href="http://okapiframework.org/wiki/index.php?title=Regex_Filter&amp;diff=948"/>
		<updated>2022-09-09T18:13:38Z</updated>

		<summary type="html">&lt;p&gt;Jhargraveiii: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;{{Filters Header}}&lt;br /&gt;
==Overview==&lt;br /&gt;
&lt;br /&gt;
The Regex Filter is an Okapi component that implements the IFilter interface for any type of text-based formats where the text can be captured using [[Regular Expressions|regular expressions]]. The filter is implemented in the &amp;lt;code&amp;gt;class net.sf.okapi.filters.regex.RegexFilter&amp;lt;/code&amp;gt; of the library.&lt;br /&gt;
&lt;br /&gt;
The filter can work with any text-based document. You define rules with regular expressions that indicate what part of the document to process. Each rule is associated with an action telling the filter what to do with the different capturing groups of its regular expression.&lt;br /&gt;
&lt;br /&gt;
For example, if you have the following input document:&lt;br /&gt;
&lt;br /&gt;
 [ID1]=Text for ID1&lt;br /&gt;
 [ID2]:Text for ID2&lt;br /&gt;
&lt;br /&gt;
...and a rule with the following regular expression:&lt;br /&gt;
&lt;br /&gt;
 ^\[(.*?)](=|:)(.*?)$&lt;br /&gt;
&lt;br /&gt;
...and that rule is set to the action &amp;lt;cite&amp;gt;Extract the content&amp;lt;/cite&amp;gt; and has the capturing group 3 assigned to the source group and the capturing group 1 assigned to the identifier group.&lt;br /&gt;
&lt;br /&gt;
...then:&lt;br /&gt;
&lt;br /&gt;
* Each line in the input document will match the rule. &lt;br /&gt;
* A new text unit will be created for each match, with its name set to the content of the capturing group 1, and its source text set to the content of the capturing group 3. &lt;br /&gt;
&lt;br /&gt;
 [&amp;lt;span class=&amp;quot;green&amp;quot;&amp;gt;ID1&amp;lt;/span&amp;gt;]=&amp;lt;span class=&amp;quot;hi&amp;quot;&amp;gt;Text for ID1&amp;lt;/span&amp;gt;&lt;br /&gt;
 [&amp;lt;span class=&amp;quot;green&amp;quot;&amp;gt;ID2&amp;lt;/span&amp;gt;]:&amp;lt;span class=&amp;quot;hi&amp;quot;&amp;gt;Text for ID2&amp;lt;/span&amp;gt;&lt;br /&gt;
&lt;br /&gt;
 ^\[&amp;lt;span class=&amp;quot;green&amp;quot;&amp;gt;(.*?)&amp;lt;/span&amp;gt;](=|:)&amp;lt;span class=&amp;quot;hi&amp;quot;&amp;gt;(.*?)&amp;lt;/span&amp;gt;$&lt;br /&gt;
&lt;br /&gt;
And if you were to represent the parsed information in XLIFF, it would look something like this:&lt;br /&gt;
&lt;br /&gt;
 ...&lt;br /&gt;
 &amp;amp;lt;body&amp;gt;&lt;br /&gt;
  &amp;amp;lt;trans-unit id=&amp;quot;1&amp;quot; resname=&amp;quot;&amp;lt;span class=&amp;quot;green&amp;quot;&amp;gt;ID1&amp;lt;/span&amp;gt;&amp;quot; xml:space=&amp;quot;preserve&amp;quot;&amp;gt;&lt;br /&gt;
   &amp;amp;lt;source xml:lang=&amp;quot;en&amp;quot;&amp;gt;&amp;lt;span class=&amp;quot;hi&amp;quot;&amp;gt;Text for ID1&amp;lt;/span&amp;gt;&amp;lt;/source&amp;gt;&lt;br /&gt;
  &amp;amp;lt;/trans-unit&amp;gt;&lt;br /&gt;
  &amp;amp;lt;trans-unit id=&amp;quot;2&amp;quot; resname=&amp;quot;&amp;lt;span class=&amp;quot;green&amp;quot;&amp;gt;ID2&amp;lt;/span&amp;gt;&amp;quot; xml:space=&amp;quot;preserve&amp;quot;&amp;gt;&lt;br /&gt;
   &amp;amp;lt;source xml:lang=&amp;quot;en&amp;quot;&amp;gt;&amp;lt;span class=&amp;quot;hi&amp;quot;&amp;gt;Text for ID2&amp;lt;/span&amp;gt;&amp;lt;/source&amp;gt;&lt;br /&gt;
  &amp;amp;lt;/trans-unit&amp;gt;&lt;br /&gt;
 &amp;amp;lt;/body&amp;gt;&lt;br /&gt;
 ...&lt;br /&gt;
&lt;br /&gt;
==Processing Details==&lt;br /&gt;
&lt;br /&gt;
===Input Encoding===&lt;br /&gt;
&lt;br /&gt;
The filter decides which encoding to use for the input document using the following logic:&lt;br /&gt;
&lt;br /&gt;
* If the file has a Unicode Byte-Order-Mark:&lt;br /&gt;
** Then, the corresponding encoding (e.g. UTF-8, UTF-16, etc.) is used. &lt;br /&gt;
* Otherwise, the input encoding used is the default encoding that was specified when opening the document. &lt;br /&gt;
&lt;br /&gt;
===Output Encoding===&lt;br /&gt;
&lt;br /&gt;
The filter does not recognize any encoding declarations in the document, and therefore cannot update them.&lt;br /&gt;
&lt;br /&gt;
If the output encoding is UTF-8:&lt;br /&gt;
&lt;br /&gt;
* If the input encoding was also UTF-8, a Byte-Order-Mark is used for the output document only if one was detected in the input document. &lt;br /&gt;
* If the input encoding was not UTF-8, no Byte-Order-Mark is used in the output document. &lt;br /&gt;
&lt;br /&gt;
===Line-Breaks===&lt;br /&gt;
&lt;br /&gt;
The type of line-breaks of the output is the same as the one of the original input.&lt;br /&gt;
&lt;br /&gt;
===Parsing===&lt;br /&gt;
&lt;br /&gt;
Here is how an input document is parsed:&lt;br /&gt;
&lt;br /&gt;
# The filter sets the current search position at the top of the document. &lt;br /&gt;
# It searches for the first possible rule that has a match from a current position. &lt;br /&gt;
# It takes the match and applies whatever action is associated with the rule. &lt;br /&gt;
# It moves the current search position at the end of the match. &lt;br /&gt;
# The steps 2, 3, and 4 are repeated until no more matches are found or the search position reaches the end of the document. &lt;br /&gt;
&lt;br /&gt;
===Actions===&lt;br /&gt;
&lt;br /&gt;
Each rule is associated with one of several possible actions. Depending on the action, you can associate different parts of the text that matches the rule with a specific role. This is done with the capturing groups. The source group, the target group, the identifier group and the note group.&lt;br /&gt;
&lt;br /&gt;
A capturing group is a part of the regular expression between parentheses. The capturing group 0 is the whole match, then other capturing groups are numbered by counting their opening parentheses from left to right. For example, in the expression (A)(B(C)) there are three groups:&lt;br /&gt;
&lt;br /&gt;
# (A)&lt;br /&gt;
# (B(C))&lt;br /&gt;
# (C)&lt;br /&gt;
&lt;br /&gt;
The following table summarizes what each action does, and what the different groups it may use:&lt;br /&gt;
&lt;br /&gt;
{| border=&amp;quot;1&amp;quot; cellpadding=&amp;quot;5&amp;quot; cellspacing=&amp;quot;0&amp;quot;&lt;br /&gt;
|+&lt;br /&gt;
| '''Action''' || '''Effect''' || '''Source''' || '''Target''' || '''Identifier''' || '''Note'''&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| &amp;lt;cite&amp;gt;Extract the strings in the source group&amp;lt;/cite&amp;gt;&lt;br /&gt;
| Sends a &amp;lt;code&amp;gt;TEXT_UNIT&amp;lt;/code&amp;gt; event for each string found in the source capturing group.&lt;br /&gt;
| Must be defined. It is where the string or strings to extract are taken from.&lt;br /&gt;
| Not used.&lt;br /&gt;
| If defined: It is the name for the first text unit. If there is more than one string to extract, a sequential number (starting at 2) is appended to it, and used as the name of the other text units.&lt;br /&gt;
| If defined: It is the ''note'' property associated to each text unit corresponding to each extracted string.&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| &amp;lt;cite&amp;gt;Extract the content of the source group&amp;lt;/cite&amp;gt;&lt;br /&gt;
| Sends a single &amp;lt;code&amp;gt;TEXT_UNIT&amp;lt;/code&amp;gt; event based on the different capturing groups.&lt;br /&gt;
| Must be defined. It is the source text of the text unit.&lt;br /&gt;
| If defined: It is the target text of the text unit.&lt;br /&gt;
| If defined: It is the name of the text unit.&lt;br /&gt;
| If defined: It is the note property associated to the text unit.&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| &amp;lt;cite&amp;gt;Treat the source group as comment&amp;lt;/cite&amp;gt;&lt;br /&gt;
| Process the source capturing group for localization directives (if requested) and leaves the content of the whole expression's match untouched.&lt;br /&gt;
| Must be defined. It is processed for localization directives if that option is set.&lt;br /&gt;
| Not used.&lt;br /&gt;
| Not used.&lt;br /&gt;
| Not used.&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| &amp;lt;cite&amp;gt;Do not extract&amp;lt;/cite&amp;gt;&lt;br /&gt;
| Leaves the content of the whole expression's match untouched.&lt;br /&gt;
| Not used.&lt;br /&gt;
| Not used.&lt;br /&gt;
| Not used.&lt;br /&gt;
| Not used.&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| &amp;lt;cite&amp;gt;Start a section&amp;lt;/cite&amp;gt;&lt;br /&gt;
| Sends a &amp;lt;code&amp;gt;START_GROUP&amp;lt;/code&amp;gt; event. If the option &amp;lt;cite&amp;gt;Auto-close previous section when a new one starts&amp;lt;/cite&amp;gt; is set, you '''must not''' define a corresponding end section. If that option is not set, you '''must''' define a rule to close this section.&lt;br /&gt;
| Not used.&lt;br /&gt;
| Not used.&lt;br /&gt;
| If defined: It is the name of the section being opened. A section corresponds to a &amp;lt;code&amp;gt;&amp;amp;lt;group&amp;gt;&amp;lt;/code&amp;gt; in XLIFF.&lt;br /&gt;
| If defined: It is the note property associated to the section being opened.&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| &amp;lt;cite&amp;gt;End a section&amp;lt;/cite&amp;gt;&lt;br /&gt;
| Sends an &amp;lt;code&amp;gt;END_GROUP&amp;lt;/code&amp;gt; event. &lt;br /&gt;
| Not used.&lt;br /&gt;
| Not used.&lt;br /&gt;
| Not used.&lt;br /&gt;
| Not used.&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
==Parameters==&lt;br /&gt;
&lt;br /&gt;
===Rules Tab===&lt;br /&gt;
&lt;br /&gt;
&amp;lt;cite&amp;gt;Add&amp;lt;/cite&amp;gt; &amp;amp;mdash; Click this button to add a new rule to the list. This opens the &amp;lt;cite&amp;gt;Edit Rule&amp;lt;/cite&amp;gt; dialog box with the new rule.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;cite&amp;gt;Rename&amp;lt;/cite&amp;gt; &amp;amp;mdash; Click this button to rename the rule currently selected. Note that two rules can have the same name, but this is obviously not recommended.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;cite&amp;gt;Remove&amp;lt;/cite&amp;gt; &amp;amp;mdash; Click this button to delete the rule currently selected from the list. No confirmation is asked.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;cite&amp;gt;Edit&amp;lt;/cite&amp;gt; &amp;amp;mdash; Click this button to edit the rule currently selected. This opens the &amp;lt;cite&amp;gt;Edit Rule&amp;lt;/cite&amp;gt; dialog box.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;cite&amp;gt;Move Up&amp;lt;/cite&amp;gt; &amp;amp;mdash; Click this button to move the rule currently selected up in the list. Rules are evaluated in the order of the list.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;cite&amp;gt;Move Down&amp;lt;/cite&amp;gt; &amp;amp;mdash; Click this button to move the rule currently selected down in the list. Rules are evaluated in the order of the list.&lt;br /&gt;
&lt;br /&gt;
====Rule properties====&lt;br /&gt;
&lt;br /&gt;
&amp;lt;cite&amp;gt;Preserve white spaces&amp;lt;/cite&amp;gt; &amp;amp;mdash; Set this option to preserve all white spaces of the extracted text. If this option is not set the extracted content is unwrapped: That is any sequence of consecutive white spaces is replaced by a single space character, and any white space character at the start or the end of the content is trimmed out. White spaces here are: spaces, tabs, carriage returns, and line-feeds.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;cite&amp;gt;Has inline codes&amp;lt;/cite&amp;gt; &amp;amp;mdash; Set this option to enable the conversion of some part of the extracted text into inline codes.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;cite&amp;gt;Edit Inline Codes Patterns&amp;lt;/cite&amp;gt; &amp;amp;mdash; Click this button to open the &amp;lt;cite&amp;gt;Inline Codes Patterns&amp;lt;/cite&amp;gt; dialog box where you can define rules for converting parts of text into inline codes.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;cite&amp;gt;Auto-close previous section when a new one starts&amp;lt;/cite&amp;gt; &amp;amp;mdash; Set this option to automatically close any opened section when a new one is starting. Section are defined with the &amp;lt;cite&amp;gt;Start a section&amp;lt;/cite&amp;gt; action. This option allows you to define only start of sections. If this option is not set, each &amp;lt;cite&amp;gt;Start a section&amp;lt;/cite&amp;gt; action must have a corresponding &amp;lt;cite&amp;gt;End a section&amp;lt;/cite&amp;gt; action.&lt;br /&gt;
&lt;br /&gt;
====Regular expressions options====&lt;br /&gt;
&lt;br /&gt;
This set of options are used for all rules defined in the list. If you need to overwrite an option for a given rule, use the &amp;lt;code&amp;gt;(?idmsux-idmsux)&amp;lt;/code&amp;gt; construct in the pattern for that rule.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;cite&amp;gt;Dot also matches line-feed&amp;lt;/cite&amp;gt; &amp;amp;mdash; Set this option to enable the dot operator to match line-feeds.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;cite&amp;gt;Multi-line&amp;lt;/cite&amp;gt; &amp;amp;mdash; Set this option so the expressions &amp;lt;code&amp;gt;^&amp;lt;/code&amp;gt; and &amp;lt;code&amp;gt;$&amp;lt;/code&amp;gt; match just after or just before, respectively, a line terminator or the end of the input sequence. If this option is not set these expressions only match at the beginning and the end of the entire input sequence.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;cite&amp;gt;Ignore case differences&amp;lt;/cite&amp;gt; &amp;amp;mdash; Set this option to ignore differences between letter cases. If this option is set &amp;quot;&amp;lt;code&amp;gt;abc&amp;lt;/code&amp;gt;&amp;quot; is seen as identical as &amp;quot;&amp;lt;code&amp;gt;Abc&amp;lt;/code&amp;gt;&amp;quot;. If this option is not set, both strings are seen as different.&lt;br /&gt;
&lt;br /&gt;
===Options Tab===&lt;br /&gt;
&lt;br /&gt;
====Localization directives====&lt;br /&gt;
&lt;br /&gt;
&amp;lt;cite&amp;gt;Use localization directives when they are present&amp;lt;/cite&amp;gt; &amp;amp;mdash; Set this option to enable the filter to recognize localization directives. If this option is not set, any localization directive in the input file will be ignored.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;cite&amp;gt;Extract items outside the scope of localization directives&amp;lt;/cite&amp;gt; &amp;amp;mdash; Set this option to extract any translatable item that is not within the scope of a localization directive. Selecting to extract or not outside localization directives allows you to mark up fewer parts of the source document. This option is enabled only when the Use localization directives when they are present option is set.&lt;br /&gt;
&lt;br /&gt;
====Strings====&lt;br /&gt;
&lt;br /&gt;
&amp;lt;cite&amp;gt;Beginning of string&amp;lt;/cite&amp;gt; &amp;amp;mdash; Enter the character specifying the start of a string. Entering several characters defines several ways to start a string.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;cite&amp;gt;End of string&amp;lt;/cite&amp;gt; &amp;amp;mdash; Enter the character specifying the end of a string. If you have defined several beginning characters, you must defined an equal number of end characters, and the position of each end character must correspond to the position of its corresponding beginning character.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;cite&amp;gt;Escaped characters use back-slash prefix&amp;lt;/cite&amp;gt; &amp;amp;mdash; Set this option if the way to escape a character is to have a back-slash prefix (e.g. &amp;lt;code&amp;gt;\&amp;quot;&amp;lt;/code&amp;gt;). &lt;br /&gt;
&lt;br /&gt;
&amp;lt;cite&amp;gt;Escaped characters are doubled&amp;lt;/cite&amp;gt; &amp;amp;mdash; Set this option if the way to escape a character is to double it (e.g. &amp;lt;code&amp;gt;&amp;quot;&amp;quot;&amp;lt;/code&amp;gt;).&lt;br /&gt;
&lt;br /&gt;
====Content type====&lt;br /&gt;
&lt;br /&gt;
&amp;lt;cite&amp;gt;MIME type of the document&amp;lt;/cite&amp;gt; &amp;amp;mdash; Enter the MIME type value to use when extracting content with this parameters. The value is used to identify the type of document. It may also change the way the text is written back into the original format. Most of the time &amp;lt;code&amp;gt;text/plain&amp;lt;/code&amp;gt; should be fine.&lt;br /&gt;
&lt;br /&gt;
====FPRM Only Parameters (not accessible in UI)====&lt;br /&gt;
&amp;lt;cite&amp;gt;METADATA Rules&amp;lt;/cite&amp;gt; &amp;amp;mdash; Rule name and matched regex is added as metadata to the TextUnit.&lt;br /&gt;
====Example:====&lt;br /&gt;
&amp;lt;code&amp;gt; &lt;br /&gt;
metaRuleCount.i=2&amp;lt;/br&amp;gt;&lt;br /&gt;
metaRule0.ruleName=meta1&amp;lt;/br&amp;gt;&lt;br /&gt;
metaRule0.expr=(\d\d:\d\d:\d\d)&amp;lt;/br&amp;gt;&lt;br /&gt;
metaRule1.ruleName=meta2&amp;lt;/br&amp;gt;&lt;br /&gt;
metaRule1.expr=(\d\d:\d\d:\d\d)&amp;lt;/br&amp;gt;&lt;br /&gt;
&amp;lt;/code&amp;gt;&lt;br /&gt;
&lt;br /&gt;
==Limitations==&lt;br /&gt;
&lt;br /&gt;
* The whole document is loaded in memory to apply the regular expressions. This may cause issues with very large documents.&lt;br /&gt;
* The option &amp;lt;cite&amp;gt;Extract strings outside the rules&amp;lt;/cite&amp;gt; is not yet implemented.&lt;br /&gt;
&lt;br /&gt;
[[Category:Filters]]&lt;/div&gt;</summary>
		<author><name>Jhargraveiii</name></author>
	</entry>
	<entry>
		<id>http://okapiframework.org/wiki/index.php?title=Properties_Filter&amp;diff=947</id>
		<title>Properties Filter</title>
		<link rel="alternate" type="text/html" href="http://okapiframework.org/wiki/index.php?title=Properties_Filter&amp;diff=947"/>
		<updated>2022-09-09T18:01:25Z</updated>

		<summary type="html">&lt;p&gt;Jhargraveiii: /* Key filtering */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;{{Filters Header}}&lt;br /&gt;
==Overview==&lt;br /&gt;
&lt;br /&gt;
The Properties Filter is an Okapi component that implements the IFilter interface for properties files. The filter is implemented in the class net.sf.okapi.filters.properties.PropertiesFilter of the Okapi library.&lt;br /&gt;
&lt;br /&gt;
The implementation is based on the specification found in the Java java.util.Properties class documentation. Support for a few additional features is also provided for compatibility with other type of properties files.&lt;br /&gt;
&lt;br /&gt;
The following is an example of a very simple properties file. The translatable text is highlighted:&lt;br /&gt;
&lt;br /&gt;
 # Example of Java properties&lt;br /&gt;
 &lt;br /&gt;
 labelOK= &amp;lt;span class='hi'&amp;gt;OK&amp;lt;/span&amp;gt;&lt;br /&gt;
 msgBadFile: &amp;lt;span class='hi'&amp;gt;Invalid input file&amp;lt;/span&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Note that Java properties can also be represented in XML. To process such document, use the [[XML Filter]] or the [[XML Stream Filter]].&lt;br /&gt;
&lt;br /&gt;
==Processing Details==&lt;br /&gt;
&lt;br /&gt;
===Input Encoding===&lt;br /&gt;
&lt;br /&gt;
The filter decides which encoding to use for the input file using the following logic:&lt;br /&gt;
&lt;br /&gt;
* If the file has a Unicode Byte-Order-Mark:&lt;br /&gt;
** Then, the corresponding encoding (e.g. UTF-8, UTF-16, etc.) is used. &lt;br /&gt;
* Otherwise, the input encoding used is the default encoding that was specified when setting the filter options. &lt;br /&gt;
&lt;br /&gt;
===Output Encoding===&lt;br /&gt;
&lt;br /&gt;
If the output encoding is UTF-8:&lt;br /&gt;
&lt;br /&gt;
* If the input encoding was also UTF-8, a Byte-Order-Mark is used for the output document only if one was detected in the input document. &lt;br /&gt;
* If the input encoding was not UTF-8, no Byte-Order-Mark is used in the output document.&lt;br /&gt;
&lt;br /&gt;
===Line-Breaks===&lt;br /&gt;
&lt;br /&gt;
The type of line-breaks of the output is the same as the one of the original input.&lt;br /&gt;
&lt;br /&gt;
===Mapping===&lt;br /&gt;
&lt;br /&gt;
Each entry of the property file is mapped to a text unit resource as follow:&lt;br /&gt;
&lt;br /&gt;
{| border=&amp;quot;1&amp;quot; cellpadding=&amp;quot;4&amp;quot; cellspacing=&amp;quot;0&amp;quot;&lt;br /&gt;
! Properties&lt;br /&gt;
! Okapi Resources&lt;br /&gt;
|-&lt;br /&gt;
|The key of the entry&lt;br /&gt;
|The name of the text unit &lt;br /&gt;
|-&lt;br /&gt;
|The text of the entry&lt;br /&gt;
|The source content of the text unit &lt;br /&gt;
|-&lt;br /&gt;
|Comments (before the entry)&lt;br /&gt;
|The note property of the text unit (if the option is set)&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
==Parameters==&lt;br /&gt;
&lt;br /&gt;
===Options Tab===&lt;br /&gt;
&lt;br /&gt;
====Localization directives====&lt;br /&gt;
&lt;br /&gt;
Localization directives are special comments you can use to override the default behavior of the filter regarding the parts to extract. The syntax and behavior of the directives are the same across all Okapi filters.Note that the directives override key conditions.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;cite&amp;gt;Use localization directives when they are present&amp;lt;/cite&amp;gt; &amp;amp;mdash; Set this option to enable the filter to recognize localization directives. If this option is not set, any localization directive in the input file will be ignored.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;cite&amp;gt;Extract items outside the scope of localization directives&amp;lt;/cite&amp;gt; &amp;amp;mdash; Set this option to extract any translatable item that is not within the scope of a localization directive. Selecting to extract or not outside localization directives allows you to mark up fewer parts of the source document. This option is enabled only when the &amp;lt;cite&amp;gt;Use localization directives when they are present&amp;lt;/cite&amp;gt; option is set.&lt;br /&gt;
&lt;br /&gt;
====Key filtering====&lt;br /&gt;
&lt;br /&gt;
&amp;lt;cite&amp;gt;Use the following key condition:&amp;lt;/cite&amp;gt; &amp;amp;mdash; Set this option to extract items based on their keys. You specify a regular expression pattern, if the key matches the pattern, the item is extracted or not depending on the action you specify. Note that directives have precedence over key condition.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;cite&amp;gt;Extract only the items with a key matching the given expression&amp;lt;/cite&amp;gt; &amp;amp;mdsk; Select this option to extract only the items with keys that match the specified pattern.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;cite&amp;gt;Do not extract the items with a key matching the given expression&amp;lt;/cite&amp;gt; &amp;amp;mdash; Select this option to not extract the items with keys that match the specified pattern.&lt;br /&gt;
&lt;br /&gt;
Enter the pattern to test against the key. The pattern must be a valid regular expression. For example, with the following settings:&lt;br /&gt;
&lt;br /&gt;
* Use the following key condition = set&lt;br /&gt;
* Extract only the items with a key matching the specified pattern = set&lt;br /&gt;
* Pattern = .*text.*&lt;br /&gt;
 &lt;br /&gt;
The extracted text is highlighted:&lt;br /&gt;
&lt;br /&gt;
 key1 = Text for key1&lt;br /&gt;
 text.err1 = &amp;lt;span class='hi'&amp;gt;Text for text.err1&amp;lt;/span&amp;gt;&lt;br /&gt;
 menu_text_file = &amp;lt;span class='hi'&amp;gt;Text for menu_text_file&amp;lt;/span&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;cite&amp;gt;Configuration identifier of the sub-filter to use on the content&amp;lt;/cite&amp;gt; &amp;amp;mdash; Enter the filter configuration identifier of the sub-filter to use on the extracted content. For exmple: &amp;lt;code&amp;gt;okf_html&amp;lt;/code&amp;gt;. You should leave the entry empty if no sub-filter is to be used. For sub-filter you  can use the [[HTML Filter]], the [[XML Stream Filter]] or other filters derived from the &amp;lt;code&amp;gt;AbstractMarkupFilter&amp;lt;/code&amp;gt; class.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;cite&amp;gt;Recognize additional comment markers&amp;lt;/cite&amp;gt; &amp;amp;mdash; Set this option to take into account other comment styles in addition to the strict Java comments (single-line starting with '#' or '!'). When this option is set, the filter also recognizes comments single-lines starting with ';', as well as single-line where &amp;quot;//&amp;quot; is the first no-whitespace sequence. Note that // after a = are considered part of the value of the entry, not a comment.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;cite&amp;gt;Extract comments to note properties&amp;lt;/cite&amp;gt; &amp;amp;mdash; Set this option to include the comments before each entry as a note property on the text unit of the corresponding entry. All comments lines are grouped into a single note.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;cite&amp;gt;Convert \n and \t to line-break and tab&amp;lt;/cite&amp;gt; &amp;amp;mdash; Set this option to convert the escaped codes \n and \t to true line-breaks and tabs. All the other escaped characters remain escaped.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;cite&amp;gt;Use the key value for the text-unit ID value&amp;lt;/cite&amp;gt; &amp;amp;mdash; Set this option to use the keys of the extracted entries as ID for the text-units. Note that the key is already extracted into the name property (&amp;lt;code&amp;gt;resname&amp;lt;/code&amp;gt; in XLIFF). Note also that the value of a property key may not be a valid string for some output format, for example in XLIFF2 the unit ID must be a NMTOKEN value. This option is available starting in M31.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;cite&amp;gt;Use Java Properties Escaping Conventions&amp;lt;/cite&amp;gt; &amp;amp;mdash; Extracted text retains Java Property escapes (default false)&lt;br /&gt;
&lt;br /&gt;
===Inline Codes Tab===&lt;br /&gt;
&lt;br /&gt;
&amp;lt;cite&amp;gt;Has inline codes as defined below:&amp;lt;/cite&amp;gt; &amp;amp;mdash; Set this option to use the specified regular expressions on the text of the extracted items. Any match will be converted to an inline code. By default the expression is:&lt;br /&gt;
&lt;br /&gt;
 ((%(([-0+#]?)[-0+#]?)((\d\$)?)(([\d\*]*)(\.[\d\*]*)?)[dioxXucsfeEgGpn])&lt;br /&gt;
 |((\\r\\n)|\\a|\\b|\\f|\\n|\\r|\\t|\\v)&lt;br /&gt;
 |(\{\d.*?\}))&lt;br /&gt;
&lt;br /&gt;
{{CodeFinder Help}}&lt;br /&gt;
&lt;br /&gt;
===Output Tab===&lt;br /&gt;
&lt;br /&gt;
&amp;lt;cite&amp;gt;Escape all extended characters&amp;lt;/cite&amp;gt; &amp;amp;mdash; Set this option to convert all characters above U+007F into Unicode escape sequences (\uHHHH). When this option is not set, only the characters not supported by the output encoding are escaped.&lt;br /&gt;
&lt;br /&gt;
==Limitations==&lt;br /&gt;
&lt;br /&gt;
None known.&lt;br /&gt;
&lt;br /&gt;
[[Category:Filters]]&lt;/div&gt;</summary>
		<author><name>Jhargraveiii</name></author>
	</entry>
	<entry>
		<id>http://okapiframework.org/wiki/index.php?title=OpenOffice_Filter&amp;diff=946</id>
		<title>OpenOffice Filter</title>
		<link rel="alternate" type="text/html" href="http://okapiframework.org/wiki/index.php?title=OpenOffice_Filter&amp;diff=946"/>
		<updated>2022-09-09T17:49:18Z</updated>

		<summary type="html">&lt;p&gt;Jhargraveiii: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;{{Filters Header}}&lt;br /&gt;
==Overview==&lt;br /&gt;
&lt;br /&gt;
The OpenOffice Filter is an Okapi component that implements the IFilter interface for OpenOffice.org documents: ODT (text), ODS (spreadsheet), ODP (slides), ODG (graphics), and their corresponding template formats.&lt;br /&gt;
&lt;br /&gt;
These documents use the [http://en.wikipedia.org/wiki/OpenDocument OpenDocument format (ODF)]. If you need to process directly an XML ODF file, you can use the ODF Filter that the OpenOffice Filter uses internally.&lt;br /&gt;
&lt;br /&gt;
==Processing Details==&lt;br /&gt;
&lt;br /&gt;
===Encodings===&lt;br /&gt;
&lt;br /&gt;
The input encoding is automatically detected.&lt;br /&gt;
&lt;br /&gt;
Any user-specified output encoding is ignored by these filters. They always use UTF-8.&lt;br /&gt;
&lt;br /&gt;
===Line-Breaks===&lt;br /&gt;
&lt;br /&gt;
The type of line-breaks of the output is always set to a simple linefeed (LF).&lt;br /&gt;
&lt;br /&gt;
===Sub-Documents===&lt;br /&gt;
&lt;br /&gt;
An OpenOffice documents is a ZIP file with several documents inside. The main one (&amp;lt;code&amp;gt;content.xml&amp;lt;/code&amp;gt;) contains the body of the data. But other files may also contain translatable text: &amp;lt;code&amp;gt;meta.xml&amp;lt;/code&amp;gt; and &amp;lt;code&amp;gt;style.xml&amp;lt;/code&amp;gt;.&lt;br /&gt;
&lt;br /&gt;
All the different embedded files are treated as sub-documents by the filter. This means that, for example, when represented in XLIFF, a single ODT extracted to a single XLIFF document is made up three XLIFF &amp;lt;code&amp;gt;&amp;amp;lt;file&amp;gt;&amp;lt;/code&amp;gt; elements: One for &amp;lt;code&amp;gt;content.xml&amp;lt;/code&amp;gt;, one for &amp;lt;code&amp;gt;style.xml&amp;lt;/code&amp;gt;, and one for &amp;lt;code&amp;gt;meta.xml&amp;lt;/code&amp;gt;. Note that very often, only &amp;lt;code&amp;gt;content.xml&amp;lt;/code&amp;gt; has extracted text.&lt;br /&gt;
&lt;br /&gt;
==Parameters==&lt;br /&gt;
&lt;br /&gt;
&amp;lt;cite&amp;gt;Extract notes&amp;lt;/cite&amp;gt; &amp;amp;mdash; Set this option to extract the content of &amp;lt;code&amp;gt;&amp;amp;lt;office:annotation&amp;gt;&amp;lt;/code&amp;gt; elements (notes) as translatable text. If this option is not set, notes are not extracted.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;cite&amp;gt;Extract references&amp;lt;/cite&amp;gt; &amp;amp;mdash; Set this option to extract the content of &amp;lt;code&amp;gt;&amp;amp;lt;text:bookmark-ref&amp;gt;&amp;lt;/code&amp;gt; elements. the content of these element is only a copy of the content of the referent. It is updated automatically within OpenOffice, so any translation done for these content will be automatically overwritten as soon as the document is updated. However, in some cases it may be useful to be able to have the referenced text as part of the segment where it is inserted.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;cite&amp;gt;Extract Metadata&amp;lt;/cite&amp;gt; &amp;amp;mdash; Extract metadata in meta.xml (default true)&lt;br /&gt;
&lt;br /&gt;
&amp;lt;cite&amp;gt;Encode Character Entity Reference Glyphs&amp;lt;/cite&amp;gt; &amp;amp;mdash; (default true)&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
==Limitations==&lt;br /&gt;
&lt;br /&gt;
* Some deleted text may get extracted. Make sure you have accepted or rejected the revisions changes before processing the input document, as currently some text marked as deleted is still extracted.&lt;br /&gt;
* The options to extract or not the notes and the references is not working yet. They wll be extracted regardless of the option settings.&lt;br /&gt;
* Sequential tabs may get reduced to a single tab during an extraction and merge round trip: The elements for spaces and tabs are supported in output but still incorrectly handled on input.&lt;br /&gt;
* The target (output) encoding must be set to UTF-8 when extracting the documents to merge them back properly.&lt;br /&gt;
&lt;br /&gt;
Please, report any other issues to the [http://code.google.com/p/okapi/issues/list Issues List of the project].&lt;br /&gt;
&lt;br /&gt;
[[Category:Filters]]&lt;/div&gt;</summary>
		<author><name>Jhargraveiii</name></author>
	</entry>
	<entry>
		<id>http://okapiframework.org/wiki/index.php?title=JSON_Filter&amp;diff=945</id>
		<title>JSON Filter</title>
		<link rel="alternate" type="text/html" href="http://okapiframework.org/wiki/index.php?title=JSON_Filter&amp;diff=945"/>
		<updated>2022-09-09T17:38:23Z</updated>

		<summary type="html">&lt;p&gt;Jhargraveiii: /* New Extraction Rules &amp;gt;= version M39 */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;{{Filters Header}}&lt;br /&gt;
==Overview==&lt;br /&gt;
&lt;br /&gt;
The JSON Filter is an Okapi component that implements the IFilter interface for JSON (Javascript Object Notation).&lt;br /&gt;
&lt;br /&gt;
The implementation is based on the JSON specifications: http://www.json.org/&lt;br /&gt;
&lt;br /&gt;
The following is an example of a very simple JSON file. The translatable text is highlighted:&lt;br /&gt;
&lt;br /&gt;
 {&amp;quot;menu&amp;quot;: {&lt;br /&gt;
   &amp;quot;value&amp;quot;: &amp;quot;&amp;lt;span class=&amp;quot;hi&amp;quot;&amp;gt;File&amp;lt;/span&amp;gt;&amp;quot;,&lt;br /&gt;
    &amp;quot;popup&amp;quot;: {&lt;br /&gt;
       &amp;quot;menuitem&amp;quot;: [&lt;br /&gt;
          {&amp;quot;value&amp;quot;: &amp;quot;&amp;lt;span class=&amp;quot;hi&amp;quot;&amp;gt;New&amp;lt;/span&amp;gt;&amp;quot;},&lt;br /&gt;
          {&amp;quot;value&amp;quot;: &amp;quot;&amp;lt;span class=&amp;quot;hi&amp;quot;&amp;gt;Open&amp;lt;/span&amp;gt;&amp;quot;},&lt;br /&gt;
          {&amp;quot;value&amp;quot;: &amp;quot;&amp;lt;span class=&amp;quot;hi&amp;quot;&amp;gt;Close&amp;lt;/span&amp;gt;&amp;quot;}&lt;br /&gt;
       ]&lt;br /&gt;
    }&lt;br /&gt;
 }}&lt;br /&gt;
&lt;br /&gt;
==Processing Details==&lt;br /&gt;
&lt;br /&gt;
===Input Encoding===&lt;br /&gt;
&lt;br /&gt;
JSON files are normally in one of the Unicode encoding, but the filter supports any encoding. It decides which encoding to use for the input file using the following logic:&lt;br /&gt;
&lt;br /&gt;
* If the file has a Unicode Byte-Order-Mark:&lt;br /&gt;
** Then, the corresponding encoding (e.g. UTF-8, UTF-16, etc.) is used.&lt;br /&gt;
* Else, if a header entry with a &amp;lt;code&amp;gt;charset&amp;lt;/code&amp;gt; declaration exists in the first 1000 characters of the file:&lt;br /&gt;
** If the value of the &amp;lt;code&amp;gt;charset&amp;lt;/code&amp;gt; is &amp;quot;&amp;lt;code&amp;gt;charset&amp;lt;/code&amp;gt;&amp;quot; (case insensitive):&lt;br /&gt;
*** Then the file is likely to be a template with no encoding declared, so the current encoding (auto-detected or default) is used.&lt;br /&gt;
*** Else, the declared encoding is used. Note that if the encoding has been detected from a Byte-Order-Mark and the encoding declared in the header entry does not match, a warning is generated and the encoding of the Byte-Order-Mark is used.&lt;br /&gt;
* Otherwise, the input encoding used is the default encoding that was specified when setting the filter options.&lt;br /&gt;
&lt;br /&gt;
===Output Encoding===&lt;br /&gt;
&lt;br /&gt;
If the output encoding is UTF-8:&lt;br /&gt;
&lt;br /&gt;
* If the input encoding was also UTF-8, a Byte-Order-Mark is used for the output document only if one was detected in the input document.&lt;br /&gt;
* If the input encoding was not UTF-8, no Byte-Order-Mark is used in the output document.&lt;br /&gt;
&lt;br /&gt;
===Line-Breaks===&lt;br /&gt;
&lt;br /&gt;
The type of line-breaks of the output is the same as the one of the original input.&lt;br /&gt;
&lt;br /&gt;
===Comments===&lt;br /&gt;
&lt;br /&gt;
Though not technically legal in JSON these comment types are supported:&lt;br /&gt;
&amp;lt;code&amp;gt;&lt;br /&gt;
* // comment&lt;br /&gt;
* # comment&lt;br /&gt;
* /* comment */&lt;br /&gt;
* &amp;amp;lt;!-- comment --&amp;gt;&lt;br /&gt;
&amp;lt;/code&amp;gt;&lt;br /&gt;
&lt;br /&gt;
==Parameters==&lt;br /&gt;
&lt;br /&gt;
=== Options Tab===&lt;br /&gt;
&lt;br /&gt;
====Stand-alone strings====&lt;br /&gt;
&lt;br /&gt;
&amp;lt;cite&amp;gt;Extract strings without associated key&amp;lt;/cite&amp;gt; &amp;amp;mdash; Set this option to extract string that are not associated directly to a key value.&lt;br /&gt;
&lt;br /&gt;
====Strings with keys====&lt;br /&gt;
&lt;br /&gt;
&amp;lt;cite&amp;gt;Extract all key/strings pairs&amp;lt;/cite&amp;gt; &amp;amp;mdash; Set this option to extract all strings that have a key associated. If a regular expression for exceptions is defined, the strings that have a key matching the expression are not extracted.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;cite&amp;gt;Do not extract key/string pairs&amp;lt;/cite&amp;gt; &amp;amp;mdash; Set the option to not extract any string that has an associated key. If a regular expression for exceptions is defined, the strings that have a key matching the expression are extracted.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;cite&amp;gt;Excepted when the key matches the following regular expression&amp;lt;/cite&amp;gt; &amp;amp;mdash; Enter a regular expression that correspond to the keys that should have a behavior inverse to the default behavior you have selected for the key/strings pairs.&lt;br /&gt;
For example, you could exclude a key-value with &amp;lt;code&amp;gt;key&amp;lt;/code&amp;gt;.&lt;br /&gt;
In combination with &amp;lt;code&amp;gt;Use the full key path&amp;lt;/code&amp;gt; you can exclude all nested elements in a JSON structure with &amp;lt;code&amp;gt;^.*?/excludedStructure/.*&amp;lt;/code&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;cite&amp;gt;Use the key as the resname&amp;lt;/cite&amp;gt; &amp;amp;mdash; Set this option to use the value of the key as the value of the name of the extracted item (&amp;lt;code&amp;gt;resname&amp;lt;/code&amp;gt; in XLIFF).&lt;br /&gt;
&lt;br /&gt;
&amp;lt;cite&amp;gt;Use the full key path&amp;lt;/cite&amp;gt; &amp;amp;mdash; Set this option to use the full key path in the &amp;lt;code&amp;gt;resname&amp;lt;/code&amp;gt;. For example: &amp;lt;code&amp;gt;/menu/value/popup/menuitem/value&amp;lt;/code&amp;gt;. The use key name as resname option must be set for this option to take effect. If enabled, exception regular expressions apply to the full path.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;cite&amp;gt;Include leading &amp;quot;/&amp;quot; on key path&amp;lt;/cite&amp;gt; &amp;amp;mdash; Set this option to have a leading character '/' in the full key path.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;cite&amp;gt;Regex matching keys that are notes, values of which to appear as &amp;lt;note&amp;gt; in XLIFF&amp;lt;/cite&amp;gt; &amp;amp;mdash; Specify regular expression. The values of the matching keys will be transferred to &amp;amp;lt;note&amp;gt; elements in XLIFF.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;cite&amp;gt;Regex matching keys who's values are added as TextUnit Metadata&amp;lt;/cite&amp;gt; &amp;amp;mdash; Specify regular expression. The values of the matching keys will be written out as &amp;amp;lt;context-group&amp;gt; elements in XLIFF.&lt;br /&gt;
&lt;br /&gt;
===New Extraction Rules &amp;gt;= version M39===&lt;br /&gt;
&amp;lt;b&amp;gt;If specified these will override the corresponding rules above.&amp;lt;/b&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;cite&amp;gt;Regex matching keys who's values are extracted (overrides extraction exceptions)&amp;lt;/cite&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;cite&amp;gt;Regex matching keys that are notes, values of which to appear as &amp;lt;note&amp;gt; in XLIFF&amp;lt;/cite&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;cite&amp;gt;Regex matching keys which are ID's (resname in XLIFF), overrides &amp;quot;use key as resname&amp;quot;&amp;lt;/cite&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;cite&amp;gt;Regex matching keys who's values are added as TextUnit Metadata&amp;lt;/cite&amp;gt;&lt;br /&gt;
&lt;br /&gt;
====Example FPRM Settings:====&lt;br /&gt;
Regex rules apply to key names.&lt;br /&gt;
&lt;br /&gt;
'''&lt;br /&gt;
&amp;lt;b&amp;gt;extraction rules (use instead of rule exceptions):&amp;lt;/b&amp;gt;&lt;br /&gt;
extractionRules=/widgets/body.*&lt;br /&gt;
&lt;br /&gt;
&amp;lt;b&amp;gt;note rules (add values to TextUnits as notes):&amp;lt;/b&amp;gt;&lt;br /&gt;
noteRules=/widgets/name.*&lt;br /&gt;
&lt;br /&gt;
&amp;lt;b&amp;gt;id rules (overrides useKeyAsName):&amp;lt;/b&amp;gt;&lt;br /&gt;
idRules=/widgets/id.*&lt;br /&gt;
&lt;br /&gt;
&amp;lt;b&amp;gt;generic metadata (matched key:values are added as metadata to TextUnit):&amp;lt;/b&amp;gt;&lt;br /&gt;
genericMetaRules=/widgets/image.*'''&lt;br /&gt;
&lt;br /&gt;
===Content Processing Tab===&lt;br /&gt;
&lt;br /&gt;
&amp;lt;cite&amp;gt;Process text content with this sub-filter&amp;lt;/cite&amp;gt; &amp;amp;mdash; Specify an Okapi filter ID (e.g. &amp;lt;code&amp;gt;okf_html&amp;lt;/code&amp;gt;) to process the content of all translatable text with that filter. Leave this field blank for default behavior.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;cite&amp;gt;Find inline codes by patterns defined below&amp;lt;/cite&amp;gt; &amp;amp;mdash; Set this option to use the specified regular expressions on the text of the extracted items. Any match will be converted to an inline code.&lt;br /&gt;
&lt;br /&gt;
'''Note:''' This option cannot be used together with the sub-filtering option.&lt;br /&gt;
&lt;br /&gt;
By default the expression is:&lt;br /&gt;
&lt;br /&gt;
 ((%(([-0+#]?)[-0+#]?)((\d\$)?)(([\d\*]*)(\.[\d\*]*)?)[dioxXucsfeEgGpn])&lt;br /&gt;
 |((\\r\\n)|\\a|\\b|\\f|\\n|\\r|\\t|\\v)&lt;br /&gt;
 |(\{\d.*?\}))&lt;br /&gt;
&lt;br /&gt;
{{CodeFinder Help}}&lt;br /&gt;
&lt;br /&gt;
==Limitations==&lt;br /&gt;
&lt;br /&gt;
Comments within a JSON string are parsed as part of the string content, not as comments. A configured subfilter will then process these as true comments (they will become part of the skeleton or whatever the filter is configured to do).&lt;br /&gt;
[[Category:Filters]]&lt;/div&gt;</summary>
		<author><name>Jhargraveiii</name></author>
	</entry>
	<entry>
		<id>http://okapiframework.org/wiki/index.php?title=ICML_Filter&amp;diff=944</id>
		<title>ICML Filter</title>
		<link rel="alternate" type="text/html" href="http://okapiframework.org/wiki/index.php?title=ICML_Filter&amp;diff=944"/>
		<updated>2022-09-09T17:04:19Z</updated>

		<summary type="html">&lt;p&gt;Jhargraveiii: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;{{Filters Header}}&lt;br /&gt;
==Overview==&lt;br /&gt;
&lt;br /&gt;
This filter allows you to process WCML documents. ICML (InCopy Markup Language) is an XML-based format.&lt;br /&gt;
&lt;br /&gt;
This filter is very similar to the [[IDML Filter]]&lt;br /&gt;
&lt;br /&gt;
==Processing Details==&lt;br /&gt;
&lt;br /&gt;
TODO&lt;br /&gt;
&lt;br /&gt;
==Parameters==&lt;br /&gt;
&lt;br /&gt;
&amp;lt;cite&amp;gt;Extract notes&amp;lt;/cite&amp;gt; &amp;amp;mdash; Set this option to extract the content of notes (&amp;lt;code&amp;gt;&amp;amp;lt;Note&amp;gt;&amp;lt;/code&amp;gt; elements).&lt;br /&gt;
&lt;br /&gt;
&amp;lt;cite&amp;gt;Extract master spreads&amp;lt;/cite&amp;gt; &amp;amp;mdash; Set this option to extract the content of the master spreads if they exist. If this option is not set only the normal spreads are extracted.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;cite&amp;gt;Simplify inline codes when possible&amp;lt;/cite&amp;gt; &amp;amp;mdash; Set this option to reduce the number of inline codes by re-grouping adjacent codes when it is possible.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;cite&amp;gt;Maximum spread size&amp;lt;/cite&amp;gt; &amp;amp;mdash; Set the maximum size for the spread files (in KBytes). Any spread file above the given value will either generate an error or will be skipped from extraction depending on the specified option. This allows you to skip over large spread files that may contain only graphics and require too much memory to be opened. Note that the skipped file are not checked for translatable text.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;cite&amp;gt;newTuOnBr&amp;lt;/cite&amp;gt; &amp;amp;mdash; Should a Br (break) tag create a new segment? (default false)&lt;br /&gt;
&lt;br /&gt;
==Limitations==&lt;br /&gt;
&lt;br /&gt;
TODO&lt;br /&gt;
&lt;br /&gt;
[[Category:Filters]]&lt;/div&gt;</summary>
		<author><name>Jhargraveiii</name></author>
	</entry>
	<entry>
		<id>http://okapiframework.org/wiki/index.php?title=IDML_Filter&amp;diff=943</id>
		<title>IDML Filter</title>
		<link rel="alternate" type="text/html" href="http://okapiframework.org/wiki/index.php?title=IDML_Filter&amp;diff=943"/>
		<updated>2022-09-09T16:55:18Z</updated>

		<summary type="html">&lt;p&gt;Jhargraveiii: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;{{Filters Header}}&lt;br /&gt;
==Overview==&lt;br /&gt;
&lt;br /&gt;
This filter allows you to process IDML documents. IDML (InDesign Markup Language) is an XML-based format, introduced in Adobe InDesign CS4, for representing InDesign content. IDML is used in several InDesign and InCopy file types. The specification can be found [http://www.adobe.com/content/dam/Adobe/en/devnet/indesign/cs5_docs/idml/idml-specification.pdf on the Adobe Web site].&lt;br /&gt;
&lt;br /&gt;
==Processing Details==&lt;br /&gt;
&lt;br /&gt;
When processing an IDML filter, the filter looks at all the spreads in the document, and for each of them, gather the list of the stories used in &amp;lt;code&amp;gt;&amp;amp;lt;TextFrame&amp;gt;&amp;lt;/code&amp;gt; and &amp;lt;code&amp;gt;&amp;amp;lt;TextPath&amp;gt;&amp;lt;/code&amp;gt;. The text is extracted by spread, and for each spread by story in the order the appear in the spread.&lt;br /&gt;
&lt;br /&gt;
Stories embedded inside other stories and not declared at a spread level are extracted in a special group.&lt;br /&gt;
&lt;br /&gt;
==Parameters==&lt;br /&gt;
&lt;br /&gt;
&amp;lt;cite&amp;gt;Untag XML Structures&amp;lt;/cite&amp;gt; &amp;amp;mdash; Set this option to skip embedded XML structural information when extracting translatable content.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;cite&amp;gt;Extract notes&amp;lt;/cite&amp;gt; &amp;amp;mdash; Set this option to extract the content of notes (&amp;lt;code&amp;gt;&amp;amp;lt;Note&amp;gt;&amp;lt;/code&amp;gt; elements).&lt;br /&gt;
&lt;br /&gt;
&amp;lt;cite&amp;gt;Extract master spreads&amp;lt;/cite&amp;gt; &amp;amp;mdash; Set this option to extract the content of the master spreads if they exist. If this option is not set only the normal spreads are extracted.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;cite&amp;gt;Extract hidden layers&amp;lt;/cite&amp;gt; &amp;amp;mdash; Set this option to extract also the hidden layers.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;cite&amp;gt;maxAttributeSize&amp;lt;/cite&amp;gt; &amp;amp;mdash; Set the size in MB for the attribute buffer. Default is 4MB (4 * 1024 * 1024)&lt;br /&gt;
&lt;br /&gt;
&amp;lt;cite&amp;gt;extractHiddenPasteboardItems&amp;lt;/cite&amp;gt; &amp;amp;mdash; (default is false)&lt;br /&gt;
&lt;br /&gt;
&amp;lt;cite&amp;gt;skipDiscretionaryHyphens&amp;lt;/cite&amp;gt; &amp;amp;mdash; (default is false)&lt;br /&gt;
&lt;br /&gt;
&amp;lt;cite&amp;gt;extractBreaksInline&amp;lt;/cite&amp;gt; &amp;amp;mdash; (default is false)&lt;br /&gt;
&lt;br /&gt;
&amp;lt;cite&amp;gt;extractCustomTextVariables&amp;lt;/cite&amp;gt; &amp;amp;mdash; (default is false)&lt;br /&gt;
&lt;br /&gt;
&amp;lt;cite&amp;gt;extractIndexTopics&amp;lt;/cite&amp;gt; &amp;amp;mdash; (default is false)&lt;br /&gt;
&lt;br /&gt;
&amp;lt;cite&amp;gt;ignoreCharacterKerning&amp;lt;/cite&amp;gt; &amp;amp;mdash; (default is false)&lt;br /&gt;
&lt;br /&gt;
&amp;lt;cite&amp;gt;ignoreCharacterTracking&amp;lt;/cite&amp;gt; &amp;amp;mdash; (default is false)&lt;br /&gt;
&lt;br /&gt;
&amp;lt;cite&amp;gt;ignoreCharacterLeading&amp;lt;/cite&amp;gt; &amp;amp;mdash; (default is false)&lt;br /&gt;
&lt;br /&gt;
&amp;lt;cite&amp;gt;ignoreCharacterBaselineShift&amp;lt;/cite&amp;gt; &amp;amp;mdash; (default is false)&lt;br /&gt;
&lt;br /&gt;
==Deprecated Parameters==&lt;br /&gt;
&lt;br /&gt;
Prior to release M34, the filter supported several additional parameters.  The behavior of these has been subsumed by the more intelligent content processing performed by the updated version of the filter in versions M34 and later.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;cite&amp;gt;Simplify inline codes when possible&amp;lt;/cite&amp;gt; &amp;amp;mdash; Set this option to reduce the number of inline codes by re-grouping adjacent codes when it is possible.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;cite&amp;gt;Create new text units on hard returns&amp;lt;/cite&amp;gt; &amp;amp;mdash; Set this option to create separate text units when a hard return element (&amp;lt;code&amp;gt;&amp;amp;lt;Br/&amp;gt;&amp;lt;/code&amp;gt;) is found.&amp;lt;br/&amp;gt; '''IMPORTANT: This option is not completed yet. Setting it may create extracted documents you will not be able to merge back. Always test merge before use this for production.'''&lt;br /&gt;
&lt;br /&gt;
&amp;lt;cite&amp;gt;Maximum spread size&amp;lt;/cite&amp;gt; &amp;amp;mdash; Set the maximum size for the spread files (in KBytes). Any spread file above the given value will either generate an error or will be skipped from extraction depending on the specified option. This allows you to skip over large spread files that may contain only graphics and require too much memory to be opened. Note that the skipped file are not checked for translatable text.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;cite&amp;gt;Generate an error when a spread is larger than the specified value&amp;lt;/cite&amp;gt; &amp;amp;mdash; Set this option to generate an error if a spread size is above the specified &amp;lt;cite&amp;gt;Maximum spread size&amp;lt;/cite&amp;gt;. If this option is not set, the spread is skipped with a warning message.&lt;br /&gt;
&lt;br /&gt;
[[Category:Filters]]&lt;/div&gt;</summary>
		<author><name>Jhargraveiii</name></author>
	</entry>
	<entry>
		<id>http://okapiframework.org/wiki/index.php?title=HTML_Filter&amp;diff=942</id>
		<title>HTML Filter</title>
		<link rel="alternate" type="text/html" href="http://okapiframework.org/wiki/index.php?title=HTML_Filter&amp;diff=942"/>
		<updated>2022-09-09T16:39:24Z</updated>

		<summary type="html">&lt;p&gt;Jhargraveiii: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;{{Filters Header}}&lt;br /&gt;
==Overview==&lt;br /&gt;
&lt;br /&gt;
The HTML Filter is an Okapi component that implements the IFilter interface for HTML and XHTML documents.&lt;br /&gt;
&lt;br /&gt;
==Processing Details==&lt;br /&gt;
&lt;br /&gt;
===Input Encoding===&lt;br /&gt;
&lt;br /&gt;
The filter decides which encoding to use for the input document using the following logic:&lt;br /&gt;
&lt;br /&gt;
* If the document has an encoding declaration it is used. &lt;br /&gt;
* Otherwise, the input encoding used is the default encoding that was specified when setting the filter options. &lt;br /&gt;
&lt;br /&gt;
===Output Encoding===&lt;br /&gt;
&lt;br /&gt;
If the output encoding is UTF-8:&lt;br /&gt;
&lt;br /&gt;
* If the input encoding was also UTF-8, a Byte-Order-Mark is used for the output document only if one was detected in the input document. &lt;br /&gt;
* If the input encoding was not UTF-8, no Byte-Order-Mark is used in the output document. &lt;br /&gt;
&lt;br /&gt;
If the input file has no declared encoding, the filter tries to add one in output. A &amp;lt;code&amp;gt;&amp;amp;lt;meta&amp;gt;&amp;lt;/code&amp;gt; tag for HTML files, or a &amp;lt;code&amp;gt;&amp;amp;lt;meta /&amp;gt;&amp;lt;/code&amp;gt; tag for XHTML files. The potential addition is done only if there is a &amp;lt;code&amp;gt;&amp;amp;lt;head&amp;gt;&amp;lt;/code&amp;gt; element in the file.&lt;br /&gt;
&lt;br /&gt;
===Line-Breaks===&lt;br /&gt;
&lt;br /&gt;
The type of line-breaks of the output is the same as the one of the original input.&lt;br /&gt;
&lt;br /&gt;
===Entities===&lt;br /&gt;
&lt;br /&gt;
Character and numeric entities are converted to Unicode. Entities defined in a DTD or schema are passed through without change.&lt;br /&gt;
&lt;br /&gt;
Note that text entity declarations can be processed by the [[DTD Filter]].&lt;br /&gt;
&lt;br /&gt;
==Parameters==&lt;br /&gt;
&lt;br /&gt;
===Built-in Configuration===&lt;br /&gt;
&lt;br /&gt;
The HTML filter does not currently have a user interface to modify its configuration files. By default the HTML filter uses a minimalist configuration file that does not create structural groupings. For example, a table group or list group will never be created.&lt;br /&gt;
&lt;br /&gt;
There is a predefined maximalist configuration (&amp;lt;code&amp;gt;okf_html-wellFormed&amp;lt;/code&amp;gt;) that can be used if structural groupings are needed. The caveat is that any structural tags that map to groups must be well formed, that is, they must have a start and end tag. Otherwise the filter return an error.&lt;br /&gt;
&lt;br /&gt;
===HTML Configuration Syntax===&lt;br /&gt;
&lt;br /&gt;
For the truly brave, you can create your own HTML configuration files. These configurations are written in [http://www.yaml.org/ YAML].  See the &amp;lt;code&amp;gt;[https://bitbucket.org/okapiframework/okapi/src/master/okapi/filters/html/src/main/resources/net/sf/okapi/filters/html/wellformedConfiguration.yml wellformedConfiguration.yml]&amp;lt;/code&amp;gt; and &amp;lt;code&amp;gt;[https://bitbucket.org/okapiframework/okapi/src/master/okapi/filters/html/src/main/resources/net/sf/okapi/filters/html/nonwellformedConfiguration.yml nonwellformedConfiguration.yml]&amp;lt;/code&amp;gt; for examples. &lt;br /&gt;
&lt;br /&gt;
HTML tags are associated with rules. These rules are used by the filter to process the input document.&lt;br /&gt;
&lt;br /&gt;
Notes:&lt;br /&gt;
&lt;br /&gt;
* All attributes and elements names should be in '''lowercase''' in the configuration file, regardless of their casing in the document.&lt;br /&gt;
* Element or attributes with a prefix should be declared with the prefix (and between single quotes) in the configuration (e.g. &amp;lt;code&amp;gt;'xml:lang'&amp;lt;/code&amp;gt;)&lt;br /&gt;
&lt;br /&gt;
==== Configuring Element Rules ====&lt;br /&gt;
&lt;br /&gt;
The &amp;lt;code&amp;gt;elements&amp;lt;/code&amp;gt; section of the configuration consists of a set of key-value pairs.  Each key is an element name, and the value is the rules for that element, represented as another set of key-value pairs.  An element declaration should include one or more of the available element rules:&lt;br /&gt;
{| border=&amp;quot;1&amp;quot; cellpadding=&amp;quot;5&amp;quot; cellspacing=&amp;quot;0&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
| &amp;lt;code&amp;gt;ruleTypes&amp;lt;/code&amp;gt;&lt;br /&gt;
| Basic description of how the filter treats this tag.  See [[#Rule Types]].&lt;br /&gt;
|-  &lt;br /&gt;
| &amp;lt;code&amp;gt;idAttributes&amp;lt;/code&amp;gt;&lt;br /&gt;
| A list containing attributes which may provide the segment ID for text contained within this element. &lt;br /&gt;
|-&lt;br /&gt;
| &amp;lt;code&amp;gt;conditions&amp;lt;/code&amp;gt;&lt;br /&gt;
| A condition that further restricts this rule.  For example, to indicate that the element should only be handled if it contains an attribute with a certain value.  See [[#Condition Syntax]].&lt;br /&gt;
|-&lt;br /&gt;
| &amp;lt;code&amp;gt;translatableAttributes&amp;lt;/code&amp;gt;&lt;br /&gt;
| Contains information about translatable attributes in this element. See [[#Configuring Translatable Attributes]].&lt;br /&gt;
|-&lt;br /&gt;
| &amp;lt;code&amp;gt;elementType&amp;lt;/code&amp;gt;&lt;br /&gt;
| Indicates the corresponding XLIFF 1.2 &amp;lt;code&amp;gt;type&amp;lt;/code&amp;gt; value for this element.&lt;br /&gt;
|-&lt;br /&gt;
| &amp;lt;code&amp;gt;writableLocalizationAttributes&amp;lt;/code&amp;gt;&lt;br /&gt;
| Specifies attributes which are writable, but not translatable. (TODO) &lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
==== Rule Types ====&lt;br /&gt;
The rules types are the following:&lt;br /&gt;
&lt;br /&gt;
{| border=&amp;quot;1&amp;quot; cellpadding=&amp;quot;5&amp;quot; cellspacing=&amp;quot;0&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
| &amp;lt;code&amp;gt;INLINE&amp;lt;/code&amp;gt;&lt;br /&gt;
| A tag which may occur inside a text run. For example &amp;lt;code&amp;gt;&amp;amp;lt;b&amp;gt;&amp;lt;/code&amp;gt;, &amp;lt;code&amp;gt;&amp;amp;lt;i&amp;gt;&amp;lt;/code&amp;gt;, and &amp;lt;code&amp;gt;&amp;amp;lt;u&amp;gt;&amp;lt;/code&amp;gt;.&lt;br /&gt;
|- &lt;br /&gt;
| &amp;lt;code&amp;gt;GROUP&amp;lt;/code&amp;gt;&lt;br /&gt;
| Defines a group of elements that are structurally bound. For example &amp;lt;code&amp;gt;&amp;amp;lt;table&amp;gt;&amp;lt;/code&amp;gt;, &amp;lt;code&amp;gt;&amp;amp;lt;div&amp;gt;&amp;lt;/code&amp;gt; and &amp;lt;code&amp;gt;&amp;amp;lt;menu&amp;gt;&amp;lt;/code&amp;gt;.&lt;br /&gt;
|-&lt;br /&gt;
| &amp;lt;code&amp;gt;EXCLUDE&amp;lt;/code&amp;gt;&lt;br /&gt;
| Prevents extraction of any text until the end tag of the same element is found. For example, if the content between a &amp;lt;code&amp;gt;&amp;amp;lt;script&amp;gt;&amp;lt;/code&amp;gt; element should not be extracted then define &amp;lt;code&amp;gt;&amp;amp;lt;script&amp;gt;&amp;lt;/code&amp;gt; as &amp;lt;code&amp;gt;EXCLUDE&amp;lt;/code&amp;gt;.&lt;br /&gt;
|-&lt;br /&gt;
| &amp;lt;code&amp;gt;INCLUDE&amp;lt;/code&amp;gt;&lt;br /&gt;
| Overrides any current exclusions. This allows exceptions for children of &amp;lt;code&amp;gt;EXCLUDE&amp;lt;/code&amp;gt;d elements.&lt;br /&gt;
|-&lt;br /&gt;
| &amp;lt;code&amp;gt;TEXTUNIT&amp;lt;/code&amp;gt;&lt;br /&gt;
| A tag that starts a complex text unit. Examples include &amp;lt;code&amp;gt;&amp;amp;lt;p&amp;gt;&amp;lt;/code&amp;gt;, &amp;lt;code&amp;gt;&amp;amp;lt;title&amp;gt;&amp;lt;/code&amp;gt;, &amp;lt;code&amp;gt;&amp;amp;lt;h1&amp;gt;&amp;lt;/code&amp;gt;. Complex text units carry their surrounding tags along with any extracted text.&lt;br /&gt;
|-&lt;br /&gt;
| &amp;lt;code&amp;gt;PRESERVE_WHITESPACE&amp;lt;/code&amp;gt;&lt;br /&gt;
| A tag that must preserve its white spaces as-is. For example &amp;lt;code&amp;gt;&amp;amp;lt;pre&amp;gt;&amp;lt;/code&amp;gt;.&lt;br /&gt;
|-&lt;br /&gt;
| &amp;lt;code&amp;gt;ATTRIBUTES_ONLY&amp;lt;/code&amp;gt;&lt;br /&gt;
| A tag that has localizable or translatable attributes but does not have translatable content. &lt;br /&gt;
|-&lt;br /&gt;
| &amp;lt;code&amp;gt;ATTRIBUTE_TRANS&amp;lt;/code&amp;gt;&lt;br /&gt;
| A translatable attribute. &lt;br /&gt;
|-&lt;br /&gt;
| &amp;lt;code&amp;gt;ATTRIBUTE_WRITABLE&amp;lt;/code&amp;gt;&lt;br /&gt;
| A writable or modifiable attribute, but not translatable.&lt;br /&gt;
|-&lt;br /&gt;
| &amp;lt;code&amp;gt;ATTRIBUTE_READONLY&amp;lt;/code&amp;gt;&lt;br /&gt;
| A read-only attribute, extracted but that cannot be modified. &lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
==== Configuring Translatable Attributes ====&lt;br /&gt;
Translatable attributes may be specified in two ways, depending on the level of complexity needed.  &lt;br /&gt;
&lt;br /&gt;
If all the specified attributes should always be translated, they can be exposed as a simple list.  For example, the definition for the &amp;lt;code&amp;gt;&amp;amp;lt;area&amp;amp;gt;&amp;lt;/code&amp;gt; element specifies that &amp;lt;code&amp;gt;accesskey&amp;lt;/code&amp;gt;, &amp;lt;code&amp;gt;area&amp;lt;/code&amp;gt;, and &amp;lt;code&amp;gt;alt&amp;lt;/code&amp;gt; attributes are translatable:&lt;br /&gt;
 &amp;lt;nowiki&amp;gt;  area:&lt;br /&gt;
    ruleTypes: [ATTRIBUTES_ONLY]&lt;br /&gt;
    translatableAttributes: [accesskey, area, alt]&amp;lt;/nowiki&amp;gt;&lt;br /&gt;
&lt;br /&gt;
However, if additional restrictions on translatable attributes are present, the &amp;lt;code&amp;gt;translatableAttributes&amp;lt;/code&amp;gt; rule may be specified as a set of key-value pairs, with each key being a translatable attribute and each value being an (optional) list of conditions, using the [[#Condition Syntax]].  For example, this snippet defines the handling of the &amp;lt;code&amp;gt;&amp;amp;lt;input&amp;amp;gt;&amp;lt;/code&amp;gt; element in the built-in configurations:&lt;br /&gt;
 &amp;lt;nowiki&amp;gt;&lt;br /&gt;
  input:&lt;br /&gt;
    ruleTypes: [INLINE]&lt;br /&gt;
    translatableAttributes:&lt;br /&gt;
      alt: [type, NOT_EQUALS, [file, hidden, image, password]]&lt;br /&gt;
      value: [type, NOT_EQUALS, [file, hidden, image, password]]&lt;br /&gt;
      accesskey: [type, NOT_EQUALS, [file, hidden, image, password]]&lt;br /&gt;
      title: [type, NOT_EQUALS, [file, hidden, image, password]]&amp;lt;/nowiki&amp;gt;&lt;br /&gt;
&lt;br /&gt;
This specifies that there are four attributes (&amp;lt;code&amp;gt;alt&amp;lt;/code&amp;gt;, &amp;lt;code&amp;gt;value&amp;lt;/code&amp;gt;, &amp;lt;code&amp;gt;accesskey&amp;lt;/code&amp;gt;, and &amp;lt;code&amp;gt;title&amp;lt;/code&amp;gt;) that are translatable.  The translatability of each of these attributes is conditional on the &amp;lt;code&amp;gt;&amp;amp;lt;input&amp;amp;gt;&amp;lt;/code&amp;gt; element not having particular &amp;lt;code&amp;gt;type&amp;lt;/code&amp;gt; values.&lt;br /&gt;
&lt;br /&gt;
==== Condition Syntax ====&lt;br /&gt;
&lt;br /&gt;
Rule conditions are expressed as a list of the form&lt;br /&gt;
 &amp;lt;nowiki&amp;gt;[attribute, operation, value]&amp;lt;/nowiki&amp;gt;&lt;br /&gt;
&lt;br /&gt;
{| border=&amp;quot;1&amp;quot; cellpadding=&amp;quot;5&amp;quot; cellspacing=&amp;quot;0&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
| &amp;lt;code&amp;gt;attribute&amp;lt;/code&amp;gt;&lt;br /&gt;
| The name of the attribute which the condition applies to.&lt;br /&gt;
|- &lt;br /&gt;
| &amp;lt;code&amp;gt;operation&amp;lt;/code&amp;gt;&lt;br /&gt;
| Available operations are &amp;lt;code&amp;gt;EQUALS&amp;lt;/code&amp;gt;, &amp;lt;code&amp;gt;NOT_EQUALS&amp;lt;/code&amp;gt;, and &amp;lt;code&amp;gt;MATCHES&amp;lt;/code&amp;gt;.  &amp;lt;code&amp;gt;EQUALS&amp;lt;/code&amp;gt; and &amp;lt;code&amp;gt;NOT_EQUALS&amp;lt;/code&amp;gt; test for (case-insensitive) string matches, while &amp;lt;code&amp;gt;MATCHES&amp;lt;/code&amp;gt; uses a regular expression.&lt;br /&gt;
|-&lt;br /&gt;
| &amp;lt;code&amp;gt;value&amp;lt;/code&amp;gt;&lt;br /&gt;
| The value of the attribute to be compared using the operation.&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
===Inline Code Finder===&lt;br /&gt;
&lt;br /&gt;
You can define a set of regular expressions to capture span of extracted text that should be treated as inline codes. For example, some element content may have variables that need to be protected from modification and treated as codes. Use the &amp;lt;code&amp;gt;useCodeFinder&amp;lt;/code&amp;gt; and &amp;lt;code&amp;gt;codeFinderRules&amp;lt;/code&amp;gt; options for this.&lt;br /&gt;
&lt;br /&gt;
 useCodeFinder: true&lt;br /&gt;
 codeFinderRules: &amp;quot;#v1\ncount.i=1\nrule0=\\bVAR\\d\\b&amp;quot;&lt;br /&gt;
&lt;br /&gt;
Note that the regular expression is &amp;quot;&amp;lt;code&amp;gt;\bVAR\d\b&amp;lt;/code&amp;gt;&amp;quot; but you must escape the backslash in the YAML notation as well.&lt;br /&gt;
&lt;br /&gt;
You can also use this alternate syntax, which is slightly easier to read:&lt;br /&gt;
&lt;br /&gt;
 useCodeFinder: true&lt;br /&gt;
 codeFinderRules: |-&lt;br /&gt;
    #v1&lt;br /&gt;
    count.i=1&lt;br /&gt;
    rule0=\bVAR\d\b&lt;br /&gt;
&lt;br /&gt;
The options above will set the text &amp;quot;&amp;lt;code&amp;gt;VAR1&amp;lt;/code&amp;gt;&amp;quot; as in-line code in the following HTML:&lt;br /&gt;
&lt;br /&gt;
 &amp;amp;lt;p&amp;gt;Number of files = VAR1&amp;amp;lt;/p&amp;gt;&lt;br /&gt;
&lt;br /&gt;
To facilitate the creation of code finder rules [[Rainbow - Code Finder Editor|Rainbow provides the Code Finder Editor]].&lt;br /&gt;
&lt;br /&gt;
===Character Entity References in Output===&lt;br /&gt;
&lt;br /&gt;
By default extended characters are not using character entity references in output (e.g. &amp;lt;code&amp;gt;&amp;amp;amp;copy;&amp;lt;/code&amp;gt; for the character '&amp;amp;copy').&lt;br /&gt;
&lt;br /&gt;
You can change this by specifying the &amp;lt;code&amp;gt;escapeCharacters&amp;lt;/code&amp;gt; rule with a string of all the characters you wish to see output as character entity reference. Any specified character that is not extended or has no HTML character entity defined is processed like a normal character.&lt;br /&gt;
&lt;br /&gt;
For example, given the following rule:&lt;br /&gt;
&lt;br /&gt;
 escapeCharacters: &amp;quot;© €µÆĄ&amp;quot;&lt;br /&gt;
&lt;br /&gt;
The output of &amp;lt;code&amp;gt;&amp;amp;lt;p&amp;gt;© €µÆĄ&amp;amp;lt;/p&amp;gt;&amp;lt;/code&amp;gt; (assuming the output encoding is UTF-8) will be:&lt;br /&gt;
&lt;br /&gt;
 &amp;amp;lt;p&amp;gt;&amp;amp;amp;copy;&amp;amp;amp;nbsp;&amp;amp;amp;euro;&amp;amp;amp;micro;&amp;amp;amp;AElig;Ą&amp;amp;lt;/p&amp;gt; &lt;br /&gt;
&lt;br /&gt;
Only the character &amp;lt;code&amp;gt;Ą&amp;lt;/code&amp;gt; (U+0104) is not represented as an entity reference because there is no HTML character entity defined for it.&lt;br /&gt;
&lt;br /&gt;
===Inline CDATA===&lt;br /&gt;
&lt;br /&gt;
For formats that use CDATA in ways that undesirably break the flow of text, you can set the filter to treat CDATA as if it was an inline element like so:&lt;br /&gt;
&lt;br /&gt;
  inlineCdata: true&lt;br /&gt;
&lt;br /&gt;
Then markup such as &amp;lt;code&amp;gt;&amp;amp;lt;p&amp;gt;Text with &amp;amp;lt;![CDATA[inline]]&amp;gt; CDATA&amp;amp;lt;/p&amp;gt;&amp;lt;/code&amp;gt; will be extracted as if &amp;lt;code&amp;gt;&amp;amp;lt;![CDATA[&amp;lt;/code&amp;gt; was a regular inline opening tag and &amp;lt;code&amp;gt;]]&amp;gt;&amp;lt;/code&amp;gt; was a regular inline closing tag.&lt;br /&gt;
&lt;br /&gt;
===Excluding By Default===&lt;br /&gt;
&lt;br /&gt;
Normally, there is an implicit &amp;quot;default rule&amp;quot; to include elements.  If the filter configuration contained no tag information at all, the default behavior of the filter would be to expose all PCDATA for translation.  Sometimes it is useful to change this behavior in order to make your configuration more concise.  This can be done by setting the &amp;lt;code&amp;gt;exclude_by_default&amp;lt;/code&amp;gt; option in your config.&lt;br /&gt;
&lt;br /&gt;
For example, if you wished to have a custom configuration that exposed the translation of the &amp;lt;code&amp;gt;&amp;amp;lt;title&amp;amp;gt;&amp;lt;/code&amp;gt; element but nothing else.  You could specify this as&lt;br /&gt;
&lt;br /&gt;
 exclude_by_default: true&lt;br /&gt;
 // .... other configuration&lt;br /&gt;
 elements:&lt;br /&gt;
    title:&lt;br /&gt;
      ruleTypes: [TEXTUNIT]&lt;br /&gt;
&lt;br /&gt;
===Quote Mode===&lt;br /&gt;
Escaping of quote and apostrophe (single quote) characters can be changed by adding these lines to the config file:&lt;br /&gt;
&lt;br /&gt;
 quoteModeDefined: true&lt;br /&gt;
 quoteMode: 3&lt;br /&gt;
&lt;br /&gt;
'''Current quote modes:'''&lt;br /&gt;
&lt;br /&gt;
* Do not escape single or double quotes: '''UNESCAPED = 0'''&lt;br /&gt;
* Escape single and double quotes to a named entity: '''ALL = 1'''&lt;br /&gt;
* Escape double quotes to a named entity, and single quotes to a numeric entity: '''NUMERIC_SINGLE_QUOTES = 2'''&lt;br /&gt;
* Escape double quotes only: '''DOUBLE_QUOTES_ONLY = 3'''&lt;br /&gt;
&lt;br /&gt;
===Miscellaneous Options===&lt;br /&gt;
* '''cleanupHtml: false''' - turn off post-processing cleanup of input file. The filter attempts to clean up common syntax errors such as unquoted attributes. This option turns off this feature.&lt;br /&gt;
 &lt;br /&gt;
&lt;br /&gt;
==Limitations==&lt;br /&gt;
&lt;br /&gt;
* In the current version of the filter the content of &amp;lt;code&amp;gt;&amp;amp;lt;style&amp;gt;&amp;lt;/code&amp;gt; and &amp;lt;code&amp;gt;&amp;amp;lt;script&amp;gt;&amp;lt;/code&amp;gt; elements is not extracted.&lt;br /&gt;
* Tags from server-side scripts such as PHP, ASPX, JSP, etc. are not formally supported and will be treated as non-translatable.&lt;br /&gt;
&lt;br /&gt;
[[Category:Filters]]&lt;/div&gt;</summary>
		<author><name>Jhargraveiii</name></author>
	</entry>
	<entry>
		<id>http://okapiframework.org/wiki/index.php?title=Filters&amp;diff=941</id>
		<title>Filters</title>
		<link rel="alternate" type="text/html" href="http://okapiframework.org/wiki/index.php?title=Filters&amp;diff=941"/>
		<updated>2022-09-08T17:06:15Z</updated>

		<summary type="html">&lt;p&gt;Jhargraveiii: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;Filters are the components that convert input documents from their native file format into a common internal set of [[Glossary#Resource|resources]] that all Okapi components use. The extracted content can be re-written into the original file format. When using the steps, the extraction is done by the [[Raw Document to Filter Events Step]] and the re-writing by the [[Filter Events to Raw Document Step]].&lt;br /&gt;
&lt;br /&gt;
Note: The [[Okapi Filters Plugin for OmegaT]] allows you to use some of the filters directly from [http://www.omegat.org OmegaT].&lt;br /&gt;
&lt;br /&gt;
==List of the Filters==&lt;br /&gt;
&lt;br /&gt;
The framework distribution comes with the following filters:&lt;br /&gt;
&lt;br /&gt;
{| cellpadding=&amp;quot;8&amp;quot; width=100%&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
|&lt;br /&gt;
* [[Archive Filter]]&lt;br /&gt;
* [[DTD Filter]]&lt;br /&gt;
* [[Doxygen Filter]]&lt;br /&gt;
* [[HTML Filter]]&lt;br /&gt;
* [[HTML5-ITS Filter]]&lt;br /&gt;
* [[ICML Filter]]&lt;br /&gt;
* [[IDML Filter]]&lt;br /&gt;
* [[JSON Filter]]&lt;br /&gt;
* [[Markdown Filter]]&lt;br /&gt;
* [[MIF Filter]]&lt;br /&gt;
* [[Moses Text Filter]]&lt;br /&gt;
* [[Multi-Parsers Filter]]&lt;br /&gt;
* [[OpenOffice Filter]]&lt;br /&gt;
* [[OpenXML Filter|OpenXML (MS Office) Filter]]&lt;br /&gt;
|&lt;br /&gt;
* [[PDF Filter]]&lt;br /&gt;
* [[Pensieve TM Filter]]&lt;br /&gt;
* [[PHP Content Filter]]&lt;br /&gt;
* [[Plain Text Filter]]&lt;br /&gt;
* [[PO Filter]]&lt;br /&gt;
* [[Properties Filter]]&lt;br /&gt;
* [[Rainbow Translation Kit Filter]]&lt;br /&gt;
* [[Regex Filter]]&lt;br /&gt;
* [[SDL Trados Package Filter]]&lt;br /&gt;
* [[Simplification Filter]]&lt;br /&gt;
* [[Table Filter]]&lt;br /&gt;
* [[TMX Filter]]&lt;br /&gt;
* [[Trados-Tagged RTF Filter]]&lt;br /&gt;
|&lt;br /&gt;
* [[Transifex Filter]]&lt;br /&gt;
* [[TS Filter]]&lt;br /&gt;
* [[TTX Filter]]&lt;br /&gt;
* [[TXML Filter]]&lt;br /&gt;
* [[Wiki Filter]]&lt;br /&gt;
* [[Vignette Filter]]&lt;br /&gt;
* [[XLIFF Filter]]&lt;br /&gt;
* [[XLIFF-2 Filter]]&lt;br /&gt;
* [[XML Filter]]&lt;br /&gt;
* [[XML Stream Filter]]&lt;br /&gt;
* [[YAML Filter]]&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
==Supported File Formats==&lt;br /&gt;
&lt;br /&gt;
The following is a list of some of the file formats supported by the distribution through [[Understanding Filter Configurations|pre-defined configurations]]:&lt;br /&gt;
&lt;br /&gt;
{| border=&amp;quot;1&amp;quot; cellpadding=&amp;quot;6&amp;quot; cellspacing=&amp;quot;0&amp;quot;&lt;br /&gt;
|+&lt;br /&gt;
| '''Format''' || '''Extensions''' || '''Pre-Defined Configuration''' || '''Filter''' || '''Notes'''&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Android Strings || .xml || &amp;lt;code&amp;gt;okf_xml-AndroidStrings&amp;lt;/code&amp;gt; || [[XML Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Apple Stringsdict || .stringsdict || &amp;lt;code&amp;gt;okf_xml-AppleStringsdict&amp;lt;/code&amp;gt; || [[XML Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Archive || .zip || &amp;lt;code&amp;gt;okf_archive&amp;lt;/code&amp;gt; || [[Archive Filter]] || Meta filter that processes zip files with various formats as one file.&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Auto Xliff || .xlf, .xliff || &amp;lt;code&amp;gt;okf_autoxliff&amp;lt;/code&amp;gt; || [[Auto Xliff Filter]] || Detects the version of an XLIFF file and then hands parsing off to the appropriate filter &lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| CSV (Comma-separated values files) || .csv, .txt || &amp;lt;code&amp;gt;okf_table_csv&amp;lt;/code&amp;gt; || [[Table Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| CSV (Multiple complex sub-formats) || .csv || &amp;lt;code&amp;gt;okf_multiparsers&amp;lt;/code&amp;gt; || [[Multi-Parsers Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| DITA || .dita, .ditamap, .xml || &amp;lt;code&amp;gt;okf_xmlstream-dita&amp;lt;/code&amp;gt; || [[XML Stream Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| DocBook v5.0 || .xml || &amp;lt;code&amp;gt;okf_xml-docbook&amp;lt;/code&amp;gt; || [[XML Filter]] || Since Okapi 1.42. &amp;amp;lt;footnote&amp;gt; is not handled properly.&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| DokuWiki pages || .txt || &amp;lt;code&amp;gt;okf_wiki&amp;lt;/code&amp;gt; || [[Wiki Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Doxygen-commented files || .c, .h, cpp || &amp;lt;code&amp;gt;okf_doxygen&amp;lt;/code&amp;gt; || [[Doxygen Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| DTD || .dtd || &amp;lt;code&amp;gt;okf_dtd&amp;lt;/code&amp;gt; || [[DTD Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Fixed-Width Columns Table || .txt || &amp;lt;code&amp;gt;okf_table_fwc&amp;lt;/code&amp;gt; || [[Table Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Idiom WorldServer XLIFF || .xlf || &amp;lt;code&amp;gt;okf_xliff-iws&amp;lt;/code&amp;gt; || [[XLIFF Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| InCopy ICML || .wcml || &amp;lt;code&amp;gt;okf_icml&amp;lt;/code&amp;gt; || [[ICML Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| InDesign IDML || .idml || &amp;lt;code&amp;gt;okf_idml&amp;lt;/code&amp;gt; || [[IDML Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| iOS/Mac Strings|| .strings || &amp;lt;code&amp;gt;okf_regex-macStrings&amp;lt;/code&amp;gt; || [[Regex Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Java Properties || .properties || &amp;lt;code&amp;gt;okf_properties&amp;lt;/code&amp;gt; || [[Properties Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Java Properties (Output not escaped) || .properties || &amp;lt;code&amp;gt;okf_properties-outputNotEscaped&amp;lt;/code&amp;gt; || [[Properties Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Java XML Properties || .xml || &amp;lt;code&amp;gt;okf_xml-JavaProperties&amp;lt;/code&amp;gt; || [[XML Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Java XML Properties (HTML strings) || .xml || &amp;lt;code&amp;gt;okf_xmlstream-JavaPropertiesHTML&amp;lt;/code&amp;gt; || [[XML Stream Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| JSON || .json || &amp;lt;code&amp;gt;okf_json&amp;lt;/code&amp;gt; || [[JSON Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Haiku CatKeys || .catkeys || &amp;lt;code&amp;gt;okf_table_catkeys&amp;lt;/code&amp;gt; || [[Table Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| HTML (any) || .html, .htm || &amp;lt;code&amp;gt;okf_html&amp;lt;/code&amp;gt; || [[HTML Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| HTML (Well-formed, and XHTML) || .html, .htm|| &amp;lt;code&amp;gt;okf_html-wellFormed&amp;lt;/code&amp;gt; || [[HTML Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| HTML5 (and XHTML5) || .html, .htm|| &amp;lt;code&amp;gt;okf_itshtml5&amp;lt;/code&amp;gt; || [[HTML5-ITS Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Markdown || .md || &amp;lt;code&amp;gt;okf_markdown&amp;lt;/code&amp;gt; || [[Markdown Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Microsoft Excel 2007/2010 || .xlsx, .xlsm, .xltx, .xltm || &amp;lt;code&amp;gt;okf_openxml&amp;lt;/code&amp;gt; || [[OpenXML Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Microsoft PowerPoint 2007/2010 || .pptx, .pptm, .potx, .potm, .ppsx, .ppsm || &amp;lt;code&amp;gt;okf_openxml&amp;lt;/code&amp;gt; || [[OpenXML Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Microsoft Visio || .vsdx, .vsdm || &amp;lt;code&amp;gt;okf_openxml&amp;lt;/code&amp;gt; || [[OpenXML Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Microsoft Word 2007/2010 || .docx, .docm, .dotx, .dotm || &amp;lt;code&amp;gt;okf_openxml&amp;lt;/code&amp;gt; || [[OpenXML Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| MIF || .mif || &amp;lt;code&amp;gt;okf_mif&amp;lt;/code&amp;gt; || [[MIF Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Moses Text || .txt || &amp;lt;code&amp;gt;okf_mosestext&amp;lt;/code&amp;gt; || [[Moses Text Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| OpenOffice.org Calc || .ods, .ots || &amp;lt;code&amp;gt;okf_odf&amp;lt;/code&amp;gt; || [[OpenOffice Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| OpenOffice.org Draw || .odg, .otg || &amp;lt;code&amp;gt;okf_odf&amp;lt;/code&amp;gt; || [[OpenOffice Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| OpenOffice.org Impress || .odp, .otp || &amp;lt;code&amp;gt;okf_odf&amp;lt;/code&amp;gt; || [[OpenOffice Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| OpenOffice.org Writer || .odt, .ott || &amp;lt;code&amp;gt;okf_odf&amp;lt;/code&amp;gt; || [[OpenOffice Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| PDF || .pdf || &amp;lt;code&amp;gt;okf_pdf&amp;lt;/code&amp;gt; || [[PDF Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| [[Pensieve TM]] || .pentm || &amp;lt;code&amp;gt;okf_pensieve&amp;lt;/code&amp;gt; || [[Pensieve TM Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| PHP Content || .php || &amp;lt;code&amp;gt;okf_phpcontent&amp;lt;/code&amp;gt; || [[PHP Content Filter]] || Can be used as a subfilter only&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Plain Text (Line = text unit) || .txt || &amp;lt;code&amp;gt;okf_plaintext&amp;lt;/code&amp;gt; || [[ Plain Text Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Plain Text (Paragraph = text unit) || .txt || &amp;lt;code&amp;gt;okf_plaintext_paragraphs&amp;lt;/code&amp;gt; || [[Plain Text Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| PO || .po || &amp;lt;code&amp;gt;okf_po&amp;lt;/code&amp;gt; || [[PO Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| PO (Monolingual style) || .po || &amp;lt;code&amp;gt;okf_po-monolingual&amp;lt;/code&amp;gt; || [[PO Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Rainbow Translation Kit manifests || .rkm || &amp;lt;code&amp;gt;okf_rainbowkit&amp;lt;/code&amp;gt; || [[Rainbow Translation Kit Filter]] || Used as a tkit reader only&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Regex (Any text-based format) || .txt || &amp;lt;code&amp;gt;okf_regex&amp;lt;/code&amp;gt; || [[Regex Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| RDF (Mozilla RDF) || .rdf || &amp;lt;code&amp;gt;okf_xml-MozillaRDF&amp;lt;/code&amp;gt; || [[XML Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| RESX || .resx || &amp;lt;code&amp;gt;okf_xml-resx&amp;lt;/code&amp;gt; || [[XML Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| SDLPPX || .sdlppx || &amp;lt;code&amp;gt;okf_sdlpackage&amp;lt;/code&amp;gt; || [[SDL Trados Package Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| SDLRPX || .sdlrpx || &amp;lt;code&amp;gt;okf_sdlpackage&amp;lt;/code&amp;gt; || [[SDL Trados Package Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| SDL[[XLIFF]] || .sdlxlf || &amp;lt;code&amp;gt;okf_xliff-sdl&amp;lt;/code&amp;gt; || [[XLIFF Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Skype Language Files || .lang || &amp;lt;code&amp;gt;okf_properties-skypeLang&amp;lt;/code&amp;gt; || [[Properties Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| SRT (Sub-Rip Text, sub-titles files) || .srt || &amp;lt;code&amp;gt;okf_regex-srt&amp;lt;/code&amp;gt; || [[Regex Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Tab-Delimiter files || .tsv, .txt || &amp;lt;code&amp;gt;okf_table_tsv&amp;lt;/code&amp;gt; || [[Table Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Tex files || .tex || &amp;lt;code&amp;gt;okf_tex&amp;lt;/code&amp;gt; || [[TEX Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| [[TMX]] || .tmx || &amp;lt;code&amp;gt;okf_tmx&amp;lt;/code&amp;gt; || [[TMX Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Transifex project || .txp || &amp;lt;code&amp;gt;okf_transifex&amp;lt;/code&amp;gt; || [[Transifex Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Trados-Tagged RTF || .rtf || &amp;lt;code&amp;gt;okf_tradosrtf&amp;lt;/code&amp;gt; || [[Trados-Tagged RTF Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| TS - Qt TS files || .ts || &amp;lt;code&amp;gt;okf_ts&amp;lt;/code&amp;gt; || [[TS Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| TTX - Trados TagEditor TTX files || .ttx || &amp;lt;code&amp;gt;okf_ttx&amp;lt;/code&amp;gt; || [[TTX Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| TXML - Wordfast Pro TXML files || .txml || &amp;lt;code&amp;gt;okf_txml&amp;lt;/code&amp;gt; || [[TXML Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Vignette Export/Import Content || .xml || &amp;lt;code&amp;gt;okf_vignette&amp;lt;/code&amp;gt; || [[Vignette Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| XHTML || .html, .htm || &amp;lt;code&amp;gt;okf_html-wellFormed&amp;lt;/code&amp;gt; || [[HTML Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| WIX (Windows Installer XML) localization files || .wix || &amp;lt;code&amp;gt;okf_xml-WixLocalization&amp;lt;/code&amp;gt; || [[XML Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| [[XLIFF]] v1.2 || .xlf, .xliff || &amp;lt;code&amp;gt;okf_xliff&amp;lt;/code&amp;gt; || [[XLIFF Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| [[XLIFF]] v2 || .xlf || &amp;lt;code&amp;gt;okf_xliff2&amp;lt;/code&amp;gt; || [[XLIFF-2 Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| XML (Generic, using [[ITS]] defaults) || .xml || &amp;lt;code&amp;gt;okf_xml&amp;lt;/code&amp;gt; || [[XML Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| XML (Generic, using stream reader) || .xml || &amp;lt;code&amp;gt;okf_xmlstream&amp;lt;/code&amp;gt; || [[XML Stream Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| YAML (Generic YAML filter) || .yml, .yaml || &amp;lt;code&amp;gt;okf_yaml&amp;lt;/code&amp;gt; || [[YAML Filter]] ||&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
Note that most filters allow you to [[Understanding Filter Configurations|create your own configurations]] to support more file formats.&lt;br /&gt;
&lt;br /&gt;
==Code Simplification Rules==&lt;br /&gt;
&lt;br /&gt;
All filters support code simplification rules. By default the [[Inline Codes Simplifier Step]], [[Simplification Filter]] and [[Post-segmentation Inline Codes Removal Step]] maximize the trimming and merging (aka simplification) of inline codes. In some cases this may not be desired. The simplification rules allow you to override the default behavior and prevent specific codes from being trimmed or merged.&lt;br /&gt;
&lt;br /&gt;
===General Syntax===&lt;br /&gt;
&lt;br /&gt;
The rules parser ignores irrelevant whitespace. Rules can be separated by spaces, newlines or nothing. This makes it easier to accommodate various container formats and their whitespace normalization rules. When a rule applies it means &amp;quot;do not simplify the match code&amp;quot;. Uppercase tokens are constants and predefined by the rule parser. Multiple rules are always OR'ed together.&lt;br /&gt;
&lt;br /&gt;
For more details see the JavaCC grammar: &amp;lt;code&amp;gt;../okapi-core/src/main/java/net/sf/okapi/core/simplifierrules/SimplifierRules.jj&amp;lt;/code&amp;gt; &lt;br /&gt;
&lt;br /&gt;
===Rule Examples===&lt;br /&gt;
&lt;br /&gt;
If Code has any of these flags then don't simplify &lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;if DELETABLE or ADDABLE or CLONEABLE;&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;quot;=&amp;quot; is string match&lt;br /&gt;
Match basic TAGTYPE opening, closing or standalone &lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;if DATA = &amp;quot;a&amp;quot; and TAGTYPE = OPENING;&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;quot;~&amp;quot; is regex match&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;if DATA ~ &amp;quot;a.*&amp;quot;;&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
You can negate any of the match operators &lt;br /&gt;
Don't simplify if the DATA does not match the regex &lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;if DATA !~ &amp;quot;a.*&amp;quot;;&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Match on type, linebreak in this case, don't simplify &lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;if TYPE = &amp;quot;lb&amp;quot;;&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Don't simplify any rich text types&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;if TYPE = &amp;quot;bold&amp;quot; or TYPE = &amp;quot;italic&amp;quot; or TYPE = &amp;quot;underline&amp;quot;;&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Expressions can be recursive (supports embedded parens)&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;if TYPE = &amp;quot;bold&amp;quot; or (DATA = &amp;quot;bar&amp;quot; or (DATA = &amp;quot;foo&amp;quot; and TYPE = &amp;quot;underline&amp;quot;));&amp;lt;/pre&amp;gt;&lt;br /&gt;
 &lt;br /&gt;
&lt;br /&gt;
===Filter Config Examples===&lt;br /&gt;
&lt;br /&gt;
Examples of using simplifier rules within the filter config formats used by Okapi.&lt;br /&gt;
&lt;br /&gt;
'''YAML:'''&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
simplifierRules: |&lt;br /&gt;
  if ADDABLE or DELETABLE or CLONEABLE;&lt;br /&gt;
  if DATA = &amp;quot;&amp;lt;br/&amp;gt;&amp;quot; or DATA = &amp;quot;&amp;lt;font&amp;gt;&amp;quot; or DATA = &amp;quot;&amp;lt;/font&amp;gt;&amp;quot; or DATA = &amp;quot;&amp;lt;/a&amp;gt;&amp;quot;;&lt;br /&gt;
  if DATA ~ &amp;quot;\\&amp;lt;font.+&amp;quot; or DATA ~ &amp;quot;\\&amp;lt;img.+&amp;quot; or DATA ~ &amp;quot;\\&amp;lt;a.+&amp;quot;;&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
'''ITS:'''&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
&amp;lt;?xml version=&amp;quot;1.0&amp;quot; encoding=&amp;quot;UTF-8&amp;quot;?&amp;gt;&lt;br /&gt;
&amp;lt;its:rules xmlns:its=&amp;quot;http://www.w3.org/2005/11/its&amp;quot; version=&amp;quot;1.0&amp;quot; xmlns:itsx=&amp;quot;http://www.w3.org/2008/12/its-extensions&amp;quot; xmlns:okp=&amp;quot;okapi-framework:xmlfilter-options&amp;quot;&amp;gt;&lt;br /&gt;
&amp;lt;!-- See ITS specification at: http://www.w3.org/TR/its/ --&amp;gt;&lt;br /&gt;
 &amp;lt;its:translateRule selector=&amp;quot;//*&amp;quot; translate=&amp;quot;yes&amp;quot;/&amp;gt;&lt;br /&gt;
 &amp;lt;its:withinTextRule selector=&amp;quot;//codeph&amp;quot; withinText=&amp;quot;yes&amp;quot;/&amp;gt;&lt;br /&gt;
 &amp;lt;its:withinTextRule selector=&amp;quot;//ph&amp;quot; withinText=&amp;quot;yes&amp;quot;/&amp;gt;&lt;br /&gt;
 &amp;lt;okp:simplifierRules&amp;gt;&lt;br /&gt;
 if ADDABLE or DELETABLE or CLONEABLE; if DATA ~ &amp;quot;.+&amp;quot;;&lt;br /&gt;
 &amp;lt;/okp:simplifierRules&amp;gt;&lt;br /&gt;
&amp;lt;/its:rules&amp;gt;&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
'''FPRM (Parameters):'''&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
#v1&lt;br /&gt;
extractNotes.b=true&lt;br /&gt;
simplifierRules=if ADDABLE or DELETABLE or CLONEABLE; if DATA ~ &amp;quot;.+&amp;quot;;&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
==Font Mapping==&lt;br /&gt;
&lt;br /&gt;
The font mapping can be considered as a filter's ability to automatically substitute font information in the target document on the fly, according to a provided configuration - this helps to reduce the amount of reformatting and post-translation DTP. It is supported by IDML and OpenXML (DOCX, PPTX and XLSX documents) filters at the moment.&lt;br /&gt;
&lt;br /&gt;
The following font mapping configuration options are available:&lt;br /&gt;
* The source locale regular expression pattern: &amp;lt;code&amp;gt;.*&amp;lt;/code&amp;gt;, &amp;lt;code&amp;gt;en.*&amp;lt;/code&amp;gt;, &amp;lt;code&amp;gt;en-UK&amp;lt;/code&amp;gt;, etc. It can be ommited to apply the mapping to any source locale.&lt;br /&gt;
* The target locale regular expression pattern: &amp;lt;code&amp;gt;.*&amp;lt;/code&amp;gt;, &amp;lt;code&amp;gt;ru.*&amp;lt;/code&amp;gt;, &amp;lt;code&amp;gt;ru-RU&amp;lt;/code&amp;gt;, etc. It can be ommited to apply the mapping to any target locale.&lt;br /&gt;
* The source font name regular expression pattern: &amp;lt;code&amp;gt;.*&amp;lt;/code&amp;gt;, &amp;lt;code&amp;gt;Arial.*&amp;lt;/code&amp;gt;, &amp;lt;code&amp;gt;Times New Roman&amp;lt;/code&amp;gt;, etc. It can be ommited to apply the mapping to any source font name found.&lt;br /&gt;
* The target font name: &amp;lt;code&amp;gt;Arial&amp;lt;/code&amp;gt;, &amp;lt;code&amp;gt;Times New Roman&amp;lt;/code&amp;gt;, etc. It should not be empty. And if it is made so, the mapping configuration is ignored.&lt;br /&gt;
&lt;br /&gt;
Also, the configured font mappings are applied in the order they are stated. And the final target font value is determined by a sequential&lt;br /&gt;
substitution of the source font values. I.e. if there is more than one mapping:&lt;br /&gt;
# &amp;lt;code&amp;gt;Arial&amp;lt;/code&amp;gt; -&amp;gt; &amp;lt;code&amp;gt;Times New Roman&amp;lt;/code&amp;gt;&lt;br /&gt;
# &amp;lt;code&amp;gt;Times New Roman&amp;lt;/code&amp;gt; -&amp;gt; &amp;lt;code&amp;gt;Sans Serif&amp;lt;/code&amp;gt;&lt;br /&gt;
then the first mapping will produce &amp;lt;code&amp;gt;Times New Roman&amp;lt;/code&amp;gt; replacement and the second one will be applied to this new value, thus, ending up with the &amp;lt;code&amp;gt;Sans Serif&amp;lt;/code&amp;gt;.&lt;br /&gt;
&lt;br /&gt;
The parameters serialisation format can look like that:&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
fontMappings.0.sourceLocalePattern=en.*&lt;br /&gt;
fontMappings.0.targetLocalePattern=ru.*&lt;br /&gt;
fontMappings.0.sourceFontPattern=Times.*&lt;br /&gt;
fontMappings.0.targetFont=Arial Unicode MS&lt;br /&gt;
fontMappings.1.sourceLocalePattern=ru&lt;br /&gt;
fontMappings.1.targetLocalePattern=fr&lt;br /&gt;
fontMappings.1.sourceFontPattern=The Sims Sans&lt;br /&gt;
fontMappings.1.targetFont=Arial Unicode MS&lt;br /&gt;
fontMappings.number.i=2&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
When source locale, target locale and source font are omitted:&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
fontMappings.0.targetFont=Arial Unicode MS&lt;br /&gt;
fontMappings.number.i=1&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
And this is the same as the abovementioned:&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
fontMappings.0.sourceLocalePattern=.*&lt;br /&gt;
fontMappings.0.targetLocalePattern=.*&lt;br /&gt;
fontMappings.0.sourceFontPattern=.*&lt;br /&gt;
fontMappings.0.targetFont=Arial Unicode MS&lt;br /&gt;
fontMappings.number.i=1&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
[[Category:Filters]]&lt;/div&gt;</summary>
		<author><name>Jhargraveiii</name></author>
	</entry>
	<entry>
		<id>http://okapiframework.org/wiki/index.php?title=XLIFF-2_Filter&amp;diff=930</id>
		<title>XLIFF-2 Filter</title>
		<link rel="alternate" type="text/html" href="http://okapiframework.org/wiki/index.php?title=XLIFF-2_Filter&amp;diff=930"/>
		<updated>2022-01-26T18:26:14Z</updated>

		<summary type="html">&lt;p&gt;Jhargraveiii: /* Parameters */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;{{Filters Header}}&lt;br /&gt;
==Overview==&lt;br /&gt;
&lt;br /&gt;
The XLIFF-2 Filter is an Okapi component that implements the IFilter interface for XLIFF 2.x (XML Localisation Interchange File Format) documents.&lt;br /&gt;
&lt;br /&gt;
XLIFF v2 is an OASIS Standard that defines a file format for transporting translatable text and localization-related information across a chain of translation and localization tools.&lt;br /&gt;
&lt;br /&gt;
The XLIFF 2.0 specification is at http://docs.oasis-open.org/xliff/xliff-core/v2.0/xliff-core-v2.0.html.&lt;br /&gt;
&lt;br /&gt;
==Processing Details==&lt;br /&gt;
&lt;br /&gt;
Basic support for XLIFF 2.x core&lt;br /&gt;
&lt;br /&gt;
* Extended Attributes and Namespaces&lt;br /&gt;
* Segments and Ignorables&lt;br /&gt;
* Inline Codes&lt;br /&gt;
* Notes&lt;br /&gt;
* Groups&lt;br /&gt;
* XLIFF 2.x Metadata Module&lt;br /&gt;
&lt;br /&gt;
==Parameters==&lt;br /&gt;
&lt;br /&gt;
&amp;lt;cite&amp;gt;Perform maximum validation when parsing&amp;lt;/cite&amp;gt; &amp;amp;mdash; Select this option to ensure the XLIFF-2 parser used by the filter does the maximum verification of the format.&lt;br /&gt;
&amp;lt;br/&amp;gt;&lt;br /&gt;
&amp;lt;cite&amp;gt;Needs Segmentation&amp;lt;/cite&amp;gt; &amp;amp;mdash; Select this option to resegment any unit that is marked with canSegment=&amp;amp;quot;yes&amp;amp;quot;. Units with an existing target cannot be segmented.&lt;br /&gt;
&lt;br /&gt;
==Limitations==&lt;br /&gt;
&lt;br /&gt;
* Skeleton not supported.&lt;br /&gt;
* Comments are lost in the merged document.&lt;br /&gt;
* Original XML formatting lost in merged document.&lt;br /&gt;
* Attributes can be reordered.&lt;br /&gt;
* Attributes may be removed/added compared to the original depending on the default values and logic in the XLIFF 2 Toolkit.&lt;br /&gt;
&lt;br /&gt;
[[Category:Filters]] [[Category:Translation Formats]] [[Category:XLIFF]]&lt;/div&gt;</summary>
		<author><name>Jhargraveiii</name></author>
	</entry>
	<entry>
		<id>http://okapiframework.org/wiki/index.php?title=XLIFF-2_Filter&amp;diff=929</id>
		<title>XLIFF-2 Filter</title>
		<link rel="alternate" type="text/html" href="http://okapiframework.org/wiki/index.php?title=XLIFF-2_Filter&amp;diff=929"/>
		<updated>2022-01-26T17:55:27Z</updated>

		<summary type="html">&lt;p&gt;Jhargraveiii: /* Limitations */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;{{Filters Header}}&lt;br /&gt;
==Overview==&lt;br /&gt;
&lt;br /&gt;
The XLIFF-2 Filter is an Okapi component that implements the IFilter interface for XLIFF 2.x (XML Localisation Interchange File Format) documents.&lt;br /&gt;
&lt;br /&gt;
XLIFF v2 is an OASIS Standard that defines a file format for transporting translatable text and localization-related information across a chain of translation and localization tools.&lt;br /&gt;
&lt;br /&gt;
The XLIFF 2.0 specification is at http://docs.oasis-open.org/xliff/xliff-core/v2.0/xliff-core-v2.0.html.&lt;br /&gt;
&lt;br /&gt;
==Processing Details==&lt;br /&gt;
&lt;br /&gt;
Basic support for XLIFF 2.x core&lt;br /&gt;
&lt;br /&gt;
* Extended Attributes and Namespaces&lt;br /&gt;
* Segments and Ignorables&lt;br /&gt;
* Inline Codes&lt;br /&gt;
* Notes&lt;br /&gt;
* Groups&lt;br /&gt;
* XLIFF 2.x Metadata Module&lt;br /&gt;
&lt;br /&gt;
==Parameters==&lt;br /&gt;
&lt;br /&gt;
&amp;lt;cite&amp;gt;Perform maximum validation when parsing&amp;lt;/cite&amp;gt; &amp;amp;mdash; Select this option to ensure the XLIFF-2 parser used by the filter does the maximum verification of the format.&lt;br /&gt;
&amp;lt;br/&amp;gt;&lt;br /&gt;
&amp;lt;cite&amp;gt;Needs Segmentation&amp;lt;/cite&amp;gt; &amp;amp;mdash; Select this option to resegment any unit that is marked with canSegment=&amp;amp;quot;yes&amp;amp;quot;.&lt;br /&gt;
&lt;br /&gt;
==Limitations==&lt;br /&gt;
&lt;br /&gt;
* Skeleton not supported.&lt;br /&gt;
* Comments are lost in the merged document.&lt;br /&gt;
* Original XML formatting lost in merged document.&lt;br /&gt;
* Attributes can be reordered.&lt;br /&gt;
* Attributes may be removed/added compared to the original depending on the default values and logic in the XLIFF 2 Toolkit.&lt;br /&gt;
&lt;br /&gt;
[[Category:Filters]] [[Category:Translation Formats]] [[Category:XLIFF]]&lt;/div&gt;</summary>
		<author><name>Jhargraveiii</name></author>
	</entry>
	<entry>
		<id>http://okapiframework.org/wiki/index.php?title=Filters&amp;diff=860</id>
		<title>Filters</title>
		<link rel="alternate" type="text/html" href="http://okapiframework.org/wiki/index.php?title=Filters&amp;diff=860"/>
		<updated>2021-04-07T19:58:08Z</updated>

		<summary type="html">&lt;p&gt;Jhargraveiii: /* Filter Config Examples */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;Filters are the components that convert input documents from their native file format into a common internal set of [[Glossary#Resource|resources]] that all Okapi components use. The extracted content can be re-written into the original file format. When using the steps, the extraction is done by the [[Raw Document to Filter Events Step]] and the re-writing by the [[Filter Events to Raw Document Step]].&lt;br /&gt;
&lt;br /&gt;
Note: The [[Okapi Filters Plugin for OmegaT]] allows you to use some of the filters directly from [http://www.omegat.org OmegaT].&lt;br /&gt;
&lt;br /&gt;
==List of the Filters==&lt;br /&gt;
&lt;br /&gt;
The framework distribution comes with the following filters:&lt;br /&gt;
&lt;br /&gt;
{| cellpadding=&amp;quot;8&amp;quot; width=100%&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
|&lt;br /&gt;
* [[Archive Filter]]&lt;br /&gt;
* [[DTD Filter]]&lt;br /&gt;
* [[Doxygen Filter]]&lt;br /&gt;
* [[HTML Filter]]&lt;br /&gt;
* [[HTML5-ITS Filter]]&lt;br /&gt;
* [[ICML Filter]]&lt;br /&gt;
* [[IDML Filter]]&lt;br /&gt;
* [[JSON Filter]]&lt;br /&gt;
* [[Markdown Filter]]&lt;br /&gt;
* [[MIF Filter]]&lt;br /&gt;
* [[Moses Text Filter]]&lt;br /&gt;
* [[Multi-Parsers Filter]]&lt;br /&gt;
* [[OpenOffice Filter]]&lt;br /&gt;
* [[OpenXML Filter|OpenXML (MS Office) Filter]]&lt;br /&gt;
|&lt;br /&gt;
* [[PDF Filter]]&lt;br /&gt;
* [[Pensieve TM Filter]]&lt;br /&gt;
* [[PHP Content Filter]]&lt;br /&gt;
* [[Plain Text Filter]]&lt;br /&gt;
* [[PO Filter]]&lt;br /&gt;
* [[Properties Filter]]&lt;br /&gt;
* [[Rainbow Translation Kit Filter]]&lt;br /&gt;
* [[Regex Filter]]&lt;br /&gt;
* [[SDL Trados Package Filter]]&lt;br /&gt;
* [[Simplification Filter]]&lt;br /&gt;
* [[Table Filter]]&lt;br /&gt;
* [[TMX Filter]]&lt;br /&gt;
* [[Trados-Tagged RTF Filter]]&lt;br /&gt;
|&lt;br /&gt;
* [[Transifex Filter]]&lt;br /&gt;
* [[TS Filter]]&lt;br /&gt;
* [[TTX Filter]]&lt;br /&gt;
* [[TXML Filter]]&lt;br /&gt;
* [[Wiki Filter]]&lt;br /&gt;
* [[Vignette Filter]]&lt;br /&gt;
* [[XLIFF Filter]]&lt;br /&gt;
* [[XLIFF-2 Filter]]&lt;br /&gt;
* [[XML Filter]]&lt;br /&gt;
* [[XML Stream Filter]]&lt;br /&gt;
* [[YAML Filter]]&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
==Supported File Formats==&lt;br /&gt;
&lt;br /&gt;
The following is a list of some of the file formats supported by the distribution through [[Understanding Filter Configurations|pre-defined configurations]]:&lt;br /&gt;
&lt;br /&gt;
{| border=&amp;quot;1&amp;quot; cellpadding=&amp;quot;6&amp;quot; cellspacing=&amp;quot;0&amp;quot;&lt;br /&gt;
|+&lt;br /&gt;
| '''Format''' || '''Extensions''' || '''Pre-Defined Configuration''' || '''Filter''' || '''Notes'''&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Android Strings || .xml || &amp;lt;code&amp;gt;okf_xml-AndroidStrings&amp;lt;/code&amp;gt; || [[XML Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Apple Stringsdict || .stringsdict || &amp;lt;code&amp;gt;okf_xml-AppleStringsdict&amp;lt;/code&amp;gt; || [[XML Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Archive || .zip || &amp;lt;code&amp;gt;okf_archive&amp;lt;/code&amp;gt; || [[Archive Filter]] || Meta filter that processes zip files with various formats as one file.&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Auto Xliff || .xlf, .xliff || &amp;lt;code&amp;gt;okf_autoxliff&amp;lt;/code&amp;gt; || [[Auto Xliff Filter]] || Detects the version of an XLIFF file and then hands parsing off to the appropriate filter &lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| CSV (Comma-separated values files) || .csv, .txt || &amp;lt;code&amp;gt;okf_table_csv&amp;lt;/code&amp;gt; || [[Table Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| CSV (Multiple complex sub-formats) || .csv || &amp;lt;code&amp;gt;okf_multiparsers&amp;lt;/code&amp;gt; || [[Multi-Parsers Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| DITA || .dita, .ditamap, .xml || &amp;lt;code&amp;gt;okf_xmlstream-dita&amp;lt;/code&amp;gt; || [[XML Stream Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| DocBook v5.0 || .xml || &amp;lt;code&amp;gt;okf_xml-docbook&amp;lt;/code&amp;gt; || [[XML Filter]] || Since Okapi 1.42. &amp;amp;lt;footnote&amp;gt; is not handled properly.&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| DokuWiki pages || .txt || &amp;lt;code&amp;gt;okf_wiki&amp;lt;/code&amp;gt; || [[Wiki Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Doxygen-commented files || .c, .h, cpp || &amp;lt;code&amp;gt;okf_doxygen&amp;lt;/code&amp;gt; || [[Doxygen Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| DTD || .dtd || &amp;lt;code&amp;gt;okf_dtd&amp;lt;/code&amp;gt; || [[DTD Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Fixed-Width Columns Table || .txt || &amp;lt;code&amp;gt;okf_table_fwc&amp;lt;/code&amp;gt; || [[Table Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Idiom WorldServer XLIFF || .xlf || &amp;lt;code&amp;gt;okf_xliff-iws&amp;lt;/code&amp;gt; || [[XLIFF Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| InCopy ICML || .wcml || &amp;lt;code&amp;gt;okf_icml&amp;lt;/code&amp;gt; || [[ICML Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| InDesign IDML || .idml || &amp;lt;code&amp;gt;okf_idml&amp;lt;/code&amp;gt; || [[IDML Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| iOS/Mac Strings|| .strings || &amp;lt;code&amp;gt;okf_regex-macStrings&amp;lt;/code&amp;gt; || [[Regex Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Java Properties || .properties || &amp;lt;code&amp;gt;okf_properties&amp;lt;/code&amp;gt; || [[Properties Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Java Properties (Output not escaped) || .properties || &amp;lt;code&amp;gt;okf_properties-outputNotEscaped&amp;lt;/code&amp;gt; || [[Properties Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Java XML Properties || .xml || &amp;lt;code&amp;gt;okf_xml-JavaProperties&amp;lt;/code&amp;gt; || [[XML Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Java XML Properties (HTML strings) || .xml || &amp;lt;code&amp;gt;okf_xmlstream-JavaPropertiesHTML&amp;lt;/code&amp;gt; || [[XML Stream Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| JSON || .json || &amp;lt;code&amp;gt;okf_json&amp;lt;/code&amp;gt; || [[JSON Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Haiku CatKeys || .catkeys || &amp;lt;code&amp;gt;okf_table_catkeys&amp;lt;/code&amp;gt; || [[Table Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| HTML (any) || .html, .htm || &amp;lt;code&amp;gt;okf_html&amp;lt;/code&amp;gt; || [[HTML Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| HTML (Well-formed, and XHTML) || .html, .htm|| &amp;lt;code&amp;gt;okf_html-wellFormed&amp;lt;/code&amp;gt; || [[HTML Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| HTML5 (and XHTML5) || .html, .htm|| &amp;lt;code&amp;gt;okf_itshtml5&amp;lt;/code&amp;gt; || [[HTML5-ITS Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Markdown || .md || &amp;lt;code&amp;gt;okf_markdown&amp;lt;/code&amp;gt; || [[Markdown Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Microsoft Excel 2007/2010 || .xlsx, .xlsm, .xltx, .xltm || &amp;lt;code&amp;gt;okf_openxml&amp;lt;/code&amp;gt; || [[OpenXML Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Microsoft PowerPoint 2007/2010 || .pptx, .pptm, .potx, .potm, .ppsx, .ppsm || &amp;lt;code&amp;gt;okf_openxml&amp;lt;/code&amp;gt; || [[OpenXML Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Microsoft Visio || .vsdx, .vsdm || &amp;lt;code&amp;gt;okf_openxml&amp;lt;/code&amp;gt; || [[OpenXML Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Microsoft Word 2007/2010 || .docx, .docm, .dotx, .dotm || &amp;lt;code&amp;gt;okf_openxml&amp;lt;/code&amp;gt; || [[OpenXML Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| MIF || .mif || &amp;lt;code&amp;gt;okf_mif&amp;lt;/code&amp;gt; || [[MIF Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Moses Text || .txt || &amp;lt;code&amp;gt;okf_mosestext&amp;lt;/code&amp;gt; || [[Moses Text Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| OpenOffice.org Calc || .ods, .ots || &amp;lt;code&amp;gt;okf_odf&amp;lt;/code&amp;gt; || [[OpenOffice Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| OpenOffice.org Draw || .odg, .otg || &amp;lt;code&amp;gt;okf_odf&amp;lt;/code&amp;gt; || [[OpenOffice Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| OpenOffice.org Impress || .odp, .otp || &amp;lt;code&amp;gt;okf_odf&amp;lt;/code&amp;gt; || [[OpenOffice Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| OpenOffice.org Writer || .odt, .ott || &amp;lt;code&amp;gt;okf_odf&amp;lt;/code&amp;gt; || [[OpenOffice Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| PDF || .pdf || &amp;lt;code&amp;gt;okf_pdf&amp;lt;/code&amp;gt; || [[PDF Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| [[Pensieve TM]] || .pentm || &amp;lt;code&amp;gt;okf_pensieve&amp;lt;/code&amp;gt; || [[Pensieve TM Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| PHP Content || .php || &amp;lt;code&amp;gt;okf_phpcontent&amp;lt;/code&amp;gt; || [[PHP Content Filter]] || Can be used as a subfilter only&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Plain Text (Line = text unit) || .txt || &amp;lt;code&amp;gt;okf_plaintext&amp;lt;/code&amp;gt; || [[ Plain Text Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Plain Text (Paragraph = text unit) || .txt || &amp;lt;code&amp;gt;okf_plaintext_paragraphs&amp;lt;/code&amp;gt; || [[Plain Text Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| PO || .po || &amp;lt;code&amp;gt;okf_po&amp;lt;/code&amp;gt; || [[PO Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| PO (Monolingual style) || .po || &amp;lt;code&amp;gt;okf_po-monolingual&amp;lt;/code&amp;gt; || [[PO Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Rainbow Translation Kit manifests || .rkm || &amp;lt;code&amp;gt;okf_rainbowkit&amp;lt;/code&amp;gt; || [[Rainbow Translation Kit Filter]] || Used as a tkit reader only&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Regex (Any text-based format) || .txt || &amp;lt;code&amp;gt;okf_regex&amp;lt;/code&amp;gt; || [[Regex Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| RDF (Mozilla RDF) || .rdf || &amp;lt;code&amp;gt;okf_xml-MozillaRDF&amp;lt;/code&amp;gt; || [[XML Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| RESX || .resx || &amp;lt;code&amp;gt;okf_xml-resx&amp;lt;/code&amp;gt; || [[XML Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| SDLPPX || .sdlppx || &amp;lt;code&amp;gt;okf_sdlpackage&amp;lt;/code&amp;gt; || [[SDL Trados Package Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| SDLRPX || .sdlrpx || &amp;lt;code&amp;gt;okf_sdlpackage&amp;lt;/code&amp;gt; || [[SDL Trados Package Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| SDL[[XLIFF]] || .sdlxlf || &amp;lt;code&amp;gt;okf_xliff-sdl&amp;lt;/code&amp;gt; || [[XLIFF Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Skype Language Files || .lang || &amp;lt;code&amp;gt;okf_properties-skypeLang&amp;lt;/code&amp;gt; || [[Properties Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| SRT (Sub-Rip Text, sub-titles files) || .srt || &amp;lt;code&amp;gt;okf_regex-srt&amp;lt;/code&amp;gt; || [[Regex Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Tab-Delimiter files || .tsv, .txt || &amp;lt;code&amp;gt;okf_table_tsv&amp;lt;/code&amp;gt; || [[Table Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Tex files || .tex || &amp;lt;code&amp;gt;okf_tex&amp;lt;/code&amp;gt; || [[TEX Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| [[TMX]] || .tmx || &amp;lt;code&amp;gt;okf_tmx&amp;lt;/code&amp;gt; || [[TMX Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Transifex project || .txp || &amp;lt;code&amp;gt;okf_transifex&amp;lt;/code&amp;gt; || [[Transifex Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Trados-Tagged RTF || .rtf || &amp;lt;code&amp;gt;okf_tradosrtf&amp;lt;/code&amp;gt; || [[Trados-Tagged RTF Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| TS - Qt TS files || .ts || &amp;lt;code&amp;gt;okf_ts&amp;lt;/code&amp;gt; || [[TS Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| TTX - Trados TagEditor TTX files || .ttx || &amp;lt;code&amp;gt;okf_ttx&amp;lt;/code&amp;gt; || [[TTX Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| TXML - Wordfast Pro TXML files || .txml || &amp;lt;code&amp;gt;okf_txml&amp;lt;/code&amp;gt; || [[TXML Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Vignette Export/Import Content || .xml || &amp;lt;code&amp;gt;okf_vignette&amp;lt;/code&amp;gt; || [[Vignette Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| XHTML || .html, .htm || &amp;lt;code&amp;gt;okf_html-wellFormed&amp;lt;/code&amp;gt; || [[HTML Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| WIX (Windows Installer XML) localization files || .wix || &amp;lt;code&amp;gt;okf_xml-WixLocalization&amp;lt;/code&amp;gt; || [[XML Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| [[XLIFF]] v1.2 || .xlf, .xliff || &amp;lt;code&amp;gt;okf_xliff&amp;lt;/code&amp;gt; || [[XLIFF Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| [[XLIFF]] v2 || .xlf || &amp;lt;code&amp;gt;okf_xliff2&amp;lt;/code&amp;gt; || [[XLIFF-2 Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| XML (Generic, using [[ITS]] defaults) || .xml || &amp;lt;code&amp;gt;okf_xml&amp;lt;/code&amp;gt; || [[XML Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| XML (Generic, using stream reader) || .xml || &amp;lt;code&amp;gt;okf_xmlstream&amp;lt;/code&amp;gt; || [[XML Stream Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| YAML (Generic YAML filter) || .yml, .yaml || &amp;lt;code&amp;gt;okf_yaml&amp;lt;/code&amp;gt; || [[YAML Filter]] ||&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
Note that most filters allow you to [[Understanding Filter Configurations|create your own configurations]] to support more file formats.&lt;br /&gt;
&lt;br /&gt;
==Code Simplification Rules==&lt;br /&gt;
&lt;br /&gt;
All filters support code simplification rules. By default the [[Inline Codes Simplifier Step]], [[Simplification Filter]] and [[Post-segmentation Inline Codes Removal Step]] maximize the trimming and merging (aka simplification) of inline codes. In some cases this may not be desired. The simplification rules allow you to override the default behavior and prevent specific codes from being trimmed or merged.&lt;br /&gt;
&lt;br /&gt;
===General Syntax===&lt;br /&gt;
&lt;br /&gt;
The rules parser ignores irrelevant whitespace. Rules can be separated by spaces, newlines or nothing. This makes it easier to accommodate various container formats and their whitespace normalization rules. When a rule applies it means &amp;quot;do not simplify the match code&amp;quot;. Uppercase tokens are constants and predefined by the rule parser. Multiple rules are always OR'ed together.&lt;br /&gt;
&lt;br /&gt;
For more details see the JavaCC grammar: &amp;lt;code&amp;gt;../okapi-core/src/main/java/net/sf/okapi/core/simplifierrules/SimplifierRules.jj&amp;lt;/code&amp;gt; &lt;br /&gt;
&lt;br /&gt;
===Rule Examples===&lt;br /&gt;
&lt;br /&gt;
If Code has any of these flags then don't simplify &lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;if DELETABLE or ADDABLE or CLONEABLE;&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;quot;=&amp;quot; is string match&lt;br /&gt;
Match basic TAGTYPE opening, closing or standalone &lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;if DATA = &amp;quot;a&amp;quot; and TAGTYPE = OPENING;&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;quot;~&amp;quot; is regex match&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;if DATA ~ &amp;quot;a.*&amp;quot;;&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
You can negate any of the match operators &lt;br /&gt;
Don't simplify if the DATA does not match the regex &lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;if DATA !~ &amp;quot;a.*&amp;quot;;&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Match on type, linebreak in this case, don't simplify &lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;if the Code is a linebreak if TYPE = &amp;quot;lb&amp;quot;;&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Don't simplify any rich text types&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;if TYPE = &amp;quot;bold&amp;quot; or TYPE = &amp;quot;italic&amp;quot; or TYPE = &amp;quot;underline&amp;quot;;&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Expressions can be recursive (supports embedded parens)&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;if TYPE = &amp;quot;bold&amp;quot; or (DATA = &amp;quot;bar&amp;quot; or (DATA = &amp;quot;foo&amp;quot; and TYPE = &amp;quot;underline&amp;quot;));&amp;lt;/pre&amp;gt;&lt;br /&gt;
 &lt;br /&gt;
&lt;br /&gt;
===Filter Config Examples===&lt;br /&gt;
&lt;br /&gt;
Examples of using simplifier rules within the filter config formats used by Okapi.&lt;br /&gt;
&lt;br /&gt;
'''YAML:'''&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
simplifierRules: |&lt;br /&gt;
  if ADDABLE or DELETABLE or CLONEABLE;&lt;br /&gt;
  if DATA = &amp;quot;&amp;lt;br/&amp;gt;&amp;quot; or DATA = &amp;quot;&amp;lt;font&amp;gt;&amp;quot; or DATA = &amp;quot;&amp;lt;/font&amp;gt;&amp;quot; or DATA = &amp;quot;&amp;lt;/a&amp;gt;&amp;quot;;&lt;br /&gt;
  if DATA ~ &amp;quot;\\&amp;lt;font.+&amp;quot; or DATA ~ &amp;quot;\\&amp;lt;img.+&amp;quot; or DATA ~ &amp;quot;\\&amp;lt;a.+&amp;quot;;&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
'''ITS:'''&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
&amp;lt;?xml version=&amp;quot;1.0&amp;quot; encoding=&amp;quot;UTF-8&amp;quot;?&amp;gt;&lt;br /&gt;
&amp;lt;its:rules xmlns:its=&amp;quot;http://www.w3.org/2005/11/its&amp;quot; version=&amp;quot;1.0&amp;quot; xmlns:itsx=&amp;quot;http://www.w3.org/2008/12/its-extensions&amp;quot; xmlns:okp=&amp;quot;okapi-framework:xmlfilter-options&amp;quot;&amp;gt;&lt;br /&gt;
&amp;lt;!-- See ITS specification at: http://www.w3.org/TR/its/ --&amp;gt;&lt;br /&gt;
 &amp;lt;its:translateRule selector=&amp;quot;//*&amp;quot; translate=&amp;quot;yes&amp;quot;/&amp;gt;&lt;br /&gt;
 &amp;lt;its:withinTextRule selector=&amp;quot;//codeph&amp;quot; withinText=&amp;quot;yes&amp;quot;/&amp;gt;&lt;br /&gt;
 &amp;lt;its:withinTextRule selector=&amp;quot;//ph&amp;quot; withinText=&amp;quot;yes&amp;quot;/&amp;gt;&lt;br /&gt;
 &amp;lt;okp:simplifierRules&amp;gt;&lt;br /&gt;
 if ADDABLE or DELETABLE or CLONEABLE; if DATA ~ &amp;quot;.+&amp;quot;;&lt;br /&gt;
 &amp;lt;/okp:simplifierRules&amp;gt;&lt;br /&gt;
&amp;lt;/its:rules&amp;gt;&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
'''FPRM (Parameters):'''&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
#v1&lt;br /&gt;
extractNotes.b=true&lt;br /&gt;
simplifierRules=if ADDABLE or DELETABLE or CLONEABLE; if DATA ~ &amp;quot;.+&amp;quot;;&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
==Font Mapping==&lt;br /&gt;
&lt;br /&gt;
The font mapping can be considered as filter's ability to automatically substitute font information in the target document on the fly, according to a provided configuration - this helps to reduce the amount of reformatting and post-translation DTP. It is supported by IDML and OpenXML (DOCX documents) filters at the moment.&lt;br /&gt;
&lt;br /&gt;
The following font mapping configuration options are available:&lt;br /&gt;
* The source language regular expression pattern: &amp;lt;code&amp;gt;en.*&amp;lt;/code&amp;gt;, &amp;lt;code&amp;gt;en-UK&amp;lt;/code&amp;gt;, etc. It can be left empty to apply the mapping to any source language.&lt;br /&gt;
* The target language regular expression pattern: &amp;lt;code&amp;gt;ru.*&amp;lt;/code&amp;gt;, &amp;lt;code&amp;gt;ru-RU&amp;lt;/code&amp;gt;, etc. It can be left empty to apply the mapping to any target language.&lt;br /&gt;
* The source font name regular expression pattern: &amp;lt;code&amp;gt;Arial.*&amp;lt;/code&amp;gt;, &amp;lt;code&amp;gt;Times New Roman&amp;lt;/code&amp;gt;, etc. It can be left empty to apply the mapping to any source font name found.&lt;br /&gt;
* The target font name: &amp;lt;code&amp;gt;Arial&amp;lt;/code&amp;gt;, &amp;lt;code&amp;gt;Times New Roman&amp;lt;/code&amp;gt;, etc. It should not be empty. And if it is made so, the mapping configuration is ignored.&lt;br /&gt;
&lt;br /&gt;
Also, the configured font mappings are applied in the order they are stated. And the final target font value is determined by a sequential&lt;br /&gt;
substitution of the source font values. I.e. if there is more than one mapping:&lt;br /&gt;
# &amp;lt;code&amp;gt;Arial&amp;lt;/code&amp;gt; -&amp;gt; &amp;lt;code&amp;gt;Times New Roman&amp;lt;/code&amp;gt;&lt;br /&gt;
# &amp;lt;code&amp;gt;Times New Roman&amp;lt;/code&amp;gt; -&amp;gt; &amp;lt;code&amp;gt;Sans Serif&amp;lt;/code&amp;gt;&lt;br /&gt;
then the first mapping will produce &amp;lt;code&amp;gt;Times New Roman&amp;lt;/code&amp;gt; replacement and the second one will be applied to this new value, thus, ending up with the &amp;lt;code&amp;gt;Sans Serif&amp;lt;/code&amp;gt;.&lt;br /&gt;
&lt;br /&gt;
The parameters serialisation format can look like that:&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
fontMappings.0.sourceLocalePattern=en.*&lt;br /&gt;
fontMappings.0.targetLocalePattern=ru.*&lt;br /&gt;
fontMappings.0.sourceFontPattern=Times.*&lt;br /&gt;
fontMappings.0.targetFont=Arial Unicode MS&lt;br /&gt;
fontMappings.1.sourceLocalePattern=ru&lt;br /&gt;
fontMappings.1.targetLocalePattern=fr&lt;br /&gt;
fontMappings.1.sourceFontPattern=The Sims Sans&lt;br /&gt;
fontMappings.1.targetFont=Arial Unicode MS&lt;br /&gt;
fontMappings.number.i=2&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
[[Category:Filters]]&lt;/div&gt;</summary>
		<author><name>Jhargraveiii</name></author>
	</entry>
	<entry>
		<id>http://okapiframework.org/wiki/index.php?title=Steps&amp;diff=852</id>
		<title>Steps</title>
		<link rel="alternate" type="text/html" href="http://okapiframework.org/wiki/index.php?title=Steps&amp;diff=852"/>
		<updated>2021-03-09T16:38:37Z</updated>

		<summary type="html">&lt;p&gt;Jhargraveiii: Reverted edits by Jhargraveiii (talk) to last revision by Okapifra&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;Steps are components that execute one specific task. You use them by chaining them into '''pipelines'''. See for example &amp;quot;[[How to Create a Pipeline in Rainbow]]&amp;quot; to see how steps can be used. Rainbow also [[Rainbow - Utilities|comes with several pre-defined pipelines]] using some of these steps.&lt;br /&gt;
&lt;br /&gt;
The Okapi Framework comes with several ready-to-use steps:&lt;br /&gt;
&lt;br /&gt;
{| cellpadding=&amp;quot;8&amp;quot; width=&amp;quot;100%&amp;quot;&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
|&lt;br /&gt;
* &amp;lt;span class='blue'&amp;gt;RD-&amp;gt;RD&amp;lt;/span&amp;gt;&amp;amp;nbsp; [[Batch Translation Step]]&lt;br /&gt;
* &amp;lt;span class='blue'&amp;gt;RD-&amp;gt;RD&amp;lt;/span&amp;gt;&amp;amp;nbsp; [[BOM Conversion Step]]&lt;br /&gt;
* &amp;lt;span class='green'&amp;gt;FE-&amp;gt;FE&amp;lt;/span&amp;gt;&amp;amp;nbsp; [[Character Count Step]]&lt;br /&gt;
* &amp;lt;span class='green'&amp;gt;FE-&amp;gt;FE&amp;lt;/span&amp;gt;&amp;amp;nbsp; [[Cleanup Step]]&lt;br /&gt;
* &amp;lt;span class='blue'&amp;gt;RD-&amp;gt;RD&amp;lt;/span&amp;gt;&amp;amp;nbsp; [[Copy Or Move Step]]&lt;br /&gt;
* &amp;lt;span class='blue'&amp;gt;RD-&amp;gt;RD&amp;lt;/span&amp;gt;&amp;amp;nbsp; [[Combined Xliff Merger Step]]&lt;br /&gt;
* &amp;lt;span class='green'&amp;gt;FE-&amp;gt;FE&amp;lt;/span&amp;gt;&amp;amp;nbsp; [[Create Target Step]]&lt;br /&gt;
* &amp;lt;span class='green'&amp;gt;FE-&amp;gt;FE&amp;lt;/span&amp;gt;&amp;amp;nbsp; [[Desegmentation Step]]&lt;br /&gt;
* &amp;lt;span class='green'&amp;gt;FE-&amp;gt;FE&amp;lt;/span&amp;gt;&amp;amp;nbsp; [[Diff Leverage Step]]&lt;br /&gt;
* &amp;lt;span class='blue'&amp;gt;RD-&amp;gt;RD&amp;lt;/span&amp;gt;&amp;amp;nbsp; [[Encoding Conversion Step]]&lt;br /&gt;
* &amp;lt;span class='green'&amp;gt;FE-&amp;gt;FE&amp;lt;/span&amp;gt;&amp;amp;nbsp; [[Enrycher Step]]&lt;br /&gt;
* &amp;lt;span class='blue'&amp;gt;RD-&amp;gt;RD&amp;lt;/span&amp;gt;&amp;amp;nbsp; [[External Command Step]]&lt;br /&gt;
* &amp;lt;span class='blue'&amp;gt;RD-&amp;gt;RD&amp;lt;/span&amp;gt;&amp;amp;nbsp; [[Extraction Verification Step]]&lt;br /&gt;
* &amp;lt;span class='green'&amp;gt;FE&amp;lt;/span&amp;gt;-&amp;gt;&amp;lt;span class='blue'&amp;gt;RD&amp;lt;/span&amp;gt;&amp;amp;nbsp; [[Filter Events to Raw Document Step]]&lt;br /&gt;
* &amp;lt;span class='green'&amp;gt;FE-&amp;gt;FE&amp;lt;/span&amp;gt;&amp;amp;nbsp; [[Format Conversion Step]]&lt;br /&gt;
* &amp;lt;span class='green'&amp;gt;FE-&amp;gt;FE&amp;lt;/span&amp;gt;&amp;amp;nbsp; [[Full-Width Conversion Step]]&lt;br /&gt;
* &amp;lt;span class='green'&amp;gt;FE-&amp;gt;FE&amp;lt;/span&amp;gt;&amp;amp;nbsp; [[Generate SimpleTM Step]]&lt;br /&gt;
* &amp;lt;span class='green'&amp;gt;FE-&amp;gt;FE&amp;lt;/span&amp;gt;&amp;amp;nbsp; [[GTT Batch Translation Step]]&lt;br /&gt;
* &amp;lt;span class='green'&amp;gt;FE-&amp;gt;FE&amp;lt;/span&amp;gt;&amp;amp;nbsp; [[Id-Based Copy Step]]&lt;br /&gt;
* &amp;lt;span class='green'&amp;gt;FE-&amp;gt;FE&amp;lt;/span&amp;gt;&amp;amp;nbsp; [[Id-Based Aligner Step]]&lt;br /&gt;
* &amp;lt;span class='blue'&amp;gt;RD-&amp;gt;RD&amp;lt;/span&amp;gt;&amp;amp;nbsp; [[Image Modification Step]]&lt;br /&gt;
* &amp;lt;span class='green'&amp;gt;FE-&amp;gt;FE&amp;lt;/span&amp;gt;&amp;amp;nbsp; [[Inconsistency Check Step]]&lt;br /&gt;
* &amp;lt;span class='green'&amp;gt;FE-&amp;gt;FE&amp;lt;/span&amp;gt;&amp;amp;nbsp; [[Inline Codes Removal Step]]&lt;br /&gt;
* &amp;lt;span class='green'&amp;gt;FE-&amp;gt;FE&amp;lt;/span&amp;gt;&amp;amp;nbsp; [[Inline Codes Simplifier Step]]&lt;br /&gt;
* &amp;lt;span class='green'&amp;gt;FE-&amp;gt;FE&amp;lt;/span&amp;gt;&amp;amp;nbsp; [[LanguageTool Step]]&lt;br /&gt;
* &amp;lt;span class='green'&amp;gt;FE-&amp;gt;FE&amp;lt;/span&amp;gt;&amp;amp;nbsp; [[Leveraging Step]]&lt;br /&gt;
* &amp;lt;span class='blue'&amp;gt;RD-&amp;gt;RD&amp;lt;/span&amp;gt;&amp;amp;nbsp; [[Line-Break Conversion Step]]&lt;br /&gt;
* &amp;lt;span class='green'&amp;gt;FE-&amp;gt;FE&amp;lt;/span&amp;gt;&amp;amp;nbsp; [[Localizables Check Step]]&lt;br /&gt;
|&lt;br /&gt;
* &amp;lt;span class='green'&amp;gt;FE-&amp;gt;FE&amp;lt;/span&amp;gt;&amp;amp;nbsp; [[Microsoft Batch Translation Step]]&lt;br /&gt;
* &amp;lt;span class='green'&amp;gt;FE-&amp;gt;FE&amp;lt;/span&amp;gt;&amp;amp;nbsp; [[Microsoft Batch Submission Step]]&lt;br /&gt;
* &amp;lt;span class='green'&amp;gt;FE-&amp;gt;FE&amp;lt;/span&amp;gt;&amp;amp;nbsp; [[Moses InlineText Extraction Step]]&lt;br /&gt;
* &amp;lt;span class='green'&amp;gt;FE-&amp;gt;FE&amp;lt;/span&amp;gt;&amp;amp;nbsp; [[Moses InlineText Leveraging Step]]&lt;br /&gt;
* &amp;lt;span class='blue'&amp;gt;RD-&amp;gt;RD&amp;lt;/span&amp;gt;&amp;amp;nbsp; [[MS Word Resaver Step]]&lt;br /&gt;
* &amp;lt;span class='blue'&amp;gt;RD-&amp;gt;RD&amp;lt;/span&amp;gt;&amp;amp;nbsp; [[MS Word Search and Replace Step]]&lt;br /&gt;
* &amp;lt;span class='blue'&amp;gt;RD-&amp;gt;RD&amp;lt;/span&amp;gt;&amp;amp;nbsp; [[Original Document Xliff Merger Step]]&lt;br /&gt;
* &amp;lt;span class='green'&amp;gt;FE-&amp;gt;FE&amp;lt;/span&amp;gt;&amp;amp;nbsp; [[Paragraph Alignment Step]]&lt;br /&gt;
* &amp;lt;span class='green'&amp;gt;FE-&amp;gt;FE&amp;lt;/span&amp;gt;&amp;amp;nbsp; [[Post-segmentation Inline Codes Removal Step]]&lt;br /&gt;
* &amp;lt;span class='green'&amp;gt;FE-&amp;gt;FE&amp;lt;/span&amp;gt;&amp;amp;nbsp; [[Properties Setting Step]]&lt;br /&gt;
* &amp;lt;span class='green'&amp;gt;FE-&amp;gt;FE&amp;lt;/span&amp;gt;&amp;amp;nbsp; [[QuEst Quality Estimation Step]]&lt;br /&gt;
* &amp;lt;span class='green'&amp;gt;FE-&amp;gt;FE&amp;lt;/span&amp;gt;&amp;amp;nbsp; [[QuEst SVM Model Builder Step]]&lt;br /&gt;
* &amp;lt;span class='blue'&amp;gt;RD-&amp;gt;RD&amp;lt;/span&amp;gt;&amp;amp;nbsp; [[Quality Check Step]]&lt;br /&gt;
* &amp;lt;span class='green'&amp;gt;FE-&amp;gt;FE&amp;lt;/span&amp;gt;&amp;amp;nbsp; [[Quality Check Step]]&lt;br /&gt;
* &amp;lt;span class='blue'&amp;gt;RD&amp;lt;/span&amp;gt;-&amp;gt;&amp;lt;span class='green'&amp;gt;FE&amp;lt;/span&amp;gt;&amp;amp;nbsp; [[Raw Document to Filter Events Step]]&lt;br /&gt;
* &amp;lt;span class='green'&amp;gt;FE&amp;lt;/span&amp;gt;-&amp;gt;&amp;lt;span class='hi'&amp;gt;RD/FE&amp;lt;/span&amp;gt;&amp;amp;nbsp; [[Rainbow Translation Kit Creation Step]]&lt;br /&gt;
* &amp;lt;span class='green'&amp;gt;FE&amp;lt;/span&amp;gt;-&amp;gt;&amp;lt;span class='hi'&amp;gt;RD/FE&amp;lt;/span&amp;gt;&amp;amp;nbsp; [[Rainbow Translation Kit Merging Step]]&lt;br /&gt;
* &amp;lt;span class='blue'&amp;gt;RD-&amp;gt;RD&amp;lt;/span&amp;gt;&amp;amp;nbsp; [[RTF Conversion Step]]&lt;br /&gt;
* &amp;lt;span class='green'&amp;gt;FE-&amp;gt;FE&amp;lt;/span&amp;gt;&amp;amp;nbsp; [[Remove Target Step]]&lt;br /&gt;
* &amp;lt;span class='green'&amp;gt;FE-&amp;gt;FE&amp;lt;/span&amp;gt;&amp;amp;nbsp; [[Repetition Analysis Step]]&lt;br /&gt;
* &amp;lt;span class='green'&amp;gt;FE-&amp;gt;FE&amp;lt;/span&amp;gt;&amp;amp;nbsp; [[Resource Simplifier Step]]&lt;br /&gt;
* &amp;lt;span class='green'&amp;gt;FE-&amp;gt;FE&amp;lt;/span&amp;gt;&amp;amp;nbsp; [[Scoping Report Step]]&lt;br /&gt;
* &amp;lt;span class='blue'&amp;gt;RD-&amp;gt;RD&amp;lt;/span&amp;gt;&amp;amp;nbsp; [[Search and Replace Step]]&lt;br /&gt;
* &amp;lt;span class='green'&amp;gt;FE-&amp;gt;FE&amp;lt;/span&amp;gt;&amp;amp;nbsp; [[Search and Replace Step]]&lt;br /&gt;
* &amp;lt;span class='green'&amp;gt;FE-&amp;gt;FE&amp;lt;/span&amp;gt;&amp;amp;nbsp; [[Segmentation Step]]&lt;br /&gt;
* &amp;lt;span class='green'&amp;gt;FE-&amp;gt;FE&amp;lt;/span&amp;gt;&amp;amp;nbsp; [[Segments to Text Units Converter Step]]&lt;br /&gt;
* &amp;lt;span class='green'&amp;gt;FE-&amp;gt;FE&amp;lt;/span&amp;gt;&amp;amp;nbsp; [[Sentence Alignment Step]]&lt;br /&gt;
* &amp;lt;span class='blue'&amp;gt;RD-&amp;gt;RD&amp;lt;/span&amp;gt;&amp;amp;nbsp; [[SimpleTM to TMX Step]]&lt;br /&gt;
|&lt;br /&gt;
* &amp;lt;span class='green'&amp;gt;FE-&amp;gt;FE&amp;lt;/span&amp;gt;&amp;amp;nbsp; [[Space Check Step]]&lt;br /&gt;
* &amp;lt;span class='blue'&amp;gt;RD-&amp;gt;RD&amp;lt;/span&amp;gt;&amp;amp;nbsp; [[Skeleton Xliff Merger Step]]&lt;br /&gt;
* &amp;lt;span class='green'&amp;gt;FE-&amp;gt;FE&amp;lt;/span&amp;gt;&amp;amp;nbsp; [[Term Extraction Step]]&lt;br /&gt;
* &amp;lt;span class='green'&amp;gt;FE-&amp;gt;FE&amp;lt;/span&amp;gt;&amp;amp;nbsp; [[Terminology Leveraging Step]]&lt;br /&gt;
* &amp;lt;span class='green'&amp;gt;FE-&amp;gt;FE&amp;lt;/span&amp;gt;&amp;amp;nbsp; [[Text Modification Step]]&lt;br /&gt;
* &amp;lt;span class='green'&amp;gt;FE-&amp;gt;FE&amp;lt;/span&amp;gt;&amp;amp;nbsp; [[ThreadedWorkQueue Step]]&lt;br /&gt;
* &amp;lt;span class='green'&amp;gt;FE-&amp;gt;FE&amp;lt;/span&amp;gt;&amp;amp;nbsp; [[TM Import Step]]&lt;br /&gt;
* &amp;lt;span class='green'&amp;gt;FE-&amp;gt;FE&amp;lt;/span&amp;gt;&amp;amp;nbsp; [[Tokenization Step]]&lt;br /&gt;
* &amp;lt;span class='green'&amp;gt;FE-&amp;gt;FE&amp;lt;/span&amp;gt;&amp;amp;nbsp; [[Translation Comparison Step]]&lt;br /&gt;
* &amp;lt;span class='green'&amp;gt;FE-&amp;gt;FE&amp;lt;/span&amp;gt;&amp;amp;nbsp; [[URI Conversion Step]]&lt;br /&gt;
* &amp;lt;span class='green'&amp;gt;FE-&amp;gt;FE&amp;lt;/span&amp;gt;&amp;amp;nbsp; [[Used Characters Listing Step]]&lt;br /&gt;
* &amp;lt;span class='green'&amp;gt;FE-&amp;gt;FE&amp;lt;/span&amp;gt;&amp;amp;nbsp; [[Word Count Step]]&lt;br /&gt;
* &amp;lt;span class='green'&amp;gt;FE-&amp;gt;FE&amp;lt;/span&amp;gt;&amp;amp;nbsp; [[Simple Word Count Step]]&lt;br /&gt;
* &amp;lt;span class='green&amp;gt;FE-&amp;gt;FE&amp;lt;/span&amp;gt;&amp;amp;nbsp; [[Whitespace Correction Step]]&lt;br /&gt;
* &amp;lt;span class='blue'&amp;gt;RD-&amp;gt;RD&amp;lt;/span&amp;gt;&amp;amp;nbsp; [[Trados Analysis Step]]&lt;br /&gt;
* &amp;lt;span class='blue'&amp;gt;RD-&amp;gt;RD&amp;lt;/span&amp;gt;&amp;amp;nbsp; [[Trados Cleanup Step]]&lt;br /&gt;
* &amp;lt;span class='blue'&amp;gt;RD-&amp;gt;RD&amp;lt;/span&amp;gt;&amp;amp;nbsp; [[Trados Export Step]]&lt;br /&gt;
* &amp;lt;span class='blue'&amp;gt;RD-&amp;gt;RD&amp;lt;/span&amp;gt;&amp;amp;nbsp; [[Trados Import Step]]&lt;br /&gt;
* &amp;lt;span class='blue'&amp;gt;RD-&amp;gt;RD&amp;lt;/span&amp;gt;&amp;amp;nbsp; [[Trados Translation Step]]&lt;br /&gt;
* &amp;lt;span class='blue'&amp;gt;RD-&amp;gt;RD&amp;lt;/span&amp;gt;&amp;amp;nbsp; [[TTX Joiner Step]]&lt;br /&gt;
* &amp;lt;span class='blue'&amp;gt;RD-&amp;gt;RD&amp;lt;/span&amp;gt;&amp;amp;nbsp; [[TTX Splitter Step]]&lt;br /&gt;
* &amp;lt;span class='blue'&amp;gt;RD-&amp;gt;RD&amp;lt;/span&amp;gt;&amp;amp;nbsp; [[XLIFF Joiner Step]]&lt;br /&gt;
* &amp;lt;span class='blue'&amp;gt;RD-&amp;gt;RD&amp;lt;/span&amp;gt;&amp;amp;nbsp; [[XLIFF Splitter Step]]&lt;br /&gt;
* &amp;lt;span class='blue'&amp;gt;RD-&amp;gt;RD&amp;lt;/span&amp;gt;&amp;amp;nbsp; [[XML Analysis Step]]&lt;br /&gt;
* &amp;lt;span class='blue'&amp;gt;RD-&amp;gt;RD&amp;lt;/span&amp;gt;&amp;amp;nbsp; [[XML Characters Fixing Step]]&lt;br /&gt;
* &amp;lt;span class='blue'&amp;gt;RD-&amp;gt;RD&amp;lt;/span&amp;gt;&amp;amp;nbsp; [[XML Validation Step]]&lt;br /&gt;
* &amp;lt;span class='blue'&amp;gt;RD-&amp;gt;RD&amp;lt;/span&amp;gt;&amp;amp;nbsp; [[XSL Transformation Step]]&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
Legend:&lt;br /&gt;
: &amp;lt;span class='blue'&amp;gt;RD-&amp;gt;RD&amp;lt;/span&amp;gt; = the step takes raw document and sends raw document &lt;br /&gt;
: &amp;lt;span class='blue'&amp;gt;RD&amp;lt;/span&amp;gt;-&amp;gt;&amp;lt;span the step class='green'&amp;gt;FE&amp;lt;/span&amp;gt; = the step takes raw document and sends filter events&lt;br /&gt;
: &amp;lt;span class='green'&amp;gt;FE-&amp;gt;FE&amp;lt;/span&amp;gt; = the step takes filter events and sends filter events&lt;br /&gt;
: &amp;lt;span class='green'&amp;gt;FE&amp;lt;/span&amp;gt;-&amp;gt;&amp;lt;span class='blue'&amp;gt;RD&amp;lt;/span&amp;gt; = the step takes filter events and sends raw document&lt;br /&gt;
: &amp;lt;span class='green'&amp;gt;FE&amp;lt;/span&amp;gt;-&amp;gt;&amp;lt;span class='hi'&amp;gt;RD/FE&amp;lt;/span&amp;gt; = the step takes filter events and sends either raw document or filter events&lt;br /&gt;
&lt;br /&gt;
[[Category:Steps]]&lt;/div&gt;</summary>
		<author><name>Jhargraveiii</name></author>
	</entry>
	<entry>
		<id>http://okapiframework.org/wiki/index.php?title=Steps&amp;diff=851</id>
		<title>Steps</title>
		<link rel="alternate" type="text/html" href="http://okapiframework.org/wiki/index.php?title=Steps&amp;diff=851"/>
		<updated>2021-03-09T16:37:42Z</updated>

		<summary type="html">&lt;p&gt;Jhargraveiii: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;Steps are components that execute one specific task. You use them by chaining them into '''pipelines'''. See for example &amp;quot;[[How to Create a Pipeline in Rainbow]]&amp;quot; to see how steps can be used. Rainbow also [[Rainbow - Utilities|comes with several pre-defined pipelines]] using some of these steps.&lt;br /&gt;
&lt;br /&gt;
The Okapi Framework comes with several ready-to-use steps:&lt;br /&gt;
&lt;br /&gt;
{| cellpadding=&amp;quot;8&amp;quot; width=&amp;quot;100%&amp;quot;&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
|&lt;br /&gt;
* &amp;lt;span class='blue'&amp;gt;RD-&amp;gt;RD&amp;lt;/span&amp;gt;&amp;amp;nbsp; [[Batch Translation Step]]&lt;br /&gt;
* &amp;lt;span class='blue'&amp;gt;RD-&amp;gt;RD&amp;lt;/span&amp;gt;&amp;amp;nbsp; [[BOM Conversion Step]]&lt;br /&gt;
* &amp;lt;span class='green'&amp;gt;FE-&amp;gt;FE&amp;lt;/span&amp;gt;&amp;amp;nbsp; [[Character Count Step]]&lt;br /&gt;
* &amp;lt;span class='green'&amp;gt;FE-&amp;gt;FE&amp;lt;/span&amp;gt;&amp;amp;nbsp; [[Cleanup Step]]&lt;br /&gt;
* &amp;lt;span class='blue'&amp;gt;RD-&amp;gt;RD&amp;lt;/span&amp;gt;&amp;amp;nbsp; [[Copy Or Move Step]]&lt;br /&gt;
* &amp;lt;span class='blue'&amp;gt;RD-&amp;gt;RD&amp;lt;/span&amp;gt;&amp;amp;nbsp; [[Combined Xliff Merger Step]]&lt;br /&gt;
* &amp;lt;span class='green'&amp;gt;FE-&amp;gt;FE&amp;lt;/span&amp;gt;&amp;amp;nbsp; [[Create Target Step]]&lt;br /&gt;
* &amp;lt;span class='green'&amp;gt;FE-&amp;gt;FE&amp;lt;/span&amp;gt;&amp;amp;nbsp; [[Desegmentation Step]]&lt;br /&gt;
* &amp;lt;span class='green'&amp;gt;FE-&amp;gt;FE&amp;lt;/span&amp;gt;&amp;amp;nbsp; [[Diff Leverage Step]]&lt;br /&gt;
* &amp;lt;span class='blue'&amp;gt;RD-&amp;gt;RD&amp;lt;/span&amp;gt;&amp;amp;nbsp; [[Encoding Conversion Step]]&lt;br /&gt;
* &amp;lt;span class='green'&amp;gt;FE-&amp;gt;FE&amp;lt;/span&amp;gt;&amp;amp;nbsp; [[Enrycher Step]]&lt;br /&gt;
* &amp;lt;span class='blue'&amp;gt;RD-&amp;gt;RD&amp;lt;/span&amp;gt;&amp;amp;nbsp; [[External Command Step]]&lt;br /&gt;
* &amp;lt;span class='blue'&amp;gt;RD-&amp;gt;RD&amp;lt;/span&amp;gt;&amp;amp;nbsp; [[Extraction Verification Step]]&lt;br /&gt;
* &amp;lt;span class='green'&amp;gt;FE&amp;lt;/span&amp;gt;-&amp;gt;&amp;lt;span class='blue'&amp;gt;RD&amp;lt;/span&amp;gt;&amp;amp;nbsp; [[Filter Events to Raw Document Step]]&lt;br /&gt;
* &amp;lt;span class='green'&amp;gt;FE-&amp;gt;FE&amp;lt;/span&amp;gt;&amp;amp;nbsp; [[Format Conversion Step]]&lt;br /&gt;
* &amp;lt;span class='green'&amp;gt;FE-&amp;gt;FE&amp;lt;/span&amp;gt;&amp;amp;nbsp; [[Full-Width Conversion Step]]&lt;br /&gt;
* &amp;lt;span class='green'&amp;gt;FE-&amp;gt;FE&amp;lt;/span&amp;gt;&amp;amp;nbsp; [[Generate SimpleTM Step]]&lt;br /&gt;
* &amp;lt;span class='green'&amp;gt;FE-&amp;gt;FE&amp;lt;/span&amp;gt;&amp;amp;nbsp; [[GTT Batch Translation Step]]&lt;br /&gt;
* &amp;lt;span class='green'&amp;gt;FE-&amp;gt;FE&amp;lt;/span&amp;gt;&amp;amp;nbsp; [[Id-Based Copy Step]]&lt;br /&gt;
* &amp;lt;span class='green'&amp;gt;FE-&amp;gt;FE&amp;lt;/span&amp;gt;&amp;amp;nbsp; [[Id-Based Aligner Step]]&lt;br /&gt;
* &amp;lt;span class='blue'&amp;gt;RD-&amp;gt;RD&amp;lt;/span&amp;gt;&amp;amp;nbsp; [[Image Modification Step]]&lt;br /&gt;
* &amp;lt;span class='green'&amp;gt;FE-&amp;gt;FE&amp;lt;/span&amp;gt;&amp;amp;nbsp; [[Inconsistency Check Step]]&lt;br /&gt;
* &amp;lt;span class='green'&amp;gt;FE-&amp;gt;FE&amp;lt;/span&amp;gt;&amp;amp;nbsp; [[Inline Codes Removal Step]]&lt;br /&gt;
* &amp;lt;span class='green'&amp;gt;FE-&amp;gt;FE&amp;lt;/span&amp;gt;&amp;amp;nbsp; [[Inline Codes Simplifier Step]]&lt;br /&gt;
* &amp;lt;span class='green'&amp;gt;FE-&amp;gt;FE&amp;lt;/span&amp;gt;&amp;amp;nbsp; [[LanguageTool Step]]&lt;br /&gt;
* &amp;lt;span class='green'&amp;gt;FE-&amp;gt;FE&amp;lt;/span&amp;gt;&amp;amp;nbsp; [[Leveraging Step]]&lt;br /&gt;
* &amp;lt;span class='blue'&amp;gt;RD-&amp;gt;RD&amp;lt;/span&amp;gt;&amp;amp;nbsp; [[Line-Break Conversion Step]]&lt;br /&gt;
* &amp;lt;span class='green'&amp;gt;FE-&amp;gt;FE&amp;lt;/span&amp;gt;&amp;amp;nbsp; [[Localizables Check Step]]&lt;br /&gt;
|&lt;br /&gt;
* &amp;lt;span class='green'&amp;gt;FE-&amp;gt;FE&amp;lt;/span&amp;gt;&amp;amp;nbsp; [[Microsoft Batch Translation Step]]&lt;br /&gt;
* &amp;lt;span class='green'&amp;gt;FE-&amp;gt;FE&amp;lt;/span&amp;gt;&amp;amp;nbsp; [[Microsoft Batch Submission Step]]&lt;br /&gt;
* &amp;lt;span class='green'&amp;gt;FE-&amp;gt;FE&amp;lt;/span&amp;gt;&amp;amp;nbsp; [[Moses InlineText Extraction Step]]&lt;br /&gt;
* &amp;lt;span class='green'&amp;gt;FE-&amp;gt;FE&amp;lt;/span&amp;gt;&amp;amp;nbsp; [[Moses InlineText Leveraging Step]]&lt;br /&gt;
* &amp;lt;span class='blue'&amp;gt;RD-&amp;gt;RD&amp;lt;/span&amp;gt;&amp;amp;nbsp; [[MS Word Resaver Step]]&lt;br /&gt;
* &amp;lt;span class='blue'&amp;gt;RD-&amp;gt;RD&amp;lt;/span&amp;gt;&amp;amp;nbsp; [[MS Word Search and Replace Step]]&lt;br /&gt;
* &amp;lt;span class='blue'&amp;gt;RD-&amp;gt;RD&amp;lt;/span&amp;gt;&amp;amp;nbsp; [[Original Document Xliff Merger Step]]&lt;br /&gt;
* &amp;lt;span class='green'&amp;gt;FE-&amp;gt;FE&amp;lt;/span&amp;gt;&amp;amp;nbsp; [[Paragraph Alignment Step]]&lt;br /&gt;
* &amp;lt;span class='green'&amp;gt;FE-&amp;gt;FE&amp;lt;/span&amp;gt;&amp;amp;nbsp; [[Post-segmentation Inline Codes Simplifier Step]]&lt;br /&gt;
* &amp;lt;span class='green'&amp;gt;FE-&amp;gt;FE&amp;lt;/span&amp;gt;&amp;amp;nbsp; [[Properties Setting Step]]&lt;br /&gt;
* &amp;lt;span class='green'&amp;gt;FE-&amp;gt;FE&amp;lt;/span&amp;gt;&amp;amp;nbsp; [[QuEst Quality Estimation Step]]&lt;br /&gt;
* &amp;lt;span class='green'&amp;gt;FE-&amp;gt;FE&amp;lt;/span&amp;gt;&amp;amp;nbsp; [[QuEst SVM Model Builder Step]]&lt;br /&gt;
* &amp;lt;span class='blue'&amp;gt;RD-&amp;gt;RD&amp;lt;/span&amp;gt;&amp;amp;nbsp; [[Quality Check Step]]&lt;br /&gt;
* &amp;lt;span class='green'&amp;gt;FE-&amp;gt;FE&amp;lt;/span&amp;gt;&amp;amp;nbsp; [[Quality Check Step]]&lt;br /&gt;
* &amp;lt;span class='blue'&amp;gt;RD&amp;lt;/span&amp;gt;-&amp;gt;&amp;lt;span class='green'&amp;gt;FE&amp;lt;/span&amp;gt;&amp;amp;nbsp; [[Raw Document to Filter Events Step]]&lt;br /&gt;
* &amp;lt;span class='green'&amp;gt;FE&amp;lt;/span&amp;gt;-&amp;gt;&amp;lt;span class='hi'&amp;gt;RD/FE&amp;lt;/span&amp;gt;&amp;amp;nbsp; [[Rainbow Translation Kit Creation Step]]&lt;br /&gt;
* &amp;lt;span class='green'&amp;gt;FE&amp;lt;/span&amp;gt;-&amp;gt;&amp;lt;span class='hi'&amp;gt;RD/FE&amp;lt;/span&amp;gt;&amp;amp;nbsp; [[Rainbow Translation Kit Merging Step]]&lt;br /&gt;
* &amp;lt;span class='blue'&amp;gt;RD-&amp;gt;RD&amp;lt;/span&amp;gt;&amp;amp;nbsp; [[RTF Conversion Step]]&lt;br /&gt;
* &amp;lt;span class='green'&amp;gt;FE-&amp;gt;FE&amp;lt;/span&amp;gt;&amp;amp;nbsp; [[Remove Target Step]]&lt;br /&gt;
* &amp;lt;span class='green'&amp;gt;FE-&amp;gt;FE&amp;lt;/span&amp;gt;&amp;amp;nbsp; [[Repetition Analysis Step]]&lt;br /&gt;
* &amp;lt;span class='green'&amp;gt;FE-&amp;gt;FE&amp;lt;/span&amp;gt;&amp;amp;nbsp; [[Resource Simplifier Step]]&lt;br /&gt;
* &amp;lt;span class='green'&amp;gt;FE-&amp;gt;FE&amp;lt;/span&amp;gt;&amp;amp;nbsp; [[Scoping Report Step]]&lt;br /&gt;
* &amp;lt;span class='blue'&amp;gt;RD-&amp;gt;RD&amp;lt;/span&amp;gt;&amp;amp;nbsp; [[Search and Replace Step]]&lt;br /&gt;
* &amp;lt;span class='green'&amp;gt;FE-&amp;gt;FE&amp;lt;/span&amp;gt;&amp;amp;nbsp; [[Search and Replace Step]]&lt;br /&gt;
* &amp;lt;span class='green'&amp;gt;FE-&amp;gt;FE&amp;lt;/span&amp;gt;&amp;amp;nbsp; [[Segmentation Step]]&lt;br /&gt;
* &amp;lt;span class='green'&amp;gt;FE-&amp;gt;FE&amp;lt;/span&amp;gt;&amp;amp;nbsp; [[Segments to Text Units Converter Step]]&lt;br /&gt;
* &amp;lt;span class='green'&amp;gt;FE-&amp;gt;FE&amp;lt;/span&amp;gt;&amp;amp;nbsp; [[Sentence Alignment Step]]&lt;br /&gt;
* &amp;lt;span class='blue'&amp;gt;RD-&amp;gt;RD&amp;lt;/span&amp;gt;&amp;amp;nbsp; [[SimpleTM to TMX Step]]&lt;br /&gt;
|&lt;br /&gt;
* &amp;lt;span class='green'&amp;gt;FE-&amp;gt;FE&amp;lt;/span&amp;gt;&amp;amp;nbsp; [[Space Check Step]]&lt;br /&gt;
* &amp;lt;span class='blue'&amp;gt;RD-&amp;gt;RD&amp;lt;/span&amp;gt;&amp;amp;nbsp; [[Skeleton Xliff Merger Step]]&lt;br /&gt;
* &amp;lt;span class='green'&amp;gt;FE-&amp;gt;FE&amp;lt;/span&amp;gt;&amp;amp;nbsp; [[Term Extraction Step]]&lt;br /&gt;
* &amp;lt;span class='green'&amp;gt;FE-&amp;gt;FE&amp;lt;/span&amp;gt;&amp;amp;nbsp; [[Terminology Leveraging Step]]&lt;br /&gt;
* &amp;lt;span class='green'&amp;gt;FE-&amp;gt;FE&amp;lt;/span&amp;gt;&amp;amp;nbsp; [[Text Modification Step]]&lt;br /&gt;
* &amp;lt;span class='green'&amp;gt;FE-&amp;gt;FE&amp;lt;/span&amp;gt;&amp;amp;nbsp; [[ThreadedWorkQueue Step]]&lt;br /&gt;
* &amp;lt;span class='green'&amp;gt;FE-&amp;gt;FE&amp;lt;/span&amp;gt;&amp;amp;nbsp; [[TM Import Step]]&lt;br /&gt;
* &amp;lt;span class='green'&amp;gt;FE-&amp;gt;FE&amp;lt;/span&amp;gt;&amp;amp;nbsp; [[Tokenization Step]]&lt;br /&gt;
* &amp;lt;span class='green'&amp;gt;FE-&amp;gt;FE&amp;lt;/span&amp;gt;&amp;amp;nbsp; [[Translation Comparison Step]]&lt;br /&gt;
* &amp;lt;span class='green'&amp;gt;FE-&amp;gt;FE&amp;lt;/span&amp;gt;&amp;amp;nbsp; [[URI Conversion Step]]&lt;br /&gt;
* &amp;lt;span class='green'&amp;gt;FE-&amp;gt;FE&amp;lt;/span&amp;gt;&amp;amp;nbsp; [[Used Characters Listing Step]]&lt;br /&gt;
* &amp;lt;span class='green'&amp;gt;FE-&amp;gt;FE&amp;lt;/span&amp;gt;&amp;amp;nbsp; [[Word Count Step]]&lt;br /&gt;
* &amp;lt;span class='green'&amp;gt;FE-&amp;gt;FE&amp;lt;/span&amp;gt;&amp;amp;nbsp; [[Simple Word Count Step]]&lt;br /&gt;
* &amp;lt;span class='green&amp;gt;FE-&amp;gt;FE&amp;lt;/span&amp;gt;&amp;amp;nbsp; [[Whitespace Correction Step]]&lt;br /&gt;
* &amp;lt;span class='blue'&amp;gt;RD-&amp;gt;RD&amp;lt;/span&amp;gt;&amp;amp;nbsp; [[Trados Analysis Step]]&lt;br /&gt;
* &amp;lt;span class='blue'&amp;gt;RD-&amp;gt;RD&amp;lt;/span&amp;gt;&amp;amp;nbsp; [[Trados Cleanup Step]]&lt;br /&gt;
* &amp;lt;span class='blue'&amp;gt;RD-&amp;gt;RD&amp;lt;/span&amp;gt;&amp;amp;nbsp; [[Trados Export Step]]&lt;br /&gt;
* &amp;lt;span class='blue'&amp;gt;RD-&amp;gt;RD&amp;lt;/span&amp;gt;&amp;amp;nbsp; [[Trados Import Step]]&lt;br /&gt;
* &amp;lt;span class='blue'&amp;gt;RD-&amp;gt;RD&amp;lt;/span&amp;gt;&amp;amp;nbsp; [[Trados Translation Step]]&lt;br /&gt;
* &amp;lt;span class='blue'&amp;gt;RD-&amp;gt;RD&amp;lt;/span&amp;gt;&amp;amp;nbsp; [[TTX Joiner Step]]&lt;br /&gt;
* &amp;lt;span class='blue'&amp;gt;RD-&amp;gt;RD&amp;lt;/span&amp;gt;&amp;amp;nbsp; [[TTX Splitter Step]]&lt;br /&gt;
* &amp;lt;span class='blue'&amp;gt;RD-&amp;gt;RD&amp;lt;/span&amp;gt;&amp;amp;nbsp; [[XLIFF Joiner Step]]&lt;br /&gt;
* &amp;lt;span class='blue'&amp;gt;RD-&amp;gt;RD&amp;lt;/span&amp;gt;&amp;amp;nbsp; [[XLIFF Splitter Step]]&lt;br /&gt;
* &amp;lt;span class='blue'&amp;gt;RD-&amp;gt;RD&amp;lt;/span&amp;gt;&amp;amp;nbsp; [[XML Analysis Step]]&lt;br /&gt;
* &amp;lt;span class='blue'&amp;gt;RD-&amp;gt;RD&amp;lt;/span&amp;gt;&amp;amp;nbsp; [[XML Characters Fixing Step]]&lt;br /&gt;
* &amp;lt;span class='blue'&amp;gt;RD-&amp;gt;RD&amp;lt;/span&amp;gt;&amp;amp;nbsp; [[XML Validation Step]]&lt;br /&gt;
* &amp;lt;span class='blue'&amp;gt;RD-&amp;gt;RD&amp;lt;/span&amp;gt;&amp;amp;nbsp; [[XSL Transformation Step]]&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
Legend:&lt;br /&gt;
: &amp;lt;span class='blue'&amp;gt;RD-&amp;gt;RD&amp;lt;/span&amp;gt; = the step takes raw document and sends raw document &lt;br /&gt;
: &amp;lt;span class='blue'&amp;gt;RD&amp;lt;/span&amp;gt;-&amp;gt;&amp;lt;span the step class='green'&amp;gt;FE&amp;lt;/span&amp;gt; = the step takes raw document and sends filter events&lt;br /&gt;
: &amp;lt;span class='green'&amp;gt;FE-&amp;gt;FE&amp;lt;/span&amp;gt; = the step takes filter events and sends filter events&lt;br /&gt;
: &amp;lt;span class='green'&amp;gt;FE&amp;lt;/span&amp;gt;-&amp;gt;&amp;lt;span class='blue'&amp;gt;RD&amp;lt;/span&amp;gt; = the step takes filter events and sends raw document&lt;br /&gt;
: &amp;lt;span class='green'&amp;gt;FE&amp;lt;/span&amp;gt;-&amp;gt;&amp;lt;span class='hi'&amp;gt;RD/FE&amp;lt;/span&amp;gt; = the step takes filter events and sends either raw document or filter events&lt;br /&gt;
&lt;br /&gt;
[[Category:Steps]]&lt;/div&gt;</summary>
		<author><name>Jhargraveiii</name></author>
	</entry>
	<entry>
		<id>http://okapiframework.org/wiki/index.php?title=Filters&amp;diff=829</id>
		<title>Filters</title>
		<link rel="alternate" type="text/html" href="http://okapiframework.org/wiki/index.php?title=Filters&amp;diff=829"/>
		<updated>2019-12-16T19:34:15Z</updated>

		<summary type="html">&lt;p&gt;Jhargraveiii: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;Filters are the components that convert input documents from their native file format into a common internal set of [[Glossary#Resource|resources]] that all Okapi components use. The extracted content can be re-written into the original file format. When using the steps, the extraction is done by the [[Raw Document to Filter Events Step]] and the re-writing by the [[Filter Events to Raw Document Step]].&lt;br /&gt;
&lt;br /&gt;
Note: The [[Okapi Filters Plugin for OmegaT]] allows you to use some of the filters directly from [http://www.omegat.org OmegaT].&lt;br /&gt;
&lt;br /&gt;
==List of the Filters==&lt;br /&gt;
&lt;br /&gt;
The framework distribution comes with the following filters:&lt;br /&gt;
&lt;br /&gt;
{| cellpadding=&amp;quot;8&amp;quot; width=100%&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
|&lt;br /&gt;
* [[Archive Filter]]&lt;br /&gt;
* [[DTD Filter]]&lt;br /&gt;
* [[Doxygen Filter]]&lt;br /&gt;
* [[HTML Filter]]&lt;br /&gt;
* [[HTML5-ITS Filter]]&lt;br /&gt;
* [[ICML Filter]]&lt;br /&gt;
* [[IDML Filter]]&lt;br /&gt;
* [[JSON Filter]]&lt;br /&gt;
* [[Markdown Filter]]&lt;br /&gt;
* [[MIF Filter]]&lt;br /&gt;
* [[Moses Text Filter]]&lt;br /&gt;
* [[Multi-Parsers Filter]]&lt;br /&gt;
* [[OpenOffice Filter]]&lt;br /&gt;
* [[OpenXML Filter|OpenXML (MS Office) Filter]]&lt;br /&gt;
|&lt;br /&gt;
* [[PDF Filter]]&lt;br /&gt;
* [[Pensieve TM Filter]]&lt;br /&gt;
* [[PHP Content Filter]]&lt;br /&gt;
* [[Plain Text Filter]]&lt;br /&gt;
* [[PO Filter]]&lt;br /&gt;
* [[Properties Filter]]&lt;br /&gt;
* [[Rainbow Translation Kit Filter]]&lt;br /&gt;
* [[Regex Filter]]&lt;br /&gt;
* [[SDL Trados Package Filter]]&lt;br /&gt;
* [[Simplification Filter]]&lt;br /&gt;
* [[Table Filter]]&lt;br /&gt;
* [[TMX Filter]]&lt;br /&gt;
* [[Trados-Tagged RTF Filter]]&lt;br /&gt;
|&lt;br /&gt;
* [[Transifex Filter]]&lt;br /&gt;
* [[TS Filter]]&lt;br /&gt;
* [[TTX Filter]]&lt;br /&gt;
* [[TXML Filter]]&lt;br /&gt;
* [[Wiki Filter]]&lt;br /&gt;
* [[Vignette Filter]]&lt;br /&gt;
* [[XLIFF Filter]]&lt;br /&gt;
* [[XLIFF-2 Filter]]&lt;br /&gt;
* [[XML Filter]]&lt;br /&gt;
* [[XML Stream Filter]]&lt;br /&gt;
* [[YAML Filter]]&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
==Supported File Formats==&lt;br /&gt;
&lt;br /&gt;
The following is a list of some of the file formats supported by the distribution through [[Understanding Filter Configurations|pre-defined configurations]]:&lt;br /&gt;
&lt;br /&gt;
{| border=&amp;quot;1&amp;quot; cellpadding=&amp;quot;6&amp;quot; cellspacing=&amp;quot;0&amp;quot;&lt;br /&gt;
|+&lt;br /&gt;
| '''Format''' || '''Extensions''' || '''Pre-Defined Configuration''' || '''Filter''' || '''Notes'''&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Android Strings || .xml || &amp;lt;code&amp;gt;okf_xml-AndroidStrings&amp;lt;/code&amp;gt; || [[XML Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Apple Stringsdict || .stringsdict || &amp;lt;code&amp;gt;okf_xml-AppleStringsdict&amp;lt;/code&amp;gt; || [[XML Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Archive || .zip || &amp;lt;code&amp;gt;okf_archive&amp;lt;/code&amp;gt; || [[Archive Filter]] || Meta filter that processes zip files with various formats as one file.&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Auto Xliff || .xlf, .xliff || &amp;lt;code&amp;gt;okf_autoxliff&amp;lt;/code&amp;gt; || [[Auto Xliff Filter]] || Detects the version of an XLIFF file and then hands parsing off to the appropriate filter &lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| CSV (Comma-separated values files) || .csv, .txt || &amp;lt;code&amp;gt;okf_table_csv&amp;lt;/code&amp;gt; || [[Table Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| CSV (Multiple complex sub-formats) || .csv || &amp;lt;code&amp;gt;okf_multiparsers&amp;lt;/code&amp;gt; || [[Multi-Parsers Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| DITA || .dita, .ditamap, .xml || &amp;lt;code&amp;gt;okf_xmlstream-dita&amp;lt;/code&amp;gt; || [[XML Stream Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| DokuWiki pages || .txt || &amp;lt;code&amp;gt;okf_wiki&amp;lt;/code&amp;gt; || [[Wiki Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Doxygen-commented files || .c, .h, cpp || &amp;lt;code&amp;gt;okf_doxygen&amp;lt;/code&amp;gt; || [[Doxygen Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| DTD || .dtd || &amp;lt;code&amp;gt;okf_dtd&amp;lt;/code&amp;gt; || [[DTD Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Fixed-Width Columns Table || .txt || &amp;lt;code&amp;gt;okf_table_fwc&amp;lt;/code&amp;gt; || [[Table Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Idiom WorldServer XLIFF || .xlf || &amp;lt;code&amp;gt;okf_xliff-iws&amp;lt;/code&amp;gt; || [[XLIFF Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| InCopy ICML || .wcml || &amp;lt;code&amp;gt;okf_icml&amp;lt;/code&amp;gt; || [[ICML Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| InDesign IDML || .idml || &amp;lt;code&amp;gt;okf_idml&amp;lt;/code&amp;gt; || [[IDML Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| iOS/Mac Strings|| .strings || &amp;lt;code&amp;gt;okf_regex-macStrings&amp;lt;/code&amp;gt; || [[Regex Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Java Properties || .properties || &amp;lt;code&amp;gt;okf_properties&amp;lt;/code&amp;gt; || [[Properties Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Java Properties (Output not escaped) || .properties || &amp;lt;code&amp;gt;okf_properties-outputNotEscaped&amp;lt;/code&amp;gt; || [[Properties Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Java XML Properties || .xml || &amp;lt;code&amp;gt;okf_xml-JavaProperties&amp;lt;/code&amp;gt; || [[XML Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Java XML Properties (HTML strings) || .xml || &amp;lt;code&amp;gt;okf_xmlstream-JavaPropertiesHTML&amp;lt;/code&amp;gt; || [[XML Stream Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| JSON || .json || &amp;lt;code&amp;gt;okf_json&amp;lt;/code&amp;gt; || [[JSON Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Haiku CatKeys || .catkeys || &amp;lt;code&amp;gt;okf_table_catkeys&amp;lt;/code&amp;gt; || [[Table Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| HTML (any) || .html, .htm || &amp;lt;code&amp;gt;okf_html&amp;lt;/code&amp;gt; || [[HTML Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| HTML (Well-formed, and XHTML) || .html, .htm|| &amp;lt;code&amp;gt;okf_html-wellFormed&amp;lt;/code&amp;gt; || [[HTML Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| HTML5 (and XHTML5) || .html, .htm|| &amp;lt;code&amp;gt;okf_itshtml5&amp;lt;/code&amp;gt; || [[HTML5-ITS Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Markdown || .md || &amp;lt;code&amp;gt;okf_markdown&amp;lt;/code&amp;gt; || [[Markdown Filter]]&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Microsoft Excel 2007/2010 || .xlsx, .xlsm, .xltx, .xltm || &amp;lt;code&amp;gt;okf_openxml&amp;lt;/code&amp;gt; || [[OpenXML Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Microsoft PowerPoint 2007/2010 || .pptx, .pptm, .potx, .potm, .ppsx, .ppsm || &amp;lt;code&amp;gt;okf_openxml&amp;lt;/code&amp;gt; || [[OpenXML Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Microsoft Visio || .vsdx, .vsdm || &amp;lt;code&amp;gt;okf_openxml&amp;lt;/code&amp;gt; || [[OpenXML Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Microsoft Word 2007/2010 || .docx, .docm, .dotx, .dotm || &amp;lt;code&amp;gt;okf_openxml&amp;lt;/code&amp;gt; || [[OpenXML Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| MIF || .mif || &amp;lt;code&amp;gt;okf_mif&amp;lt;/code&amp;gt; || [[MIF Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Moses Text || .txt || &amp;lt;code&amp;gt;okf_mosestext&amp;lt;/code&amp;gt; || [[Moses Text Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| OpenOffice.org Calc || .ods, .ots || &amp;lt;code&amp;gt;okf_odf&amp;lt;/code&amp;gt; || [[OpenOffice Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| OpenOffice.org Draw || .odg, .otg || &amp;lt;code&amp;gt;okf_odf&amp;lt;/code&amp;gt; || [[OpenOffice Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| OpenOffice.org Impress || .odp, .otp || &amp;lt;code&amp;gt;okf_odf&amp;lt;/code&amp;gt; || [[OpenOffice Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| OpenOffice.org Writer || .odt, .ott || &amp;lt;code&amp;gt;okf_odf&amp;lt;/code&amp;gt; || [[OpenOffice Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| PDF || .pdf || &amp;lt;code&amp;gt;okf_pdf&amp;lt;/code&amp;gt; || [[PDF Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| [[Pensieve TM]] || .pentm || &amp;lt;code&amp;gt;okf_pensieve&amp;lt;/code&amp;gt; || [[Pensieve TM Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| PHP Content || .php || &amp;lt;code&amp;gt;okf_phpcontent&amp;lt;/code&amp;gt; || [[PHP Content Filter]] || Can be used as a subfilter only&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Plain Text (Line = text unit) || .txt || &amp;lt;code&amp;gt;okf_plaintext&amp;lt;/code&amp;gt; || [[ Plain Text Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Plain Text (Paragraph = text unit) || .txt || &amp;lt;code&amp;gt;okf_plaintext_paragraphs&amp;lt;/code&amp;gt; || [[Plain Text Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| PO || .po || &amp;lt;code&amp;gt;okf_po&amp;lt;/code&amp;gt; || [[PO Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| PO (Monolingual style) || .po || &amp;lt;code&amp;gt;okf_po-monolingual&amp;lt;/code&amp;gt; || [[PO Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Rainbow Translation Kit manifests || .rkm || &amp;lt;code&amp;gt;okf_rainbowkit&amp;lt;/code&amp;gt; || [[Rainbow Translation Kit Filter]] || Used as a tkit reader only&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Regex (Any text-based format) || .txt || &amp;lt;code&amp;gt;okf_regex&amp;lt;/code&amp;gt; || [[Regex Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| RDF (Mozilla RDF) || .rdf || &amp;lt;code&amp;gt;okf_xml-MozillaRDF&amp;lt;/code&amp;gt; || [[XML Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| RESX || .resx || &amp;lt;code&amp;gt;okf_xml-resx&amp;lt;/code&amp;gt; || [[XML Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| SDLPPX || .sdlppx || &amp;lt;code&amp;gt;okf_sdlpackage&amp;lt;/code&amp;gt; || [[SDL Trados Package Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| SDLRPX || .sdlrpx || &amp;lt;code&amp;gt;okf_sdlpackage&amp;lt;/code&amp;gt; || [[SDL Trados Package Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| SDL[[XLIFF]] || .sdlxlf || &amp;lt;code&amp;gt;okf_xliff-sdl&amp;lt;/code&amp;gt; || [[XLIFF Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Skype Language Files || .lang || &amp;lt;code&amp;gt;okf_properties-skypeLang&amp;lt;/code&amp;gt; || [[Properties Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| SRT (Sub-Rip Text, sub-titles files) || .srt || &amp;lt;code&amp;gt;okf_regex-srt&amp;lt;/code&amp;gt; || [[Regex Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Tab-Delimiter files || .tsv, .txt || &amp;lt;code&amp;gt;okf_table_tsv&amp;lt;/code&amp;gt; || [[Table Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Tex files || .tex || &amp;lt;code&amp;gt;okf_tex&amp;lt;/code&amp;gt; || [[TEX Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| [[TMX]] || .tmx || &amp;lt;code&amp;gt;okf_tmx&amp;lt;/code&amp;gt; || [[TMX Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Transifex project || .txp || &amp;lt;code&amp;gt;okf_transifex&amp;lt;/code&amp;gt; || [[Transifex Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Trados-Tagged RTF || .rtf || &amp;lt;code&amp;gt;okf_tradosrtf&amp;lt;/code&amp;gt; || [[Trados-Tagged RTF Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| TS - Qt TS files || .ts || &amp;lt;code&amp;gt;okf_ts&amp;lt;/code&amp;gt; || [[TS Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| TTX - Trados TagEditor TTX files || .ttx || &amp;lt;code&amp;gt;okf_ttx&amp;lt;/code&amp;gt; || [[TTX Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| TXML - Wordfast Pro TXML files || .txml || &amp;lt;code&amp;gt;okf_txml&amp;lt;/code&amp;gt; || [[TXML Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Vignette Export/Import Content || .xml || &amp;lt;code&amp;gt;okf_vignette&amp;lt;/code&amp;gt; || [[Vignette Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| XHTML || .html, .htm || &amp;lt;code&amp;gt;okf_html-wellFormed&amp;lt;/code&amp;gt; || [[HTML Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| WIX (Windows Installer XML) localization files || .wix || &amp;lt;code&amp;gt;okf_xml-WixLocalization&amp;lt;/code&amp;gt; || [[XML Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| [[XLIFF]] v1.2 || .xlf, .xliff || &amp;lt;code&amp;gt;okf_xliff&amp;lt;/code&amp;gt; || [[XLIFF Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| [[XLIFF]] v2 || .xlf || &amp;lt;code&amp;gt;okf_xliff2&amp;lt;/code&amp;gt; || [[XLIFF-2 Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| XML (Generic, using [[ITS]] defaults) || .xml || &amp;lt;code&amp;gt;okf_xml&amp;lt;/code&amp;gt; || [[XML Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| XML (Generic, using stream reader) || .xml || &amp;lt;code&amp;gt;okf_xmlstream&amp;lt;/code&amp;gt; || [[XML Stream Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| YAML (Generic YAML filter) || .yml, .yaml || &amp;lt;code&amp;gt;okf_yaml&amp;lt;/code&amp;gt; || [[YAML Filter]] ||&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
Note that most filters allow you to [[Understanding Filter Configurations|create your own configurations]] to support more file formats.&lt;br /&gt;
&lt;br /&gt;
==Code Simplification Rules==&lt;br /&gt;
&lt;br /&gt;
All filters support code simplification rules. By default the [[Inline Codes Simplifier Step]], [[Simplification Filter]] and [[Post-segmentation Inline Codes Removal Step]] maximize the trimming and merging (aka simplification) of inline codes. In some cases this may not be desired. The simplification rules allow you to override the default behavior and prevent specific codes from being trimmed or merged.&lt;br /&gt;
&lt;br /&gt;
===General Syntax===&lt;br /&gt;
&lt;br /&gt;
The rules parser ignores irrelevant whitespace. Rules can be separated by spaces, newlines or nothing. This makes it easier to accommodate various container formats and their whitespace normalization rules. When a rule applies it means &amp;quot;do not simplify the match code&amp;quot;. Uppercase tokens are constants and predefined by the rule parser. Multiple rules are always OR'ed together.&lt;br /&gt;
&lt;br /&gt;
For more details see the JavaCC grammar: &amp;lt;code&amp;gt;../okapi-core/src/main/java/net/sf/okapi/core/simplifierrules/SimplifierRules.jj&amp;lt;/code&amp;gt; &lt;br /&gt;
&lt;br /&gt;
===Rule Examples===&lt;br /&gt;
&lt;br /&gt;
If Code has any of these flags then don't simplify &lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;if DELETABLE or ADDABLE or CLONEABLE;&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;quot;=&amp;quot; is string match&lt;br /&gt;
Match basic TAGTYPE opening, closing or standalone &lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;if DATA = &amp;quot;a&amp;quot; and TAGTYPE = OPENING;&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;quot;~&amp;quot; is regex match&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;if DATA ~ &amp;quot;a.*&amp;quot;;&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
You can negate any of the match operators &lt;br /&gt;
Don't simplify if the DATA does not match the regex &lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;if DATA !~ &amp;quot;a.*&amp;quot;;&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Match on type, linebreak in this case, don't simplify &lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;if the Code is a linebreak if TYPE = &amp;quot;lb&amp;quot;;&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Don't simplify any rich text types&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;if TYPE = &amp;quot;bold&amp;quot; or TYPE = &amp;quot;italic&amp;quot; or TYPE = &amp;quot;underline&amp;quot;;&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Expressions can be recursive (supports embedded parens)&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;if TYPE = &amp;quot;bold&amp;quot; or (DATA = &amp;quot;bar&amp;quot; or (DATA = &amp;quot;foo&amp;quot; and TYPE = &amp;quot;underline&amp;quot;));&amp;lt;/pre&amp;gt;&lt;br /&gt;
 &lt;br /&gt;
&lt;br /&gt;
===Filter Config Examples===&lt;br /&gt;
&lt;br /&gt;
Examples of using simplifier rules within the filter config formats used by Okapi.&lt;br /&gt;
&lt;br /&gt;
'''YAML:'''&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
simplifierRules: |&lt;br /&gt;
  if ADDABLE or DELETABLE or CLONEABLE;&lt;br /&gt;
  if DATA = &amp;quot;&amp;lt;br/&amp;gt;&amp;quot; or DATA = &amp;quot;&amp;lt;font&amp;gt;&amp;quot; or DATA = &amp;quot;&amp;lt;/font&amp;gt;&amp;quot; or DATA = &amp;quot;&amp;lt;/a&amp;gt;&amp;quot;;&lt;br /&gt;
  if DATA ~ &amp;quot;\\&amp;lt;font.+&amp;quot; or DATA ~ &amp;quot;\\&amp;lt;img.+&amp;quot; or DATA ~ &amp;quot;\\&amp;lt;a.+&amp;quot;;&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
'''ITS:'''&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
&amp;lt;?xml version=&amp;quot;1.0&amp;quot; encoding=&amp;quot;UTF-8&amp;quot;?&amp;gt;&lt;br /&gt;
&amp;lt;its:rules xmlns:its=&amp;quot;http://www.w3.org/2005/11/its&amp;quot; version=&amp;quot;1.0&amp;quot; xmlns:itsx=&amp;quot;http://www.w3.org/2008/12/its-extensions&amp;quot; xmlns:okp=&amp;quot;okapi-framework:xmlfilter-options&amp;quot;&amp;gt;&lt;br /&gt;
&amp;lt;!-- See ITS specification at: http://www.w3.org/TR/its/ --&amp;gt;&lt;br /&gt;
 &amp;lt;its:translateRule selector=&amp;quot;//*&amp;quot; translate=&amp;quot;yes&amp;quot;/&amp;gt;&lt;br /&gt;
 &amp;lt;its:withinTextRule selector=&amp;quot;//codeph&amp;quot; withinText=&amp;quot;yes&amp;quot;/&amp;gt;&lt;br /&gt;
 &amp;lt;its:withinTextRule selector=&amp;quot;//ph&amp;quot; withinText=&amp;quot;yes&amp;quot;/&amp;gt;&lt;br /&gt;
 &amp;lt;okp:simplifierRules&amp;gt;&lt;br /&gt;
 if ADDABLE or DELETABLE or CLONEABLE; if DATA ~ &amp;quot;.+&amp;quot;;&lt;br /&gt;
 &amp;lt;/okp:simplifierRules&amp;gt;&lt;br /&gt;
&amp;lt;/its:rules&amp;gt;&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
'''FPRM (Parameters):'''&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
#v1&lt;br /&gt;
extractNotes.b=true&lt;br /&gt;
simplifyCodes.b=true&lt;br /&gt;
simplifierRules=if ADDABLE or DELETABLE or CLONEABLE; if DATA ~ &amp;quot;.+&amp;quot;;&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
[[Category:Filters]]&lt;/div&gt;</summary>
		<author><name>Jhargraveiii</name></author>
	</entry>
	<entry>
		<id>http://okapiframework.org/wiki/index.php?title=Filters&amp;diff=828</id>
		<title>Filters</title>
		<link rel="alternate" type="text/html" href="http://okapiframework.org/wiki/index.php?title=Filters&amp;diff=828"/>
		<updated>2019-12-16T19:33:33Z</updated>

		<summary type="html">&lt;p&gt;Jhargraveiii: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;Filters are the components that convert input documents from their native file format into a common internal set of [[Glossary#Resource|resources]] that all Okapi components use. The extracted content can be re-written into the original file format. When using the steps, the extraction is done by the [[Raw Document to Filter Events Step]] and the re-writing by the [[Filter Events to Raw Document Step]].&lt;br /&gt;
&lt;br /&gt;
Note: The [[Okapi Filters Plugin for OmegaT]] allows you to use some of the filters directly from [http://www.omegat.org OmegaT].&lt;br /&gt;
&lt;br /&gt;
==List of the Filters==&lt;br /&gt;
&lt;br /&gt;
The framework distribution comes with the following filters:&lt;br /&gt;
&lt;br /&gt;
{| cellpadding=&amp;quot;8&amp;quot; width=100%&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
|&lt;br /&gt;
* [[Archive Filter]]&lt;br /&gt;
* [[DTD Filter]]&lt;br /&gt;
* [[Doxygen Filter]]&lt;br /&gt;
* [[HTML Filter]]&lt;br /&gt;
* [[HTML5-ITS Filter]]&lt;br /&gt;
* [[ICML Filter]]&lt;br /&gt;
* [[IDML Filter]]&lt;br /&gt;
* [[JSON Filter]]&lt;br /&gt;
* [[Markdown Filter]]&lt;br /&gt;
* [[MIF Filter]]&lt;br /&gt;
* [[Moses Text Filter]]&lt;br /&gt;
* [[Multi-Parsers Filter]]&lt;br /&gt;
* [[OpenOffice Filter]]&lt;br /&gt;
* [[OpenXML Filter|OpenXML (MS Office) Filter]]&lt;br /&gt;
|&lt;br /&gt;
* [[PDF Filter]]&lt;br /&gt;
* [[Pensieve TM Filter]]&lt;br /&gt;
* [[PHP Content Filter]]&lt;br /&gt;
* [[Plain Text Filter]]&lt;br /&gt;
* [[PO Filter]]&lt;br /&gt;
* [[Properties Filter]]&lt;br /&gt;
* [[Rainbow Translation Kit Filter]]&lt;br /&gt;
* [[Regex Filter]]&lt;br /&gt;
* [[SDL Trados Package Filter]]&lt;br /&gt;
* [[Simplification Filter]]&lt;br /&gt;
* [[Table Filter]]&lt;br /&gt;
* [[TMX Filter]]&lt;br /&gt;
* [[Trados-Tagged RTF Filter]]&lt;br /&gt;
|&lt;br /&gt;
* [[Transifex Filter]]&lt;br /&gt;
* [[TS Filter]]&lt;br /&gt;
* [[TTX Filter]]&lt;br /&gt;
* [[TXML Filter]]&lt;br /&gt;
* [[Wiki Filter]]&lt;br /&gt;
* [[Vignette Filter]]&lt;br /&gt;
* [[XLIFF Filter]]&lt;br /&gt;
* [[XLIFF-2 Filter]]&lt;br /&gt;
* [[XML Filter]]&lt;br /&gt;
* [[XML Stream Filter]]&lt;br /&gt;
* [[YAML Filter]]&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
==Supported File Formats==&lt;br /&gt;
&lt;br /&gt;
The following is a list of some of the file formats supported by the distribution through [[Understanding Filter Configurations|pre-defined configurations]]:&lt;br /&gt;
&lt;br /&gt;
{| border=&amp;quot;1&amp;quot; cellpadding=&amp;quot;6&amp;quot; cellspacing=&amp;quot;0&amp;quot;&lt;br /&gt;
|+&lt;br /&gt;
| '''Format''' || '''Extensions''' || '''Pre-Defined Configuration''' || '''Filter''' || '''Notes'''&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Android Strings || .xml || &amp;lt;code&amp;gt;okf_xml-AndroidStrings&amp;lt;/code&amp;gt; || [[XML Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Apple Stringsdict || .stringsdict || &amp;lt;code&amp;gt;okf_xml-AppleStringsdict&amp;lt;/code&amp;gt; || [[XML Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Archive || .zip || &amp;lt;code&amp;gt;okf_archive&amp;lt;/code&amp;gt; || [[Archive Filter]] || Meta filter that processes zip files with various formats as one file.&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Auto Xliff || .xlf, .xliff || &amp;lt;code&amp;gt;okf_autoxliff&amp;lt;/code&amp;gt; || [[Auto Xliff Filter]] || Detects the version of an XLIFF file and then hands parsing off to the appropriate filter &lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| CSV (Comma-separated values files) || .csv, .txt || &amp;lt;code&amp;gt;okf_table_csv&amp;lt;/code&amp;gt; || [[Table Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| CSV (Multiple complex sub-formats) || .csv || &amp;lt;code&amp;gt;okf_multiparsers&amp;lt;/code&amp;gt; || [[Multi-Parsers Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| DITA || .dita, .ditamap, .xml || &amp;lt;code&amp;gt;okf_xmlstream-dita&amp;lt;/code&amp;gt; || [[XML Stream Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| DokuWiki pages || .txt || &amp;lt;code&amp;gt;okf_wiki&amp;lt;/code&amp;gt; || [[Wiki Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Doxygen-commented files || .c, .h, cpp || &amp;lt;code&amp;gt;okf_doxygen&amp;lt;/code&amp;gt; || [[Doxygen Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| DTD || .dtd || &amp;lt;code&amp;gt;okf_dtd&amp;lt;/code&amp;gt; || [[DTD Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Fixed-Width Columns Table || .txt || &amp;lt;code&amp;gt;okf_table_fwc&amp;lt;/code&amp;gt; || [[Table Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Idiom WorldServer XLIFF || .xlf || &amp;lt;code&amp;gt;okf_xliff-iws&amp;lt;/code&amp;gt; || [[XLIFF Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| InCopy ICML || .wcml || &amp;lt;code&amp;gt;okf_icml&amp;lt;/code&amp;gt; || [[ICML Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| InDesign IDML || .idml || &amp;lt;code&amp;gt;okf_idml&amp;lt;/code&amp;gt; || [[IDML Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| iOS/Mac Strings|| .strings || &amp;lt;code&amp;gt;okf_regex-macStrings&amp;lt;/code&amp;gt; || [[Regex Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Java Properties || .properties || &amp;lt;code&amp;gt;okf_properties&amp;lt;/code&amp;gt; || [[Properties Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Java Properties (Output not escaped) || .properties || &amp;lt;code&amp;gt;okf_properties-outputNotEscaped&amp;lt;/code&amp;gt; || [[Properties Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Java XML Properties || .xml || &amp;lt;code&amp;gt;okf_xml-JavaProperties&amp;lt;/code&amp;gt; || [[XML Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Java XML Properties (HTML strings) || .xml || &amp;lt;code&amp;gt;okf_xmlstream-JavaPropertiesHTML&amp;lt;/code&amp;gt; || [[XML Stream Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| JSON || .json || &amp;lt;code&amp;gt;okf_json&amp;lt;/code&amp;gt; || [[JSON Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Haiku CatKeys || .catkeys || &amp;lt;code&amp;gt;okf_table_catkeys&amp;lt;/code&amp;gt; || [[Table Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| HTML (any) || .html, .htm || &amp;lt;code&amp;gt;okf_html&amp;lt;/code&amp;gt; || [[HTML Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| HTML (Well-formed, and XHTML) || .html, .htm|| &amp;lt;code&amp;gt;okf_html-wellFormed&amp;lt;/code&amp;gt; || [[HTML Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| HTML5 (and XHTML5) || .html, .htm|| &amp;lt;code&amp;gt;okf_itshtml5&amp;lt;/code&amp;gt; || [[HTML5-ITS Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Markdown || .md || &amp;lt;code&amp;gt;okf_markdown&amp;lt;/code&amp;gt; || [[Markdown Filter]]&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Microsoft Excel 2007/2010 || .xlsx, .xlsm, .xltx, .xltm || &amp;lt;code&amp;gt;okf_openxml&amp;lt;/code&amp;gt; || [[OpenXML Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Microsoft PowerPoint 2007/2010 || .pptx, .pptm, .potx, .potm, .ppsx, .ppsm || &amp;lt;code&amp;gt;okf_openxml&amp;lt;/code&amp;gt; || [[OpenXML Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Microsoft Visio || .vsdx, .vsdm || &amp;lt;code&amp;gt;okf_openxml&amp;lt;/code&amp;gt; || [[OpenXML Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Microsoft Word 2007/2010 || .docx, .docm, .dotx, .dotm || &amp;lt;code&amp;gt;okf_openxml&amp;lt;/code&amp;gt; || [[OpenXML Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| MIF || .mif || &amp;lt;code&amp;gt;okf_mif&amp;lt;/code&amp;gt; || [[MIF Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Moses Text || .txt || &amp;lt;code&amp;gt;okf_mosestext&amp;lt;/code&amp;gt; || [[Moses Text Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| OpenOffice.org Calc || .ods, .ots || &amp;lt;code&amp;gt;okf_odf&amp;lt;/code&amp;gt; || [[OpenOffice Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| OpenOffice.org Draw || .odg, .otg || &amp;lt;code&amp;gt;okf_odf&amp;lt;/code&amp;gt; || [[OpenOffice Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| OpenOffice.org Impress || .odp, .otp || &amp;lt;code&amp;gt;okf_odf&amp;lt;/code&amp;gt; || [[OpenOffice Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| OpenOffice.org Writer || .odt, .ott || &amp;lt;code&amp;gt;okf_odf&amp;lt;/code&amp;gt; || [[OpenOffice Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| PDF || .pdf || &amp;lt;code&amp;gt;okf_pdf&amp;lt;/code&amp;gt; || [[PDF Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| [[Pensieve TM]] || .pentm || &amp;lt;code&amp;gt;okf_pensieve&amp;lt;/code&amp;gt; || [[Pensieve TM Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| PHP Content || .php || &amp;lt;code&amp;gt;okf_phpcontent&amp;lt;/code&amp;gt; || [[PHP Content Filter]] || Can be used as a subfilter only&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Plain Text (Line = text unit) || .txt || &amp;lt;code&amp;gt;okf_plaintext&amp;lt;/code&amp;gt; || [[ Plain Text Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Plain Text (Paragraph = text unit) || .txt || &amp;lt;code&amp;gt;okf_plaintext_paragraphs&amp;lt;/code&amp;gt; || [[Plain Text Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| PO || .po || &amp;lt;code&amp;gt;okf_po&amp;lt;/code&amp;gt; || [[PO Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| PO (Monolingual style) || .po || &amp;lt;code&amp;gt;okf_po-monolingual&amp;lt;/code&amp;gt; || [[PO Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Rainbow Translation Kit manifests || .rkm || &amp;lt;code&amp;gt;okf_rainbowkit&amp;lt;/code&amp;gt; || [[Rainbow Translation Kit Filter]] || Used as a tkit reader only&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Regex (Any text-based format) || .txt || &amp;lt;code&amp;gt;okf_regex&amp;lt;/code&amp;gt; || [[Regex Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| RDF (Mozilla RDF) || .rdf || &amp;lt;code&amp;gt;okf_xml-MozillaRDF&amp;lt;/code&amp;gt; || [[XML Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| RESX || .resx || &amp;lt;code&amp;gt;okf_xml-resx&amp;lt;/code&amp;gt; || [[XML Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| SDLPPX || .sdlppx || &amp;lt;code&amp;gt;okf_sdlpackage&amp;lt;/code&amp;gt; || [[SDL Trados Package Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| SDLRPX || .sdlrpx || &amp;lt;code&amp;gt;okf_sdlpackage&amp;lt;/code&amp;gt; || [[SDL Trados Package Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| SDL[[XLIFF]] || .sdlxlf || &amp;lt;code&amp;gt;okf_xliff-sdl&amp;lt;/code&amp;gt; || [[XLIFF Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Skype Language Files || .lang || &amp;lt;code&amp;gt;okf_properties-skypeLang&amp;lt;/code&amp;gt; || [[Properties Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| SRT (Sub-Rip Text, sub-titles files) || .srt || &amp;lt;code&amp;gt;okf_regex-srt&amp;lt;/code&amp;gt; || [[Regex Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Tab-Delimiter files || .tsv, .txt || &amp;lt;code&amp;gt;okf_table_tsv&amp;lt;/code&amp;gt; || [[Table Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Tex files || .tex || &amp;lt;code&amp;gt;okf_tex&amp;lt;/code&amp;gt; || [[TEX Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| [[TMX]] || .tmx || &amp;lt;code&amp;gt;okf_tmx&amp;lt;/code&amp;gt; || [[TMX Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Transifex project || .txp || &amp;lt;code&amp;gt;okf_transifex&amp;lt;/code&amp;gt; || [[Transifex Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Trados-Tagged RTF || .rtf || &amp;lt;code&amp;gt;okf_tradosrtf&amp;lt;/code&amp;gt; || [[Trados-Tagged RTF Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| TS - Qt TS files || .ts || &amp;lt;code&amp;gt;okf_ts&amp;lt;/code&amp;gt; || [[TS Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| TTX - Trados TagEditor TTX files || .ttx || &amp;lt;code&amp;gt;okf_ttx&amp;lt;/code&amp;gt; || [[TTX Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| TXML - Wordfast Pro TXML files || .txml || &amp;lt;code&amp;gt;okf_txml&amp;lt;/code&amp;gt; || [[TXML Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Versified Text || .vrsz || &amp;lt;code&amp;gt;okf_versifiedtxt&amp;lt;/code&amp;gt; || [[Versified Text Filter]] || Deprecated&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| Vignette Export/Import Content || .xml || &amp;lt;code&amp;gt;okf_vignette&amp;lt;/code&amp;gt; || [[Vignette Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| XHTML || .html, .htm || &amp;lt;code&amp;gt;okf_html-wellFormed&amp;lt;/code&amp;gt; || [[HTML Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| WIX (Windows Installer XML) localization files || .wix || &amp;lt;code&amp;gt;okf_xml-WixLocalization&amp;lt;/code&amp;gt; || [[XML Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| [[XLIFF]] v1.2 || .xlf, .xliff || &amp;lt;code&amp;gt;okf_xliff&amp;lt;/code&amp;gt; || [[XLIFF Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| [[XLIFF]] v2 || .xlf || &amp;lt;code&amp;gt;okf_xliff2&amp;lt;/code&amp;gt; || [[XLIFF-2 Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| XML (Generic, using [[ITS]] defaults) || .xml || &amp;lt;code&amp;gt;okf_xml&amp;lt;/code&amp;gt; || [[XML Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| XML (Generic, using stream reader) || .xml || &amp;lt;code&amp;gt;okf_xmlstream&amp;lt;/code&amp;gt; || [[XML Stream Filter]] ||&lt;br /&gt;
|- valign=&amp;quot;top&amp;quot;&lt;br /&gt;
| YAML (Generic YAML filter) || .yml, .yaml || &amp;lt;code&amp;gt;okf_yaml&amp;lt;/code&amp;gt; || [[YAML Filter]] ||&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
Note that most filters allow you to [[Understanding Filter Configurations|create your own configurations]] to support more file formats.&lt;br /&gt;
&lt;br /&gt;
==Code Simplification Rules==&lt;br /&gt;
&lt;br /&gt;
All filters support code simplification rules. By default the [[Inline Codes Simplifier Step]], [[Simplification Filter]] and [[Post-segmentation Inline Codes Removal Step]] maximize the trimming and merging (aka simplification) of inline codes. In some cases this may not be desired. The simplification rules allow you to override the default behavior and prevent specific codes from being trimmed or merged.&lt;br /&gt;
&lt;br /&gt;
===General Syntax===&lt;br /&gt;
&lt;br /&gt;
The rules parser ignores irrelevant whitespace. Rules can be separated by spaces, newlines or nothing. This makes it easier to accommodate various container formats and their whitespace normalization rules. When a rule applies it means &amp;quot;do not simplify the match code&amp;quot;. Uppercase tokens are constants and predefined by the rule parser. Multiple rules are always OR'ed together.&lt;br /&gt;
&lt;br /&gt;
For more details see the JavaCC grammar: &amp;lt;code&amp;gt;../okapi-core/src/main/java/net/sf/okapi/core/simplifierrules/SimplifierRules.jj&amp;lt;/code&amp;gt; &lt;br /&gt;
&lt;br /&gt;
===Rule Examples===&lt;br /&gt;
&lt;br /&gt;
If Code has any of these flags then don't simplify &lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;if DELETABLE or ADDABLE or CLONEABLE;&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;quot;=&amp;quot; is string match&lt;br /&gt;
Match basic TAGTYPE opening, closing or standalone &lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;if DATA = &amp;quot;a&amp;quot; and TAGTYPE = OPENING;&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;quot;~&amp;quot; is regex match&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;if DATA ~ &amp;quot;a.*&amp;quot;;&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
You can negate any of the match operators &lt;br /&gt;
Don't simplify if the DATA does not match the regex &lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;if DATA !~ &amp;quot;a.*&amp;quot;;&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Match on type, linebreak in this case, don't simplify &lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;if the Code is a linebreak if TYPE = &amp;quot;lb&amp;quot;;&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Don't simplify any rich text types&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;if TYPE = &amp;quot;bold&amp;quot; or TYPE = &amp;quot;italic&amp;quot; or TYPE = &amp;quot;underline&amp;quot;;&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Expressions can be recursive (supports embedded parens)&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;if TYPE = &amp;quot;bold&amp;quot; or (DATA = &amp;quot;bar&amp;quot; or (DATA = &amp;quot;foo&amp;quot; and TYPE = &amp;quot;underline&amp;quot;));&amp;lt;/pre&amp;gt;&lt;br /&gt;
 &lt;br /&gt;
&lt;br /&gt;
===Filter Config Examples===&lt;br /&gt;
&lt;br /&gt;
Examples of using simplifier rules within the filter config formats used by Okapi.&lt;br /&gt;
&lt;br /&gt;
'''YAML:'''&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
simplifierRules: |&lt;br /&gt;
  if ADDABLE or DELETABLE or CLONEABLE;&lt;br /&gt;
  if DATA = &amp;quot;&amp;lt;br/&amp;gt;&amp;quot; or DATA = &amp;quot;&amp;lt;font&amp;gt;&amp;quot; or DATA = &amp;quot;&amp;lt;/font&amp;gt;&amp;quot; or DATA = &amp;quot;&amp;lt;/a&amp;gt;&amp;quot;;&lt;br /&gt;
  if DATA ~ &amp;quot;\\&amp;lt;font.+&amp;quot; or DATA ~ &amp;quot;\\&amp;lt;img.+&amp;quot; or DATA ~ &amp;quot;\\&amp;lt;a.+&amp;quot;;&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
'''ITS:'''&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
&amp;lt;?xml version=&amp;quot;1.0&amp;quot; encoding=&amp;quot;UTF-8&amp;quot;?&amp;gt;&lt;br /&gt;
&amp;lt;its:rules xmlns:its=&amp;quot;http://www.w3.org/2005/11/its&amp;quot; version=&amp;quot;1.0&amp;quot; xmlns:itsx=&amp;quot;http://www.w3.org/2008/12/its-extensions&amp;quot; xmlns:okp=&amp;quot;okapi-framework:xmlfilter-options&amp;quot;&amp;gt;&lt;br /&gt;
&amp;lt;!-- See ITS specification at: http://www.w3.org/TR/its/ --&amp;gt;&lt;br /&gt;
 &amp;lt;its:translateRule selector=&amp;quot;//*&amp;quot; translate=&amp;quot;yes&amp;quot;/&amp;gt;&lt;br /&gt;
 &amp;lt;its:withinTextRule selector=&amp;quot;//codeph&amp;quot; withinText=&amp;quot;yes&amp;quot;/&amp;gt;&lt;br /&gt;
 &amp;lt;its:withinTextRule selector=&amp;quot;//ph&amp;quot; withinText=&amp;quot;yes&amp;quot;/&amp;gt;&lt;br /&gt;
 &amp;lt;okp:simplifierRules&amp;gt;&lt;br /&gt;
 if ADDABLE or DELETABLE or CLONEABLE; if DATA ~ &amp;quot;.+&amp;quot;;&lt;br /&gt;
 &amp;lt;/okp:simplifierRules&amp;gt;&lt;br /&gt;
&amp;lt;/its:rules&amp;gt;&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
'''FPRM (Parameters):'''&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&lt;br /&gt;
#v1&lt;br /&gt;
extractNotes.b=true&lt;br /&gt;
simplifyCodes.b=true&lt;br /&gt;
simplifierRules=if ADDABLE or DELETABLE or CLONEABLE; if DATA ~ &amp;quot;.+&amp;quot;;&lt;br /&gt;
&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
[[Category:Filters]]&lt;/div&gt;</summary>
		<author><name>Jhargraveiii</name></author>
	</entry>
</feed>