Multi-Parsers Filter

From Okapi Framework
Revision as of 07:03, 9 August 2018 by Okapifra (talk | contribs)
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
Jump to: navigation, search

Overview

The Multi-Parsers Filter is an Okapi component for extracting translatable text from two-levels complex formats. For example: a CSV file some some columns in Markdown, some in HTML and some in plain text. By defaults, the filter settings are defined to process a CSV file where all columns are translatable and plain text.

Processing Details

Input Encoding

The filter decides which encoding to use for the input file using the following logic:

If the file has a Unicode Byte-Order-Mark: Then, the corresponding encoding (e.g. UTF-8, UTF-16, etc.) is used. Otherwise, the input encoding used is the default encoding that was specified when setting the filter options.

Parameters

TODO

Limitations

This filter is BETA