Text Modification Step

From Okapi Framework
Revision as of 18:20, 4 June 2016 by Ysavourel (talk | contribs) (1 revision imported)
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
Jump to navigation Jump to search

Overview

This step modifies the content of the text units.

Takes: Filter events. Sends: Filter events.

Text units set as non-translatable are not modified.

Text units with existing translations are modified only if requested.

Parameters

Type of change to perform — Select a kind of change to apply. Several are available:

  • Keep the original text
  • Replace letters with Xs and digits with Ns.
  • Remove text but keep inline codes.
  • Replace selected ASCII characters with Extended Latin characters.
  • Replace selected ASCII characters with Cyrillic characters.
  • Replace selected ASCII characters with Arabic characters.
  • Replace selected ASCII characters with Chinese characters.

Note that the result of the character substitution is not meant have any specific meaning beyond being a set of characters in a given script. This function does not perform a translation, or a transliteration, or any other meaningful linguistic operation.

Add the following prefix — Set this option to add a prefix at the start of each text unit. Enter the text of the prefix.

Add the following suffix — Set this option to add a suffix at the end of each text unit. Enter the text of the suffix.

Append the name of the item — Set this option to add the name of each text unit at the end of its value. If the text unit has no name associated, the extraction ID is added instead.

Append the extraction ID of the item — Set this option to add the extraction ID of each text unit at the end of its value. Extraction IDs are filter-specific.

Marks segments with '[' and ']' delimiters — Set this option to add delimiters to the segments in each text unit. The delimiters are just around the text (after the prefix if one is added, and before the item name, extraction ID, and suffix if they are added). If the text unit is not segmented, the delimiters are added at the front and back of the full content of the text unit.

Expand the text — Set this option to expand the text. If the content is less than 31 characters it is expanded by 50% or at least one character. If it is longer than 30 characters, it is expanded by 100%. Empty strings are not expanded.

Modify also the items without text — Set this option to apply the changes also to text units that have no text (i.e. are empty, or contain only white spaces or codes).

Modify also the items with an existing translation — Set this option to apply the changes also to text units that already have a translation (e.g. for multilingual files).

Limitations

None known.