Okapi Framework - Steps
Table of Contents |
|
If you are using an Okapi Tool after the M9 release, you should be using the wiki online help:
http://www.opentag.com/okapi/wiki/index.php?title=Steps
The default distribution of the framework comes with the pre-defined steps
listed below. Note that some of them may be Beta or Alpha versions. See the
documentation of each step for more information.
The letters between parenthesis indicates what type of data the step
takes and what type it sends. RD=Raw Document, FE=Filter Events.
- Batch Translation
(RD > RD) -
Creates translation memories using external tools.
- Desegmentation (FE >
FE) - Joins all the segments of each source and/or target text of each text
units into a single content.
- Diff Leverage (FE > FE)
- Performs context-based translation leveraging between two sets of input
files.
- External Command (RD
> RD) - Executes an external command.
- Filter Events to Raw
Document (FE > RD) - Merges filter events back into the original
file format.
- Format Conversion
(FE > FE) - Converts extracted data to a given file format.
- Full-Width Conversion
(FE > FE)
- Converts to or from full-width characters
- Generate SimpleTM (FE >
FE) Creates a SimpleTM TM from the text units of a set of bilingual
documents.
- Image Modification (RD
> RD) - Changes the size and/or the format of an image file.
-
Id-Based Aligner (FE > FE) - Aligns text units of two input
documents based on their unique ids (resname).
- Inline Codes Removal (FE > FE)
- Removes markers and/or content of inline codes from text units.
- Leveraging (FE > FE) -
Leverages existing translations into extracted text units.
- Line-Break Conversion
(RD > RD) - Converts line-breaks to a specified type.
- Quality Check (FE > FE) -
Detect potential quality issues by comparing the translation with their
source content.
- Raw Document to
Filter Events (RD > FE) - Extracts translatable data from a raw document
and convert it into filter events.
-
Resource Simplifier (FE > FE) - Creates multiple events from
events with complex resources.
- Search and Replace
(RD/FE > RD/FE) - Performs search and replace actions.
- Segmentation (FE > FE)
- Segments extracted text.
- Sentence Alignment
(RD > RD) - Align the sentences of the text units of two documents.
- SimpleTM to TMX (RD >
RD) - Converts a SimpleTM TM into a TMX document.
- Term Extraction (FE >
FE) - Creates a list of possible terms from the input documents.
- Text Modification
(FE > FE) - Modifies the content of the text units.
- TM Import (FE > FE) -
Imports text units into a Pensieve TM.
- Tokenization (FE > FE)
- Tokenizes the text units content into different forms.
- Translation Comparison
(FE > FE) - Compares the translations of the same entries between several files.
- URI Conversion (FE >
FE) - Converts text into or from URI escaped notation.
- Used Characters Listing
(FD > FD) - Generates a list of all characters used in the translatable text
of a given set of documents,
- Word Count (FE > FE) -
Perfoms word counts on text units.
-
XLIFF Splitter (RD > RD) - Splits the <file> elements of XLIFF
input into separate documents.
-
XML Validation (RD > RD) - Performs well-formness check and,
optionally, DTD or schema validation on XML files.
- XSL Transform (RD > RD)
- Applies an XSLT template on the input documents.
You may have additional steps available as plug-ins. For more information
about those steps, look in the dropins
directory of your installation.