Okapi Framework - Steps

Translation Comparison Step

- Overview
- Parameters

If you are using an Okapi Tool after the M9 release, you should be using the wiki online help:
http://www.opentag.com/okapi/wiki/index.php?title=Translation_Comparison_Step

Overview

This step compares the translation between two (or three) documents.

Takes: Filter events
Sends: Filter events

The input is two or three documents, the first document contains the base translations, the second one (and optionally the third one, contain the translations to compare to the base translations. The input documents must have the same number of text units and the text units must be in the same order (but you can have different events between text units).

For each entry the step compares the translated texts and calculates a score that measures how similar the text of the second document (and optionally the third one) is to the text of the baseline translation. The score is a number between 0 and 100. The score is based on the Levenstein distance and additional processing. You can find more information on that metric at http://www.merriampark.com/ld.htm and http://en.wikipedia.org/wiki/Levenshtein_distance.

The utility outputs an HTML document for each input documents. The HTML document lists the source text, the translation text and the resulting scores. You can also generate a TMX document containing the same information.

Parameters

Generate output tables in HTML -- Set this option to create one HTML file for each input document, where the results of the comparisons are listed. The name of the HTML result file is the name of the first document to compare with the base document, plus an .html extension.

Open the first HTML output after completion -- Set this option to open the first HTML result file created when the process is completed.

Generate a TMX output document -- Set this option to create one single TMX document that contains all the translations compared and their scores, for all input documents.

Enter the full path of the TMX document to generate.

Suffix for the target language code of the document 2 -- Enter the suffix to append to the target language code for the second set of target entries. For example, enter -mt to indicate that the second target language is a machine-translation generated text. If you do not enter a suffix, the language code for both target entries will be the same and the TMX document will include only the last one.

Suffix for the target language code of the document 3 -- Enter the suffix to append to the target language code for the third set of target entries.

Label for the document 1 -- Enter an optional label that is used in the report to label the base translation.

Label for the document 2 -- Enter an optional label that is used in the report to label the second translation.

Label for the document 3 -- Enter an optional label that is used in the report to label the optional third translation.

Take into account case differences -- Set this option to take into account case differences when calculating the scores.

Take into account white-space differences -- Set this option to take into account white-space differences when calculating the scores.

Take into account punctuation differences -- Set this option to take into account punctuation differences when calculating the scores.