Post-segmentation Inline Codes Removal Step: Difference between revisions

From Okapi Framework
Jump to navigation Jump to search
(Created page with "{{Steps Header}} __TOC__ ==Overview== This step attempts to simplify (trim and merge) as many inline codes as possible by looking at each linguistically distinct segment in a Te...")
 
m (1 revision imported)
(No difference)

Revision as of 19:20, 4 June 2016

Overview

This step attempts to simplify (trim and merge) as many inline codes as possible by looking at each linguistically distinct segment in a TextUnit.

The step must be run after segmentation. Joins adjacent inline codes inside segments, and optionally moves leading and trailing codes from the segment to an inter-segment Textpart. Original (un-merged) codes are saved as okp:merged attributes inside the generated XLIFF file. Trimmed codes are simply written outside the "mrk" elements.

Takes: Filter Events. Sends: Filter Events.

Parameters

Remove leading and trailing codes — Set this option to remove leading and trailing inline codes from the text units and place them outside the segment.

Limitations

Currently pre-segmented formats such as XLIFF, TMX, TTX etc. will not have their codes simplified.