Changes from 1.47.0 to 1.48.0

Core

  • Changed: TextFragmet renumberCodes now assigns a new code id for closing tags without a corresponding opening tag.
  • Removed: XLIFF Writer: includeNoTranslate parameter.
  • Improved: the generation of code IDs based on string values is consistent with versions 1.42.0 and earlier: issue#1366.

Connectors

  • Apertium MT

    • Fixed: update the API and endpoint: Issue#656.

Filters

  • CSV Filter

    • Fixed: escape qualifiers in unqualified fields.
  • HTML Filter

    • Improved: start tags with forbidden end tags handled with well-formed configuration: issue#1401.
  • IDML Filter

    • Fixed: correctly display codefinder rules in the filter config UI
    • Added: math zones conditional extraction: issue#1412.
    • Added: adjacent codes merge: issue#1415.
    • Added: story elements isolation by style: issue#1418.
    • Fixed: the segmented content handling clarified: issue#1426.
    • Fixed: font mapping for names with processing instructions: issue#1432.
  • ITS Filter

    • Fixed: within text handling crash: issue#1402.
    • Fixed: inline non-translatable text extraction: issue#1404.
    • Improved: the codes simplification can be performed on the filter level: issue#1427.
    • Fixed a case where subfiltering an empty CDATA node corrupted the target file.
  • JSON Filter

    • Added: an option to set the size-unit property when specifying maxwidth.
  • Markdown Filter

    • Fixed: appearance of extra whitespace before a blockquote: issue#1439.
  • OpenXML Filter

    • Fixed: PPTX: a line break prepended by a run with empty text (extended cases): issue#1373.
    • Fixed: XLSX: tinted colors handling clarified: issue#1378.
    • Improved: PPTX: graphic metadata translation supported: issue#1380.
    • Improved: whitespace styles ignorance supported: issue#1385.
    • Added: XLSX: non-empty maxwidth and size-unit properties become available per worksheet and target column: issue#1386.
    • Improved: styles and highlight colors exclusion and inclusion: issue#1394.
    • Improved: DOCX: phonetic guide and base texts extracted as nested text units.
    • Added: XLSX: source and target columns content joining on extraction: issue#1399.
    • Added: PPTX: cached chart strings for translation: issue#1405.
    • Added: PPTX: cached chart numbers and formatCode for translation.
    • Added: embedded Excel package removal (related values extraction must be turned on - cached chart strings or numbers).
    • Added: inline code finding: issue#1406.
    • Added: DOCX: ASCII and High Ansi font categories conditional preservation issue#1411.
    • Fixed: XLSX: cells with omitted values supported: issue#1419.
    • Added: XLSX: extraction of explicitly specified worksheets: issue#1420.
    • Fixed: DOCX: line breaks merging: issue#1421.
    • Improved: PPTX: block level run properties parsing.
    • Fixed: block level leading codes mapping.
    • Fixed: DOCX: hidden tables and columns created with Apache POI ≥ 5.x now recognised (support for on/off vanish values in addition to true/false). issue#1424.
    • Fixed: XLSX: source to target column extraction with hidden content clarified: issue#1428.
    • Added: XLSX: when configuring worksheet columns, allow the ability to exclude all columns that are not specified as part of the configuration. issue#1429.
    • Added: XLSX: source column cell styles conditional treatment for exclusion: issue#1430.
    • Improved: nested textual unit IDs generation and handling: issue#1431.
    • Added: DOCX and XLSX: code display text contains excluded run content: issue#1433.
    • Improved: DOCX: exclusion by default highlight and font color values: issue#1437.
    • Fixed: DOCX: default run formatting optimised conditionally: issue#1440.
    • Improved: DOCX: code type information of highlight colors always represented as RGB values.
    • Improved: DOCX: font color empty ignorance thresholds always considered as default minimum (000000) and maximum (FFFFFF).
    • Deprecated: DOCX: the bPreferenceTranslateWordExcludeGraphicMetaData string parameter: please use the translateWordGraphicName one instead. Automatic migration is available on loading.
    • Deprecated: PPTX: the bReorderPowerpointNotesAndComments string parameter: please use the bPreferenceReorderPowerpointNotes and reorderPowerpointComments ones instead. Automatic migration is available on loading.
  • PDF Filter

    • Added: Option to use SortByPosition to order results by position on page
  • Subtitle Filter

    • Added: Option to not merge captions
    • Added: Codefinder support
    • Added: (TTML filter) an option to toggle escape br mode
    • Improved: Handling of automatic splitting in CJK languages
    • Modified: Improved automatic splitting no longer applies to Korean, since Korean translations usually contain spaces.
  • TTX Filter

    • Fixed: fuzzy match maximum value definition.

Libraries

  • Merge Library

    • Added: preserveWhitespace parameter.
  • XLIFF 2.0 Library

    • Added a new validation option to allow for less strict processing of “monolingual” files. When disabled, empty <target> elements will be allowed, even if no trgLang is declared, and a trgLang attribute consisting of the empty string will be treated as not present, rather than as a validation error.

    • Added a validation option which, when not set, allows for <file> elements containing no <unit> or <group> content to be accepted.

    • Tweaked validation such that enforcement of NMTOKEN types in element ids is only enforced by schema validation, and is no longer redundantly enforced by the code.

  • Serialization Library

    • Added: text part (segment) level alternative translations.

Steps

  • Code Simplifier Step

    • Improved: the rules of step parameters can be overridden by the rules of filter parameters: issue#1416.
    • Deprecated: the removeLeadingTrailingCodes boolean parameter. It will be automatically migrated to the moveLeadingAndTrailingCodesToSkeleton one in the next release.
    • Deprecated: the mergeCodes boolean parameter. It will be automatically migrated to the mergeAdjacentCodes one in the next release.
    • Deprecated: the rules string parameter. It will be automatically migrated to the simplifierRules one in the next release.
  • Merging Step

    • Improved: preserveWhiteSpaceByDefault replaced by preserveWhitespace parameter to align the names throughout the codebase.
  • Post Segmentation Code Simplifier Step

    • Deprecated: the removeLeadingTrailingCodes boolean parameter. It will be automatically migrated to the moveLeadingAndTrailingCodesToSkeleton one in the next release.
    • Deprecated: the mergeCodes boolean parameter. It will be automatically migrated to the mergeAdjacentCodes one in the next release.
    • Deprecated: the rules string parameter. It will be automatically migrated to the simplifierRules one in the next release.
  • Rainbow Kit Step

    • Added: preserveWhitespace parameter.