Transform Strings

Description

Perform various transformations on strings. Casing, stemming, removal of diacritics, etc.

Input

  • SOURCE [STRING]: Strings to transform

Output

  • PAIRS [STRING,STRING] : pairs with original and modified strings
  • RESULT [STRING]: the modified strings

Parameters

  • Normalize diacritics: transliterates non-ASCII characters into their closest ASCII form
  • Stemming: strings (single words) can be stemmed for a specific language or left as they are
  • Case: Convert all characters to lower-/upper-case

Output scores can be aggregated and/or normalized.