Transform Strings

Description

Perform various transformations on strings. Casing, stemming, removal of diacritics, etc.

Input

  • SOURCE [STRING]: Strings to transform

Output

  • PAIRS [STRING,STRING] : pairs with original and modified strings

  • RESULT [STRING]: the modified strings

Parameters

  • Normalize diacritics: transliterates non-ASCII characters into their closest ASCII form

  • Stemming: strings (single words) can be stemmed for a specific language or left as they are

  • Case: Convert all characters to lower-/upper- case

Output scores can be aggregated and/or normalised.