Normalize diacritics

Description

Normalize diacritics into the closest ASCII characters, from all strings in a [OBJ,STRING] input. E.g.:

  • Nguyễn Tấn Dũng : Nguyen Tan Dung

  • St.-Veit-Straße : St.-Veit-Strasse

Input

  • SOURCE [OBJ,STRING]: a 2-column input with an object-string pair. Typically obtained with the Extract string block

Output

  • RESULT [OBJ,STRING]: the pairs from SOURCE, where the string has been modified

  • STRINGS [STRING]: the modified strings, without the object they were paired to

Output scores can be aggregated and/or normalised.