Normalize diacritics¶
Description¶
Normalize diacritics into the closest ASCII characters, from all strings in a [OBJ,STRING]
input.
E.g.:
Nguyễn Tấn Dũng : Nguyen Tan Dung
St.-Veit-Straße : St.-Veit-Strasse
Input¶
SOURCE [OBJ,STRING]
: a 2-column input with an object-string pair. Typically obtained with theExtract string
block
Output¶
RESULT [OBJ,STRING]
: the pairs fromSOURCE
, where the string has been modifiedSTRINGS [STRING]
: the modified strings, without the object they were paired to
Output scores can be aggregated and/or normalised.