Normalize Diacritics
Description
Normalize diacritics into the closest ASCII characters, from all strings in a [OBJ,STRING] input.
Example:
- Nguyễn Tấn Dũng : Nguyen Tan Dung
- St.-Veit-Straße : St.-Veit-Strasse
Input
SOURCE [OBJ,STRING]: a 2-column input with an object-string pair. Typically obtained with the Extract Strings block.
Output
RESULT [OBJ,STRING]: the pairs fromSOURCE, where the string has been modifiedSTRINGS [STRING]: the modified strings, without the object they were paired to
Output scores can be aggregated and/or normalized.