# Match Strings

### Description
Finds matches between the `STRING`-columns in the inputs.
Various comparison options can be chosen: equals, contains, startsWith, endsWith or edit-distance.
The result provides both the matching strings, as well as the strings from both inputs that didn't generate a match.

It is strongly recommended that the inputs are already de-duplicated. This block does not do that and duplicates can increase computation time.

### Input
- `A [STRING]`: a list of candidates
- `B [STRING]`: a list of candidates

### Output
- `RESULT [STRING,STRING]`: the matched strings from `A` and `B`
- `NOTA [STRING]`: the strings from A that did not match with a strings from `B`
- `NOTB [STRING]`: the strings from B that did not match with a strings from `A`

### Parameters
- `Comparison`: Comparison function to use
  - `equal`: the strings must be equal
  - `contains`: the string in `B` must be contained in `A`
  - `containsWholeWord`: the string in `B` must be contained in `A`, as a whole word (only punctuation/spaces around)
  - `startsWith`: the string in `A` must start with `B`
  - `endsWith`: the string in `A` must end with `B`
  - `prefix`: strings in `A` and `B` share a prefix of a given length
  - `levenshtein`: the string in `A` may not have more than `Max edit-distance` differences (character insertions or deletions) with `B`.
  - `jaro-winkler`: the strings in `A` and `B` must have a Jaro-Winkler similarity score not smaller than `Min similarity`.
- `Case-sensitive`: if set to `false`, upper/lower case is ignored
- `Exclude self-matches`: whether to emit the match if the objects in `A` and `B` are the same. Mostly useful when `A` and `B` come from the same source