Match by Double (Blocking)
Description
Finds matches between the DOUBLE
-columns in the inputs.
Input
A [OBJ,DOUBLE]
: a list of candidates, in which theDOUBLE
-column will be used for comparison and theOBJ
-column will be the resultCandidates [OBJ,OBJ]
: candidate pairs, onlyA
s andB
s that are in Candidates will be matchedB [OBJ,DOUBLE]
: a list of candidates, in which theDOUBLE
-column will be used for comparison and theOBJ
-column will be the result
Output
RESULT [OBJ,OBJ]
: the matched objects fromA
andB
NOTA [OBJ]
: the objects from A that did not match with an item fromB
NOTB [OBJ]
: the objects from B that did not match with an item fromA
Parameters
Comparison
: Comparison function to use (=
,!=
,<
,>
,<=
,>=
,distance
)Max distance
: whenComparison
is set todistance
, the match is valid only if the difference between the two doubles is not greater than this valueSlope
: Positive value that determines the slope of the ranking curve. The higher the slope, the closer together the resulting scores.Exclude self-matches
: whether to emit the match if the objects inA
andB
are the same. Mostly useful whenA
andB
come from the same source