Category: Transform / Advanced
This action goes through the text values of a column in one table and finds matching values in another table (lookup table). Matched lookup values then replace original values. In the case of multiple matches, an arbitrary match is taken (in no particular order) and other matches are ignored.
The Fuzzy mode of this action can be used for matching text values that may have typos.
|Column||Select the column in the current dataset that contains the values to match in the lookup table.|
|Lookup table||Select the dataset to be use for the lookup.|
|Lookup column||This setting appears once the Lookup table has been selected. Select the column that contains the values
to be matched to.
|Match mode||Options: Contains, Starts with, Ends with, and Fuzzy. See the table below.|
|Case sensitive match||When checked, text case must match in the lookup values.|
|If not matched||Choose how unmatched values are handled. Options: Keep existing value, Make empty, or Default to (and choose
whether this value is an expression, or text or a number, and enter the expression or text/number value).
|Contains||A lookup value is returned if the source value contains it. For example, "tor" would be returned if the source value
|Starts with||A lookup value is returned if the source value starts with it. For example, "Moto" would be returned if the source value
|Ends with||A lookup value is returned if the source value ends with it. For example, "rola" would be returned if the source value
|Fuzzy||A lookup value is returned if the source value is within the specified edit distance.
The Damerau–Levenshtein edit distance measures the number of edit operations required to produce one word
|Max. edit distance||This setting appears when the Match mode is set to "Fuzzy". Choose the edit distance used by the Fuzzy-matching
process. See the Fuzzy mode, above.
Example: Match the source value to any of the lookup values. (Illustrates the example statements given above in the Match mode settings table)
Table 1: Source value
Table 2: Lookup table
Column is "Source text" (in Table 1)
Lookup table is "Table 2"
Lookup column is "Lookup values"
Case sensitive is unchecked.
If not matched, "Make empty"
Using the two tables and parameters above, the following tables show the results of the given settings.
Contains match mode:
Starts with match mode:
Ends with match mode:
Fuzzy match mode (max edit distance of 1):