Category: Transform / Filters
This action removes all duplicate rows in the entire table. Deduplication can be performed based on all columns, or only on specific columns.
Use this action to remove records that may have been duplicated in the source dataset or during previous actions, and keep unique records only.
Setting | Description |
---|---|
Apply to | Select whether to base deduplication on all columns in the dataset, or only selected columns. Options: All columns or Selected columns (and select the columns to use from the list). |
When deduplicating based on specific columns:
Find and remove duplicate rows based on the values in all columns.
River | Length (km) | Continent |
---|---|---|
Nile | 6650 | Africa |
Amazon | 6400 | South America |
Nile | 6650 | Africa |
River | Length (km) | Continent |
---|---|---|
Nile | 6650 | Africa |
Amazon | 6400 | South America |
Apply to: All columns
Remove duplicate rows based on First and Last Names.
FirstName | LastName | MiddleInit |
---|---|---|
Leah | Boswell | A |
José | Silva | D |
Gino | Basso | E |
Leah | Boswell | F |
Gino | Basso | R |
Santa | Alegio | G |
FirstName | LastName | MiddleInit |
---|---|---|
Leah | Boswell | A |
José | Silva | D |
Gino | Basso | E |
Santa | Alegio | G |
Apply to: Selected columns (FirstName, LastName)