This is an old revision of the document!

DEDUPLICATE ROWS

Category: Transform / Advanced

Description

This action removes all duplicate rows in the entire table. Deduplication can be performed based on all columns, or only on specific columns.

Use Deduplicate rows to clean datasets of records that may have been duplicated in the source dataset, or during previous actions.

Setting	Description
Apply to	Select whether to base deduplication on all columns in the dataset, or only selected columns. Options: All columns or Selected columns (and select the columns to use from the list).

When deduplicating based on specific columns:

Objective: Find and remove duplicate rows.

Source table: The longest rivers in the world

Action parameters:

Apply to "All columns"

Result:

River	Length (km)	Continent
Nile	6650	Africa
Amazon	6400	South America