User Tools

Site Tools


transformations:deduplicate

Deduplicate

This transformation removes all duplicate rows in entire table. Deduplication can be performed on all columns, or only on specific columns. In the latter case uniqueness of values in non-selected columns will be ignored.

EXAMPLE

Source table: The longest rivers in the world

River Length (km) Continent
Nile 6650 Africa
Amazon 6400 South America
Nile 6650 Africa

Objective: Find and remove duplicate rows.

Output table:

River Length (km) Continent
Nile 6650 Africa
Amazon 6400 South America
transformations/deduplicate.txt · Last modified: 2016/06/12 15:12 by dmitry