User Tools

Site Tools


transformations:deduplicate

This is an old revision of the document!


Deduplicate

This transformation removes all duplicate rows in entire table. Deduplication can be performed on all columns, or only on specific columns. In the latter case uniqueness of values in non-selected columns will be ignored.

EXAMPLE

Source table: The longest rivers in the world

River Length (km) Continent
Nile 6650 Africa
Amazon 6400 South America
Nile 6650 Africa

Objective: Find and remove duplicate rows.

Output table:

River Length (km) Continent
Nile 6650 Africa
Amazon 6400 South America
transformations/deduplicate.1465744361.txt.gz · Last modified: 2016/06/12 11:12 by dmitry

Donate Powered by PHP Valid HTML5 Valid CSS Driven by DokuWiki