KEEP DUPLICATES

KEEP DUPLICATES

Category: Transform / Filters

Description

This action keeps all duplicate rows and removes all unique rows. This operation can be performed on all columns, or only on specific columns. In the latter case, the uniqueness of values in columns not selected will be ignored.

Use cases

In datasets where all records should be unique, this action helps clean them by looking for duplicate values/records and pulling them out for review - to determine if those records warrant removal or modification.

Action settings

Setting	Description
Apply to	Select whether to check the values in all columns for duplicates, or just specified columns. Options: All columns or Selected columns (and select the columns to check).

Examples

Example #1

Find duplicates in column "Continent".

Before (source table)

River	Length (km)	Continent
Nile	6650	Africa
Amazon	6400	South America
Yangtze	6300	Eurasia
Yellow River	5464	Eurasia

After (result table)

River	Length (km)	Continent
Yangtze	6300	Eurasia
Yellow River	5464	Eurasia

Action parameters

Apply to: Selected columns
Selected columns: Continent

Community examples

Compare two Excel sheets with EasyMorph (Project; Module: Main; Group: Compare data; Table: Matching rows; Action position: 3)

Table of Contents