Table of Contents
FILTER
Category: Transform / Filters
Description
This action filters a dataset based on selected values in a specified column. The action can be set to either keep the rows or remove them.
Use cases
Create subsets of a dataset based on values in a specific column.
Clean a dataset by removing records that may have "junk" data in a specific column.
Remove or keep records containing a "flag" value in a specific column.
Action settings
Setting | Description |
---|---|
Filter column | Select the column containing the values to be used for filtering. |
Keep/Remove columns | Select whether to keep the rows containing the selected values or remove them. Select the individual values to keep/remove from the list at the bottom of the settings pane. Options: Keep only selected values or Remove selected values. |
Remarks
This action keeps or removes rows containing selected whole values in a single column and does not support partial matches or multiple columns.
The choice to use keep vs. remove depends on how you want to handle new values that are introduced in the dataset. If you select specific values to keep, the new values will not appear in the dataset. If you select specific values to remove, the new values will appear in the dataset.
Examples
Example: Find out which are the longest rivers in both Americas.
Source table: The longest rivers in the world
River | Length (km) | Continent |
---|---|---|
Nile | 6650 | Africa |
Amazon | 6400 | South America |
Yangtze | 6300 | Asia |
Mississippi | 6275 | North America |
Yenisei | 5539 | Asia |
Yellow River | 5464 | Asia |
Ob | 5410 | Asia |
Paraná | 4880 | South America |
Action parameters:
Column is "Continent"
Filtering mode is "Keep only selected values"
Values selected: "South America"; "North America "
Result table:
River | Length (km) | Continent |
---|---|---|
Amazon | 6400 | South America |
Mississippi | 6275 | North America |
Paraná | 4880 | South America |
Community examples
- Finding all projects with particular action (Project; Module: Main; Group: Tab 1; Table: Documents; Action position: 2)
- Search string on input table (Project; Module: Main; Group: Tab 1; Table: INPUT TABLE.xlsx; Action position: 4)
- Iterate through a set of column (Project; Module: Main; Group: Tab 1; Table: in.csv; Action position: 7)
- Duplicating rows based on a condition (Project; Module: Main; Group: Tab 1; Table: Table 1 (2); Action position: 3)
- Collapsing multiple rows into one (Project; Module: Main; Group: Group 1; Table: Method 1; Action position: 3)
- How to verify uniqueness for a combination of fields (Project; Module: Main; Group: Group 1; Table: Check; Action position: 3)