User Tools

Site Tools


overview

This is an old revision of the document!


EasyMorph Overview

Data Import

In order to start transforming data in EasyMorph, all data subject to manipulations has to be imported into EasyMorph as tables. As of version 2.8 EasyMorph supports the following data sources:

  • Relational databases (Oracle, MS SQL Server, MySQL, PostgreSQL, SQLite)
  • ODBC data sources
  • Text files with delimiters (e.g. CSV)
  • Excel files (.xlsx, .xls)
  • QlikView's QVD files
  • SAS files (.sas7bdat)

Once tables are loaded into EasyMorph they can further be modified using transformations that are applied one after another.

Technically speaking, import is performed by special import transformations which are available from "Create table" menu rather than "Add transformation" menu where all other transformations reside.

Transformations

All manipulations with tables are done with the help of various transformations. E.g. renaming a column is a transformation. Filtering, joining, aggregations – all are transformations. Every transformation is an action that somehow changes a table.

Transformations are executed consequently, step-by-step. One transformation is one step. Every transformation is applied to the result (output) of previous transformation. Therefore sequence of transformations is important. The same transformations in different order would usually produce different result (although not always).

Final state (or result) of a table is the result of all transformations of this table applied consequently, one after another.

Some transformations (e.g. Append Table) require another table. These transformations have two inputs – one input is the result of previous transformation in the same table and the other input is the final state (i.e. the result of last transformation) of the other table.

Here is full list of all transformations.

Expressions

Some transformations can use expressions in order to calculate new columns or filter rows. These expressions can be composed using arithmetical operations, column names and functions. Read more about expressions.

Derived tables

Derived tables is a powerful feature that allows transforming data in a non-linear way. When one table is derived from another (source) table it replicates the final result of the latter. So if the source table changes, the derived table changes automatically. In everything else derived tables is just a regular table and can further be modified using various transformations.

Since you can derive any number of tables from a table, and then keep transforming every derived table in its own way, you can create non-linear transformation chains. For instance, you can produce two derived tables from a table, transform each of them in its own way and then append or merge them back into one table.

Circular dependencies are not allowed in EasyMorph. I.e. you can't append table A to table B if table A is derived from table B directly or indirectly.

Data Export

Exporting data is performed using export transformations. They can be inserted at any point of transformation sequence. Export transformations do not modify tables, they just export them into specified file format. Therefore output of export transformation is the same as input. I.e. it contains exactly the same number of rows and columns in exactly the same order.

Exporting data to databases is done using SQL Command transformation that instructs the target database to upload a file previously created in EasyMorph.

Auto-documentation

EasyMorph can automatically generate documentation from current transformation project. Read more about Auto-documentation.

Command-line execution

EasyMorph projects can be executed from command line thus allowing integration with third-party applications. Read more about command line parameters.

See also:

overview.1505384475.txt.gz · Last modified: 2017/09/14 06:21 by dmitry

Donate Powered by PHP Valid HTML5 Valid CSS Driven by DokuWiki