Aggregation Function

The Aggregation Function node is used create a new column based on an aggregation of multiple given numeric columns. This function supports the following numeric column types: Integer, Float 32, Float 64, and BigInteger.

The Aggregation Function function differs from defining aggregation from Data Model, in which case the aggregation type can be set for each measure and is used in the querying process only. Using the Aggregation Function node in the Data Flow creates a new aggregated column that is part of the schema of the underlying database.

The Aggregation Function node is useful if end users have a need to analyze aggregations of multiple columns. For example, there may be a need for a column summing the cost and overhead to show all expenses.

This can be creating by inputting both of those columns in the Aggregation Function node and setting the aggregation type to Sum.

Configure Aggregation Function

Connect the Aggregation Function node to the Select tables of the relevant table.

Go to the Properties panel and set the aggregation preferences:

  • Select Column: Choose the columns that you want to apply the aggregation to.
  • Function Type: Select the type of aggregation to use (see below for more information).
  • Column Output: Determine whether to keep the original column.
  • New Column Name: Name the new column.

Function Type

The Aggregation Function node enables the following aggregate functions:

  • Sum: Returns the sum of values in each row from the given numeric columns.
  • Average: Returns the average values for each row of the given numeric columns.
  • Minimum: Returns the lowest values in each row from the given numeric columns.
  • Maximum: Returns the highest values in each row from the given numeric columns.

Split Nodes

The Aggregation Function node can be used to produce one aggregated column. If you want to create multiple aggregated columns based on a single source table, you can connect multiple Aggregation Function nodes to the relevant table, thereby splitting the table node.

You can then use the merge function to combine the original table with the new aggregated columns. See the final example below for a demonstration of this process.

Related information

Common Properties

There are a number of fields that are present in the Properties panel when you have any of the preceding nodes selected on the canvas. These fields include Result Properties, Column Selection, Set Variable Values, and Metadata.

  • Click here for more details about the Common Properties