Skip to main content

Celonis Product Documentation

PU_MEDIAN
Description

Calculates the median of the specified source column for each element in the given target table.

The median is the middle element of a group. If the group has an even number of elements, the upper value of the two middle values is taken as the median.

Like the regular MEDIAN operator, the column can either be an INT, FLOAT or DATE column. The data type of the result is the same as the input column data type.

Syntax
 PU_MEDIAN ( target_table, source_table.column [, filter_expression] )
  • target_table: The table to which the aggregation result should be pulled. This can be:

  • source_table.column: The column which should be aggregated for every row of the target_table.

  • filter_expression (optional): An optional filter expression to specify which values of the source_table.column should be taken into account for the aggregation.

NULL handling

If no value in the source table column exists for the element in the target table (either because all values of the source table are filtered out, or because no corresponding value exists in the first place), NULL will be returned. NULL values in the source table column are treated as if the row does not exist.

Examples

[1]

Calculate the maximum of the case table values for each company code:

Query

Column1

         "companyDetail"."companyCode"
        

Column2

         PU_MEDIAN ( "companyDetail" , "caseTable"."value" )
        

Input

Output

caseTable

caseId : int

companyCode : string

value : int

1

'001'

600

2

'001'

400

3

'001'

200

4

'002'

300

5

'002'

300

6

'003'

200

companyDetail

companyCode : string

country : string

'001'

'DE'

'002'

'DE'

'003'

'US'

Foreign Keys

caseTable.companyCode

companyDetail.companyCode

Result

Column1 : string

Column2 : int

'001'

400

'002'

300

'003'

200

[2]

PU-functions can be used in a FILTER. In this example, the company codes are filtered such that the corresponding median case table value is smaller than 300:

Query

Filter

         FILTER PU_MEDIAN ( "companyDetail" , "caseTable"."value" ) < 300;
        

Column1

         "companyDetail"."companyCode"
        

Input

Output

caseTable

caseId : int

companyCode : string

value : int

1

'001'

600

2

'001'

400

3

'001'

200

4

'002'

300

5

'002'

300

6

'003'

200

companyDetail

companyCode : string

country : string

'001'

'DE'

'002'

'DE'

'003'

'US'

Foreign Keys

caseTable.companyCode

companyDetail.companyCode

Result

Column1 : string

'003'

[3]

PU-functions can be used inside another aggregation function. In this example, the maximum value of all median case table values for each company code is calculated:

Query

Column1

         MAX ( PU_MEDIAN ( "companyDetail" , "caseTable"."value" ) )
        

Input

Output

caseTable

caseId : int

companyCode : string

value : int

1

'001'

600

2

'001'

400

3

'001'

200

4

'002'

300

5

'002'

300

6

'003'

200

companyDetail

companyCode : string

country : string

'001'

'DE'

'002'

'DE'

'003'

'US'

Foreign Keys

caseTable.companyCode

companyDetail.companyCode

Result

Column1 : int

400

[4]

Calculate the median of the case table values for each company code. Only consider cases with an ID larger than 2:

Query

Column1

         "companyDetail"."companyCode"
        

Column2

         PU_MEDIAN ( "companyDetail" , "caseTable"."value" , "caseTable"."caseID" > 2 )
        

Input

Output

caseTable

caseId : int

companyCode : string

value : int

1

'001'

600

2

'001'

400

3

'001'

200

4

'002'

300

5

'002'

300

6

'003'

200

companyDetail

companyCode : string

country : string

'001'

'DE'

'002'

'DE'

'003'

'US'

Foreign Keys

caseTable.companyCode

companyDetail.companyCode

Result

Column1 : string

Column2 : int

'001'

200

'002'

300

'003'

200

[5]

Calculate the median of the case table values for each company code. Only consider cases with an ID larger than 3. All case table values for companyCode '001' are filtered out, which means that in this case, NULL is returned.

Query

Column1

         "companyDetail"."companyCode"
        

Column2

         PU_MEDIAN ( "companyDetail" , "caseTable"."value" , "caseTable"."caseID" > 3 )
        

Input

Output

caseTable

caseId : int

companyCode : string

value : int

1

'001'

600

2

'001'

400

3

'001'

200

4

'002'

300

5

'002'

300

6

'003'

200

companyDetail

companyCode : string

country : string

'001'

'DE'

'002'

'DE'

'003'

'US'

Foreign Keys

caseTable.companyCode

companyDetail.companyCode

Result

Column1 : string

Column2 : int

'001'

null

'002'

300

'003'

200

[6]

Example over three tables: For each entry in table B, calculate the median of the values that are larger than 100 in table C. Tables B and C do not have a direct connection, but are connected via table A:

Query

Column1

         "B"."B_KEY"
        

Column2

         PU_MEDIAN ( "B" , "C"."VALUE" , "C"."VALUE" > 100 )
        

Input

Output

A

B_KEY : int

C_KEY : string

VALUE : int

1

'A'

100

1

'B'

200

2

'C'

300

2

'D'

400

3

'E'

500

3

'F'

600

B

B_KEY : int

1

2

C

C_KEY : string

VALUE : int

'A'

400

'A'

100

'A'

200

'B'

100

'C'

200

'D'

500

Foreign Keys

A.C_KEY

C.C_KEY

B.B_KEY

A.B_KEY

Result

Column1 : int

Column2 : int

1

400

2

500

See also: