join() function

Flux 0.7.0 – 0.172.0

join() merges two streams of tables into a single output stream based on columns with equal values. Null values are not considered equal when comparing column values. The resulting schema is the union of the input schemas. The resulting group key is the union of the input group keys.

Deprecated

join() is deprecated in favor of join.inner(). The join package provides support for multiple join methods.

Output data

The schema and group keys of the joined output output data is the union of the input schemas and group keys. Columns that exist in both input streams that are not part specified as columns to join on are renamed using the pattern <column>_<table> to prevent ambiguity in joined tables.

Join vs union

join() creates new rows based on common values in one or more specified columns. Output rows also contain the differing values from each of the joined streams. union() does not modify data in rows, but unions separate streams of tables into a single stream of tables and groups rows of data based on existing group keys.

Function type signature

(<-tables: A, ?method: string, ?on: [string]) => stream[B] where A: Record, B: Record

For more information, see Function type signatures.

Parameters

tables

Record containing two input streams to join.

on

List of columns to join on.

method

Join method. Default is inner.

Supported methods:

inner

Join two streams of tables

import "generate"

t1 =
    generate.from(
        count: 4,
        fn: (n) => n + 1,
        start: 2021-01-01T00:00:00Z,
        stop: 2021-01-05T00:00:00Z,
    )
        |> set(key: "tag", value: "foo")

t2 =
    generate.from(
        count: 4,
        fn: (n) => n * (-1),
        start: 2021-01-01T00:00:00Z,
        stop: 2021-01-05T00:00:00Z,
    )
        |> set(key: "tag", value: "foo")

join(tables: {t1: t1, t2: t2}, on: ["_time", "tag"])

View example output

_time	_value_t1	_value_t2	tag
2021-01-01T00:00:00Z	1	0	foo
2021-01-02T00:00:00Z	2	-1	foo
2021-01-03T00:00:00Z	3	-2	foo
2021-01-04T00:00:00Z	4	-3	foo

Join data from separate data sources

import "sql"

sqlData =
    sql.from(
        driverName: "postgres",
        dataSourceName: "postgresql://username:password@localhost:5432",
        query: "SELECT * FROM example_table",
    )

tsData =
    from(bucket: "example-bucket")
        |> range(start: -1h)
        |> filter(fn: (r) => r._measurement == "example-measurement")
        |> filter(fn: (r) => exists r.sensorID)

join(tables: {sql: sqlData, ts: tsData}, on: ["_time", "sensorID"])

transformations

Was this page helpful?

Thank you for your feedback!

Support and feedback

Thank you for being part of our community! We welcome and encourage your feedback and bug reports for Flux and this documentation. To find support, use the following resources:

Customers with an annual or support contract can contact InfluxData Support.

Edit this page Submit docs issue Submit Flux issue

join() function

Deprecated

Output data

Join vs union

Function type signature

Parameters

tables

on

method

Examples

Join two streams of tables

Output data

Join data from separate data sources

Support and feedback

InfluxDB OSS 2.9.0: API tokens are hashed by default

Key enhancements in Explorer 1.9

InfluxDB 3.11 is now available

InfluxDB 3 Enterprise 3.11: A significant performance upgrade for complex time series workloads

Telegraf Enterprise is now generally available

InfluxDB Docker latest tag changing to InfluxDB 3 Core

join() function

Deprecated

Output data

Join vs union

Function type signature

Parameters

tables

on

method

Examples

Join two streams of tables

Output data

Join data from separate data sources

Related

Support and feedback

Where are you running InfluxDB?

AWS

GCP

Azure

Default

Custom

Thank you for your feedback!

InfluxDB support for join()

InfluxDB Open Source (OSS)

InfluxDB Enterprise

InfluxDB OSS 2.9.0: API tokens are hashed by default

Key enhancements in Explorer 1.9

InfluxDB 3.11 is now available

InfluxDB 3 Enterprise 3.11: A significant performance upgrade for complex time series workloads

Telegraf Enterprise is now generally available

InfluxDB Docker latest tag changing to InfluxDB 3 Core

InfluxDB support for `join()`