Resolve high series cardinality

This page documents an earlier version of InfluxDB OSS. InfluxDB 3 Core is the latest stable version.

API token hashing is enabled by default in InfluxDB OSS 2.9.0

Stronger token security: tokens are stored as hashes on disk, so a copy of the database file doesn’t expose usable tokens. Existing tokens are hashed on first startup and the original strings can’t be recovered afterward — capture any plaintext tokens you still need before you upgrade.

For more information, see Token hashing.

If reads and writes to InfluxDB have started to slow down, high series cardinality (too many series) may be causing memory issues.

Take steps to understand and resolve high series cardinality.

Learn the causes of high cardinality
Measure series cardinality
Resolve high cardinality

Learn the causes of high series cardinality

InfluxDB indexes the following data elements to speed up reads:

Each unique set of indexed data elements forms a series key. Tags containing highly variable information like unique IDs, hashes, and random strings lead to a large number of series, also known as high series cardinality. High series cardinality is a primary driver of high memory usage for many database workloads.

Measure series cardinality

Use the following to measure series cardinality of your buckets:

influxdb.cardinality(): Flux function that returns the number of unique series keys in your data.
SHOW SERIES CARDINALITY: InfluxQL command that returns the number of unique series keys in your data.

Resolve high cardinality

To resolve high series cardinality, complete the following steps (for multiple buckets if applicable):

Review tags.
Improve your schema.
Delete high cardinality data.

Review tags

Review your tags to ensure each tag does not contain unique values for most entries:

Scan your tags for common tag issues.
Use the example Flux query below to count unique tag values.

Common tag issues

Look for the following common issues, which often cause many unique tag values:

Writing log messages to tags. If a log message includes a unique timestamp, pointer value, or unique string, many unique tag values are created.
Writing timestamps to tags. Typically done by accident in client code.
Unique tag values that grow over time For example, a user ID tag may work at a small startup, but may begin to cause issues when the company grows to hundreds of thousands of users.

Count unique tag values

The following example Flux query shows you which tags are contributing the most to cardinality. Look for tags with values orders of magnitude higher than others.

// Count unique values for each tag in a bucket
import "influxdata/influxdb/schema"

cardinalityByTag = (bucket) => schema.tagKeys(bucket: bucket)
    |> map(
        fn: (r) => ({
            tag: r._value,
            _value: if contains(set: ["_stop", "_start"], value: r._value) then
                0
            else
                (schema.tagValues(bucket: bucket, tag: r._value)
                    |> count()
                    |> findRecord(fn: (key) => true, idx: 0))._value,
        }),
    )
    |> group(columns: ["tag"])
    |> sum()

cardinalityByTag(bucket: "example-bucket")

If you’re experiencing runaway cardinality, the query above may timeout. If you experience a timeout, run the queries below—one at a time.

Generate a list of tags:

// Generate a list of tags
import "influxdata/influxdb/schema"

schema.tagKeys(bucket: "example-bucket")

Count unique tag values for each tag:

// Run the following for each tag to count the number of unique tag values
import "influxdata/influxdb/schema"

tag = "example-tag-key"

schema.tagValues(bucket: "my-bucket", tag: tag)
    |> count()

These queries should help identify the sources of high cardinality in each of your buckets. To determine which specific tags are growing, check the cardinality again after 24 hours to see if one or more tags have grown significantly.

Improve your schema

To minimize cardinality in the future, design your schema for easy and performant querying. Review best practices for schema design.

Delete data to reduce high cardinality

Consider whether you need the data that is causing high cardinality. If you no longer need this data, you can delete the whole bucket or delete a range of data.

Was this page helpful?

Thank you for your feedback!

Support and feedback

Thank you for being part of our community! We welcome and encourage your feedback and bug reports for InfluxDB OSS v2 and this documentation. To find support, use the following resources:

Customers with an annual or support contract can contact InfluxData Support.

Edit this page Submit docs issue Submit InfluxDB OSS v2 issue

Resolve high series cardinality

API token hashing is enabled by default in InfluxDB OSS 2.9.0

Learn the causes of high series cardinality

Measure series cardinality

Resolve high cardinality

Review tags

Common tag issues

Count unique tag values

Improve your schema

Delete data to reduce high cardinality

Support and feedback

InfluxDB OSS 2.9.0: API tokens are hashed by default

Key enhancements in Explorer 1.9

InfluxDB 3.10 is now available

InfluxDB 3.10 is now available

Telegraf Enterprise is now generally available

InfluxDB Docker latest tag changing to InfluxDB 3 Core

Resolve high series cardinality

API token hashing is enabled by default in InfluxDB OSS 2.9.0

Learn the causes of high series cardinality

Measure series cardinality

Resolve high cardinality

Review tags

Common tag issues

Count unique tag values

Improve your schema

Delete data to reduce high cardinality

Support and feedback

What is your InfluxDB OSS URL?

Default

Custom

Thank you for your feedback!

InfluxDB OSS 2.9.0: API tokens are hashed by default

Key enhancements in Explorer 1.9

InfluxDB 3.10 is now available

InfluxDB 3.10 is now available

Telegraf Enterprise is now generally available

InfluxDB Docker latest tag changing to InfluxDB 3 Core