Monitor the performance upgrade preview

Performance preview beta

The performance upgrade preview is available to InfluxDB 3 Enterprise Trial and Commercial users as a beta. These features are subject to breaking changes and should not be used for production workloads.

InfluxDB 3 Enterprise provides system tables and a query telemetry endpoint to monitor file status, query execution, and overall performance when using the performance upgrade preview.

System tables

The upgraded storage engine exposes internal state through system tables that you can query with SQL.

system.pt_ingest_wal

View WAL files and their partitions:

SELECT * FROM system.pt_ingest_wal;

Example output:

wal_file_id	partition_id	database_id	table_id	min_time	max_time	row_count	size_bytes
wal_001	p_1	db_1	t_1	2024-01-01T00:00:00Z	2024-01-01T00:10:00Z	50000	2456789
wal_002	p_1	db_1	t_1	2024-01-01T00:10:00Z	2024-01-01T00:20:00Z	48000	2345678

Use this table to monitor:

WAL accumulation: Track the number and size of unmerged WAL files
Partition distribution: See how data is distributed across partitions
Time coverage: Verify data time ranges

Monitor WAL backlog

Check for WAL accumulation that may indicate merging is falling behind:

SELECT
  COUNT(*) as wal_file_count,
  SUM(size_bytes) / 1024 / 1024 as total_size_mb,
  MIN(min_time) as oldest_data,
  MAX(max_time) as newest_data
FROM system.pt_ingest_wal;

system.pt_ingest_files

View Gen0 files with metadata:

SELECT * FROM system.pt_ingest_files;

Example output:

file_id	generation	database_id	table_id	min_time	max_time	row_count	size_bytes
gen0_001	0	db_1	t_1	2024-01-01T00:00:00Z	2024-01-01T01:00:00Z	500000	45678901
gen0_002	0	db_1	t_1	2024-01-01T01:00:00Z	2024-01-01T02:00:00Z	480000	43567890

Use this table to monitor:

File counts per generation: Track compaction progress
File sizes: Verify files are within configured limits
Time ranges: Identify Gen0 files that span multiple compaction windows

Monitor file distribution

Check file distribution and compaction status:

SELECT
  generation,
  COUNT(*) as file_count,
  SUM(row_count) as total_rows,
  SUM(size_bytes) / 1024 / 1024 as total_size_mb,
  AVG(size_bytes) / 1024 / 1024 as avg_file_size_mb
FROM system.pt_ingest_files
GROUP BY generation
ORDER BY generation;

Parquet upgrade status

If you upgraded from Parquet, use these system tables to monitor migration progress.

system.upgrade_parquet_node

View per-node upgrade status:

SELECT * FROM system.upgrade_parquet_node;

Monitor this table to confirm each node reaches completed status. During the upgrade, nodes progress through detection, conversion, and finalization stages.

system.upgrade_parquet

View per-file migration progress:

SELECT * FROM system.upgrade_parquet;

Use this table to track individual file conversions during the migration. The status updates on a polling interval (default 5 seconds, configurable with --pt-upgrade-poll-interval).

Query telemetry

The query telemetry endpoint provides detailed execution statistics for analyzing query performance.

Enable query telemetry

Query the telemetry endpoint after executing a query:

curl -X GET "http://localhost:8181/api/v3/query_sql_telemetry" \
  -H "Authorization: Bearer AUTH_TOKEN"

Replace AUTH_TOKEN with your authentication token.

Telemetry response

The response includes:

Field	Description
`query_id`	Unique identifier for the query
`execution_time_us`	Total execution time in microseconds
`chunks`	Per-chunk statistics
`cache_stats`	Cache hit rates by type
`file_stats`	File-level read statistics

Example telemetry output

{
  "query_id": "q_12345",
  "execution_time_us": 4523,
  "chunks": [
    {
      "chunk_id": "c_1",
      "files_scanned": 3,
      "blocks_processed": 12,
      "rows_read": 24000,
      "rows_returned": 150,
      "bytes_read": 1234567
    }
  ],
  "cache_stats": {
    "gen0_hits": 5,
    "gen0_misses": 1,
    "compacted_hits": 8,
    "compacted_misses": 2
  }
}

Performance analysis

Query performance metrics

Track these key metrics for query performance:

Metric	Good	Warning	Action
Cache hit rate	>80%	<60%	Increase `--pt-file-cache-size` or `--pt-file-cache-recency`
Rows read vs returned ratio	<100:1	>1000:1	Add more selective predicates

Ingest performance metrics

Monitor these metrics for write performance:

Metric	Healthy	Warning	Action
WAL file count	<50	>100	Increase `--pt-wal-flush-concurrency`
Gen0 file count	<100	>200	Increase `--pt-compactor-input-size-budget`

Monitor with SQL

Create a performance summary query:

-- File generation summary
SELECT
  'Gen0 files' as metric,
  COUNT(*) as count,
  SUM(size_bytes) / 1024 / 1024 as size_mb
FROM system.pt_ingest_files
WHERE generation = 0

UNION ALL

SELECT
  'Compacted files' as metric,
  COUNT(*) as count,
  SUM(size_bytes) / 1024 / 1024 as size_mb
FROM system.pt_ingest_files
WHERE generation > 0

UNION ALL

SELECT
  'WAL files' as metric,
  COUNT(*) as count,
  SUM(size_bytes) / 1024 / 1024 as size_mb
FROM system.pt_ingest_wal;

Troubleshooting

High WAL file count

Symptom: system.pt_ingest_wal shows many accumulated files.

Possible causes:

Merge operations falling behind write rate
Insufficient flush concurrency
Object storage latency

Solutions:

Increase flush concurrency:
```
--pt-wal-flush-concurrency 8
```
Increase WAL flush interval to create larger, fewer files:
```
--pt-wal-flush-interval 5s
```
Increase the WAL buffer size so each flush produces a larger file:
```
--pt-wal-max-buffer-size 30MB
```
Check object storage performance and connectivity.

High cache miss rate

Symptom: cache_stats shows >40% miss rate.

Possible causes:

Cache size too small for working set
Cache recency window too narrow
Random access patterns across time ranges

Solutions:

Increase cache size:
```
--pt-file-cache-size 16GB
```
Extend cache recency window:
```
--pt-file-cache-recency 24h
```
Extend eviction timeout:
```
--pt-file-cache-evict-after 48h
```

Slow compaction

Symptom: Gen0 file count continues to grow.

Possible causes:

Compaction budget too low for write volume
High write rate overwhelming compaction
Snapshot size too large, creating oversized Gen0 files

Solutions:

Increase the compaction input size budget:
```
--pt-compactor-input-size-budget 12GB
```
Reduce snapshot size to create smaller, more frequent Gen0 files:
```
--pt-snapshot-size 125MB
```

For distributed deployments, add a dedicated compactor node:

influxdb3 serve \
  # ...
  --use-pacha-tree \
  --mode compact

Query node lag

Symptom: Query nodes return stale data.

Possible causes:

Replication falling behind
Network latency to object storage
Insufficient replica concurrency

Solutions:

Increase replication concurrency:
```
--pt-wal-replica-steady-concurrency 8
```
Reduce the replication polling interval:
```
--pt-wal-replication-interval 100ms
```
Increase replica queue size:
```
--pt-wal-replica-queue-size 200
```

Was this page helpful?

Thank you for your feedback!

Support and feedback

Thank you for being part of our community! We welcome and encourage your feedback and bug reports for InfluxDB 3 Enterprise and this documentation. To find support, use the following resources:

Customers with an annual or support contract can contact InfluxData Support. Customers using a trial license can email trial@influxdata.com for assistance.

Edit this page Submit docs issue Submit InfluxDB 3 Enterprise issue

Monitor the performance upgrade preview

Performance preview beta

System tables

system.pt_ingest_wal

Monitor WAL backlog

system.pt_ingest_files

Monitor file distribution

Parquet upgrade status

system.upgrade_parquet_node

system.upgrade_parquet

Query telemetry

Enable query telemetry

Telemetry response

Example telemetry output

Performance analysis

Query performance metrics

Ingest performance metrics

Monitor with SQL

Troubleshooting

High WAL file count

High cache miss rate

Slow compaction

Query node lag

Support and feedback

InfluxDB 3.9: Performance upgrade preview

Telegraf Enterprise now in public beta

InfluxDB Docker latest tag changing to InfluxDB 3 Core

Monitor the performance upgrade preview

Performance preview beta

System tables

system.pt_ingest_wal

Monitor WAL backlog

system.pt_ingest_files

Monitor file distribution

Parquet upgrade status

system.upgrade_parquet_node

system.upgrade_parquet

Query telemetry

Enable query telemetry

Telemetry response

Example telemetry output

Performance analysis

Query performance metrics

Ingest performance metrics

Monitor with SQL

Troubleshooting

High WAL file count

High cache miss rate

Slow compaction

Query node lag

Related

Support and feedback

What is your InfluxDB 3 Enterprise URL?

Default

Custom

Thank you for your feedback!

InfluxDB 3.9: Performance upgrade preview

Telegraf Enterprise now in public beta

InfluxDB Docker latest tag changing to InfluxDB 3 Core