---
title: Telegraf Controller architecture
description: Architectural overview of the Telegraf Controller application.
url: https://docs.influxdata.com/telegraf/controller/reference/architecture/
estimated_tokens: 3804
product: Telegraf
version: v1
---

# Telegraf Controller architecture

#### Telegraf Controller is in Public Beta

Telegraf Controller is in public beta and will be part of the future Telegraf Enterprise offering. While in beta, Telegraf Controller is **not meant for production use**. The Telegraf Controller documentation is a work in progress, and we are actively working to improve it. If you have any questions or suggestions, please [submit an issue](https://github.com/influxdata/docs-v2/issues/new?labels=Telegraf%20Controller). We welcome any and all contributions.

Beta expectations

-   **No configuration or agent limits**  
    While in beta, Telegraf Controller doesn't place any limits on the number of configurations you can store or the number of Telegraf agents you can track. However, upon being generally available, the free distribution of Telegraf Controller will have limits introduced, with the option to increase limits through a Telegraf Enterprise license.
-   **Potential breaking changes**  
    While in beta, we will do our best to no longer make breaking changes to Telegraf Controller, however, they may be necessary. The majority of changes we make will be additive and non-breaking, and include any necessary migrations. When we do need to make breaking changes, we will do our best to communicate them clearly and in advance to minimize disruption.
-   **Flexible release schedule**  
    While in beta, we will continue to create new releases of Telegraf Controller, but likely at irregular intervals. We will provide [Telegraf Controller release notes](/telegraf/controller/reference/release-notes/) to make it easy to track updates.

Provide beta feedback

-   Use the **Feedback** feature in the Telegraf Controller UI.
-   [Join the InfluxDB Community Slack](https://influxdata.com/slack) and post feedback in the **#telegraf-enterprise-alpha** channel.
-   Post feedback in the [InfluxData Community](https://community.influxdata.com).

Join our public channels

-   [InfluxDB Community Slack *(Preferred)*](https://influxdata.com/slack)
-   [InfluxData Community](https://community.influxdata.com)
-   [InfluxDB Subreddit](https://reddit.com/r/influxdb)

Telegraf Controller is a standalone application that provides centralized management for Telegraf agents. It runs as a single binary that starts two separate servers: a web interface/API server and a dedicated high-performance heartbeat server for agent monitoring.

## Runtime Architecture

### Application Components

When you run the Telegraf Controller binary, it starts four main subsystems:

-   **Web Server**: Serves the management interface (default port: `8888`)
-   **API Server**: Handles configuration management and administrative requests (served on the same port as the web server)
-   **Heartbeat Server**: Dedicated high-performance server for agent heartbeats (default port: `8000`)
-   **Background Scheduler**: Monitors agent health every 60 seconds

### Process Model

-   **telegraf\_controller** *(single process, multiple servers)*
    -   **Main HTTP Server** *(port `8888`)*
        -   Web UI (`/`)
        -   API Endpoints (`/api/*`)
    -   **Heartbeat Server** (port `8000`)
        -   POST /heartbeat *(high-performance endpoint)*
    -   **Database Connection**
        -   SQLite or PostgreSQL
    -   **Background Tasks**
        -   Agent Status Monitor (60s interval)

The dual-server architecture separates high-frequency heartbeat traffic from regular management operations, ensuring that the web interface remains responsive even under heavy agent load.

## Configuration

Telegraf Controller configuration is controlled through command options and environment variables.

| Command Option | Environment Variable | Description |
| --- | --- | --- |
| --port | APP_PORT | API server port (default is 8888) |
| --heartbeat-port | HEARTBEAT_PORT | Heartbeat service port (default: 8000) |
| --database | DATABASE_URL | Database filepath or URL (default is SQLite path) |
| (none) | SSL_CERT_PATH | Path to SSL certificate |
| (none) | SSL_KEY_PATH | Path to SSL private key |

*For a full list of options, see the [Telegraf Controller configuration options reference](/telegraf/controller/reference/config-options/).*

To use environment variables, create a `.env` file in the same directory as the binary or export these environment variables in your terminal session.

### Database Selection

Telegraf Controller automatically selects the database type based on the `DATABASE_URL` string:

-   **SQLite** (default): Best for development and small deployments with less than 1000 agents. Database file created automatically.
-   **PostgreSQL**: Required for large deployments. Must be provisioned separately.

Example PostgreSQL configuration:

```bash
DATABASE_URL="postgresql://user:password@localhost:5432/telegraf_controller"
```

## Data Flow

### Agent registration and heartbeats

flowchart LR T\["Telegraf Agents  
(POST heartbeats)"\] --> H\["Port 8000  
Heartbeat Server"\] H --Direct Write--> D\[("Database")\] W\["Web UI/API  
"\] --> A\["Port 8888  
API Server"\] --View Agents (Read-Only)--> D R\["Rust Scheduler  
(Agent status updates)"\] --> D

1. **Agents send heartbeats**:
    
    Telegraf agents with the heartbeat output plugin send `POST` requests to the dedicated heartbeat server (port `8000` by default).
    
2. **Heartbeat server processes the heartbeat**:
    
    The heartbeat server is a high-performance Rust-based HTTP server that:
    
    -   Receives the `POST` request at `/agents/heartbeat`
    -   Validates the heartbeat payload
    -   Extracts agent information (ID, hostname, IP address, status, etc.)
    -   Uniquely identifies each agent using the `instance_id` in the heartbeat payload.
3. **Heartbeat server writes directly to the database**:
    
    The heartbeat server uses a Rust NAPI module that:
    
    -   Bypasses the application ORM (Object-Relational Mapping) layer entirely
    -   Uses `sqlx` (Rust SQL library) to write directly to the database
    -   Implements batch processing to efficiently process multiple heartbeats
    -   Provides much higher throughput than going through the API layer
    
    The Rust module performs these operations:
    
    -   Creates a new agent if it does not already exist
    -   Adds or updates the `last_seen` timestamp
    -   Adds or updates the agent status to the status reported in the heartbeat
    -   Updates other agent metadata (hostname, IP, etc.)
4. **API layer reads agent data**:
    
    The API layer has read-only access for agent data and performs the following actions:
    
    -   `GET /api/agents` - List agents
    -   `GET /api/agents/summary` - Agent status summary
    
    The API never writes to the agents table. Only the heartbeat server does.
    
5. **The Web UI displays updated agent data**:
    
    The web interface polls the API endpoints to display:
    
    -   Real-time agent status
    -   Last seen timestamps
    -   Agent health metrics
6. **The background scheduler evaluates agent statuses**:
    
    Every 60 seconds, a Rust-based scheduler (also part of the NAPI module):
    
    -   Scans all agents in the database
    -   Checks `last_seen` timestamps against the agent’s assigned reporting rule
    -   Updates agent statuses:
        -   ok → not\_reporting (if heartbeat missed beyond threshold)
        -   not\_reporting → ok (if heartbeat resumes)
    -   Auto-deletes agents that have exceeded the auto-delete threshold (if enabled for the reporting rule)

### Configuration distribution

1. **An agent requests a configuration**:
    
    Telegraf agents request their configuration from the main API server (port `8888`):
    
    ```bash
    telegraf --config "http://localhost:8888/api/configs/{config-id}/toml?location=datacenter1&env=prod"
    ```
    
    The agent makes a `GET` request with:
    
    -   **Config ID**: Unique identifier for the configuration template
    -   **Query Parameters**: Variables for parameter substitution
    -   **Accept Header**: Can specify `text/x-toml` or `application/octet-stream` for download
2. **The API server receives request**:
    
    The API server on port `8888` handles the request at `/api/configs/{id}/toml` and does the following:
    
    -   Validates the configuration ID
    -   Extracts all query parameters for substitution
    -   Checks the `Accept` header to determine response format
3. **The application retrieves the configuration from the database**:
    
    Telegraf Controller fetches configuration data from the database:
    
    -   **Configuration TOML**: The raw configuration with parameter placeholders
    -   **Configuration name**: Used for filename if downloading
    -   **Updated timestamp**: For the `Last-Modified` header
4. **Telegraf Controller substitutes parameters**:
    
    Telegraf Controller processes the TOML template and replaces parameters with parameter values specified in the `GET` request.
    
5. **Telegraf Controller sets response headers**:
    
    -   Content-Type
    -   Last-Modified
    
    Telegraf uses the `Last-Modified` header to determine if a configuration has been updated and, if so, download and use the updated configuration.
    
6. **Telegraf Controller delivers the response**:
    
    Based on the `Accept` header:
    
    <!-- Tabbed content: Select one of the following options -->
    
    
    **text/x-toml (TOML):**
    
    ```
    HTTP/1.1 200 OK
    Content-Type: text/x-toml; charset=utf-8
    Last-Modified: Mon, 05 Jan 2025 07:28:00 GMT
    
    [agent]
      hostname = "server-01"
      environment = "prod"
    ...
    ```
    
    
    
    **application/octet-stream (Download):**
    
    ```
    HTTP/1.1 200 OK
    Content-Type: application/octet-stream
    Content-Disposition: attachment; filename="config_name.toml"
    Last-Modified: Mon, 05 Jan 2025 07:28:00 GMT
    
    [agent]
      hostname = "server-01"
    ...
    ```
    
    
    
    <!-- End tabbed content -->
    
7. *(Optional)* **Telegraf regularly checks the configuration for updates**:
    
    Telegraf agents can regularly check Telegraf Controller for configuration updates and automatically load updates when detected. When starting a Telegraf agent, include the `--config-url-watch-interval` option with the interval that you want the agent to use to check for updates—for example:
    
    ```bash
    telegraf \
      --config http://localhost:8888/api/configs/xxxxxx/toml \
      --config-url-watch-interval 1h
    ```
    

## Reporting Rules

Telegraf Controller uses reporting rules to determine when agents should be marked as not reporting:

-   **Default Rule**: Created automatically on first run
-   **Heartbeat Interval**: Expected frequency of agent heartbeats (default: 60s)
-   **Threshold Multiplier**: How many intervals to wait before marking not\_reporting (default: 3x)

Access reporting rules via:

-   **Web UI**: Reporting Rules
-   **API**: `GET /api/reporting-rules`

#### Related

-   [Telegraf Controller configuration options](/telegraf/controller/reference/config-options/)
