Documentation

statsmodels.linearRegression() function

statsmodels.linearRegression() is a user-contributed function maintained by the package author.

statsmodels.linearRegression() performs a linear regression.

It calculates and returns ŷ (y_hat), and residual sum of errors (rse). Output data includes the following columns:

  • N: Number of points in the calculation.
  • slope: Slope of the calculated regression.
  • sx: Sum of x.
  • sxx: Sum of x squared.
  • sxy: Sum of x*y.
  • sy: Sum of y.
  • errors: Residual sum of squares. Defined by (r.y - r.y_hat) ^ 2 in this context
  • x: An index [1,2,3,4…n], with the assumption that the timestamps are regularly spaced.
  • y: Field value
  • y_hat: Linear regression values
Function type signature
(
    <-tables: stream[A],
) => stream[{
    B with
    y_hat: float,
    y: float,
    x: float,
    sy: H,
    sxy: G,
    sxx: F,
    sx: E,
    slope: D,
    errors: float,
    N: C,
}] where A: Record, D: Divisible + Subtractable

For more information, see Function type signatures.

Parameters

tables

Input data. Default is piped-forward data (<-).

Examples

Perform a linear regression on a dataset

import "contrib/anaisdg/statsmodels"
import "sampledata"

sampledata.float()
    |> statsmodels.linearRegression()

Was this page helpful?

Thank you for your feedback!


The future of Flux

Flux is going into maintenance mode. You can continue using it as you currently are without any changes to your code.

Read more

New in InfluxDB 3.4

Key enhancements in InfluxDB 3.4 and the InfluxDB 3 Explorer 1.2.

See the Blog Post

InfluxDB 3.4 is now available for both Core and Enterprise, which introduces offline token generation for use in automated deployments and configurable license type selection that lets you bypass the interactive license prompt. InfluxDB 3 Explorer 1.2 is also available, which includes InfluxDB cache management and other new features.

For more information, check out: