Skip to main content

Data Dictionary

This comprehensive reference describes all data fields available in Enertel forecast extracts and API responses. During your trial, you'll receive historical backcast data for your price nodes of interest that follows this schema.

Data Structure Overview

Our forecast data is structured around the core concepts described in our concepts documentation. Each row in your data extract represents a single forecast point with associated metadata and percentile values.

Field Reference

Object Information

ColumnTypeDescription
object_namestringName of the forecasted object, typically a price node from the ISO
object_typestringClassification of the object (e.g., 'price_node', 'hub', 'interface')
object_subtypestringAdditional classification details for the object
object_latitudenumberLatitude coordinate of the object location
object_longitudenumberLongitude coordinate of the object location
object_isostringISO code representing the geographic market area

Power Plant Information

Available when forecasting generation assets

ColumnTypeDescription
power_plantstringName of the power plant (if applicable)
plant_technologystringGeneration technology (e.g., 'natural_gas', 'solar', 'wind')
prime_mover_codestringPrimary equipment type code (e.g., turbine type)
plant_capacitynumberNameplate capacity of the power plant in megawatts

Target and Series Information

ColumnTypeDescription
target_idintegerUnique identifier for the forecast target
descriptionstringHuman-readable description of the target (e.g., "ERCOT DALMP")
timezonestringTimezone for all timestamps in this target
isostringIndependent System Operator code
series_namestringName of the data series being forecasted

Scenario and Batch Information

ColumnTypeDescription
scenario_idintegerUnique identifier for the input scenario
range_startstringStart of forecast time range (ISO 8601 format)
range_endstringEnd of forecast time range (ISO 8601 format)
schedule_idintegerIdentifier for the processing schedule
scheduled_atstringWhen the forecast batch was scheduled (ISO 8601 format)
batch_idintegerUnique identifier for the forecast batch
batch_created_atstringWhen the forecast batch was created (ISO 8601 format)

Model Information

ColumnTypeDescription
model_idintegerUnique identifier for the forecasting model
model_created_atstringWhen the model was trained/created (ISO 8601 format)

Forecast Data

ColumnTypeDescription
timestampstringThe exact time for which the forecast applies (ISO 8601 format)
valuenumberExpected value (mean) of the forecast distribution
feature_idintegerUnique identifier for the specific feature being forecasted

Probabilistic Forecasts

Our models provide complete probability distributions through percentile forecasts:

ColumnTypeDescription
p01number1st percentile - extreme low outcomes (1% chance of being lower)
p05number5th percentile - very low outcomes
p10number10th percentile - low outcomes
p20number20th percentile
p30number30th percentile
p40number40th percentile
p50numberMedian forecast - 50th percentile (most commonly used)
p60number60th percentile
p70number70th percentile
p80number80th percentile
p90number90th percentile - high outcomes
p95number95th percentile - very high outcomes
p99number99th percentile - extreme high outcomes (1% chance of being higher)

Validation Data

Available in historical backcasts

ColumnTypeDescription
actualnumberThe actual observed value for validation and accuracy assessment
errornumberDifference between forecasted value and actual value (forecast - actual)

Data Quality Notes

Timestamp Handling

  • All timestamps are provided in ISO 8601 format
  • Timestamps are automatically converted to the target's local timezone
  • Be aware of daylight saving time transitions when working with time series data

Missing Data

  • Our data extracts are cleaned and validated before delivery
  • Any missing values are explicitly handled and documented
  • Contact support if you encounter unexpected null values

Duplicate Detection

The composite primary key for forecast data is:

(timestamp, feature_id, scenario_id, model_id)

This combination should be unique across your dataset.

Common Usage Patterns

Basic Forecast Analysis

# Most common columns for basic analysis
essential_cols = [
'object_name', 'series_name', 'timestamp',
'scheduled_at', 'p50', 'actual'
]

Probabilistic Analysis

# All percentile columns for uncertainty analysis
percentile_cols = [f'p{i:02d}' for i in [1, 5, 10, 20, 30, 40, 50, 60, 70, 80, 90, 95, 99]]

Model Comparison

# Key columns for comparing different models
comparison_cols = [
'model_id', 'model_created_at', 'feature_id',
'timestamp', 'p50', 'actual'
]

Units and Conventions

  • Prices: Typically in $/MWh
  • Generation: Typically in MW or MWh depending on the series
  • Timestamps: Always in the target's local timezone
  • Coordinates: Decimal degrees (WGS84)

For specific units in your data extract, refer to the series_name and description fields which provide context about the measurement type and units.