Capabilities — energyOS Documentation

AI foundations

energyOS is built on a layered machine learning architecture that combines several distinct paradigms. No single model is sufficient for energy market intelligence — the platform uses supervised learning as its foundation, extended by deep sequential modeling, ensemble methods, and an LLM reasoning layer that ties outputs together into interpretable language.

Supervised learning

Supervised learning is the core paradigm underlying the platform's price forecasting and fair value estimation capabilities. The system is trained on labeled historical data — input features (storage levels, weather deviations, PADD utilization rates, LMP spreads) mapped to known output values (historical settlement prices, forward curves). The model learns a function that maps inputs to outputs, then generalizes to unseen market states.

In energy markets, supervised learning is particularly powerful because fundamental drivers have well-understood relationships with price outcomes — the challenge is quantifying those relationships across non-linear, regime-dependent data. The platform's training pipeline feeds from historically labeled data, retrains on rolling windows to adapt for drift, and is evaluated continuously against withheld test sets to measure degradation before it affects the inference.

Why supervised learning matters for energy

Energy prices are not random walks — they have fundamental anchors. Storage versus five-year norms, heating degree days versus forecast, refinery utilization versus seasonal demand: all are quantifiable relationships. Supervised learning formalizes these relationships with mathematical precision, replacing analyst intuition with reproducible inference grounded in the full breadth of available data.

Huber regressor — penalized linear modeling

The Huber Regressor is used within the platform's linear model layer as a robust alternative to standard ordinary least squares regression. Standard regression is highly sensitive to outliers — a single extreme price spike (a common event in energy markets) can distort model coefficients and invalidate forecasts for the subsequent period. The Huber loss function addresses this by behaving like mean squared error for small residuals (where accuracy matters) but switching to mean absolute error for large residuals (where outliers must be down-weighted). The transition point, known as epsilon (ε), is tuned per commodity and contract horizon.

In practice, this means the fair value models remain stable through short-term supply shocks, SCADA events, or weather-driven spikes — producing estimates that reflect underlying fundamentals rather than being distorted by transient market stress. The linear layer feeds its output as a feature into the ensemble models, creating a composited estimate that combines interpretable fundamental relationships with non-linear learning.

Quantile gradient boosting — training on separate data quartiles

Gradient Boosting Machines (GBMs) underpin the platform's contract horizon forecasters. The platform uses quantile regression variants of gradient boosting, which train separate models for different quantiles of the target distribution — typically the 10th, 50th, and 90th percentiles. This produces a range estimate rather than a point forecast, which is critical for energy trading decisions where understanding the upside and downside tail matters as much as the central case.

Each quantile model is trained independently. The 90th percentile model learns the parameters of upside outcomes — high storage draws, cold snaps, plant outages — while the 10th percentile model learns downside scenarios. The 50th percentile (median) estimate becomes the platform's fair value anchor. All three outputs are surfaced to energyOS Chat and available via the LLM reasoning layer when you query a contract for its price range.

method / 01

Huber Regressor

Robust linear model for fundamental fair value estimation. Penalizes outliers via the epsilon parameter — stable through energy price spikes. Used as a feature in the ensemble layer.

method / 02

Quantile GBM

Separate gradient boosting models trained on the 10th, 50th, and 90th percentiles. Produces a price range forecast. Captures asymmetric upside and downside risk for each contract horizon.

method / 03

LSTM Networks

Long short-term memory networks for sequential time-series data. Preserve memory of relevant historical patterns — seasonal cycles, multi-week storage builds — without forgetting critical context.

method / 04

Autoencoder (Regime)

Unsupervised encoder-decoder trained to reconstruct normal market conditions. High reconstruction error signals a structural regime break. The reconstruction error score is surfaced in every energyOS Chat response.

method / 05

Standard Scaler

All input features are normalized via scikit-learn's StandardScaler prior to model ingestion. Removes magnitude bias across heterogeneous inputs — BTU values, dollar prices, and percentage storage norms require normalization before joint modeling.

method / 06

Ensemble Integration

Huber Regressor outputs, GBM quantile predictions, and LSTM sequence features are stacked into a meta-model that produces the final fair value estimate and uncertainty interval for each signal.

Non-linear deep learning — LSTMs and neural networks

Energy price data is inherently sequential — what happened over the past 14 days meaningfully informs what will happen over the next 7. Standard machine learning models treat each input vector as independent, discarding temporal ordering. Long Short-Term Memory (LSTM) networks address this by maintaining a memory cell that selectively retains or forgets information across time steps. This allows the model to learn multi-week patterns — winter build-into-draw transitions, end-of-season storage roll patterns, multi-month refinery turnaround cycles — that would be invisible to non-sequential models.

The feedforward neural networks within the platform are built using PyTorch, with ReLU activations in intermediate layers and sigmoid activations in the final layer for bounded outputs. Architecture depth and hidden unit counts are tuned per commodity vertical — gas storage models use deeper networks than crude oil price models due to the longer memory dependency in the underlying fundamentals.

On model selection

The platform does not rely on a single model class. Each commodity and contract horizon has a tuned combination of linear, gradient boosting, and deep learning components. The LLM layer is aware of which model produced each signal — and will cite the source model when answering questions about confidence and methodology.

Adaptive retraining — managing market drift

All supervised models are subject to concept drift: as market structure changes, historical training data becomes progressively less representative of current conditions. The platform implements a rolling-window retraining schedule that continuously updates model weights against recent data. Retraining frequency varies by model type — fast-moving price models retrain weekly, while structural models (PADD utilization regressors, LNG export correlators) retrain on a monthly cycle aligned with data release schedules.

Drift detection is monitored using population stability indices (PSI) computed on held-out validation sets. When PSI exceeds threshold, an automated retraining job is triggered in AWS SageMaker — producing a candidate model that is evaluated against the production model before replacement. No model update reaches production without automated evaluation approval.

Data scale & domains

The total breadth of data integrated into energyOS is one of its primary structural advantages. Rather than relying on a narrow set of price feeds, the platform ingests data across multiple energy domains — each with its own release cadence, column schema, and set of derivative analytics. The scale is not incidental: it is the prerequisite for the cross-domain correlation engine and the LLM's ability to reason over multi-market relationships.

Petroleum

PADD 1–5 · 40+ series

Crude runs, distillate stocks, refinery utilization, API gravity, heating oil stocks, gasoline stocks

Natural Gas

Storage · Flows · Prices

Henry Hub, storage vs. norm, regional basis, LNG exports, pipeline nominations, burn scores

Electricity

ISO / Grid · Real-time

ERCOT, PJM, MISO, CAISO LMP data. Nodal z-scores, dispatch, renewable penetration

Coal

Production · Transport

Regional production by basin, rail volumes, inventory at generators, fuel-switching signals

Weather & Climate

HDD / CDD · Satellite

NOAA HDD/CDD forecasts, satellite-derived snowpack, sea surface temperature, drought indices

News & NLP

Regulatory · Macro

EIA reports, FERC filings, geopolitical NLP signals, OPEC statement parsing, macro indicators

PADD data — a worked example of depth

To illustrate the depth of integration, consider only the Petroleum Administration for Defense Districts (PADD) dataset. PADD regions divide the United States into five districts for tracking refined product logistics. The platform integrates all five districts with their full column sets, updated in alignment with EIA release schedules.

PADD series — active integrations Autonomously updated · EIA cadence

padd1 padd1 crude runs padd1 distillate stocks padd1a/1b distillate padd1a heating oil stocks padd2 padd2 distillate stocks padd3 padd3 crude API gravity padd3 crude runs padd3 distillate stocks padd3 refinery utilization padd4 padd5

Each PADD series contains multiple columns — for example, PADD1 crude runs alone includes: duoarea, area-name, product, product-name, process, process-name, series, series-description, padd1_crude_runs_mbbld, units, distillate_stocks, gasoline_stocks, crude_stocks, and refinery_util. Every column is independently normalized, tested for stationarity, and fed into the feature pipeline.

Autonomous data integration — release cadences

Integration is batched at specific release intervals corresponding to each source's publication schedule. The system does not poll — it is event-driven, triggering ingestion jobs on confirmed release windows. This ensures the signal store reflects data as quickly as possible after publication while avoiding redundant processing.

Hourly

ERCOT real-time LMP, ISO dispatch, intraday weather updates

Daily

Henry Hub settlement, pipeline flow nominations, weather forecasts

Weekly

EIA natural gas storage report, crude inventory, PADD stocks

Monthly

PADD utilization, STEO updates, OPEC production data

Annual

Long-run infrastructure surveys, capacity expansion filings

ML models

The platform operates three parallel model layers, each targeting a different aspect of market intelligence. They run independently within the pipeline and their outputs are combined in the Intelligence Interface to support a complete analytical picture.

Layer / 01 — Supervised

Fair Value Models

Gradient boosting and Huber regression ensembles trained per commodity and contract horizon. Produce a median fair value estimate and quantile range (P10/P90). Updated on weekly retraining cycles.

scikit-learn XGBoost Huber Regressor StandardScaler Quantile GBM

Layer / 02 — Sequential

LSTM Forecasters

PyTorch LSTM networks trained on multi-week time-series sequences per commodity. Capture seasonal memory, multi-week storage transitions, and lagged supply response — invisible to non-sequential models.

PyTorch LSTM pandas numpy AWS SageMaker

Layer / 03 — Unsupervised

Regime Autoencoder

Encoder-decoder architecture trained on normal market conditions. Reconstruction error is the regime signal — high error indicates the current market state is outside the distribution the model was trained on. Used to set the regime flag surfaced in all chat responses.

PyTorch Autoencoder Anomaly detection Reconstruction error

Layer / 04 — Optimization

Custom Quant Development

Partner-specific quantitative models are developed and hosted on request. These include custom contract horizon estimators, proprietary feature integrations, and bespoke signal weighting for specific market verticals. Hosted within the partner's isolated workspace environment.

Custom development IBM Qiskit (quantum) Partner-hosted Impacts pricing

Custom model availability and pricing

The number and complexity of active models within a workspace directly influences platform pricing. The base tier includes all three standard model layers. Custom quantitative development — bespoke models, additional commodity verticals, or partner-specific forecasters — is scoped during the onboarding engagement and priced accordingly. Contact the REDR Labs team for details.

Infrastructure

All platform compute, inference, and storage runs within a dedicated AWS environment. The infrastructure is containerized using Docker, with model serving handled via Amazon Elastic Container Registry (ECR) and execution environments provisioned as ECS tasks. This architecture ensures isolated, reproducible execution with independent scaling across pipeline stages.

stack / 01 ML & data processing

scikit-learn PyTorch XGBoost pandas numpy IBM Qiskit SciPy statsmodels

↓ packaged as Docker images

stack / 02 Container & execution layer

Docker Amazon ECR Amazon ECS AWS SageMaker (training) Lambda (event triggers) Step Functions (pipeline orchestration)

↓ signals written to signal store

stack / 03 Storage & security

Amazon S3 (signal store) AWS CloudTrail AWS Config Security Hub Audit Manager (SOC 2) KMS encryption

↓ inferences computed at query time, not persisted

stack / 04 LLM layer & intelligence interface

LLM (hosted within AWS boundary) Prompt design — partner-configured energyOS Chat Signal context injection Session context window

LLM layer and prompt design

The language model layer is the final stage of the pipeline — the surface that translates scored signals into interpretable analytical language. The LLM does not access the internet, external APIs, or any data outside the current session context. Its knowledge is strictly bounded by the signals injected into its context window at query time.

Prompt design is tailored to each partner engagement during onboarding. The system prompt encodes the partner's specific market focus, preferred analytical frameworks, vocabulary, and response conventions. A trading desk prompt differs significantly from a grid operations prompt: the former emphasizes price discovery, spread relationships, and position context; the latter prioritizes dispatch readiness, load forecasting confidence, and anomaly severity.

This means two partners asking the same question — "What is the current market regime?" — will receive responses calibrated to their respective market contexts and analytical priorities. The LLM reasoning is not generic; it is a configured intelligence layer that reflects the specific commercial context of your team.

energyOS Chat — the intelligence interface

energyOS Chat is not a search engine or a data retrieval tool. It is a reasoning environment. Every response is grounded in the real signal context, constructed through deliberate analytical logic, and bounded by what the models can actually support. The platform is designed to think out loud — surfacing its reasoning, citing the signals it used, and explicitly flagging uncertainty when it exists.

Critical thinking, not display

The platform is built for analysts who bring hypotheses, not just questions. The most powerful use of energyOS Chat is not asking for data summaries — it is asking the platform to stress-test an assumption, find a relationship you hadn't considered, or explain why a correlation is breaking down at the current market state.

When you upload your own data, the EOS Scalar engine scores it against every active signal and surfaces named correlations with alignment scores between 0 and 1. This is where the platform earns its value — taking your proprietary position data, scheduling logs, or operational metrics and finding their hidden relationships to the broader market environment you did not already know to look for.

On designing your queries

Precision is the primary variable. "What is happening in gas?" is not a question the platform can answer usefully. "What is the relationship between current Henry Hub storage deviation and the basis spread at Waha over the next 7 days, given the current pipeline nomination picture?" is. The platform rewards specificity with precision. Vague questions will be answered with honest vagueness — the model will tell you what it cannot narrow down and why.

Identifying what your data is actually saying

One of the most valuable applications of the platform is using it to understand your own data more completely. Analysts often have proprietary datasets — trading logs, operational schedules, physical position summaries — that contain signals they have not fully decoded. The EOS Scalar Correlation engine scores your uploaded data against all active market signals, surfacing statistically significant alignments that may not have been visible in your internal analysis.

This process is not simply correlation reporting. The LLM layer interprets each alignment in market terms: if your position data shows a strong positive alignment with PADD3 refinery utilization and a negative alignment with ERCOT real-time LMP, the platform explains what that combination suggests about the structural position you are holding — and what market conditions would put it under stress.

Detecting potential shifts you may have overlooked

The regime autoencoder is specifically designed to surface structural breaks — moments when the market moves outside its historically normal behavior in ways that are not yet reflected in price. When the reconstruction error score exceeds threshold, the platform flags a potential regime shift and quantifies the deviation from baseline. This signal is surfaced in all chat responses as a header context, ensuring that every analytical question is answered in the context of whether the market is currently behaving normally.

In practice, regime flags have preceded major price events by one to three sessions — giving analytical teams the advance signal needed to re-evaluate positions before the broader market reprices. The flag does not tell you what to do; it tells you that the assumptions underlying your current analysis may no longer hold.

Distribution

Distribution refers to how market intelligence reaches the people and teams that need it. energyOS is not a broadcast platform — it is a precision instrument. Access is structured around partner workspaces, role-based delivery, and the session context model that ensures every team member receives intelligence relevant to their specific function.

Partner workspace model

Each partner organization is provisioned with an isolated workspace. Feed selection, model configurations, user roles, and system prompt parameters are set per workspace. This means a midstream operator's workspace and a commodity merchant's workspace on the same platform are analytically distinct environments — one is configured for logistics and nomination decisions, the other for directional price discovery.

Role	Default access	Distribution scope
Analyst / Trader	energyOS Chat, Pipeline Monitor, Data Ingestion	Full signal context, LLM interface, scalar correlation
Operations / Grid	Pipeline Monitor, energyOS Chat (ops-configured)	Dispatch-relevant signals, anomaly flags, load forecasting outputs
Admin	All areas including Workspace Settings & Audit Log	Configuration, user provisioning, evidence export
Read-only	Pipeline Monitor, Chat (read)	Cannot upload data or modify session context

Feed configuration and prioritization

Not every partner needs every data domain. Feed selection is configured at the workspace level during onboarding — a natural gas trading desk will have gas storage, weather, and NLP signals as primary feeds, with electricity and petroleum as secondary. An ISO operations team will invert this priority. Feed prioritization affects which signals are injected first into the session context window, ensuring the most relevant data is always at the top of the reasoning stack when queries are processed.

Admins can adjust feed priority at any time via Workspace Settings. Significant additions — new commodity verticals, new geographic regions — are handled in coordination with the REDR Labs team to ensure the model layer is appropriately tuned for the new signal space before it is activated.

energyOS updates

The platform is a living system. Model weights, signal schemas, and analytical capabilities evolve continuously — driven by retraining schedules, drift detection, partner feedback, and the ongoing development of the underlying research agenda. This section describes how updates reach the platform and how they affect your workspace.

Model retraining schedule

W

Weekly — price forecasting models

Fair value models for natural gas and power contracts retrain on a rolling weekly window. Updated model candidates are evaluated against held-out test sets before promotion to production. Partners on affected verticals receive no disruption — model promotion is seamless.
Automated · No action required
M

Monthly — structural models

PADD utilization models, LNG export correlators, and regional basis models retrain monthly, aligned with EIA release schedules. Monthly retrains incorporate new structural data that is only available at this cadence.
Automated · EIA-aligned
D

Drift-triggered — adaptive retraining

When PSI drift detection exceeds threshold on any active model, an out-of-schedule retraining job is automatically triggered in AWS SageMaker. Admins are notified via the Workspace Settings panel. The REDR Labs team monitors all drift events proactively.
Event-triggered
P

Partner-driven — roadmap development

Every partner engagement generates use-case feedback that directly shapes the platform roadmap. New signal integrations, model capabilities, and analytical workflows are routinely sourced from the partner community and developed collaboratively. Partners receive early access to capabilities developed from their own use cases.
Partnership standard

Version visibility

The current model version and last retraining timestamps for each active model are visible in the Pipeline Monitor. If you observe a change in signal behavior, check the monitor for recent model update events before raising a support query — the majority of behavioral changes are explained by scheduled retrains incorporating new fundamental data.

Use-case examples

The following examples demonstrate energyOS in operational use — real prompts across different analytical contexts, with representative platform outputs. The platform is as specific as the questions you bring to it. These examples are intended to illustrate how precision querying produces precision intelligence.

Non-linear correlation: WTI crude vs. Henry Hub natural gas Cross-commodity

Prompt

What is the non-linear correlation regarding that of WTI crude and Henry Hub natural gas prices? (3–5 sentences)

energyOS response (energy trading fundamentals)

Based on the current data snapshot, the WTI–Henry Hub relationship is deeply decoupled — WTI is at $112.95/bbl (shock-elevated) while Henry Hub sits at $2.87/MMBtu, which is 396% above model fair value on its own supply/demand basis but still historically cheap relative to crude on a BTU-adjusted basis, with the HO/NG BTU ratio at an extreme 10.92x against a switching threshold of ~1.1x. The non-linear dynamic here is the fuel-switching suppression effect: when the BTU ratio exceeds ~1.1x, industrial and power consumers theoretically switch from distillate to gas, which should bid up Henry Hub — but with NG storage at 3,117 Bcf (storage score 100/100) and burn score only 58.2/100, the market has enough gas supply buffer to absorb that switching pressure without transmitting the crude shock into gas prices. The correlation effectively breaks down at extremes: in normal regimes WTI and Henry Hub share a loose 0.3–0.5 correlation through refinery economics and fuel competition, but at current ratios the relationship becomes asymmetric — crude can rally further without pulling gas up, but a sudden gas supply disruption would transmit upward into distillate markets that are already structurally tight.

PADD3 refinery utilization and distillate spread implication Petroleum logistics

Prompt

Given current PADD3 refinery utilization and crude API gravity data, what is the model's implied distillate crack spread directional view for the next 14 days?

energyOS response (petroleum fundamentals)

PADD3 refinery utilization is currently at 87.4% (score: 62/100, below the 5-year seasonal average of 91.2%) — reflecting a combination of planned turnaround activity and two unscheduled outages flagged in this week's EIA Petroleum Status Report. Crude API gravity in PADD3 has trended lighter over the past 30 days (from 35.2° to 37.8°), suggesting a shift in crude slate composition toward lighter inputs that yield proportionally less middle distillate per barrel. The combined signal — below-average utilization running lighter crude — implies a structural drag on distillate supply at the margin over the next 14-day horizon. The model's P50 distillate crack spread estimate is +$4.20/bbl above current market pricing, with a P90 scenario of +$7.80 if either of the offline units remains down through the end of the cycle. Storage in PADD1 (heating oil) is at the 23rd percentile vs. five-year, which removes the typical seasonal buffer that would absorb a refinery supply shortfall. Directional view: structurally supportive for distillate cracks, with asymmetric upside risk on any further utilization degradation.

Regime detection — ERCOT real-time market Grid operations

Prompt

Are there any active anomalies in ERCOT real-time prices? Is the grid operating in a normal regime?

energyOS response (grid operations)

Regime state: SHIFTED — the autoencoder reconstruction error for ERCOT is currently 2.34 standard deviations above baseline (threshold: 1.8σ), indicating the grid is outside normal operating conditions. Two active anomaly flags are present: (1) HB_NORTH nodal LMP is trading at a 4.2σ deviation from its 30-day mean with a z-score of 4.21, driven by a confirmed 847 MW thermal generation outage as of the 14:00 dispatch interval; (2) Wind penetration in the West load zone has dropped to 11.2% vs. a forecast of 31.4% — a 20-point gap that has historically preceded rapid intraday price escalation in this zone. The burn score for the current interval is 72.4/100, which at current reserve margins leaves limited thermal headroom before ERCOT begins emergency protocols. Recommend close monitoring of the 16:00–20:00 peak window. The platform cannot predict whether the outage will be restored within this window — check ERCOT's Outage Scheduler for unit-specific timelines.

Scalar correlation — proprietary position data vs. market signals Data upload · EOS Scalar

Prompt

Using my uploaded position data, what correlations are present with current storage signals? Are there any relationships I may not have accounted for?

energyOS response (scalar correlation)

EOS Scalar has scored your uploaded position dataset against 47 active signals. Three significant alignments were identified. (1) Your Column C (long physical gas nominations, weeks 1–4) shows a 0.81 positive alignment score with the Henry Hub storage deviation signal — your current net long is structurally aligned with a storage-bearish position. If storage normalizes toward the 5-year average over the next 6 weeks, this position faces fundamental headwinds. (2) Column F (pipeline capacity utilization, Permian Basin) shows a 0.74 negative alignment with PADD3 refinery utilization — a relationship worth examining given that tight Permian pipe tends to correlate with crude supply constraints into Gulf Coast refineries. This relationship was not flagged in your file's internal annotations. (3) Column A (power hedge expiries, ERCOT North Hub) shows a 0.69 alignment with the current wind forecast deviation signal — your expiry schedule is concentrated in a window where the model currently flags elevated ERCOT price uncertainty. No other columns showed alignment scores above 0.40 threshold.

Market impact

energyOS is built around a thesis: that energy markets are structurally inefficient, and that the primary driver of that inefficiency is the gap between available information and analytical capacity to process it. The platform addresses four specific market dynamics where that gap is largest.

Four structural roles the platform addresses

Role / 01

Enhancing Physical Grid Reliability

Modern energy operations are increasingly reliant on renewable generation and battery storage — both inherently intermittent due to weather shifts and human dispatch decisions. The platform's LSTM and regime detection layers model the interaction between forecast uncertainty, reserve margins, and real-time dispatch conditions, surfacing reliability risks before they become operational events. Grid operators can see the probability distribution of generation shortfall, not just the point forecast.

Role / 02

Optimal Resource Allocation

Physical energy operations are constrained by logistics: pipeline capacity, refinery throughput, terminal scheduling, and transportation bottlenecks. The platform integrates PADD utilization, pipeline nomination data, and regional basis signals to surface the economic opportunity — and cost — of specific allocation decisions. Midstream operators can identify when arbitrage between regions is viable against their specific infrastructure constraints, rather than against market prices alone.

Role / 03

Accelerated Price Discovery

Energy markets are inefficient regarding information gaps. The time between data publication — EIA storage reports, PADD utilization releases, weather model updates — and market repricing represents an analytical window. The platform ingests and processes these releases immediately upon publication and injects updated signals into the session context within minutes of release. Teams using energyOS have a systematic advantage in the window between data release and market response.

Role / 04

Smoothing Volatility

Traditional energy markets are exposed to violent, rapid short-term changes — price spikes driven by human error, panic, and linear extrapolation in response to non-linear market conditions. The regime autoencoder and quantile forecasting layers quantify when current volatility is driven by fundamentals versus when it represents a transient behavioral overshoot. This distinction — fundamental repricing versus noise — is the most consistently valuable output the platform produces for trading teams.

Risk considerations

energyOS is designed as a primary resource for validation and verification of analytical results — not as an execution system. No output from the platform should be treated as a trade instruction, dispatch command, or final operational decision without independent review. Understanding the risk profile of an AI-powered intelligence layer is a prerequisite for using it responsibly.

Core principle

energyOS provides intelligence to support human decision-making. It does not execute trades, submit nominations, or dispatch resources. Any operational action must be reviewed, validated, and approved by a qualified human analyst before execution. The platform is a decision support system — not a decision execution system.

Risk register

High · Awareness required

LLM hallucination & Reward Hacking

Language models can generate plausible-sounding responses that are not grounded in available signals. energyOS mitigates this by binding the LLM strictly to its session context and configuring it to explicitly declare uncertainty rather than speculate. However, hallucination cannot be completely eliminated. All quantitative outputs should be cross-referenced against the Pipeline Monitor signal values before acting on them.

High · Security

Prompt injection & adversarial noise

Prompt injection attacks embed malicious instructions within user inputs or ingested data to manipulate model behavior. Adversarial noise can corrupt uploaded datasets to produce misleading correlation outputs. All uploaded data is validated and sanitized before ingestion. System prompts are protected and cannot be overridden by user inputs. If unusual model behavior is observed, contact the REDR Labs team immediately.

High · Data governance

Information security — external leakage

Proprietary data uploaded into the session context is processed in-memory only within the AWS boundary and discarded at session end. No external API calls are made during inference. However, partners should ensure that files uploaded to the platform are appropriately classified and that no confidential data is shared beyond what is necessary for the analytical query at hand.

Medium · Operational

Data latency

All signals have freshness timestamps visible in the Pipeline Monitor. Decisions made on signals with stale timestamps — particularly in fast-moving markets — carry the risk of acting on outdated information. Always check the freshness timestamp of the relevant signal category before acting on a platform response. The LLM will declare signal staleness when it is aware of it, but latency in the underlying feed may not always be visible to the model layer.

Medium · Model risk

Liquidity risk in thin markets

Fair value models are trained on historically liquid market conditions. In thin or illiquid contract horizons — particularly far-dated forwards or non-hub basis contracts — model outputs carry higher uncertainty than they do in liquid near-dated contracts. The quantile range (P10/P90) widens in thin markets, and this widening should be treated as a signal of model uncertainty, not a trading range.

Medium · Confidence

Overconfidence in model outputs

The platform's outputs are probabilistic estimates, not certainties. A fair value estimate is the model's best guess given available data — it is not a price prediction. The platform is most valuable as a structured framework for thinking about market conditions, not as a source of confident directional calls. Never provide serious and confidential operational decisions based solely on platform outputs without independent verification.

Confidential information handling

energyOS will not request, retain, or transmit confidential counterparty information, position details, or regulatory-sensitive data beyond what you explicitly upload for analysis within a session. Never upload data that carries a contractual confidentiality obligation unless you have confirmed that use of a cloud-hosted analytics platform is permitted under that obligation. When in doubt, contact the REDR Labs team — we can advise on appropriate data handling protocols.

Verification and validation

The platform is designed to surface interpretations that require verification. When energyOS identifies a correlation, flags an anomaly, or produces a fair value estimate, treat it as a hypothesis to investigate — not a conclusion to execute against. The platform's highest-value role is helping you ask better questions and find the relationships your current process was not systematically checking.

What to read next

→

Getting Started

Access, onboarding, platform orientation, and your first analysis walkthrough.

→

Platform Architecture

The five-stage pipeline in detail — from data ingestion to the Intelligence Interface.

→

Uploading Your Data

How to bring proprietary datasets into the session context and query against your signals.

→

Contact REDR Labs

Start a partnership conversation or request a custom quant development scoping call.

energyOS Capabilities

AI foundations

Supervised learning

Huber regressor — penalized linear modeling

Quantile gradient boosting — training on separate data quartiles

Non-linear deep learning — LSTMs and neural networks

Adaptive retraining — managing market drift

Data scale & domains

PADD data — a worked example of depth

Autonomous data integration — release cadences

ML models

Infrastructure

LLM layer and prompt design

energyOS Chat — the intelligence interface

Critical thinking, not display

Identifying what your data is actually saying

Detecting potential shifts you may have overlooked

Distribution

Partner workspace model

Feed configuration and prioritization

energyOS updates

Model retraining schedule

Use-case examples

Market impact

Risk considerations

Risk register

What to read next