Skip to content

Commit b17600f

Browse files
committed
chore(release): v2.92.1
Hotfix - Foundry quota preflight (last open Section 14 Priority D item). Phase-FOUNDRY-QUOTA-PREFLIGHT shipped via 5-slice chain in 35.6 minutes. Opt-in via PFORGE_FOUNDRY_QUOTA_PREFLIGHT=warn|block. Reads AOAI deployment quota via Azure Cognitive Services control-plane REST API. Fail-open invariant - control-plane outages NEVER block plan execution. Cost: 0.05 USD declared / 0.00 USD wall (gh-copilot subscription). costForLeg() byte-identical to v2.92.0; priceSlice() signature unchanged. 34 new tests pass; full test suite unaffected.
1 parent e4a4aad commit b17600f

3 files changed

Lines changed: 6 additions & 2 deletions

File tree

CHANGELOG.md

Lines changed: 4 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -9,6 +9,10 @@ Format follows [Keep a Changelog](https://keepachangelog.com/).
99

1010
---
1111

12+
## [2.92.1] — 2026-05-08 — Hotfix: Foundry quota preflight
13+
14+
> **One-liner**: Hotfix release adding opt-in AOAI quota preflight for plans running on the `microsoft-foundry` provider. Reads deployment TPM capacity via the Azure Cognitive Services control-plane API, compares against slice token estimates, and emits warnings (or optionally blocks execution) before a slice runs. Fail-open invariant: control-plane outages NEVER block execution. Closes the last open `docs/research/enterprise-fleet-readiness.md` §14 Priority-D item.
15+
1216
### Phase-FOUNDRY-QUOTA-PREFLIGHT — Azure AI Foundry deployment quota preflight
1317

1418
> **One-liner**: Adds a quota preflight step to `forge_run_plan` for Microsoft Foundry / BYO Azure OpenAI deployments — fetches TPM capacity from the Azure Cognitive Services control-plane REST API, caches results for 5 minutes, compares the slice token estimate against available headroom (safe ≥ 30 %, warning 10–30 %, critical < 10 %), and logs a structured `[foundry-quota]` annotation on every slice. Fail-open: any quota fetch error produces `status: "unknown"` and never blocks execution. Block mode (`PFORGE_FOUNDRY_QUOTA_PREFLIGHT=block`) halts execution on `critical` status.

VERSION

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1 +1 @@
1-
2.93.0-dev
1+
2.92.1

pforge-mcp/package.json

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1,6 +1,6 @@
11
{
22
"name": "plan-forge-mcp",
3-
"version": "2.93.0-dev",
3+
"version": "2.92.1",
44
"description": "MCP server exposing Plan Forge operations as tools for AI agents",
55
"main": "server.mjs",
66
"type": "module",

0 commit comments

Comments
 (0)