CEO, Tortoise AI | 20 years getting AI into production
I build AI for environments where failure is not an option, defence, nuclear, aviation, and now live sports broadcasting!
When the frameworks I need don't exist, I build them open source.
ARMM (Agent Readiness Maturity Model)
Assessing organisational readiness to deploy AI agents in production.
Four dimensions. Five levels. Weakest-link principle. CC BY 4.0.
Decision Latency Framework (coming 2026)
A scoring methodology for knowing how much time a decision should take.
pda-platform
Open infrastructure for AI-enabled project delivery. Universal PM data
parser, MCP servers for Claude integration, AI reliability tooling. MIT.
agent-task-planning
AI reliability framework with confidence extraction and outlier mining.
Multi-provider. Production guardrails. MIT.
pm-data-tools
Universal parser for project management data. 8 formats + NISTA. MIT.
ARMM Assessment Tool
Interactive self-assessment across 251 criteria. AGPL-3.0.
Universal Dashboard Specification
Vendor-neutral declarative format for AI-native analytical dashboards.
Apache 2.0.
inspect_ai
Contributing bug fixes, evaluation utilities, and metric-factory features to the UK AI Safety Institute's LLM evaluation framework.
Verified Autonomy: A Field Guide to Engineering Trust in AI Systems (May 2026)
A field guide for engineering trust into production AI systems, covering calibration, conformal prediction, audit trails, and constrained autonomy. Ant Newman, Shanti Greene, Malia Hosseini, Philip Kitchener, Rainier Potgieter, Hadley Christoffels. Companion repo: verified-autonomy.
CC BY 4.0 (content), MIT (code).
From Policy to Practice: An Open Framework for AI-Ready Project Delivery (Feb 2026)
An open framework for AI-ready project delivery. Ant Newman. Companion repo: Project-Delivery-Toolkit.
CC BY 4.0.
Agent Readiness Maturity Model (ARMM) Framework v1.1 (Jan 2026)
A maturity model that scores whether an organisation is ready to deploy AI agents in production, applying a weakest-link rule across four dimensions and five levels. Ant Newman. Companion repo: armm-assessment.
CC BY 4.0.
The Sharon Instability Theorem: Generic Instability of the M-Invariant (Dec 2025)
Proves the M-invariant in multiparameter persistence is generically unstable, resolving an open question in the field. Ant Newman. Companion repo: sharon-instability.
CC BY 4.0.
I publish on AI deployment, decision-making under complexity, and what production-grade reliability actually requires.
LinkedIn · tortoiseai.co.uk/insights



