Skip to content
View antnewman's full-sized avatar

Block or report antnewman

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
antnewman/README.md

Ant Newman

CEO, Tortoise AI | 20 years getting AI into production

I build AI for environments where failure is not an option, defence, nuclear, aviation, and now live sports broadcasting!

When the frameworks I need don't exist, I build them open source.


Frameworks

ARMM (Agent Readiness Maturity Model)
Assessing organisational readiness to deploy AI agents in production.
Four dimensions. Five levels. Weakest-link principle. CC BY 4.0.

Decision Latency Framework (coming 2026)
A scoring methodology for knowing how much time a decision should take.


Open Source Projects

pda-platform
Open infrastructure for AI-enabled project delivery. Universal PM data parser, MCP servers for Claude integration, AI reliability tooling. MIT.

agent-task-planning
AI reliability framework with confidence extraction and outlier mining. Multi-provider. Production guardrails. MIT.

pm-data-tools
Universal parser for project management data. 8 formats + NISTA. MIT.

ARMM Assessment Tool
Interactive self-assessment across 251 criteria. AGPL-3.0.

Universal Dashboard Specification
Vendor-neutral declarative format for AI-native analytical dashboards. Apache 2.0.


Contributions to AI safety tooling

inspect_ai
Contributing bug fixes, evaluation utilities, and metric-factory features to the UK AI Safety Institute's LLM evaluation framework.


Published Work

Verified Autonomy: A Field Guide to Engineering Trust in AI Systems (May 2026)
A field guide for engineering trust into production AI systems, covering calibration, conformal prediction, audit trails, and constrained autonomy. Ant Newman, Shanti Greene, Malia Hosseini, Philip Kitchener, Rainier Potgieter, Hadley Christoffels. Companion repo: verified-autonomy.
CC BY 4.0 (content), MIT (code).

From Policy to Practice: An Open Framework for AI-Ready Project Delivery (Feb 2026)
An open framework for AI-ready project delivery. Ant Newman. Companion repo: Project-Delivery-Toolkit.
CC BY 4.0.

Agent Readiness Maturity Model (ARMM) Framework v1.1 (Jan 2026)
A maturity model that scores whether an organisation is ready to deploy AI agents in production, applying a weakest-link rule across four dimensions and five levels. Ant Newman. Companion repo: armm-assessment.
CC BY 4.0.

The Sharon Instability Theorem: Generic Instability of the M-Invariant (Dec 2025)
Proves the M-invariant in multiparameter persistence is generically unstable, resolving an open question in the field. Ant Newman. Companion repo: sharon-instability.
CC BY 4.0.


Writing

I publish on AI deployment, decision-making under complexity, and what production-grade reliability actually requires.

LinkedIn · tortoiseai.co.uk/insights


Contact

antjsnewman@outlook.com tortoiseai.co.uk

Pinned Loading

  1. antnewman antnewman Public

    My personal repository

  2. pda-platform pda-platform Public

    116 MCP tools for UK government IPA Gate Review assurance. Connects Claude to risk registers, earned value, benefits realisation, gate readiness, and IPA benchmarks.

    Python 2 3

  3. UKGovernmentBEIS/inspect_ai UKGovernmentBEIS/inspect_ai Public

    Inspect: A framework for large language model evaluations

    Python 2.1k 505

  4. Tortoise-AI/sharon-instability Tortoise-AI/sharon-instability Public

    Jupyter Notebook

  5. Tortoise-AI/uds Tortoise-AI/uds Public

    Python

  6. SingularityAI-Dev/logic-md SingularityAI-Dev/logic-md Public

    The declarative reasoning layer for AI agents. A portable, framework-agnostic file format for specifying how an agent thinks — strategy, step DAGs, contracts, quality gates, and fallback policies —…

    TypeScript 21 3