Build Notes

Project stories and technical notes

Multi-agent systemsAI evaluationProduction AIResearch methodsFounder engineering

Reader-first notesTechnical depth, clear boundaries

Peter McCann Strain working at a laptop on a sunny Glasgow balcony — From the writing desk

All AI Evaluation Computer Vision Founder Engineering LLM Cost Optimization Legal Infrastructure Medical AI Multi-Agent Systems Production AI Recruitment AI Research Methods

Conversico: AI Voice Operations for Dental Practices

Multi-Agent Systems • Production AI

Start here

Start with this

Conversico: AI Voice Operations for Dental Practices

9 min read

An in-production founder case study on an AI voice receptionist for dental practices, built around missed-call recovery, booking support, privacy, and launch-gated operations.

In production · going to market · demo video coming soon

Enter article

Guided entry

Editorial shelf

Start Here

A good first wander through Peter's current ambition, systems depth, and delivery.

Aurum at SThree: Enterprise Agentic AI Engineering

Multi-Agent Systems • AI Evaluation

12 min

Aurum at SThree: Enterprise Agentic AI Engineering

An employer-IP-bounded case study on Aurum at SThree: enterprise agentic AI engineering, reviewable state, fairness diagnostics, red-team testing, production control, and the self-improving-evaluator research that moved scoring quality from a 0.627 baseline to 0.851 without learning the benchmark.

Demo video coming soon

Equal Pay as a Data Operation: A Union's Claims Base and a Valuation Engine It Can Argue With

Founder Engineering • Legal Infrastructure

12 min

Equal Pay as a Data Operation: A Union's Claims Base and a Valuation Engine It Can Argue With

How Datamise built the equal-pay work behind GMB Union's campaigns — an owned, relational claims base and the 38-finding audit that hardened it, plus SettleMise, a valuation engine where every number is traceable and a reviewer can take it apart.

20 May 2026

Editorial shelf

Enterprise Agentic AI

Aurum, scoring systems that learn to score better over time, and the cost trade-offs of running AI in production at SThree.

Multi-Agent Systems • AI Evaluation

12 min

Aurum at SThree: Enterprise Agentic AI Engineering

Demo video coming soon

Removing the LLM From the Hot Path: Building SThree’s Skills Taxonomy Pipeline

Production AI • LLM Cost Optimization

13 min

Removing the LLM From the Hot Path: Building SThree’s Skills Taxonomy Pipeline

A 30-experiment programme that replaced a per-skill LLM validator with a zero-LLM retrieval-and-filter cascade and a 7 kB classifier head — 180× cheaper, ~100× faster, and within ~2pp of the model it replaced.

18 May 2026

Editorial shelf

Datamise & SettleMise

Datamise client work and reviewable equal-pay valuation software.

Founder Engineering • Legal Infrastructure

12 min

Equal Pay as a Data Operation: A Union's Claims Base and a Valuation Engine It Can Argue With

20 May 2026

Editorial shelf

Deep Technical Work

Oxford medical AI, Astroscale satellite vision, and turning simulated images into realistic ones.

From Whole-Slide Images To Biological Structure: My Oxford DPhil In Medical AI

Medical AI • Computer Vision

10 min

From Whole-Slide Images To Biological Structure: My Oxford DPhil In Medical AI

A technical account of my Oxford DPhil: 3 million unlabelled cell nuclei from mouse placental histology, an 89% supervised classifier, near-perfect clustering of 8 cell types, and a self-supervised experiment that fell short of its goal.

20 May 2026

Satellite Pose Estimation and the Sim-to-Real Domain Gap

Computer Vision • Research Methods

14 min

Satellite Pose Estimation and the Sim-to-Real Domain Gap

How I approached spacecraft pose estimation at Astroscale: measuring the sim-to-real gap before modelling it, a preprocessing result that flipped sign with placement, the SPS-B architecture proposal, and AstroGAN — an image-translation model that closed the appearance gap while leaving the harder half open.

20 January 2026

Editorial shelf

Evaluation & Research

Benchmarking, evaluator research, analytics, public-sector strategy, and creative AI.

The Retrieval Bottleneck: A Controlled Comparison of 13 Deep-Research Architectures

AI Evaluation • Research Methods

13 min

The Retrieval Bottleneck: A Controlled Comparison of 13 Deep-Research Architectures

A controlled benchmark of 13 deep-research agent architectures on one shared tool layer — what happens to orchestration claims when you fix the model, the tools, and the judging protocol, and read the statistics honestly.

18 May 2026

Bringing Old Family Photos To Life With AI

Computer Vision • Production AI

17 min

Bringing Old Family Photos To Life With AI

A technical log of an AI-assisted family film: colour-managed restoration, image-to-video animation, identity preservation, and the local pipeline that got built and then bypassed.

21 May 2026

Personal Assistant AI: From Second Brain to Executive Assistant OS

Multi-Agent Systems • Production AI

10 min

Personal Assistant AI: From Second Brain to Executive Assistant OS

A private multi-agent second brain in daily use, with durable memory, proactive support, and quality monitoring behind a privacy-safe engineering trail.

Ongoing case study · demo video coming soon