Company-bound platform
Aurum at SThree
Controlled hiring simulations for fairness review, red-team testing, and audit records.
Recruitment AI safety
Recruitment AI is high-stakes because model outputs affect opportunity. Peter builds systems that test fairness before deployment and preserve human oversight where it matters.
Company-bound platform
Controlled hiring simulations for fairness review, red-team testing, and audit records.
Review surfaces
Review surfaces make disagreement and failure modes visible before deployment.
Fairness signals
Fairness findings are framed as risk signals, not legal certification.
Red-team workflow
Adversarial testing is integrated with provenance and human oversight.
Aurum creates controlled hiring simulations, then helps reviewers inspect how recruitment AI behaves under ordinary, fairness-sensitive, and adversarial conditions.
Recruitment AI safety needs more than a model score: traceability, audit artifacts, small-sample caveats, and explicit points where a human stays accountable.
The Aurum self-improvement research treats evaluator optimization as a governance problem as much as a modelling one. It works against a synthetic proxy oracle, not human hiring ground truth.
Continue exploring
The full enterprise agentic AI and recruitment evaluation narrative.
OpenEvaluator optimization studied against a synthetic proxy oracle, with negative results reported in full.
OpenThe agentic architecture behind audit workflows.
OpenSenior AI engineering context and the wider work.
Open