Operations in AI Safety: A One-Year Perspective and Advice
Jul 24, 2025
Building things and fixing problems
Building systems and running events to get more people working on critical AI Safety challenges.
Previously, I ran theatre venues and productions, helped students and student groups turn their ideas into impact, and solved problems with software.
I run programs and build operational infrastructure for AI Safety organisations including recruitment and selection pipelines, program logistics, CRMs, and the automations that hold them together. More about this side of me →
First employee at the incubator for new AI Safety organisations. Co-ran the incubation program end to end
Taught AI agents, adversarial attacks, and AI Safety strategy at a 10-day intensive bootcamp
I build internal systems, data pipelines, and developer tooling, including a growing toolkit for making AI agents verifiable enough to do real work. More about this side of me →
A hook, skill, and CLI that make AI agents screenshot and visually check their own UI changes
Claude Code hooks and skills that make AI coding agents verify their work before declaring it done
A chatbot that helps you decide which of the Edinburgh Fringe's 3,500 shows to actually go see
Listening goals and statistics for Spotify. Retired since Spotify stopped verifying apps from small developers
I compose for theatre and podcasts, and I hold a degree in Acoustics and Music Technology, including a dissertation on what stalactites sound like when you hit them with a mallet. More about this side of me →
Themes and scores for podcasts, theatre, and other narrative art
I run venues at the Edinburgh Fringe, sound-design professional shows, and learned it all running a student theatre. More about this side of me →
Freelance sound design, including "Mary: A Gig Theatre Show" across five runs from the Fringe to the Traverse
A chatbot that helps you decide which of the Edinburgh Fringe's 3,500 shows to actually go see
Web soundboard for live improv comedy
Over thirty productions and festivals as production manager, director, producer, and designer, plus two years leading the Edinburgh University Theatre Company
Get the latest updates from my blog directly to your inbox.
Catalyze Impact incubates new AI Safety organisations. I was their first employee, and for a year and a half I co-ran the incubation program:
I also built the systems everythin ran on using Airtable bases and Python scripts that automated the repetitive parts of recruitment and selection, and I made some really great spreadsheets.
ML4Good runs 10-day intensive bootcamps that take people from “interested in AI Safety” to actually working on it. I was a teaching assistant at the March 2025 Europe camp where I taught the sessions on AI Agents, Adversarial Attacks, and AI Safety strategy, and coached participants through the rest of the curriculum.
Because the atmosphere at a bootcamp is just as important as the curriculum, I also ran the social side, including a welcome event, a murder mystery, lightning talks, and a Scottish ceilidh.
Vischeck makes an AI agent screenshot its own UI changes and actually look at them before reporting back. Agents are happy to call a layout “good” based on a diff changes they never rendered. This package of hook, skill, and CLI tmakes them take an authenticated screenshot of the local dev server and make them look first.
The screenshot CLI handles dev-server auth automatically, and supports dark mode, mobile viewports, element-level captures, and batch screenshots of whole page sets. The hook fires whenever the agent edits a view or template file, so visual verification happens by default.
AI coding agents love saying “this should work now” about code that has never even been run. Dev Hooks is a collection of Claude Code hooks and skills that make sure they do check their work and apply best practices without having to prompt them yourself all the time.
Also includes my personal dev-env setup, built on mise.
The Edinburgh Festival Fringe programme lists around 3,500 shows. I used to read it cover-to-cover, but that still didn’t tell me much about the shows themselves, so I built the tool I wanted instead: a chat interface where you can just ask the programme what you should go see.
Under the hood it’s a RAG pipeline which does semantic search over the festival data with ChromaDB vector storage, category filtering, and a FastAPI backend serving a React/TypeScript frontend.
Spotify Tools allows you to set listening goals for your favorite artists, albums, and tracks. The application syncs with your Spotify listening history and helps you reach your goals by generating personalized playlists based on your preferences and what you need to listen to. It also gives you detailed statistics of your listening history.
Unfortunately, Spotify no longer verifies small apps from individual developers, so it is not publicly accessible. If you would like to try it, please contact me with your email and I can give you access.
Artwork from “Ethics Town” podcast
I compose music for theatre and other performance and narrative art.
Themes for audio fiction podcasts.
Short pieces I made for university assignments but did not develop into a full piece.
I wrote my dissertation on “Ancient Rock Music: the sound stalactites” make when hit by a rubber mallet. The project was inspired by a visit to Luray Caverns, Virginia, home of the “Great Stalacpipe Organ”: an instrument that plays the cave itself.
The work had three parts: a physical simulation of struck stalactites using finite-difference and modal methods (approximating them as cantilever bars of variable diameter, in Matlab), a field trip to record real stalactites, and a virtual instrument built in C++ with JUCE so anyone can play a cave from their Digital Audio Workstation.
Photo from “Mary: A Gig Theatre Show”. Credit: Carla Watson
I work as a freelance sound designer and general organiser in theatre. See also the theatre I did as a student.
ImpAmp is the soundboard Bedlam Theatre uses for live improv comedy, where a sound effect is only funny if you can find it before the performers move on. It supports multiple soundbanks, instant search, arming tracks, multiple sounds per pad, and Google Drive sync so you can share your soundbanks with the whole team
This third version was also an experiment in AI-driven development: I built it in Next.js using Claude 3.7 Sonnet and Gemini 2.5 back when agentic AI coding was just a month old. It has been running real shows ever since.
During university (and a little after), I took part in over thirty productions and festivals as production manager, producer, director, technician, composer, and actor. I also spent two years on the committee of the Edinburgh University Theatre Company, the student company that fully operates the Bedlam Theatre: as Business Manager and later President, I helped reopen the theatre after the covid-19 pandemic and led its return as an Edinburgh Fringe venue in 2023.
Shows marked * took place in the Bedlam Theatre
I was President (2022-2023) and Business Manager (2021-2022) for the Edinburgh University Theatre Company, who fully operate the Bedlam Theatre, and Treasurer (2022-2023), President (2021-2022), and Secretary (2020-2021) for Theatre Paradok. During my time, we reopened the theatre after the covid-19 pandemic and I led the efforts for our return as a venue to the Edinburgh Fringe Festival in 2023.