AUTONOMY DATA UNIT
Six data scientists and machine-learning engineers. We point frontier methods at power, and we have done it for unions, charities, newsrooms and campaigners since 2020.
The data arm of
the public interest.
Built inside the
Autonomy Institute.
Each one runs end to end, from raw filings to a thing you can actually use. We scope, build, and hand it over.
We scrape filings, donations, contracts and the open web, then use LLMs to pull out entities and the links between them. The output is a map of who is connected to whom, and how the money moves.
Microsimulation, input-output models, and bespoke indices. We build the number when the official statistics do not exist yet, or do not break down the way the question needs.
We read documents at a scale a research team never could. LLM extraction across millions of pages turns annual reports, registers and PDFs into a clean, queryable dataset.
Searchable databases, trackers and indexes that outlive the report. We ship the public-facing site, not just the spreadsheet behind it, and we keep it running.
A dozen projects from the last six years. Most are live and public. Click through.

Millions of pages scraped to map the modern far right and its links to power.

An LLM pipeline reads every UK annual report and surfaces the confirmed risk events.

Political donations linked to the government contracts that followed them.

Labour's shift toward business donors, traced from 2019 to 2024.

An AI-augmented index of the Heritage Foundation's 900-page plan.

An economic model of UK landlord returns, built for the Joseph Rowntree Foundation.

A searchable database of licensed care-visa sponsors, with the Bureau of Investigative Journalism.
Arts-council funding by constituency since 2014, built for Equity.

Mapping the corporate connections of the UK's entrepreneurial far right.

The origin project: the UK workforce scored by Covid exposure, featured on Peston.

30 million job ads tagged with LLMs on the Isambard supercomputer, with the UK AI Security Institute.

A co-mention network built for the global trade-union body, the ITUC.
Public bodies, unions, foundations and newsrooms. Some of the names from the last six years.
A small team inside the Autonomy Institute. Each of us ships work, not slides.
Lead. Machine learning and data engineering. Builds the pipelines and keeps them running.
Data science and ML research. Works out what the model should actually be doing.
NLP and network analysis. Turns scraped text into entities and the links between them.
Investigations and political data. Finds the story buried in the filings.
Economic modelling. Microsimulation, indices, and the maths behind the headline number.
Forecasting and statistics. Nowcasting models and the things that have not happened yet.