Enable javascript in your browser for better experience. Need to know to enable it? Go here.

Research for trusted, scalable AI deployments

Our AI research team advances model interoperability, control and robustness. Focusing on rigorous evaluation, interpretability and decision-making, we deliver insights through open-source solutions and peer-reviewed publications.

Why it matters

Too much of the market still treats evaluation, interpretability and reliability as secondary concerns. That works until teams need to confidently choose a model, mitigate hallucinations or explain how an AI decision was reached.

Our approach

We tackle these challenges from the ground up. Using a first-principles approach to evaluation, we build methods that rigorously assess AI systems at every stage: input, output and internal decision-making.

More from the labs

June 08, 2026

Anti-slopping — An innovation for rectifying LLM writing clichés

LLMs sometimes produce slop, repetitive or robotic output. See how the FTPO training framework helps maintain authentic, high-quality results.
April 04, 2026

Steering smarter

February 27, 2026

Concept consistency score

December 15, 2025

p-less sampling: A robust hyperparameter-free approach for LLM decoding

December 15, 2025

p-less sampling: A robust LLM decoding strategy

October 10, 2025

Evaluating LLM-generated summaries using the Lie algebra framework

September 07, 2025

Beyond I am sorry, I can’t: dissecting large language model refusal

August 29, 2025

Distribution-aware feature selection for SAEs

August 06, 2025

Towards transparent AI grading: Entropy as a signal for human-AI disagreement

June 04, 2025

The next frontiers in AI — according to industry leaders

May 30, 2025

Beyond linear steering: Unified multi-attribute control for language models

May 06, 2025

Calculating uncertainty in generative AI

March 17, 2025

TinySQL

March 07, 2025

Evaluating LLMs using semantic entropy

October 31, 2024

LLM benchmarks, evals and tests

July 01, 2024

Turning up the heat: Min-p samling for creative and coherent creative outputs

October 16, 2023

Decoding LLM uncertainties for better predictability

September 08, 2023

A surprisingly effective way to estimate token importance in LLM prompts

September 02, 2021

Probabilistic machine learning and weak supervision

September 01, 2021

A gentle introduction to machine teaching

Partners and collaborations

Thoughtworks AI labs sit within a wider network of organizations spanning public AI research, semiconductor innovation, cloud platforms, open source and AI engineering.

These relationships strengthen the lab’s ability to contribute to the methods, tools and technical standards shaping reliable AI.

 

For partnerships and collaboration inquiries

email ai-labs@thoughtworks.com