Reinforcement Learning · Featured Paper
We Trained a Small Model to Detect Sensitive
Data — Without Sending That Data Anywhere
GRPO-trained Llama 3.1 8B matches
GPT-4o at 83.4% F1 on 58 PII entity types, at 11× lower cost, running entirely on your
infrastructure. No data ever leaves your environment.