Research — Nablon

Featured

Reinforcement Learning · Featured Paper

We Trained a Small Model to Detect Sensitive Data — Without Sending That Data Anywhere

GRPO-trained Llama 3.1 8B matches GPT-4o at 83.4% F1 on 58 PII entity types, at 11× lower cost, running entirely on your infrastructure. No data ever leaves your environment.

March 30, 2026

Upcoming Research

Generalist Agents COMING SOON

Agent Builder: Create Production-Ready Agents from a Prompt

How we architected our builder to translate natural language intents into robust, multi-stage agentic workflows ready for production use.

—

Evaluation

Sentinel: A Continuous Evaluation Service for Production Agentic Pipelines

5× cost reduction through prompt bundling. 71 scanners across 5 execution mechanisms. A full lightweight evaluation path under 500ms.

April 6, 2026

Reinforcement Learning COMING SOON

RL Feedback Systems for Self-Evolving Agents

Moving beyond static policies. How we implement continuous reinforcement learning loops that allow deployed agents to learn directly from execution failures.

—

No research found for this filter.