Question 1

What does Hermes Labs do?

Accepted Answer

Hermes Labs is an AI reliability engineering studio. We find the silent failures that standard evaluations miss in production AI and agent systems, then engineer them out: dropped instructions, fabricated tool calls, distorted memory, and actions you cannot reconstruct. Work spans agent reliability, memory and context integrity, evaluation and auditability, and runtime defense.

Question 2

What kinds of problems does Hermes Labs work on?

Accepted Answer

Agents that pass every test, then fail silently in production: a system prompt or guardrail quietly dropped mid-conversation, an agent fabricating a tool result instead of calling the tool, a memory or summary layer that compresses context and changes its meaning, and systems where you cannot prove what the agent actually did.

Question 3

What proof stands behind the work?

Accepted Answer

Two peer-reviewable papers on Zenodo (DOIs 10.5281/zenodo.19042469 and 10.5281/zenodo.18867694), merged fixes in LangChain and Microsoft Semantic Kernel, five US patent filings, and 18+ open-source reliability tools. The Semantic Kernel fix is a real, merged instance of a failure mode named in the taxonomy paper: Silent Instruction Relaxation.

Question 4

How do engagements start?

Accepted Answer

With a free 30-minute call to scope what is breaking. Tell us the system and the symptom; scope and terms are set on the call. Book at https://calendly.com/rbosch-lpci/30min.

Question 5

Is the software open source?

Accepted Answer

Yes. 18+ tools under github.com/hermes-labs-ai (Apache-2.0 and MIT, no telemetry), including lintlang, fidelis, little-canary, suy-sideguy, zer0dex, agent-gorgon, and hermes-rubric.

Your AI passed every test. Then it failed silently in production.

The guardrail that quietly disappears

The tool call that never happened

The memory that rewrote itself

The incident you cannot reconstruct

Tell us what’s breaking.