Tag
#agents
3 posts tagged agents.
- monitoring
A Lean 4 stability proof for tool-mediated LLM agents, and what it means for your runbook
A new arXiv paper certifies controllability and ISS robustness for an LLM-driven SOC agent using Lean 4. The MLOps takeaway is simpler than the math: monitor the action catalog, not the model.
- monitoring
Embedding-Based Agent Monitoring Has a Blind Spot. Here's What to Watch Instead.
A new paper demonstrates three attack patterns — Slow Drift, Benign Wrapper, Chaos Seeding — that defeat embedding-based detection of malicious agents in LLM multi-agent systems. The fix requires monitoring logit-level confidence, not just output embeddings.
- monitoring
The Authority Gap Is an Observability Problem: What MLOps Teams Should Borrow
A new framing of AI agent risk argues that delegation, not identity, is the missing telemetry. ML platform teams already have the substrate to fix it.