AgentDoG 1.5: A Lightweight and Scalable Alignment Framework for AI Agent Safety and Security

Published in arXiv preprint, 2026

Core Contributor

AgentDoG 1.5 updates the agent safety taxonomy for Codex and OpenClaw execution scenarios, trains lightweight guardrail variants with a taxonomy-guided data engine, and supports real-time safety moderation in complex interactive agent settings.

[Code]

Recommended citation: Dongrui Liu, Yu Li, Zhonghao Yang, Peng Wang, Guanxu Chen, Yuejin Xie, Qinghua Mao, Wanying Qu, Yanxu Zhu, Tianyi Zhou, et al. (2026). "AgentDoG 1.5: A Lightweight and Scalable Alignment Framework for AI Agent Safety and Security." arXiv preprint arXiv:2605.29801. https://arxiv.org/abs/2605.29801

Share on

Twitter Facebook LinkedIn