You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Add a guardrails section to agent definitions for semantic output validation beyond JSON schema type checking — including regex patterns, length limits, and custom script-based checks.
Motivation
Research shows frontier models spontaneously exhibit deceptive behaviors in multi-agent settings (UC Berkeley/UC Santa Cruz study), 30-50% of AI agents bypass ethical constraints under KPI pressure, and RAG document poisoning can cause fabricated financial data. Conductor validates output types today (JSON schema) but has no way to validate output content or semantics.
Summary
Add a
guardrailssection to agent definitions for semantic output validation beyond JSON schema type checking — including regex patterns, length limits, and custom script-based checks.Motivation
Research shows frontier models spontaneously exhibit deceptive behaviors in multi-agent settings (UC Berkeley/UC Santa Cruz study), 30-50% of AI agents bypass ethical constraints under KPI pressure, and RAG document poisoning can cause fabricated financial data. Conductor validates output types today (JSON schema) but has no way to validate output content or semantics.
Proposed Design
Behavior on Failure
max_guardrail_retries(default: 2) before hard failureguardrail_check,guardrail_pass,guardrail_failBuilt-in Guardrail Types
regex_denyregex_requiremax_lengthmin_lengthjson_schemacustom_scriptWhy It Fits Conductor
scriptstep infrastructureEffort Estimate
Medium — new validation layer in
AgentExecutorpost-output, new schema fields, script runner reuse from existing script step infrastructure.