SaFoLab : Security and Safe Foundation Model Systems
Pinned Loading
Repositories
- DynAuditClaw Public
DynAuditClaw — A security audit skill that dynamically discovers your OpenClaw agent's real configuration, designs targeted attack scenarios adapted to your specific setup, and executes them in isolated Docker containers to uncover vulnerabilities with a structured report.
SaFo-Lab/DynAuditClaw’s past year of commit activity - ROM Public
The official implementation of our paper "ROM: Real-time Overthinking Mitigation via Streaming Detection and Intervention"
SaFo-Lab/ROM’s past year of commit activity - PRISM Public
PRISM: Robust VLM Alignment with Principled Reasoning for Integrated Safety in Multimodality
SaFo-Lab/PRISM’s past year of commit activity - DRIFT Public
[NeurIPS 2025] The official implementation of the paper "DRIFT: Dynamic Rule-Based Defense with Injection Isolation for Securing LLM Agents".
SaFo-Lab/DRIFT’s past year of commit activity - llm-armor Public
SaFo-Lab/llm-armor’s past year of commit activity - armor Public
SaFo-Lab/armor’s past year of commit activity - AgentDyn Public
The official implementation of the paper "AgentDyn: A Dynamic Open-Ended Benchmark for Evaluating Prompt Injection Attacks of Real-World Agent Security System".
SaFo-Lab/AgentDyn’s past year of commit activity - A2ASecBench Public
Official code repository for "A2ASecBench: A Protocol-Aware Security Benchmark for Agent-to-Agent Multi-Agent Systems" at ICLR 2026.
SaFo-Lab/A2ASecBench’s past year of commit activity
People
This organization has no public members. You must be a member to see who’s a part of this organization.
Top languages
Loading…
Most used topics
Loading…