Enhance ChatAgent with file navigation, web browsing, scratchpad tools, and write security guardrails by kovtcharov · Pull Request #495 · amd/gaia

kovtcharov · 2026-03-11T20:22:46Z

Summary

This PR adds comprehensive file system navigation, web browsing tools, structured data analysis, and write security guardrails to the ChatAgent.

Write Security Guardrails (`src/gaia/security.py`)

Blocked system directories: Windows (C:\Windows, Program Files) and Unix (/etc, /bin, /usr/lib) system paths are blocked for writes
Sensitive file protection: .env, credentials.json, SSH keys (id_rsa, id_ed25519), certificates (.pem, .key, .crt), and other secrets are never writable
Write size limits: 10 MB maximum per write operation to prevent runaway file creation
Overwrite confirmation prompts: User is prompted before overwriting existing files
Timestamped backups: Automatic .bak copies created before file modification
Audit logging: All write operations logged to ~/.gaia/cache/file_audit.log with timestamp, operation type, path, size, and status
Symlink resolution: Paths resolved via os.path.realpath() to prevent TOCTOU bypass
Fixed ChatAgent write_file: Previously had zero security checks — now enforces full PathValidator + write guardrails
Fixed CodeAgent write_file/edit_file: Generic file tools were missing PathValidator — now enforced

File System Navigation Tools (`src/gaia/agents/tools/filesystem_tools.py`)

browse_directory: List folder contents with file sizes, dates, and type indicators
tree: Visual directory tree with configurable depth, exclusion patterns, and platform-aware defaults
find_files: Search by name, content, size, date, and file type with multi-scope search (current dir → common locations → full drives)
file_info: Detailed metadata — size, type, MIME, modification date, line counts, PDF page counts
read_file: Smart file reading with type detection — text, CSV (tabular), JSON (formatted), PDF (text extraction)
bookmark: Save, list, and remove bookmarks for quick access to important locations

File System Index Service (`src/gaia/filesystem/`)

FileSystemIndexService: Persistent SQLite-backed file index with FTS5 full-text search
auto_categorize: Automatic file categorization by extension (code, document, spreadsheet, image, video, audio, data, archive, config)
Supports incremental scanning and update-on-change for efficient re-indexing

Browser Tools (`src/gaia/agents/tools/browser_tools.py`)

fetch_page: Fetch web pages with content extraction modes — readable text, raw HTML, links, or tables as JSON
search_web: DuckDuckGo web search (no API key required) with configurable result count
download_file: Download files from the web to local disk with size limits and path validation

Web Client (`src/gaia/web/client.py`)

Rate limiting: Per-domain request throttling (configurable delay between requests)
SSRF prevention: Blocked schemes (file://, ftp://), blocked ports (SSH, SMTP, DB ports), private IP detection
Content extraction: BeautifulSoup-based text extraction with boilerplate removal (nav, footer, scripts stripped)
Table extraction: HTML tables parsed to structured JSON
Size limits: Configurable max download size (default 100 MB)
User-Agent rotation: Realistic browser user-agent strings

Scratchpad Tools (`src/gaia/agents/tools/scratchpad_tools.py`)

create_table: Create SQLite tables for structured data accumulation
insert_data: Insert rows from extracted document data
query_data: Run SQL queries (SELECT only) with formatted results — supports SUM, AVG, GROUP BY for analysis
list_tables: Show all scratchpad tables with row counts and schemas
drop_table: Clean up tables when analysis is complete

Scratchpad Service (`src/gaia/scratchpad/service.py`)

SQLite-backed working memory for multi-document data analysis
Table name prefixing (scratch_) to isolate scratchpad data
Read-only query enforcement (SELECT only) to prevent data mutation via query tool
Schema introspection and row count tracking

ChatAgent Integration (`src/gaia/agents/chat/agent.py`)

Integrated FileSystemToolsMixin, ScratchpadToolsMixin, and BrowserToolsMixin
Config toggles: enable_filesystem, enable_scratchpad, enable_browser (all default to True)
Updated system prompt with new tool workflows: file search + auto-index, data analysis pipeline, web research, download + analyze
Replaced legacy search_file/search_directory tools with enhanced find_files/browse_directory
Graceful degradation: each service initializes independently with fallback on import errors

CI Updates (`.github/workflows/test_unit.yml`)

Added beautifulsoup4 and requests to test dependencies for browser tool tests

New Modules

Module	Files	Description
`src/gaia/filesystem/`	`index.py`, `categorizer.py`	Persistent file index with FTS5 search and auto-categorization
`src/gaia/web/`	`client.py`	HTTP client with rate limiting, SSRF prevention, content extraction
`src/gaia/scratchpad/`	`service.py`	SQLite working memory for structured data analysis
`src/gaia/agents/tools/filesystem_tools.py`	—	File system navigation mixin (6 tools)
`src/gaia/agents/tools/browser_tools.py`	—	Web browsing mixin (3 tools)
`src/gaia/agents/tools/scratchpad_tools.py`	—	Data analysis mixin (5 tools)

Test Coverage

Test File	Focus
`test_file_write_guardrails.py`	Blocked directories, sensitive files, size limits, backups, audit logging, overwrite prompts
`test_security_edge_cases.py`	Symlink resolution, path traversal, platform-specific edge cases
`test_filesystem_tools_mixin.py`	browse_directory, tree, find_files, file_info, read_file, bookmarks
`test_filesystem_index.py`	FTS5 search, incremental scanning, categorization
`test_categorizer.py`	Extension-based file categorization
`test_browser_tools.py`	URL validation, SSRF prevention, content extraction, rate limiting
`test_web_client_edge_cases.py`	Timeout handling, redirect limits, encoding detection
`test_scratchpad_service.py`	Table CRUD, SQL injection prevention, schema introspection
`test_scratchpad_tools_mixin.py`	Tool registration, query formatting, error handling
`test_service_edge_cases.py`	Concurrent access, large datasets, cleanup
`test_chat_agent_integration.py`	End-to-end ChatAgent with all new mixins

Test plan

All unit tests pass (11 new test files, ~8000 lines of test code)
All 3 modified source files parse and import cleanly
Integration test: write to safe file succeeds, write to .env blocked, edit creates backup
Platform test: case-insensitive path comparison on Windows verified
Manual: run gaia chat and test file browsing, web search, scratchpad tools
Manual: verify audit log written to ~/.gaia/cache/file_audit.log after write operations

🤖 Generated with Claude Code

- Enhanced PathValidator with write guardrails: blocked system directories, sensitive file protection (.env, credentials, keys), size limits (10 MB), overwrite confirmation prompts, timestamped backups, and audit logging - Fixed ChatAgent write_file (had zero security checks) and added edit_file tool - Fixed CodeAgent generic write_file and edit_file (missing PathValidator) - Added FileSystemToolsMixin: browse_directory, tree, find_files, file_info, read_file with smart type detection, bookmarks - Added BrowserToolsMixin: fetch_page, search_web, download_file - Added ScratchpadToolsMixin: SQLite-backed data analysis tables - Added FileSystemIndexService: persistent file index with FTS5 full-text search - Added WebClient: HTTP client with rate limiting and content extraction - Integrated all new tools into ChatAgent with config toggles - 95 unit tests for write guardrails (all passing)

tests/unit/test_browser_tools.py

+    def test_rate_limit_tracks_domains(self):
+        """Rate limit state is per-domain."""
+        self.client._rate_limit_wait("example.com")
+        assert "example.com" in self.client._domain_last_request


tests/unit/test_browser_tools.py

+        """Different domains don't share rate limit state."""
+        self.client._rate_limit_wait("a.com")
+        self.client._rate_limit_wait("b.com")
+        assert "a.com" in self.client._domain_last_request


tests/unit/test_browser_tools.py

+        self.client._rate_limit_wait("a.com")
+        self.client._rate_limit_wait("b.com")
+        assert "a.com" in self.client._domain_last_request
+        assert "b.com" in self.client._domain_last_request


tests/unit/test_browser_tools.py

+        result = self.registered_tools["search_web"]("python tutorial")
+        assert "1. Python Docs" in result
+        assert "2. Real Python" in result
+        assert "https://docs.python.org" in result


Fix black/isort formatting across all modified files to pass CI lint checks. Address all 17 open CodeQL code scanning alerts: Python: Add path traversal validation with realpath/symlink checks (EMR server), sanitize API responses to strip stack traces, restrict returned fields from clear_database endpoint, redact URLs in Jira agent logs. JavaScript: Add final path validation in eval webapp server, sanitize redirect URLs to reject protocol-relative paths, add in-memory rate limiters to docs server and dev server, remove identity replacement no-op, add crossorigin attributes to CDN scripts, add HTML sanitizer for XSS prevention in Jira webui, replace innerHTML with safe DOM APIs for user messages. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

src/gaia/apps/jira/webui/public/js/modules/chat-ui.js


+    sanitizeHTML(html) {
+        const div = document.createElement('div');
+        div.innerHTML = html;


src/gaia/apps/jira/webui/public/js/modules/chat-ui.js

+        // Remove event handlers and javascript: URLs
+        div.querySelectorAll('*').forEach(el => {
+            [...el.attributes].forEach(attr => {
+                if (attr.name.startsWith('on') || (attr.name === 'href' && attr.value.trimStart().toLowerCase().startsWith('javascript:'))) {


docs/server.js

-      res.redirect(303, parsed.pathname);
+      // Sanitize pathname to prevent protocol-relative URLs (e.g., //evil.com)
+      const safePath = parsed.pathname.startsWith('/') && !parsed.pathname.startsWith('//') ? parsed.pathname : '/';
+      res.redirect(303, safePath);


src/gaia/apps/jira/webui/public/js/modules/chat-ui.js


+    sanitizeHTML(html) {
+        const div = document.createElement('div');
+        div.innerHTML = html;


src/gaia/eval/webapp/public/index.html

-    <script src="https://cdnjs.cloudflare.com/ajax/libs/html2canvas/1.4.1/html2canvas.min.js"></script>
-    <script src="https://cdnjs.cloudflare.com/ajax/libs/jspdf/2.5.1/jspdf.umd.min.js"></script>
+    <script src="https://cdnjs.cloudflare.com/ajax/libs/html2canvas/1.4.1/html2canvas.min.js" crossorigin="anonymous"></script>
+    <script src="https://cdnjs.cloudflare.com/ajax/libs/jspdf/2.5.1/jspdf.umd.min.js" crossorigin="anonymous"></script>