Ensemble Agent

An agentic system for generating, running, and iteratively fixing simulation scripts. Currently supports libEnsemble, designed to extend to other ensemble tools.

Uses LangChain ReAct agents with tools exposed via MCP and local Python functions.

Installation

Dependencies (no package installer yet):

pip install langchain langchain-openai langchain-anthropic mcp

For the web UI:

pip install gradio fastapi uvicorn[standard] websockets

MCP Server Setup (Script Generation)

The script generator runs as an MCP server from a separate repository. It is not requied when user scripts are specified via --scripts option.

sudo apt install nodejs
git clone https://github.com/Libensemble/script-creator.git
cd script-creator
npm install

export GENERATOR_MCP_SERVER="$(pwd)/mcp_server.mjs"

Options for LLM model keys

For all workflows, you will need a key to access an LLM.

For example, you can set an OpenAPI key. Requires an OpenAI account. Make sure to check MODEL at top of agentic script and usage rates.

Set user OpenAI API Key:

export OPENAI_API_KEY="sk-your-key-here"

Or if you use Anthropic, you can set.

export ANTHROPIC_API_KEY="sk-ant-your-key-here"

Optionally, you can set the LLM_MODEL env variable to a model name. However, if using the Web UI, there is a drop down select of available models.

See Model comparison below for model performance analysis.

Using other LLM services

Using Argo gateway (optional)

If you have access to Argo at Argonne, you can use it to access Claude and/or OpenAI models.

For Claude models:

export ANTHROPIC_BASE_URL="https://apps-dev.inside.anl.gov/argoapi"
export ANTHROPIC_AUTH_TOKEN="your-anl-username"

For OpenAI models:

export OPENAI_BASE_URL="https://apps-dev.inside.anl.gov/argoapi/v1"
export OPENAI_API_KEY="your-anl-username"

Set both if you want access to both model families.

Using Argonne inference service (optional)

If you have an ALCF account, you can use Argonne inference service instead of OpenAI.

Authenticate via Globus to obtain ALCF inference token:

pip install openai globus_sdk
wget https://raw.githubusercontent.com/argonne-lcf/inference-endpoints/main/inference_auth_token.py
python inference_auth_token.py authenticate  # Enter globus authentication when prompted.

Set environment variables for ALCF Inference service and model. Obtain API Key:

export OPENAI_BASE_URL=https://inference-api.alcf.anl.gov/resource_server/metis/api/v1
export LLM_MODEL=gpt-oss-120b
export OPENAI_API_KEY=$(python inference_auth_token.py get_access_token)

Please report (e.g., via Issues or support options) if you have a KEY setup that does not work with the agent.

Setup

The example script generator uses an application which can easily be built.

cd tests/six_hump_camel/
gcc six_hump_camel.c -o six_hump_camel.x -l
cd ../../

Note that six_hump_camel is used in many libEnsemble examples directly in Python, but an MPI application is used to demonstrate running ensembles with a user application.

Running Agent (Web UI)

The easiest way to try out is via the web UI interface in web_ui/.

Note that the web UI will inherit any environment (e.g. conda) you are in.

pip install fastapi uvicorn[standard] gradio websockets
cd web_ui
python gradio_ui.py

Click on the URL shown in the terminal to open the web interface.

In the interface press the Start Agent button.

See web_ui/README.md for further details.

Running Agent (command line)

To run scripts on command line, some examples...

# Fix existing scripts (does not use six_hump_camel.x)
python ensemble_agent.py --scripts tests/scripts_with_errors/

# To run with script generation pre-filled prompt
python ensemble_agent.py

# Generate scripts interactively
python ensemble_agent.py --interactive

# Generate from a prompt
python ensemble_agent.py --prompt "Create APOSMM scripts..."

# Enable debug logging
python ensemble_agent.py --debug

A user run script can also be run directly using the agent, however as there is not yet an installable agent package it must be run from the base ensemble_agent/ directory.

python -m ensemble_agent tests/scripts_with_errors/run_example.py

Scripts are saved to generated_scripts/ directory.

Scripts will be ran, fixes attempted on failure, and reran.

Each time scripts are modified by the agent a version will be stored under generated_scripts/versions.

When the agent is re-started any existing generated_scripts/ directory is backed up to an archive_runs/ dir.

Options

To see all script options run

python ensemble_agent.py -h

Environment Variables

Variable	Description
`OPENAI_API_KEY`	OpenAI API key
`ANTHROPIC_API_KEY`	Anthropic API key
`ANTHROPIC_AUTH_TOKEN`	Argo gateway token (auto-mapped to `ANTHROPIC_API_KEY`)
`OPENAI_BASE_URL`	Custom OpenAI-compatible endpoint
`ANTHROPIC_BASE_URL`	Custom Anthropic endpoint
`LLM_MODEL`	Override default model
`GENERATOR_MCP_SERVER`	Path to `mcp_server.mjs`
`AGENT_DEBUG`	Enable debug logging

Model comparison

As of March 9 2026, the model used primarily in testing has been Claude Opus 4.6. Opus shows significantly better script fixing than Haiku, and slight benefits over Sonnet (both at 4.6).

For example, the initial script generation from templates brings in a script using Scipy neldermead optimizer. Using the default prompt which requests an "optimizer that is good for smooth functions", Opus and Sonnet convert to using nlopt with "LN_BOBYQA", which is indicated in the reference docs. Haiku did not change the optimizer. Opus had minor configuration details over Sonnet and presented a more detailed analysis (e.g., table of minima found).

The Opus and Sonnet scripts both ran first time and produced good output.

Recommendation: For Claude, use Sonnet or Opus (comparison with v4.6).

OpenAI models:

GPT-5.4 produced correct scripts with and ran first time. It used nlopt with "LN_BOBYQA".

GPT-5.2 change the optimzier and script ran first time, but did not add rk_const, and found minima slower.

GPT-5.1 did not change the optimizer.

Recommendation: For OpenAI, use GPT-5.4+ (GPT-5.2 is usable).

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Ensemble Agent

Installation

MCP Server Setup (Script Generation)

Options for LLM model keys

Using other LLM services

Setup

Running Agent (Web UI)

Running Agent (command line)

Options

Environment Variables

Model comparison

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 42 Commits
ensemble_agent		ensemble_agent
tests		tests
web_ui		web_ui
CLAUDE.md		CLAUDE.md
LICENSE		LICENSE
README.md		README.md
ensemble_agent.py		ensemble_agent.py

Folders and files

Latest commit

History

Repository files navigation

Ensemble Agent

Installation

MCP Server Setup (Script Generation)

Options for LLM model keys

Using other LLM services

Setup

Running Agent (Web UI)

Running Agent (command line)

Options

Environment Variables

Model comparison

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages