🔬Evalet

[CHI 2026] Evalet: Evaluating Large Language Models by Fragmenting Outputs into Functions

CHI 2026 Honorable Mention Award🏆

Evalet helps practitioners understand and validate LLM-based evaluations by decomposing outputs into fragment-level functions. Instead of opaque scores, see exactly what in each output influenced the evaluation and why.

System overview

Running locally

After cloning the repository or downloading it as a ZIP, follow the steps below.

Prerequisites

Node.js (LTS recommended) — React frontend
Python 3.9–3.12 — clustering and API backend (dependencies such as numba may not support Python 3.13+)

1. Environment variables

Create a .env file in the project root and set your OpenAI API key (see .env.example).

cp .env.example .env
# Edit REACT_APP_OPENAI_API_KEY in .env to your real key

2. Frontend (React)

From the project root, install dependencies and start the dev server.

npm install
npm start

The app is served at http://localhost:3000 by default. If you use Yarn, run yarn and yarn start instead.

3. Backend (Flask)

In a separate terminal, a Python virtual environment is recommended.

cd pipeline_flask
python -m venv .venv
source .venv/bin/activate   # Windows: .venv\Scripts\activate
pip install -r requirements.txt
python server.py

The API runs at http://localhost:8080. The frontend uses SERVER_BASE_URL in src/configs.ts to reach the backend; for local development, ensure it points to http://localhost:8080.

Summary

Component	Command (example)	URL
React dev server	`npm start`	`http://localhost:3000`
Flask API	`python pipeline_flask/server.py`	`http://localhost:8080`

For clustering and related API features, run both React and Flask.

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
images		images
pipeline_flask		pipeline_flask
public		public
src		src
.env.example		.env.example
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
package-lock.json		package-lock.json
package.json		package.json
test_pairdata.json		test_pairdata.json
tsconfig.json		tsconfig.json
yarn.lock		yarn.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🔬Evalet

System overview

Running locally

Prerequisites

1. Environment variables

2. Frontend (React)

3. Backend (Flask)

Summary

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

🔬Evalet

System overview

Running locally

Prerequisites

1. Environment variables

2. Frontend (React)

3. Backend (Flask)

Summary

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages