Semantris Plus

LLM-powered semantic arcade game inspired by the original Semantris concept, built as a maintainable Flask application with a TypeScript-powered browser client.

Overview

Semantris is still one of the most compelling word game ideas ever made. It turns semantic intuition into visible motion: you type a clue, language itself becomes the game state, and meaning decides what survives.

This project exists because I wanted a better Semantris-style experience for the LLM era.

The original idea is still excellent, but modern language models make it possible to revisit that loop with a much broader, more current semantic engine. Instead of relying on older ranking systems, this version uses a contemporary LLM stack to reorder words by association and create a more flexible, extensible foundation for a new arcade interpretation.

Semantris Plus is not trying to pretend it surpasses the original Semantris. The original was a real product built by a strong design and engineering team. This repository is an ambitious small-game project: a playable modern reinterpretation with stronger AI-era semantics, cleaner code structure, and a roadmap toward a more polished game.

Vision

The ambition for this repo is straightforward:

build a Semantris-like arcade game that feels modern instead of prototype-grade
use current LLMs where semantic ranking actually adds value
preserve the clarity and immediacy of the original idea
make the codebase maintainable enough for future contractors or contributors to extend safely
improve both the game feel and the engineering quality at the same time

This is intentionally both a game project and a software-structure project.

Current Modes

The game now ships with three playable modes, Version Code "0.3" :

Iteration Mode

The original tower-based arcade variant:

the game starts with a tower of words and one highlighted target word
the player enters a clue
the ranking engine orders the visible words from most related to least related
the tower is displayed so the bottom-most word is the most correlated result
the bottom four words form the destruction zone
if the target lands in that zone, the target and the words between it and the zone boundary are removed
score increases by the number of removed words
new words drop in from the top
the session tracks time, score, turns, and vocabulary progress

Restriction Mode

A harder tower variant where every clue must also obey a rotating rule:

the tower and target behave like Iteration Mode
an active rule is shown above the board
your clue must both satisfy the rule and still semantically pull the target into the destruction zone
if the clue passes the rule, the tower resolves like Iteration Mode
if the clue fails the rule, you take a strike and penalty words are inserted at the bottom of the tower
the run ends if you reach 3 strikes or a penalty insertion pushes the target out of the tower
the active rule rotates every 10 turns
some successful rule-compliant turns award a score multiplier bonus

Blocks Mode

A separate grid-based chain reaction mode:

the board is an 8 x 10 grid with up to 32 occupied cells at a time
you type a clue and the system picks the single best matching word as the primary hit
nearby words are scored for how strongly they relate to the clue
any orthogonally connected neighbor scoring 75 or higher can join the chain
the chain keeps expanding outward through qualifying neighbors
all chained words are removed together
score grows by combo size using an accelerating formula, starting at 10 points for a one-word clear
words above fall downward, and new words refill empty slots from the top while unseen vocabulary remains
the run ends in a win when the unseen pool is exhausted and the board has been fully cleared

How To Play

Starting a run

Run the app locally.
Open the landing page.
Choose a vocabulary pack.
Launch Iteration Mode, Restriction Mode, or Blocks Mode.

Playing Restriction Mode

Read the active rule before typing.
Enter a clue that obeys the rule and points toward the highlighted target word.
Submit the clue.
If the clue passes, the ranked tower resolves like normal tower play.
If the clue fails, you take a strike and extra penalty words are added to the bottom.
Survive the rotating rules and clear the tower before hitting 3 strikes.

Tips:

Shorter clues are often easier to keep rule-compliant.
Local-format rules are exact, so wording details matter.
A safe clue that misses is usually better than an illegal clue that adds a strike.

Playing Blocks Mode

Look for a small cluster of words that could all plausibly answer the same clue.
Enter one clue for that cluster.
The system chooses a primary word first.
The chain then spreads through adjacent words that also match strongly enough.
Cleared words disappear together, gravity pulls columns downward, and new words spawn in.
Repeat until the unseen pool is empty and the board is cleared.

Tips:

Think in connected neighborhoods, not isolated words.
A clue that strongly matches one word but weakly matches its neighbors usually produces only a short clear.
Broad category clues can be useful, but the best clears usually come from tight local themes.

Why LLMs Here

Modern LLMs are not deterministic ranking machines, and that matters. Even at low temperature, semantic ordering can still vary.

That said, for short clue-and-word ranking tasks, modern models are strong enough to make this design space genuinely fun again. Their broader world knowledge also makes the game more flexible across themed vocab packs and future content expansions.

This repo supports two remote ranking modes: Gemini through Google’s Gen AI SDK, and an OpenAI-compatible mode through the openai Python client. The selected provider is chosen at startup, and the game still includes a local heuristic fallback so the app remains playable if the configured model path fails.

Project Status

Current status: v0.2.4 active prototype with a modular Flask backend, a typed frontend client, openai - Gemini Dual Capatable and a dual-theme UI system with manual light/dark switching.

What is already in place:

modularized gameplay logic instead of one monolithic server file
explicit JSON API for session state and turns
three playable game modes with shared pack selection
no-repeat word handling until the unseen pool is exhausted
improved tower presentation and animation sequencing
a TypeScript frontend source tree compiled into a browser bundle
a shared theme controller with manual light/dark mode switching across the landing page and the game
a polished light mode plus a flatter, surface-led dark mode tuned for the playfield
frontend type-checking and Vitest coverage for key browser-side logic
fallback ranking path for resilience
automated tests for gameplay rules, API behavior, and provider fallback behavior

What is still unfinished:

final game feel polish
richer end-of-run UX
broader frontend test coverage across full interaction flows
leaderboard or persistence systems
stronger fallback ranking quality
model selection tuning across Gemini and OpenAI-compatible providers

Architecture

This repository is now structured around clear responsibilities instead of mixing UI, session state, LLM calls, build concerns, and game rules in a single file.

Runtime flow

Flask serves the landing page and game HTML shells.
The browser applies the stored or system theme and loads the compiled TypeScript bundles.
The game frontend loads current session state from the JSON API.
The player submits a clue.
The backend sends the visible board to the ranking provider.
The ranking result is validated and converted into board mutations.
The frontend animates reorder, removal, collapse, spawn events, and end-of-run UI state.

Key design choices

Gameplay rules are isolated so they can be tested without the web app.
LLM interaction is isolated so provider changes do not require rewriting the game loop.
The interactive frontend now lives in dedicated TypeScript modules compiled into served browser bundles.
Session state is explicit so the frontend can render the game from stable API payloads.
The frontend build and validation steps are small but formalized so browser code can evolve safely.
Theme state is handled in the frontend so both pages stay visually consistent while still respecting system preference by default.

Repository Structure

SemantrisPlus/
├── app.py                 # Flask app, route wiring, session serialization
├── game_logic.py          # Iteration-mode board/session rules
├── game_logic_restriction.py # Restriction-mode rules, strikes, and rule rotation
├── game_logic_blocks.py   # Blocks-mode grid, gravity, and chain resolution
├── llm_client.py          # Provider integration, validation, and fallback ranking
├── brief.md               # Contractor-facing project brief and roadmap
├── GeminiMoving.md        # Migration evaluation and decision record
├── README.md              # Project overview and setup
├── requirements.txt       # Python dependencies
├── .env.example           # Environment variable template
├── assets/                # Vocabulary packs
│   ├── aviation_1.txt
│   ├── basic_vocab.txt
│   ├── general_1.txt
│   ├── lite_1.txt
│   └── restriction_rules.json
├── PRDs/
│   ├── README.md          # Versioned iteration workflow
│   └── v0.3/
│       ├── v0.3.md        # Current version-scoped PRD packet
│       └── v0.3-demo.html # Static design demo for the iteration
├── docs/
│   ├── PRD.md             # Foundation product requirements document
│   ├── V0.1.md            # Structural cleanup release note
│   ├── V0.2.md            # Frontend TypeScript migration note
│   ├── 2newmodes.md       # Mode design notes
│   └── 2newmodes_technical.md # Implementation brief for the new modes
├── frontend/
│   └── src/               # TypeScript source for all interactive game clients
├── package.json           # Frontend scripts and dependencies
├── tsconfig.json          # TypeScript compiler configuration
├── vitest.config.ts       # Frontend test configuration
├── scripts/
│   └── build-frontend.mjs # esbuild entry for browser bundle output
├── static/
│   ├── css/app.css        # Visual system and layout styling
│   └── js/                # Compiled browser bundles served by Flask
├── templates/
│   ├── arcade.html        # HTML shell for Iteration Mode
│   ├── restriction.html   # HTML shell for Restriction Mode
│   ├── blocks.html        # HTML shell for Blocks Mode
│   └── home.html          # HTML shell for the landing page
├── testing/
│   └── api_latency.py     # Optional provider latency experiment
└── tests/
    ├── test_app.py        # API contract tests
    ├── test_game_logic.py # Gameplay rule tests
    ├── test_game_logic_restriction.py # Restriction-mode rule tests
    ├── test_game_logic_blocks.py # Blocks-mode grid and chain tests
    └── test_llm_client.py # Provider selection and fallback tests

Tech Stack

Python
Flask
Jinja templates
TypeScript for the interactive browser views
esbuild for frontend bundling
custom CSS
system-aware light/dark theming with manual override
Google Gemini API via the Google Gen AI SDK
OpenAI-compatible model access via the OpenAI Python client
unittest for automated tests
Vitest for frontend unit and DOM tests

Getting Started

1. Install dependencies

python3 -m pip install -r requirements.txt
npm install

2. Configure environment variables

Create a .env file in the project root based on the starter template is also available at .env.example.

3. Run the app

npm run build
python3 app.py

Open http://127.0.0.1:5001, then launch Iteration Mode from the landing page.

Light and dark mode are both available. The UI will follow the system theme by default, and the top-right toggle on the landing page and game page lets you override it manually.

4. Rebuild the frontend after TypeScript changes

If you change anything under frontend/src/, rebuild the browser bundle before running or testing the app:

npm run build

Running A Session

For a normal local play session:

npm run build
python3 app.py

For a validation pass before or after changes:

npm run check:frontend
npm run test:frontend
python3 -m unittest discover -s tests

Configuration

Vocabulary packs

Vocabulary packs are plain newline-separated .txt files under assets/.

The main webpage automatically populates the vocabulary-pack dropdown by scanning the assets/ directory for .txt files, so adding a new pack there makes it available in the UI.

Current included packs:

assets/general_1.txt
assets/lite_1.txt
assets/basic_vocab.txt
assets/aviation_1.txt

If you want to change the default pack shown on startup, update this parameter in app.py:

DEFAULT_VOCAB_FILE = ASSETS_DIR / "aviation_1.txt"

You can also override the startup default with an environment variable:

SEMANTRIS_VOCAB_FILE="assets/aviation_1.txt"

Ranking provider

Choose the active remote provider with:

SEMANTRIS_LLM_PROVIDER="gemini"

Supported values:

gemini
openai

When gemini mode is active, the backend uses Google’s supported google-genai client and requests structured JSON output with schema validation.

When openai mode is active, the backend uses the openai Python client and can target either OpenAI itself or any OpenAI-compatible endpoint through OPENAI_BASE_URL.

Only one remote provider is active per process. The app does not fail over from one remote provider to the other at runtime.

If the configured remote provider is unavailable(timed out), fails validation checks, or cannot initialize, the backend falls back to a deterministic local heuristic ranker so the session does not hard-fail.

This fallback is intentionally simple. It is a resilience feature, not a semantic replacement for the primary model.

Development

Run tests

npm run check:frontend
npm run test:frontend
python3 -m unittest discover -s tests

Frontend commands

npm run build: compile the TypeScript frontend into the browser bundles served by Flask
npm run check:frontend: run TypeScript type-checking without emitting files
npm run test:frontend: run frontend unit and DOM tests with Vitest

Supporting documents

PRDs/README.md: repeatable version-folder workflow for major iterations
PRDs/v0.3/v0.3.md: current version-scoped iteration PRD
PRDs/v0.3/v0.3-demo.html: static visual reference for the v0.3 pass
docs/PRD.md: stable product direction, scope, and engineering guardrails
brief.md: product brief for future contractors
docs/V0.1.md: implementation note for the structural cleanup release
docs/V0.2.md: implementation note for the frontend TypeScript migration
GeminiMoving.md: migration evaluation and recommendation memo

Code quality goals

This repo is aiming for a small but professional standard:

clear file ownership
testable game rules
readable API contracts
documented architecture
controlled session state
graceful failure behavior

Known Limitations

LLM ranking is probabilistic, so some rounds will feel less stable than deterministic puzzle logic
the fallback ranker is much weaker than Gemini
animation quality is improved but still not at final production polish
there is no persistent profile, save system, or leaderboard yet
the current set of modes is still an early structured version of the larger idea

Roadmap

Near-term priorities:

improve end-of-run states and summaries
strengthen visual polish and motion design
add richer difficulty and session options
improve fallback ranking quality
evaluate whether gemini-3.1-flash-lite feels better than gemini-2.5-flash-lite for ranking quality

Longer-term ideas:

seeded challenge mode
daily runs
local leaderboard support
theme-aware packs and presentation
additional Semantris-inspired game modes

Contributing

Pull requests and experiments are welcome across:

gameplay tuning
UI and animation polish
prompt engineering
provider integrations
fallback ranking strategies
vocabulary packs
tests and documentation

If you are extending the codebase structurally, start with docs/PRD.md, brief.md, docs/V0.1.md, and docs/V0.2.md so the product and architecture direction stay consistent.

Notes

This project is inspired by Semantris, but it is an independent fan reimagining.
Free-tier API access from Google AI Studio is usually enough for local experimentation.
The project is intentionally small in scope today, but it is being shaped like a repo that can scale cleanly.

Name		Name	Last commit message	Last commit date
Latest commit History 44 Commits
PRDs		PRDs
assets		assets
docs		docs
frontend/src		frontend/src
scripts		scripts
static		static
templates		templates
testing		testing
tests		tests
.env.example		.env.example
.gitignore		.gitignore
.impeccable.md		.impeccable.md
AGENTS.md		AGENTS.md
CHANGELOG.md		CHANGELOG.md
GeminiMoving.md		GeminiMoving.md
LLM_DEI.md		LLM_DEI.md
README.md		README.md
app.py		app.py
brief.md		brief.md
game_logic.py		game_logic.py
game_logic_blocks.py		game_logic_blocks.py
game_logic_restriction.py		game_logic_restriction.py
llm_client.py		llm_client.py
package-lock.json		package-lock.json
package.json		package.json
requirements.txt		requirements.txt
tsconfig.json		tsconfig.json
vitest.config.ts		vitest.config.ts

Folders and files

Latest commit

History

Repository files navigation

Semantris Plus

Overview

Vision

Current Modes

Iteration Mode

Restriction Mode

Blocks Mode

How To Play

Starting a run

Playing Restriction Mode

Playing Blocks Mode

Why LLMs Here

Project Status

Architecture

Runtime flow

Key design choices

Repository Structure

Tech Stack

Getting Started

1. Install dependencies

2. Configure environment variables

3. Run the app

4. Rebuild the frontend after TypeScript changes

Running A Session

Configuration

Vocabulary packs

Ranking provider

Development

Run tests

Frontend commands

Supporting documents

Code quality goals

Known Limitations

Roadmap

Contributing

Notes

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages