Skip to content

Lucene Keyword index#58

Merged
kylebd99 merged 58 commits intomainfrom
lucene-keyword-index
Mar 3, 2026
Merged

Lucene Keyword index#58
kylebd99 merged 58 commits intomainfrom
lucene-keyword-index

Conversation

@albert-du
Copy link
Collaborator

@albert-du albert-du commented Jan 18, 2026

Implements a pylucene based keyword index and introduces a dev container first workflow.

CONTRIBUTOR.md updated to reflect workflow with dev container.

We build a base image based off python-3.11.14-trixie (Debian stable) that contains python, pylucene, and poetry. The base image is pushed to GitHub Container Register (ghcr) by a new actions workflow. Pushes to main result in a image tagged with the python, pylucene, and poetry versions.

We extend the base image to produce a built image that includes the all of the app code. This can be launched either as the server, or to run embedding scripts. A docker compose sample is included to demonstrate server capabilities.

The base image is also now used in the CI pipeline, to support testing the pylucene keyword index.

Finally, we extend the base image to create a dev container image that VS Code can attach to, with the same versioning and configuration as the CI pipeline and the built image.

Development without using a dev container is unaffected. The poetry environments are kept separate, and there are no other changes. Attempting to use lucene outside of the dev container causes an exception, so some tests will fail, but otherwise the original developer experience is unaffected.

A second PR will be necessary after this one is merged and the ci/cd pipeline completes to remove the branch name from all of the base image references. Currently they point back to images built on this working branch, they will be swapped for main, once a main image is built.

I combined the existing two embedding and server dockerfiles into a single primary dockerfile. There's no reason to build two basically identical images as we can specify the entrypoint when starting the container.

@albert-du albert-du marked this pull request as ready for review March 2, 2026 01:04
@albert-du albert-du requested a review from kylebd99 March 2, 2026 01:04
@kylebd99 kylebd99 merged commit 98028f3 into main Mar 3, 2026
3 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants