llama integration with exception safety by reeshabh90 · Pull Request #2 · as-ascii/docwire

reeshabh90 · 2026-02-18T10:25:15Z

Features worked:

Llama.cpp integration as one of the engines in docwire SDK.
Ensured exception safety for llama_runner class

ports/docwire/vcpkg.json

src/ai_runner.h

src/CMakeLists.txt

src/llama_generation_config.h

src/llama_runner.cpp

- changes made in docwire.cpp for default installation purpose.

src/c2t_runner.cpp

src/docwire.cpp

src/llama_runner.cpp

src/local_ai.cmake

src/model_inference_config.h

src/model_inference_config_type.h

tests/local_ai_llama_integration.cpp

1. Added configurable model load and unload feature, which gives sdk an option to decide whether to unload to the model after pipeline usage or keep it persistent for next usage. 2. Added files for local summarize and translate

src/ct2_runner.cpp

src/c2t_runner.cpp

src/ct2_runner.cpp

src/ai_runner.h

src/llama_runner.cpp

src/local_ai_translate.cpp

src/local_ai_translate.h

as-ascii · 2026-03-09T16:36:23Z

src/local_ai_translate.h

+class DOCWIRE_LOCAL_AI_EXPORT local_translate : public model_chain_element
+{
+  public:
+    explicit local_translate(const std::string& language, std::shared_ptr<ai_runner> runner);


I think we can have some default model runner to simplify if user does not care.

For default runner then, we may redirect the prompt to either c2t runner or else, we keep this for now.

Once, we have decided upon which models to use for specific task, that time, we decide upon which default model runner to use in case, use does not care.

Yes, it should use the default model that we choose for the particular task.

src/local_ai_summarize.h

src/local_ai_summarize.cpp

features

as-ascii · 2026-03-17T21:45:59Z

ports/docwire/vcpkg.json

+		},
+		"local-ai":
+		{
+			"description": "Enable local AI runtime (ctranslate2 + sentencepiece)",


I think that local-ai should disable everything: llama engine as well and embeddings as well. If only ctranslate2 than maybe this feature should be rather called "ctranslate2" or "ct2runner" or "local-ai-ct2" - something like this.

Yes, it should ideally disable or enable everything.

as-ascii · 2026-03-17T21:47:50Z

ports/docwire/vcpkg.json

+			"multilingual-e5-small-ct2-int8"
+		    ]
+		},
+	    "text-gen":


There is some inconsistency between "text-gen" feature and "llm-qwen" feature. Both enable/disable single LLM model but naming is different.

moved text-gen to local ai only

as-ascii · 2026-03-17T21:49:17Z

ports/docwire/vcpkg.json

+			"flan-t5-large-ct2-int8"
+			]
+	    },
+		"llm":


"llm" does not mean "local" (can be OpenAI API or Gemini API as well) so maybe better name would be something like "llama-engine" or "local-ai-llama", something like that

Agreed. Renamed it.

as-ascii · 2026-03-17T21:51:04Z

ports/qwen2-7b-instruct-q4_k_m/portfile.cmake

+set(MODEL_NAME "qwen2-7b-instruct")
+set(MODEL_QUANT "q4_k_m")
+
+set(MODEL_FILE "${MODEL_NAME}-${MODEL_QUANT}.gguf")


If MODEL_QUANT is significant and there are more than one on huggingface than port name should probably follow.

Yes, there are more than one. I will make ammendments

as-ascii · 2026-03-17T21:54:05Z

src/cosine_similarity.cpp

  // Use a practical epsilon for the squared norm to check for zero vectors.
  // This threshold is aligned with the one used for L2 normalization in
-  // c2t_runner.cpp (1e-6f). The squared value is 1e-12.
+  // ct2_runner.cpp (1e-6f). The squared value is 1e-12.


Ok this is an unexpected topic not to forget. cosine similarity should work with other engines as well. We need to check if this is something specific to ct2_runner or it is the same for Llama.cpp

as-ascii · 2026-03-17T22:03:06Z

ports/docwire/vcpkg.json

+		{
+			"description": "Enable local AI runtime (ctranslate2 + sentencepiece)",
+			"dependencies": [
+			"ctranslate2",


If dependency is disabled the code will not compile correctly. We need additional (small) code in portfile.cmake to support a feature and similar in cmake to conditionally disable for example building docwire_local_ai library.

llama integration with exception safety

295bf64

as-ascii requested changes Feb 19, 2026

View reviewed changes

reeshabh90 added 3 commits February 20, 2026 09:44

PR review suggested changes

430c854

renaming model-runner to c2t_runner

63546cc

- llama-embedding integration with local-ai::embed.

04443d2

- changes made in docwire.cpp for default installation purpose.

as-ascii requested changes Feb 25, 2026

View reviewed changes

reeshabh90 added 3 commits February 27, 2026 04:51

PR Suggested changes <minor fixes>

ce6c8ba

This commit includes following major changes:

4048bb7

1. Added configurable model load and unload feature, which gives sdk an option to decide whether to unload to the model after pipeline usage or keep it persistent for next usage. 2. Added files for local summarize and translate

minor - added documentation

f591e3f

as-ascii requested changes Mar 9, 2026

View reviewed changes

reeshabh90 added 5 commits March 12, 2026 05:32

PR review suggested changes and necessary documentations.

8d75dd4

Removing destructor as it is not doing anything user provided

f1269f9

Noticed one bug, hence fixed. Variable was getting shadowed.

12c195a

Renaming c2t_runner to ct2_runner

b9e36fe

VCPKG Features introduction for custom installation of AI specific

fd4e8a7

features

as-ascii requested changes Mar 17, 2026

View reviewed changes

Feature based AI capability installation via VCPKG.

51412ae

Conversation

reeshabh90 commented Feb 18, 2026

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants