Skip to content

E-gonito/dolphin3-optimisation

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

6 Commits
 
 
 
 
 
 
 
 

Repository files navigation

Optimising Dolphin 3

This guide explains how to create and run an optimised version of the Dolphin 3 model using Ollama.

Setup Commands

# Create optimised model
ollama create dolphin-optimised

# Run server with optimisations
OLLAMA_FLASH_ATTENTION=true OLLAMA_KV_CACHE_TYPE=f16 ollama serve

# Run the optimised model
ollama run --verbose dolphin-optimised:latest

# Or with OpenWebUI (Ensure Docker is running)
docker run -d -p 3000:8080 --add-host=host.docker.internal:host-gateway -v open-webui:/app/backend/data --name open-webui --restart always ghcr.io/open-webui/open-webui:main

Notes

About

Repository for optimising the Dolphin 3 LLM to run on lower-spec hardware with F16 KV Cache Type

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors