llmfinetuning

Here are 4 public repositories matching this topic...

Nihal108-bi / TextSummrizer

End-to-end text summarization project that fine-tunes PEGASUS (`google/pegasus-cnn_dailymail`) on the SAMSum dataset and serves predictions through a FastAPI API.

python aws fine-tuning textmodel llm llmfinetuning

Updated Feb 13, 2026
Jupyter Notebook

jeosol / llm-post-training

Star

LLM Post-Training, RLHF, PPO, DPO, etc

post-training ppo dpo post-training-quantization llm rlhf llmalignment post-training-learning llmfinetuning

Updated Apr 10, 2026
Jupyter Notebook

SivakrishnaManoj / Fine-Tuning-LLMs-using-DPO-for-Prompt-Robustness-in-Educational-Setting

Star

This paper studies prompt robustness and ambiguity handling for small instruction-tuned LLMs (Qwen2.5-1.5B/3B) in educational tutoring. It evaluates corruption-augmented supervised fine-tuning on GSM8K and DPO in two roles: i augmenting robustness for math reasoning under noisy prompts, ii inducing clarification-seeking behavior on ambiguous prompt

reinforcement-learning lora sft dpo llm gsm8k llm-training qlora llm-evaluation llmfinetuning promptbench

Updated Mar 27, 2026
Python

osamaaltaf-pk / LLMs-Unsloth

Star

LLM fine-tuning pipeline using Unsloth, LoRA and QLoRA for efficient domain-specific training

lora llms llmops unsloth q-lora llmfinetuning

Updated Feb 19, 2026
Jupyter Notebook

Improve this page

Add a description, image, and links to the llmfinetuning topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the llmfinetuning topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

llmfinetuning

Here are 4 public repositories matching this topic...

Nihal108-bi / TextSummrizer

jeosol / llm-post-training

SivakrishnaManoj / Fine-Tuning-LLMs-using-DPO-for-Prompt-Robustness-in-Educational-Setting

osamaaltaf-pk / LLMs-Unsloth

Improve this page

Add this topic to your repo