End-to-end text summarization project that fine-tunes PEGASUS (`google/pegasus-cnn_dailymail`) on the SAMSum dataset and serves predictions through a FastAPI API.
-
Updated
Feb 13, 2026 - Jupyter Notebook
End-to-end text summarization project that fine-tunes PEGASUS (`google/pegasus-cnn_dailymail`) on the SAMSum dataset and serves predictions through a FastAPI API.
LLM Post-Training, RLHF, PPO, DPO, etc
This paper studies prompt robustness and ambiguity handling for small instruction-tuned LLMs (Qwen2.5-1.5B/3B) in educational tutoring. It evaluates corruption-augmented supervised fine-tuning on GSM8K and DPO in two roles: i augmenting robustness for math reasoning under noisy prompts, ii inducing clarification-seeking behavior on ambiguous prompt
Add a description, image, and links to the llmfinetuning topic page so that developers can more easily learn about it.
To associate your repository with the llmfinetuning topic, visit your repo's landing page and select "manage topics."