This repository contains notebook to fine-tune DeepSeek R1 8B model using Unsloth. Unsloth allows for faster and more memory-efficient fine-tuning of LLMs.
The notebooks/ directory contains the following workflows:
medical_finetuning_using_hf_unsloth.ipynb: Focuses on QLoRA (Quantized LoRA) techniques in 4-bit quantization for efficient fine-tuning.
- Python 3.10+
- CUDA-enabled GPU (recommended for Unsloth)
- Install Jupyter VS Code extension (if not installed):
- Launch Jupyter Lab or Notebook:
jupyter lab
- Open one of the notebooks in the
notebooks/directory and follow the steps.
-
Prepare Secrets for notebook:
-
HF_TOKEN– Required for HuggingFace login. -
WANDB_TOKEN– Required for Weights & Biases (W&B) logging. Optional if you do not want training charts. -
Execute all runs and wait until training is complete:
Contributions welcome — open an issue or submit a PR.
See the LICENSE file for details.

