Skip to content

surafiel-habib/Kidney-Stone-Detection-from-CT-Scan-Images-using-a-Hybrid-of-Vision-Transformer-and-YOLOv8

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

13 Commits
 
 
 
 
 
 

Repository files navigation

Kidney-Stone-Detection-from-CT-Scan-Images-using-a-Hybrid-of-Vision-Transformer-and-YOLOv8

Hybrid deep learning repository for kidney stone detection from CT scan images using a Vision Transformer backbone with YOLOv8 style detection and ablation experiments for backbone size, augmentation, input resolution, and loss weights.

This repository contains the code and dataset structure for kidney stone detection from CT scan images using a hybrid Vision Transformer and YOLOv8 based approach. The method is built around ViTBackedYOLOv8, which replaces the standard YOLOv8 convolutional backbone with a pretrained Vision Transformer so that global context and local detection cues are combined in one framework. The manuscript reports that the dataset contains 1,300 annotated CT images, expanded to 5,486 samples through augmentation, and that the proposed model achieved strong detection results on the test set, including mAP@0.5 of 0.9878, precision of 0.9904, recall of 0.9883, F1-score of 0.983, and accuracy of 98.3%.

Project Structure

Kidney Stone Detection from CT Scan Images using a Hybrid of Vision Transformer and YOLOv8/
├── code/
│   ├── ViTBackedYOLOv8.ipynb
├── data/
│   ├── train/
│   │   ├── images/
│   │   └── labels/
│   ├── valid/
│   │   ├── images/
│   │   └── labels/
│   └── test/
│       ├── images/
│       └── labels/
└── README.md

## Dataset Source

The dataset used in this work is publicly available on Kaggle:

https://www.kaggle.com/datasets/safurahajiheidari/kidney-stone-images

The dataset contains CT scan images with bounding box annotations and is organized in train, valid, and test splits, each with `images` and `labels` folders.

About

Hybrid deep learning repository for kidney stone detection from CT scan images using a Vision Transformer backbone with YOLOv8 style detection and ablation experiments for backbone size, augmentation, input resolution, and loss weights.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors