Kidney-Stone-Detection-from-CT-Scan-Images-using-a-Hybrid-of-Vision-Transformer-and-YOLOv8

Hybrid deep learning repository for kidney stone detection from CT scan images using a Vision Transformer backbone with YOLOv8 style detection and ablation experiments for backbone size, augmentation, input resolution, and loss weights.

This repository contains the code and dataset structure for kidney stone detection from CT scan images using a hybrid Vision Transformer and YOLOv8 based approach. The method is built around ViTBackedYOLOv8, which replaces the standard YOLOv8 convolutional backbone with a pretrained Vision Transformer so that global context and local detection cues are combined in one framework. The manuscript reports that the dataset contains 1,300 annotated CT images, expanded to 5,486 samples through augmentation, and that the proposed model achieved strong detection results on the test set, including mAP@0.5 of 0.9878, precision of 0.9904, recall of 0.9883, F1-score of 0.983, and accuracy of 98.3%.

Project Structure

Kidney Stone Detection from CT Scan Images using a Hybrid of Vision Transformer and YOLOv8/
├── code/
│   ├── ViTBackedYOLOv8.ipynb
├── data/
│   ├── train/
│   │   ├── images/
│   │   └── labels/
│   ├── valid/
│   │   ├── images/
│   │   └── labels/
│   └── test/
│       ├── images/
│       └── labels/
└── README.md

## Dataset Source

The dataset used in this work is publicly available on Kaggle:

https://www.kaggle.com/datasets/safurahajiheidari/kidney-stone-images

The dataset contains CT scan images with bounding box annotations and is organized in train, valid, and test splits, each with `images` and `labels` folders.

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
code		code
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Kidney-Stone-Detection-from-CT-Scan-Images-using-a-Hybrid-of-Vision-Transformer-and-YOLOv8

Project Structure

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Kidney-Stone-Detection-from-CT-Scan-Images-using-a-Hybrid-of-Vision-Transformer-and-YOLOv8

Project Structure

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages