Chest X-Ray Classification

This project classifies chest X-ray images into two categories: Normal and Pneumonia. The project involves data augmentation, handling class imbalance, model training, evaluation, and inference.

Dataset

The dataset used for this project consists of chest X-ray images categorized into two classes: Normal and Pneumonia. The dataset can be downloaded from: Chest X-Ray Dataset.

Installation

Requirements To install the required libraries, use the requirements.txt file provided in the repository. Run the following command: pip install -r requirements.txt

Usage

Download and Extract Dataset: Download the dataset from the link provided above and extract it to your desired location. Make sure the directory structure looks like this:

dataset/
- train/
  - NORMAL/
  - PNEUMONIA/
- val/
  - NORMAL/
  - PNEUMONIA/
- test/
  - NORMAL/
  - PNEUMONIA/

In main you will find the necessery code. Download all the files in main to a specific directory on your local drive. Open config.yaml and specify the directory of your data. You can also modify other parameters in the config file and choose a specific model to train. After saving the configuration. run the run.py file python run run.py . Results will be saved in the same directory as the code.

Config File:

The config.yaml file contains the configuration for the training process. Below is an example configuration:

model_name: ResNet50Binary # Specify the model name (e.g., MobileNetV3Binary, VGG16Binary)
train_dir: path/to/train
val_dir: path/to/val
test_dir: path/to/test
batch_size: 32
num_epochs: 10
learning_rates:
- 0.001
- 0.0001
patience_values:
- 3
- 4
use_k_fold: false # Enable or disable k-fold cross-validation
k_folds: 3
use_random_seed: true # Specify whether to use a random seed
seed: 42 # This will be ignored if use_random_seed is true

Training:

To start the training process, run the run.py script from the main directory: python run.py This will train the model using the specified configuration and save the best model based on validation loss.

Evaluation:

After training, the script evaluates the best model on the test set and logs the results. The evaluation includes accuracy, precision, recall, F1-score, and AUC-ROC. It also plots the accuracy over epochs and the confusion matrix.

Code Structure

data_loader.py: Handles data loading, transformations, and class distribution plotting.
model.py: Contains the model architectures.
train.py: Handles the training process and early stopping.
test.py: Handles model evaluation on the test set.
plot_utils.py: Contains functions for plotting accuracies and confusion matrices.
logger.py: Logs the final results to a text file.
config.yaml: Configuration file for the training process.
run.py: Main script to run the training and evaluation pipeline.

Inference

The inference mode allows you to use a trained model to make predictions on new chest X-ray images. The script run_inference.py provides an interactive GUI for selecting the model and image files. Already trained models can be found on Google drive: https://drive.google.com/drive/folders/1Zi6nQfpwh61HBU8X8J_CKt8hH6irvJSm?usp=sharing

Inference Mode Instructions

Ensure you have a trained model saved in the .pth format.
Insure to select the right model in the config.yaml
Run the run_inference.py script
A file dialog will open to select the saved model file (*.pth).
Another file dialog will open to select one or more chest X-ray images for inference.
The script will display the predicted class (Normal or Pneumonia) along with the probability for each selected image.

Customization

Adding New Models

To add a new model, define the model architecture in model.py and specify the model name in the config.yaml file.

Data Augmentation

Data augmentation techniques such as random rotations, flips, and histogram equalization are applied in data_loader.py. You can customize these transformations in the get_transforms function.

Early Stopping

Early stopping is implemented in train.py with configurable patience values specified in the config.yaml file.

K-Fold Cross-Validation

K-fold cross-validation is implemented in data_loader.py. You can enable it by setting use_k_fold to true in config.yaml and specifying the number of folds with k_folds.

Results

The script logs the best hyperparameters and evaluation metrics. It also saves the best model with the filename format best_model_<model_name>.pth. Furthermore a confussion matrix and an accuracy plot will be created after each training. Example for a resulted confussion matrix:

Contributing

The members.txt contains the names of students contributed in the project. Feel free to open issues or submit pull requests if you want to contribute to this project.

Hardware and Environment

The training was done on the folloing hardware:

Platform: Windows-11-10.0.22631-SP0
Processor: AMD64 Family 25 Model 33 Stepping 0, AuthenticAMD
RAM: 31.93 GB
CUDA Available: True
CUDA Device Count: 1
CUDA Device Name: NVIDIA GeForce RTX 3070

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Chest X-Ray Classification

Dataset

Installation

Usage

Config File:

Training:

Evaluation:

Code Structure

Inference

Inference Mode Instructions

Customization

Adding New Models

Data Augmentation

Early Stopping

K-Fold Cross-Validation

Results

Contributing

Hardware and Environment

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 19 Commits
Dev		Dev
Main		Main
README.md		README.md
members.txt		members.txt
requirements.txt		requirements.txt

Folders and files

Latest commit

History

Repository files navigation

Chest X-Ray Classification

Dataset

Installation

Usage

Config File:

Training:

Evaluation:

Code Structure

Inference

Inference Mode Instructions

Customization

Adding New Models

Data Augmentation

Early Stopping

K-Fold Cross-Validation

Results

Contributing

Hardware and Environment

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages