Skip to content

The CGI2Real_Multi-Class_Image_Classifier categorizes humans, horses, or both using transfer learning from Inception CNN. Trained on synthetic images, it can also classify real ones.

License

Notifications You must be signed in to change notification settings

razamehar/CGI2Real-Multi-Class-Image-Classifier

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

26 Commits
 
 
 
 
 
 
 
 

Repository files navigation

CGI2Real Multi-Class Image Classifier

Project Overview

The CGI2Real_Multi-Class_Image_Classifier is an image classification model designed to classify images into three categories: humans, horses, and a combination of both. It utilizes transfer learning, a popular technique in deep learning, leveraging the architecture and pre-trained weights of the Inception Convolutional Neural Network model. Trained on synthetic images, it can also classify real ones.

Synthetic Images

Synthetic images of horses
Synthetic images of humans
Synthetic images of both horses and humans

Project Inspiration

The inspiration for this project stemmed from Laurence Moroney's binary classifier, which was trained on synthetic images of humans and horses. Building upon this concept, I aimed to utilize a different dataset of computer-generated images and transitioned the classifier from binary to multi-class. This adaptation enabled the classification of images containing both humans and horses.

Training Process

The training process involves fine-tuning the parameters of the Inception model on the three distinct CGI image datasets. These datasets are carefully curated and collected to represent scenarios featuring humans, horses, and both entities coexisting within the same image. The decision to use CGI images instead of real images stems from the practical challenges associated with acquiring large volumes of real data, particularly in scenarios involving specific subjects like horses. While real images offer authenticity, the synthetic nature of CGI images provides scalability and flexibility, enabling the creation of diverse training sets without constraints.

Model's Performance

The model's performance metrics are highly promising, with a training accuracy of 99% indicating its proficiency in learning from the training data. Furthermore, the validation accuracy of 92% underscores its ability to generalize well to unseen data, demonstrating robust classification capabilities. Despite the impressive results, there remains room for improvement, particularly in the exploration of integrating real images into the training pipeline.

Performance Evaluation: Base Model vs. Model fine-tuned with Inception Convolutional Neural Network

Base Model with Transfer Learning
base model
Model fine-tuned with Inception V3
model with transfer learning

Potential Improvements

Future iterations of the model could benefit from hybrid approaches that combine vast amount of both synthetic and real data, potentially enhancing its adaptability to real-world scenarios and further improving classification accuracy.

Data Sources:

Model and Weights for Usage

The finalized model and its weights are available for free download and unrestricted use.

License:

This project is licensed under the Raza Mehar License. See the LICENSE.md file for details.

Contact:

For any questions or clarifications, please contact Raza Mehar at [[email protected]].

About

The CGI2Real_Multi-Class_Image_Classifier categorizes humans, horses, or both using transfer learning from Inception CNN. Trained on synthetic images, it can also classify real ones.

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published