Skip to content

powerful data preprocessing application that simplifies the task of preparing data for machine learning models.

Notifications You must be signed in to change notification settings

SoufiyaneOuali/Data_Quality_App

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

6 Commits
 
 
 
 
 
 
 
 

Repository files navigation

Data_Quality_App

  • I have developed a comprehensive data preprocessing and manipulation application that streamlines the data preparation phase before applying machine learning models. This application provides a range of essential data preprocessing functionalities, including handling duplicate values and missing data.

  • One of the key features of the application is the utilization of advanced techniques like k-nearest neighbors (KNN) and rule-based methods (RL) to address missing values. These methods intelligently impute missing data, ensuring the integrity and quality of the dataset.

  • Furthermore, the application offers a user-friendly interface that allows users to interactively explore and manipulate their data. It provides intuitive options for data cleaning, feature selection, and transformation, empowering users to optimize their datasets for better model performance.

  • With this data preprocessing application, users can efficiently handle duplicate values, identify and handle missing data, and perform various data manipulation tasks. By automating these processes, it accelerates the data preprocessing phase and enables users to focus more on building robust machine learning models.

here are some image of the application

1- we can import and see our data

Capture22

2- we can also display duplicate value

3

3- we can also display missing value

4

4- we can simlpy use to delete a row or a varibble or also deleting just the duplicated rows

5

5- or we can choose to use the advanced model such as Random forest or KNN to handle our data from encoding to filling the missing value

6

6- here are the result of the trained model we can train many model and compare between them to choose the best one

7

the main object of this project is enables users to focus more on building robust machine learning models .

PS : just download the code and excute it , you will need an IDE such as spyder (i reccomend this one)

Support My Work

If this project inspired you, gave you ideas or helped you, please consider giving me a Star and share the project with others ❤️.

About

powerful data preprocessing application that simplifies the task of preparing data for machine learning models.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages