Skip to content

selvan-01/Encoders-in-Machine-Learning

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 

History

3 Commits
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 

Repository files navigation


πŸš€ Categorical Data Encoder Tool (Machine Learning Preprocessing)

A powerful and interactive Streamlit web application that demonstrates how to perform Label Encoding and One-Hot Encoding on categorical data.

This project is designed to help beginners and developers understand data preprocessing techniques used in Machine Learning.


πŸ“Œ Features

  • πŸ”’ Label Encoding

    • Converts categorical values into numerical labels
  • πŸ”₯ One-Hot Encoding

    • Converts categories into binary vectors
  • πŸ“‚ Multiple Input Methods

    • Default sample data
    • Manual input (comma-separated values)
    • Upload CSV dataset
  • πŸ“Š Data Visualization

    • Bar chart for category distribution
  • 🎯 User-Friendly UI

    • Built with Streamlit for interactive experience

🧠 Why This Project?

Data preprocessing is a critical step in Machine Learning. This project helps you understand:

  • Difference between Label Encoding & One-Hot Encoding
  • When to use each encoding technique
  • How categorical data is transformed for ML models

πŸ› οΈ Tech Stack

  • Python 🐍
  • Streamlit 🎨
  • Pandas πŸ“Š
  • NumPy πŸ”’
  • Scikit-learn πŸ€–
  • Matplotlib πŸ“ˆ

πŸ“‚ Project Structure

πŸ“ Encoder-Tool/
│── app.py
│── requirements.txt
│── README.md

▢️ How to Run the Project

1️⃣ Clone the Repository

git clone https://github.com/selvan-01/Encoders-in-Machine-Learning.git
cd encoder-tool

2️⃣ Install Dependencies

pip install -r requirements.txt

3️⃣ Run the App

streamlit run app.py

πŸ“Έ Output Preview

  • Label Encoding Table
  • One-Hot Encoding Table
  • Category Distribution Graph

⚠️ Important Notes

  • Label Encoding assigns numerical values (may create ordinal relationship)

  • One-Hot Encoding avoids ranking between categories

  • New versions of Scikit-learn use:

    OneHotEncoder(sparse_output=False)
    

πŸš€ Future Improvements

  • πŸ“₯ Download encoded data as CSV
  • 🎨 Advanced UI design (themes & animations)
  • πŸ“Š Interactive charts using Plotly
  • πŸ€– Integration with ML models

πŸ‘¨β€πŸ’» Author

S. Senthamil Selvan (Sen) 🎯 Aspiring Ai Developer | AI & ML Enthusiast

πŸ”— Links


⭐ Support

If you found this project useful:

  • ⭐ Star this repository
  • πŸ” Share with others
  • πŸ’¬ Give feedback

πŸ’‘ Conclusion

This project provides a hands-on understanding of encoding techniques, making it a great addition to your Machine Learning portfolio.


About

πŸš€ Categorical Data Encoder Tool | Machine Learning Preprocessing Web App 🎯 An interactive Streamlit application for performing Label Encoding and One-Hot Encoding on categorical data. Supports CSV upload, manual input, and visualization to help understand essential data preprocessing techniques in Machine Learning.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages