A powerful and interactive Streamlit web application that demonstrates how to perform Label Encoding and One-Hot Encoding on categorical data.
This project is designed to help beginners and developers understand data preprocessing techniques used in Machine Learning.
-
π’ Label Encoding
- Converts categorical values into numerical labels
-
π₯ One-Hot Encoding
- Converts categories into binary vectors
-
π Multiple Input Methods
- Default sample data
- Manual input (comma-separated values)
- Upload CSV dataset
-
π Data Visualization
- Bar chart for category distribution
-
π― User-Friendly UI
- Built with Streamlit for interactive experience
Data preprocessing is a critical step in Machine Learning. This project helps you understand:
- Difference between Label Encoding & One-Hot Encoding
- When to use each encoding technique
- How categorical data is transformed for ML models
- Python π
- Streamlit π¨
- Pandas π
- NumPy π’
- Scikit-learn π€
- Matplotlib π
π Encoder-Tool/
βββ app.py
βββ requirements.txt
βββ README.md
git clone https://github.com/selvan-01/Encoders-in-Machine-Learning.git
cd encoder-tool
pip install -r requirements.txt
streamlit run app.py
- Label Encoding Table
- One-Hot Encoding Table
- Category Distribution Graph
-
Label Encoding assigns numerical values (may create ordinal relationship)
-
One-Hot Encoding avoids ranking between categories
-
New versions of Scikit-learn use:
OneHotEncoder(sparse_output=False)
- π₯ Download encoded data as CSV
- π¨ Advanced UI design (themes & animations)
- π Interactive charts using Plotly
- π€ Integration with ML models
S. Senthamil Selvan (Sen) π― Aspiring Ai Developer | AI & ML Enthusiast
If you found this project useful:
- β Star this repository
- π Share with others
- π¬ Give feedback
This project provides a hands-on understanding of encoding techniques, making it a great addition to your Machine Learning portfolio.