David Andrés 🤖📈🐍
David Andrés 🤖📈🐍

@daansan_ml

9 Tweets 16 reads Jul 08, 2023
What is data normalization, and how can it be achieved?
Let's find out more about this!
🧵 👇
Normalization means to bring the data to a common range.
There are different techniques 👇
1️⃣ Min-Max Scaler:
This scaler scales each feature to a given range, typically between 0 and 1.
It’s useful when you want to preserve the shape of the original distribution but adjust the scale of the data.
2️⃣ Standard Scaler:
This scaler scales the data to have zero mean and unit variance. It is equivalent to calculating the z-score of the data.
It’s useful when the distribution of the data is not normal and you want to normalize it to a standard Gaussian distribution.
3️⃣ Robust Scaler:
This scaler is robust to outliers in the data and scales the data to the IQR (interquartile range).
It’s useful when you have data with extreme values that could skew the scaling process.
4️⃣ Max-Abs Scaler:
This scaler scales each feature to the maximum absolute value of that feature.
It’s useful when the data is centered around zero, and you want to preserve the direction and sign of each feature.
You should also join our newsletter, DSBoost🚀
Every week we share:
🔹Interviews
🔹Podcast notes
🔹Learning resources
🔹Interesting collections of content
Subscribe for free👇👇
dsboost.dev

Loading suggestions...