Afiz ⚑️
Afiz ⚑️

@itsafiz

6 Tweets Feb 03, 2023
One of the common challenges of Data Analysts is missing values in the datasets.
In this thread, we will see how to deal with missing values in real-world datasets.
A thread πŸ§΅πŸ‘‡
There are mainly π˜π—΅π—Ώπ—²π—² approaches to deal with missing values in the datasets.
πŸ”Έ Drop Columns with Missing Values
πŸ”Έ Imputation
πŸ”Έ An Extension To Imputation
Details about each approach πŸ‘‡
2/6
🟑 Drop columns with missing values: This is the simplest solution to deal with missing values in datasets.
3/6
🟑 Imputation: This is a better approach than simple drop columns approach. In imputation, missing values will be filled with some number. For example, mean value will be filled for every missing one.
4/6
🟑 Extension to Imputation: Imputation works well, but imputated values may be above or below their actual values. To handle this case, we keep the track of missing values by adding extra column. πŸ‘‡
5/6
That's all about handling missing values in thee datasets. In the next thread 🧡, I will be sharing the source code for each approach. Staty tuned. If you like this content
πŸ”Έ Follow me @itsafiz
πŸ”Έ Retweet the first tweet.
6/6

Loading suggestions...