How do i fill the Null values of a categorical column?

Question

I'm working on a project using an E-commerce dataset. I'm facing an issue in the data cleaning stage. I have the customers dataset, which has approximately. 1.6 million rows. One of the feature, "Job Title" in the dataset has around 600K Null values.

Should I Impute the Null values with values like "NA" or "Ignored", Or should i drop them? I can't afford to drop NA rows as they're connected to other datasets. Is there any way to fill these Null values in correct way for this situation?

This is the first 5 rows of my Customer DataFrame.

This question is similar to: Handling categorical missing values ML. If you believe it’s different, please edit the question, make it clear how it’s different and/or how the answers on that question are not helpful for your problem. — Guna
– Guna, Commented Apr 21 at 10:31

Philipp · Accepted Answer · 2025-04-07 18:49:33Z

If dropping rows isn’t an option due to connections with other datasets, the best approach is to impute missing "Job Title" values with a neutral placeholder like "Unknown" or "Not Provided". This maintains relational integrity and avoids introducing bias from incorrect assumptions.

Additionally, consider creating a binary flag column (e.g., "Job Title Missing") to indicate which entries were originally null. This can be useful for downstream analysis or modeling, as the missingness itself might carry predictive value.

If "Job Title" plays a significant role in your analysis, and you have strong correlated features like age, income, industry, or transaction history, you could explore more advanced imputation using classification models or clustering techniques. However, simple placeholders are often sufficient if the field is not central to your objectives.

Jake Malis · Accepted Answer · 2025-04-19 00:49:39Z

And to add on to what @Philipp said, if you need the code to fill the null values it would be:

df['Job Title'].fillna('N/A', inplace=True)

Stack Exchange Network

How do i fill the Null values of a categorical column?

2 Answers 2

Linked

Hot Network Questions

How do i fill the Null values of a categorical column?

2 Answers 2

Linked

Related

Hot Network Questions