Data Science Project from Scratch - #3
- supriyamalla

- Jan 22, 2022
- 1 min read
Great, now that we have the data extracted - let's get our hands dirty on the data cleaning!
By looking at the data, I made a note of all the things that are wrong with the data.
1. Salary parsing - remove "Glassdoor est", K, hyphens, make it in dollars; remove rows with no salary parsing. Remember our idea is to predict salary!
2. remove ratings from Company Name, company name should be text only
3. age of the company
4. parsing of job description - is python required?
I tried Spyder IDE this time - and it was a GAME CHANGER! You can view the variables on the go (don't have to print every time) plus the results aren't displayed in the same console as your code.
and finally, have cleaned the data! this was very simple yet a fulfilling exercise.
and part 3 is DONEEE!



Comments