top of page

Data Science Project from Scratch - #3

  • Writer: supriyamalla
    supriyamalla
  • Jan 22, 2022
  • 1 min read

Great, now that we have the data extracted - let's get our hands dirty on the data cleaning!


By looking at the data, I made a note of all the things that are wrong with the data.



1. Salary parsing - remove "Glassdoor est", K, hyphens, make it in dollars; remove rows with no salary parsing. Remember our idea is to predict salary!

2. remove ratings from Company Name, company name should be text only

3. age of the company

4. parsing of job description - is python required?


I tried Spyder IDE this time - and it was a GAME CHANGER! You can view the variables on the go (don't have to print every time) plus the results aren't displayed in the same console as your code.


and finally, have cleaned the data! this was very simple yet a fulfilling exercise.


and part 3 is DONEEE!




Comments


Post: Blog2 Post

Subscribe Form

Thanks for submitting!

©2020 by Learn Data Science with me. Proudly created with Wix.com

bottom of page