A Guide On How To Become A Data Scientist - (Step By Step Approach)

ABSTRACT

In this article, we will see the skills that one must acquire to become a data scientist. You cannot learn data science within one year or six months — instead, it’s a lifetime process that you have to follow with proper dedication and hard work.

DATA SCIENCE ROADMAP

STEP 1: Choose A Programming Language (Python / R)

The first step while starting the Data Science Journey is to get familiar with the Programming Language. Between the two, Python is the most preferred coding language and is adopted by most Data scientists. It is easy to understand, versatile, and supports various in-built libraries such as Numpy, Pandas, MatplotLib, Seaborn, Scipy, and many more.

  1. Kaggle’s Python Course
  2. Krish Naik’s Python Tutorial (Recommended)
  3. Udemy’s Python for Data Science and Machine Learning Bootcamp
  4. Coursera Python Course

STEP 2. Statistics

For becoming a Data Scientist, having knowledge of Statistics and Probability is as essential as having salt in food. Knowing them will help the data scientists interpret large data sets, get insights from them, and analyze them better.

  1. Coursera Statistics Course
  2. Khan Academy Statistics And Probability Course
  3. FreeCodeCamp Statistics Course (Recommended)

STEP 3: Learning SQL

Structured Query Language (SQL) is used for extracting and communicating with large databases. One should focus on understanding the different types of normalization, writing nested queries, using co-related questions, group-by, performing join operations, etc., on the data and extract in Raw format. This Data will then further be cleaned either in Microsoft Excel or by using python libraries.

  1. Intro To SQL By Kaggle (Recommended)
  2. Advanced SQL By kaggle
  3. Edureka’s SQL Playlist

STEP 4. Data Cleaning

When a Data Scientists is given a project, his majority of time goes into cleaning the data set, removing unwanted values, handling missing values. It can be achieved by using some inbuilt python libraries like Pandas and Numpy.
One should also know how to manipulate data using MICROSOFT EXCEL.

  1. Edureka’s Microsoft Excel Course
  2. Learning Pandas By Kaggle (Recommended)

STEP 5: Exploratory Data Analysis

Exploratory data analysis is the essential part when talking about data science. The data scientists have many tasks, including finding data patterns, analyzing data, finding the appropriate trends in the data and obtaining valuable insights, etc., from them with the help of various graphical and statistic l methods.
It includes:
A) Data Analysis using Pandas and Numpy,
B) Data Manipulation
C) Data Visualization

  1. Blog — Performing EDA on Iris Data Set (Recommended)
    3.Coursera Course On EDA, Statistics, Probability
TYPES OF PLOT IN SEABORN LIBRARY

STEP 6: Learning Machine Learning Algorithms

According to Google, “Machine learning is a method of data analysis that automates analytical model building. It is a branch of artificial intelligence based on the idea that systems can learn from data, identify patterns and make decisions with minimal human intervention.”
It is the most crucial step in a life cycle of a data scientist where one has to build various models using machine learning algorithms and should be able to predict and come with the most optimum solution to solve any problem.

  1. Deep Learning By Krish Naik
  2. Intro To ML By Kaggle (Recommended)
  3. Machine Learning By Krish Naik (Recommended)
  4. Coursera Deep Learning Specialization
MACHINE LEARNING

Step 7: Practicing On Analytics Vidhya and Kaggle

After acquiring the basics of Data Science, now it’s time to get hands-on experience in its part; there are many online platforms like Kaggle and Analytics Vidhya, which can provide you with hands-on experience with both beginner and advanced level data sets. They can help you to understand various machine learning algorithms, different analyzing techniques, etc.

--

--

An aspiring data scientist.

Get the Medium app

A button that says 'Download on the App Store', and if clicked it will lead you to the iOS App store
A button that says 'Get it on, Google Play', and if clicked it will lead you to the Google Play store