James Owusu-Appiah

- 5 min

QS World Ranked Universities Analysis

QS WORLD RANKED UNIVERSITIES ANALYSIS This is in fulfillment of my final project as part of Data Insight Data Scientist Program. My final...

James Owusu-Appiah

- 4 min

Linear Classifiers And Machine Learning With Tree Based Models In Python

LINEAR CLASSIFIERS Linear classifiers are supervised machine learning algorithms that classify data into labels based on linear...

James Owusu-Appiah

- 7 min

Supervised and Unsupervised Learning

What Is Supervised Learning? A function that maps an input to an output is learned through supervised learning using sample input-output...

James Owusu-Appiah

- 7 min

Understanding Fundamental Statistical Concepts In Data Science

What Is Statistics? Statistics is a form of mathematical analysis that uses quantified models and representations for a given set of...

James Owusu-Appiah

- 8 min

TIME SERIES ANALYSIS OF NAICS

NAICS The North American Industry Classification System (NAICS) is the standard used by Federal statistical agencies in classifying...

asma kirli

- 7 min

My roadmap into preprocessing data: Feature extraction from raw text using TF-IDF

“ Without big data, you are blind and deaf and in the middle of a freeway” - Geoffrey Moore. When we say big data, we mean a huge amount...

James Owusu-Appiah

- 4 min

IMPORTING AND CLEANING DATA IN PYTHON

Importing Data In Python We need datasets for data analysis. Python has various modules which help us in importing the external data in...

Tanushree Nepal

- 5 min

Exploratory Data Analysis: Importing, Cleaning, and Visualization of Titanic Dataset

Exploratory Data Analysis (EDA) is used by data scientists to analyze and investigate data sets and summarize their main characteristics,...

Importing cleaning and Visualizing data using python

In this instructional exercise, we’ll utilize Python’s Pandas and NumPy libraries to clean data. In the primary portion I will clarify...

Blessing Oluwadaara

- 15 min

Pandas Techniques in Python for Data Manipulation

Introduction Python is the language of choice for data scientists. It provides a programming language's larger ecosystem as well as the...