

bismark boateng
4 min read
Natural Language Processing And Model Validation
Most part of a data scientist role will not only lie on beautifully structured data for you to just start training and validating your...


Hamza kchok
5 min read
We learned a lot about data, let's make sure our models are working well.
In the last blog posts, we worked a lot on data preprocessing and got to recognize how much of an important task it was. Remembering the...

Omar Mohamed
6 min read
Image to Text transformation
Hello and welcome to this new article, this article aims to explain an end-to-end use case of taking the advantage of a wonderful AI...


Arpan Sapkota
7 min read
Introduction to NLP and Model Validation in Python
Natural language processing (NLP) is a field that focuses on making computer algorithms understand natural human language. Natural...

asma kirli
5 min read
Everything You Need To Know About Model Validation!
“Predicting the future isn’t magic, it’s artificial intelligence.” ~Dave Waters Have you ever been concerned about how well your your...


asma kirli
5 min read
The Guide To Deal With Text Data: From Basic to Advanced!
“Torture the data, and it will confess to anything.” – Ronald Coase One of the biggest breakthroughs when it comes to preprocessing your...


bismark boateng
4 min read
A Guide To Feature Engineering And Machine Learning For Time Series Data
#Machine Learning Machine Learning has swept the world of data science; improvements in processing power, algorithms, and community...

Tanushree Nepal
3 min read
Linear Classifiers and Tree Based Model in Python
Linear Classifiers are one of the most commonly used classifiers and it is a supervised machine learning technique A linear classifier...

Omar Mohamed
5 min read
Feature Engineering using Python
Figure (1) Introduction Hello and welcome to my new article in which we try to discuss some crucial feature engineering methods using...


mrbenjaminowusu
3 min read
EXPLORING HIGH DIMENSIONAL DATA
Introduction A high dimensional data is a data that has too many input variables or features. These features may be in the form of...

Blessing Oludele
5 min read
Where are people in Canada, Mexico and the United States getting employed?
After progressing in several data science courses, I might have overestimated my abilities lol. Let's just say this project gave me some...

TEMFACK DERICK
4 min read
Principal Component Analysis with Python
In a machine learning workflow, we generally have to deal with high-dimensional datasets. But most of the time, many features in those...

Hamza kchok
6 min read
ML Models are cool, but are they as cool as good preprocessed data?
AI Models sure are interesting and fairly fascinating. We're talking about machines "learning" new things! But, whether we like it or...

Arpan Sapkota
9 min read
Dimensionality Reduction and Preprocessing for Machine Learning in Python
Dimensionality Reduction in Python Dimensionality reduction is a method of unsupervised learning. It refers to techniques for reducing...

asma kirli
7 min read
My roadmap into preprocessing data: Feature extraction from raw text using TF-IDF
“ Without big data, you are blind and deaf and in the middle of a freeway” - Geoffrey Moore. When we say big data, we mean a huge amount...

Omar Mohamed
7 min read
Noise minimization and Sampling different techniques using Python
Introduction When it comes to creating a Machine Learning pipeline, data preprocessing is the first step marking the initiation of the...


Arpan Sapkota
6 min read
Extreme Gradient Boosting with XGBoost and Cluster Analysis in Python
Extreme Gradient Boosting with XGBoost Extreme Gradient Boosting is a tree-based method that belongs to Machine Learning's supervised...


Hamza kchok
7 min read
Need more ML Models? We got you!
In the recent blogs, we discovered a few things about machine learning algorithms; Regression and Classifications models, be it Linear,...

asma kirli
8 min read
Machine Learning Concepts: K-means Clustering.
“Everything that civilisation has to offer is a product of human intelligence; we cannot predict what we might achieve when this...


Omar Mohamed
6 min read
Cupcake and Muffin Recipes
Hello and welcome to my new article in the series of data science beginner guide projects, we discuss here the delicious cupcake and...


mrbenjaminowusu
3 min read
How Regularization Affects Training and Test Accuaracy Of A Logistic Regression Module.
In this blog, we will be discussing logistic regression classifier and how regularization affects the accuracy of its performance module....

Arpan Sapkota
5 min read
Linear Classifiers and Machine Learning with Tree-Based Models in Python
Linear Classifiers This article examines common linear classification models, including descriptions of the methods as well as Python...

ayenadykyaw1
6 min read
Support Vector Machine for Classification Problems
Support vector machine (SVM) was first identified by Vladimir Vapnik and his colleagues in 1992. SVMs are also known as one of the most...


asma kirli
6 min read
Machine Learning Concepts: Classification and Regression Trees with Scikit-Learn
“Data is a precious thing and will last longer than the systems themselves.” – Tim Berners-Lee For banks to figure out if they should...