Tag: ,

Motivating Entity Resolution for Data Science

Why Entity Resolution? Data is the new oil. Thus, analytical models are the new combustion engines. A combustion engine functions efficiently with good fuel. Similarly,...

Log Loss as a performance metric

Introduction to Log Loss Whenever we talk about performance metrics of the classification Machine Learning algorithms, the following names come to our mind: Accuracy...

Python Dedupe Library : Machine Learning to De-Duplicate Data

In Information systems, the biggest challenge faced by organizations is the quality of data. Hence, unclean, messy, and missing data is a common headache across the...

Careers in Machine Learning and AI

There has been a growing fear that AI would eat away jobs from every sector, including Software Services. Read this article of ours to know more: Is Artificial...

Tutorial: Hierarchical Clustering in Spark with Bisecting K-Means

In the previous article, we covered the standard K-Means Clustering technique on Spark. Read that article here: Tutorial : K-Means Clustering on Spark. In this article,...

Tutorial : K-Means Clustering on Spark

Analytics is discovering insights using data. Traditionally, statistical and visual techniques dominated the field. But, with advances in Machine Learning and AI,...

AI in Analytics vs AI in Automation

I was talking to a business leader once. I asked him, “What’s the biggest challenge with the young professionals?” The answer was,...

A look at Differential Privacy

AI is a superpower. And with power comes responsibility. This takes us into the domain of Responsible AI. In 2016, Microsoft released a bot named Tay on Twitter....

Machine Learning Deployment Strategies

A survey said that only one of eight Machine Learning models gets to production. This usually happens because of bad problem definition. We have dealt with that topic in...

How to detect Data Drift in Azure Machine Learning

This is the age of AI. Hence, you want to automate a certain business process, let’s say, for instance, email classification.┬áSo, you have built a Machine...