big data

Decoding Data Pipelines for Startups

In this era of Big Data, an issue that has been escalating off- late relates to data fragmentation across organisations. This makes the process of analytics and reporting to become even more complex. This is where data pipeline tools come into play. To define it, a data pipeline denotes a set of actions carried out to extract data from different sources. For a startup, building a data pipeline is an important aspect of data science. They need to gather data points from all users and process it in real- time for developing data products.

(more…)

Read more · 11 minutes
Data Blogger Courses

Mastering Pandas

In this course, you will learn how to use the Python Pandas. After the course, you will be able to:

  • Load and transform your data
  • Visualizing data using line plots, scatter plots and histograms
  • Merging and storing data

The course also includes more advanced topics, such as data parallelization and aggregation.

You can see all course content under “Curriculum” on Data Blogger Courses and the first three lessons are free. The first free lesson can be found here.

(more…) Read more
6276688407 28fb0d1b04 o

Opinion mining on Dutch news articles

In this blog post, I will learn you how you can mine opinions about companies from news articles. I will share how I scraped thousands of news articles in a few minutes and how one could classify the opinion expressed in the titles of the news articles. This information could be used for example to help with watching competitors of a company or to predict global trends.

(more…)

Read more · 17 minutes