Data Blogger Courses

Mastering Pandas

In this course, you will learn how to use the Python Pandas. After the course, you will be able to:

  • Load and transform your data
  • Visualizing data using line plots, scatter plots and histograms
  • Merging and storing data

The course also includes more advanced topics, such as data parallelization and aggregation.

You can see all course content under “Curriculum” on Data Blogger Courses and the first three lessons are free. The first free lesson can be found here.

(more…) Read more
Circuit board.

Scale out your Pandas DataFrame operations using Dask

In Pandas, one can easily apply operations on all the data using the apply method (see also our course for learning Pandas quickly). However, this method is quite slow and is not useful when scaling up your methods. Is there a way to speed up these operations? And if so, how? Yes, there is! This blog post will explain how you can use Dask to maximize the power of parallelization and to scale out your DataFrame operations.

(more…)

Read more · 12 minutes
Web search.

Should you Start Learning Python in 2018 (Guide)

Starting to learn programming most of the times is overwhelming because of the number of programming languages available to learn. This causes most of us to search for generic terms like “what is the easiest programming language to learn”.

More than 90% of the websites on the internet claims that Python is the easiest programming language to learn. This lands us to another question which is “Should I Learn Python or Not?”. In fact, not just you, I too have faced the same problem when I started to learn programming.

But, over the years of my learning, I have figured out the exact answer to this question. So, today in this post I am going to share everything you need to know in order to finally decide that do you want to add Python to your learning curriculum or NOT?

(more…)

Read more · 13 minutes
Big data.

Decoding Data Pipelines for Startups

In this era of Big Data, an issue that has been escalating off- late relates to data fragmentation across organisations. This makes the process of analytics and reporting to become even more complex. This is where data pipeline tools come into play. To define it, a data pipeline denotes a set of actions carried out to extract data from different sources. For a startup, building a data pipeline is an important aspect of data science. They need to gather data points from all users and process it in real- time for developing data products.

(more…)

Read more · 11 minutes