Category: programming


PySpark: compute the radius of gyration

How to compute the radius of gyration for large datasets with PySpark.

Mar 24, 2021

PySpark: great-circle distance

How to compute the great-circle distance between two GPS points with PySpark without using UDFs.

Mar 23, 2021

Pandas: plot the values of a groupby on multiple columns

How to plot different categories in the same figure, after a groupby, using pandas function unstack()

Jul 15, 2017

Pandas & Matplotlib: personalize the date format in a bar chart

A simple way to plot a bar chart with formatted dates on the x-axis with Pandas and Matplotlib

May 24, 2017