Blog

Exact percentiles in Spark

Combining the power of Scala and Python to make the calculation of percentiles in Spark easy and fast

avatar
Dr. Georg Heiler

Arrow 2.0.0 - structs in pandas

Finally, nested types in Arrow.

avatar
Dr. Georg Heiler

Sparkling SCD2

Data preparation using spark without ACID tables

avatar
Dr. Georg Heiler
Speed up conda and improve error messages featured image

Speed up conda and improve error messages

Efficient management of python packages

avatar
Dr. Georg Heiler
Time-series visualization in python featured image

Time-series visualization in python

Interactive and scalable plots for time-series and periodicities

avatar
Dr. Georg Heiler

Intersting links about Bayesian modeling

Useful links

avatar
Dr. Georg Heiler
Run the latest version of spark featured image

Run the latest version of spark

Execute the latest version of spark on HDP.

avatar
Dr. Georg Heiler

Intersting links about IoT

Useful links

avatar
Dr. Georg Heiler

Production grade pyspark jobs

Use additional python packages with pyspark

avatar
Dr. Georg Heiler
blazing-fast data science on GPUs featured image

blazing-fast data science on GPUs

Fast calculation of ego network using RAPIDS-AI.

avatar
Dr. Georg Heiler