Topic: Optimisation -

How to optimise loading partitioned JSON data in Spark ?

Posted on August 30, 2020 | 7 minutes | 1321 words | Wissem

In this tutorial we will explore ways to optimise loading partitioned JSON data in Spark.

I have used the SF Bay Area Bike Share dataset, you can find it here. The original data (status.csv) have gone through few transformations. The result looks like:

[Read More]

Tech: Spark Topic: Optimisation Format: Howto