Broken Backend
  • Blog
  • About
  • Tags
Broken Backend

Topic: Optimisation


How to optimise loading partitioned JSON data in Spark ?

 Posted on August 30, 2020  |  7 minutes  |  1321 words  |  Wissem

In this tutorial we will explore ways to optimise loading partitioned JSON data in Spark.

I have used the SF Bay Area Bike Share dataset, you can find it here. The original data (status.csv) have gone through few transformations. The result looks like:

[Read More]
Tech: Spark  Topic: Optimisation  Format: Howto 

Wissem  • © 2025  •  Broken Backend

Hugo v0.151.0 powered  •  Theme Beautiful Hugo adapted from Beautiful Jekyll