Best Practice of Compression Decompression Codes in Apache Spark Sophia Sun (Intel) and Qi Xie (Intel) from pace codes of practice a h Watch Video
Preview(s): Play Video: (Note: The default playback of the video is HD VERSION. If your browser is buffering the video slowly, please play the REGULAR MP4 VERSION or Open The Video below for better experience. Thank you!)
⏲ Duration: 17 min 22 sec ✓ Published: 11-Jun-2018
Description: Nowadays, people are creating, sharing and storing data at a faster pace than ever before, effective data compression / decompression could significantly reduce the cost of data usage. Apache Spark is a general distributed computing engine for big data analytics, and it has large amount of data storing and shuffling across cluster in runtime, the data compression/decompression codecs can impact the end to end application performance in many ways.nnHowever, there’s a trade-off between the stora