Spark for the Impatient: Introduction

The Zen of Spark: What is it, exactly? Apache Spark is a general-purpose, distributed, memory-based, parallel computing engine. Man, that’s a mouthful. Although technically accurate, that description comes across as just so much marketing-speech.  But the fact is, we can learn a lot about Spark’s key characteristics by breaking it down. General Purpose: For quite • Read More »