sparkSpark is a unified analytics engine for large-scale data processing. It provides hhRDD RDDResilient Distributed DatasetSpark