Introduction to Spark for Fast In-memory Big Data Processing using Python

This workshop will teach you how to utilize Apache Spark and Python to perform large-scale, in-memory data analytics. Learning outcomes of this workshop include understanding the overall conceptual design of Spark and defining the advantages of using Spark over the traditional Hadoop MapReduce. Participants will also learn to develop Spark programs using Python and to leverage Spark’s specific capacities such as SQLContext and DataFrame to assist with data analytics.

December 08, 2017 from 9:00 am to 12:00 pm
Location: Barre Hall B106

Register Here