Intro to Spark for Data Science

Get More Info Register Now

Course Details

Take your data science skills to the next level with Spark. In this Galvanize weekend course you’ll learn to process big data and build data pipelines with Spark.

Register Now

Why Scale with Spark?

Apache Spark is a huge step forward in working with data at scale, enabling us to do faster machine learning algorithms on large data sets. Used by data professionals at Amazon, eBay, NASA, and 200+ other organizations, Spark’s community is one of the fastest growing in the world. You’ll want to know Spark if you want to keep up with the next evolutionary change in big data.

What You’ll Learn

In this two day, in-person, hands-on Spark training, you will learn how to:

  • Import, clean, and query data using Spark RDDs and Spark SQL.
  • Build a product recommendation system using Spark
  • Perform Natural Language Processing (NLP) using Spark.
  • Use Amazon Web Services (AWS) to deploy a Spark cluster.
  • Use a Spark cluster to process large datasets that cannot fit on your personal computer.

After completing this weekend workshop, you’ll be better prepared to use Spark for real-world projects and problems such as learning to build a product recommender, or mining consumer sentiment and brand perception from user comments.

Have More Questions?

Read our full FAQ or get in touch with someone on the Galvanize team.

Request Info Read our FAQ

Upcoming Dates