Introduction to Spark for Data Scientists

Dates: 10 - 11 January 2019

Location: EPCC, Edinburgh

Timetable

Unless otherwise indicated all material is Copyright © EPCC, The University of Edinburgh, and is only made available for private study.

Day 1

  • 9.30 - 10.15 Spark Introduction
  • 10.15 - 11.00 Spark Essentials
  • 11.00 - 11.30 Coffee/Tea break
  • 11.30 - 12.30 Workaround
  • 12.30 - 13.30 Lunch
  • 13.30 - 14.15 Spark Cluster
  • 14.15 - 15.00 Lab 1 Exercises
  • 15.00 - 15.30 Coffee/Tea break
  • 15.30 - 16.15 Additional Spark
  • 16.15 - 17.30 Lab 2 Exercises

Day 2

  • 9.30 - 10.15 Advanced Spark
  • 10.15 - 11.00 Lab 2 Exercises
  • 11.00 - 11.30 Coffee/Tea break
  • 11.30 - 12.30 Lab 3 Exercises - or create an Spark Application
  • 12.30 - 13.30 Lunch
  • 13.30 - 14.00 Submitting a Spark application to a Spark cluster
  • 14.00 - 15.00 Lab 3 Exercises - or create an Spark Application
  • 15.00 - 15.30 Wrap Up

Course Chat

https://paper.dropbox.com/doc/Introduction-to-Spark-for-Data-Scientists--AVBVy5PkRsk6EqX6Vr0tw~qlAg-NqDuY1Wiy3coGGsyWDW5m

The Chat page is a live collaborative online document which we will use to share links, information and comments. All course participants are encouraged to contribute.

Exercise Material

Unless otherwise indicated all material is Copyright © EPCC, The University of Edinburgh, and is only made available for private study.

Exercise material