ARCHER logo ARCHER banner

The ARCHER Service is now closed and has been superseded by ARCHER2.

  • ARCHER homepage
  • About ARCHER
    • About ARCHER
    • News & Events
    • Calendar
    • Blog Articles
    • Hardware
    • Software
    • Service Policies
    • Service Reports
    • Partners
    • People
    • Media Gallery
  • Get Access
    • Getting Access
    • TA Form and Notes
    • kAU Calculator
    • Cost of Access
  • User Support
    • User Support
    • Helpdesk
    • Frequently Asked Questions
    • ARCHER App
  • Documentation
    • User Guides & Documentation
    • Essential Skills
    • Quick Start Guide
    • ARCHER User Guide
    • ARCHER Best Practice Guide
    • Scientific Software Packages
    • UK Research Data Facility Guide
    • Knights Landing Guide
    • Data Management Guide
    • SAFE User Guide
    • ARCHER Troubleshooting Guide
    • ARCHER White Papers
    • Screencast Videos
  • Service Status
    • Detailed Service Status
    • Maintenance
  • Training
    • Upcoming Courses
    • Online Training
    • Driving Test
    • Course Registration
    • Course Descriptions
    • Virtual Tutorials and Webinars
    • Locations
    • Training personnel
    • Past Course Materials Repository
    • Feedback
  • Community
    • ARCHER Community
    • ARCHER Benchmarks
    • ARCHER KNL Performance Reports
    • Cray CoE for ARCHER
    • Embedded CSE
    • ARCHER Champions
    • ARCHER Scientific Consortia
    • HPC Scientific Advisory Committee
    • ARCHER for Early Career Researchers
  • Industry
    • Information for Industry
  • Outreach
    • Outreach (on EPCC Website)

You are here:

  • ARCHER
  • Upcoming Courses
  • Online Training
  • Driving Test
  • Course Registration
  • Course Descriptions
  • Virtual Tutorials and Webinars
  • Locations
  • Training personnel
  • Past Course Materials Repository
  • Feedback

Contact Us

support@archer.ac.uk

Twitter Feed

Tweets by @ARCHER_HPC

ISO 9001 Certified

ISO 27001 Certified

GPU with Cuda

Dates: 11-12 October 2016

Location: University College London

Timetable

Tuesday 11th October 2016

Unless otherwise indicated all material is Copyright © EPCC, The University of Edinburgh, and is only made available for private study.

  • 9.00 Arrival
  • 9.30 Introduction
  • 9.45 GPU Architecture
  • 10.15 Programming GPUs with CUDA
  • 11.00 Break
  • 11.30 Practical Session 1: Your first CUDA code
    • Click here (not available before session starts)
    • Enter the password and log in
    • Navigate to YourName->intro and open main_notebook.ipynb
  • 12.30 Lunch
  • 13.30 GPU Optimisation
  • 14:00 Practical Session 2: Optimising a CUDA Application
    • Click here
    • Enter the password and log in
    • Navigate to YourName->reconstruct and open main_notebook.ipynb
  • 15.00 Break
  • 15.30 Practical Session 2 (cont)
  • 16.00 Optional Lecture: Deep Learning with GPUs (you can see a movie of a previous verison of this here)
  • 16.45 Close

Wednesday 12th October 2016

10:00 Seminar: A Lightweight Approach to Performance Portability with targetDP

(followed by discussion on this and/or previous day's training)

Alan Gray, EPCC, The University of Edinburgh

See preprint

Leading HPC systems achieve their status through use of highly parallel devices such as NVIDIA GPUs or Intel Xeon Phi many-core CPUs. The concept of performance portability across such architectures, as well as traditional CPUs, is vital for the application programmer. In this paper we describe targetDP, a lightweight abstraction layer which allows grid-based applications to target data parallel hardware in a platform agnostic manner. We demonstrate the effectiveness of our pragmatic approach by presenting performance results for a complex fluid application (with which the model was co-designed), plus a separate lattice QCD particle physics code. For each application, a single source code base is seen to achieve portable performance, as assessed within the context of the Roofline model. TargetDP can be combined with MPI to allow use on systems containing multiple nodes: we demonstrate this through provision of scaling results on traditional and GPU-accelerated large scale supercomputers.

Copyright © Design and Content 2013-2019 EPCC. All rights reserved.

EPSRC NERC EPCC