ECE Seminar (ECE 2002A/ECE 8002A)

*********************************
There is now a CONTENT FREEZE for Mercury while we switch to a new platform. It began on Friday, March 10 at 6pm and will end on Wednesday, March 15 at noon. No new content can be created during this time, but all material in the system as of the beginning of the freeze will be migrated to the new platform, including users and groups. Functionally the new site is identical to the old one. webteam@gatech.edu
*********************************

Event Details
  • Date/Time:
    • Wednesday January 31, 2018
      11:15 am - 12:05 pm
  • Location: Klaus Advanced Computing Building (Room 1456)
  • Phone:
  • URL:
  • Email:
  • Fee(s):
    N/A
  • Extras:
Contact

Paul Steffes

School of Electrical and Computer Engineering

404-894-3128

paul.steffes@ece.gatech.edu

Summaries

Summary Sentence: Seminar with Dr. Patrick Widener -- Sandia National Labs

Full Summary: Seminar with Dr. Patrick Widener -- Sandia National Labs

Speaker: Dr. Patrick Widener -- Sandia National Labs

Speaker's Title: Principal Member of the Technical Staff

Seminar Title: Understanding the Performance Effects of Resilience Mechanisms in High-Performance Computing Applications

Abstract:
Fault-tolerance poses a major challenge for future large-scale high-performance computing (HPC) systems and the important applications running on them. Alarming projections of high failure rates driven by the increasing scale and complexity of HPC systems have, over the past few years, motivated significant research into methods and techniques for providing resiliency while maintaining scalability in such systems. Our group at Sandia National Laboratories has worked to develop insights into selection and tuning of these methods and techniques. In this talk, I will describe our simulation-based framework for analyzing the performance effects of resilience activity. I will also present some recent research results obtained using our framework and discuss how those results have contributed to our understanding of the performance implications of resilience strategies for HPC applications.   

Speaker Bio:
Patrick Widener is a Principal Member of Technical Staff in the Center for Computing Research at Sandia. Dr. Widener’s research interests include the design and development of system software to support large-scale data-centric computational science, tools for examining performance interference caused by in-situ analytics, and software architectures for describing and exchanging data in computational science workflows. He is also a Research Associate Professor in the Computer Science Department at the University of New Mexico, and prior to joining Sandia was research faculty in the Department of Biomedical Informatics at Emory University. He holds a Ph.D. in Computer Science from the Georgia Institute of Technology.

Additional Information

In Campus Calendar
No
Groups

School of Electrical and Computer Engineering

Invited Audience
Faculty/Staff, Public, Graduate students, Undergraduate students
Categories
Seminar/Lecture/Colloquium
Keywords
No keywords were submitted.
Status
  • Created By: Ashlee Gardner
  • Workflow Status: Published
  • Created On: Jan 25, 2018 - 12:38pm
  • Last Updated: Jan 26, 2018 - 1:58pm