posted by organizer: ndebard || 2886 views || tracked by 2 users: [display]

FTXS 2015 : The 5th Fault Tolerance for HPC at eXtreme Scale (FTXS) Workshop

FacebookTwitterLinkedInGoogle

Link: https://sites.google.com/site/ftxsworkshop/home/ftxs-2015
 
When Jun 15, 2015 - Jun 15, 2015
Where Portland, OR
Submission Deadline Feb 9, 2015
Notification Due Mar 9, 2015
Categories    resilience   fault tolerance   HPC   supercomputing
 

Call For Papers

CALL FOR PAPERS
5th Workshop on Fault-Tolerance for HPC at eXtreme Scale (FTXS 2015)

In conjunction with
The 24th International ACM Symposium on
High Performance Distributed Computing (HPDC 2015)
Portland, Oregon, USA on June 15 – 19, 2015

Authors are invited to submit original papers on the research and practice of
fault-tolerance in extreme scale (HPC) computing. Resilience and
fault-tolerance remain a major concern for supercomputing and advances in this
area are needed to allow applications to compute accurate (or within error
tolerance) answers in a timely and efficient manner in the presence of
degradations or failures of platform components (both hardware and software).

Topics include, but are not limited to:
* Failure data analysis and field studies
* Power, performance, resilience (PPR) assessments / tradeoffs
* Novel fault-tolerance techniques and implementations
* Emerging hardware and software technology for resilience
* Silent data corruption (SDC) detection / correction techniques
* Advances in reliability monitoring, analysis, and control of highly
complex systems
* Failure prediction, error preemption, and recovery techniques
* Fault-tolerant programming models
* Models for software and hardware reliability
* Metrics and standards for measuring, improving, and enforcing effective
fault-tolerance
* Scalable Byzantine fault-tolerance and security from single-fault and
fail-silent violations
* Atmospheric evaluations relevant to HPC systems (terrestrial neutrons,
temperature, voltage, etc.)
* Near-threshold-voltage implications and evaluations for reliability
* Benchmarks and experimental environments including fault injection
* Frameworks and APIs for fault-tolerance and fault management

See https://sites.google.com/site/ftxsworkshop/home/ftxs-2015 and
http://www.hpdc.org/2015/ for more information.

AMD will sponsor the FTXS 2015 best paper award! The award will be chosen by
the PC and awarded at the workshop.

PAPER SUBMISSIONS
Submissions are solicited in the following categories:
* Regular papers presenting innovative ideas improving the state of the
art.
* Experience papers discussing the issues seen on existing extreme-scale
systems, including some form of analysis and evaluation.
* Extended abstracts proposing disruptive ideas in the field, including
some form of preliminary results.

Submissions shall be sent electronically, must conform to ACM conference
proceedings style and should not exceed eight (8) pages including all text,
appendices, and figures. Position papers should not exceed six (6) pages.

IMPORTANT DATES
Submission of papers: February 9th, 2015
Author notification: March 9th, 2015
Camera-ready papers: April 2015
Workshop: June 15th, 2015

FTXS 2015 PROGRAM CHAIRS
Nathan DeBardeleben – Los Alamos National Laboratory
Franck Cappello – Argonne National Laboratory and UIUC
Robert Clay – Sandia National Laboratories

PROGRAM COMMITTEE
Leonardo Bautista Gomez – Argonne National Laboratory
Aurélien Bouteiller – University of Tennessee Knoxville
Greg Bronevetsky - Lawrence Livermore National Laboratory
John Daly - Department of Defense
Christian Engelmann – Oak Ridge National Laboratory
Kurt Ferreira – Sandia National Laboratories
Ana Gainaru – University of Illinois at Urbana-Champaign
Qiang Guan – Los Alamos National Laboratory
Saurabh Gupta – Oak Ridge National Laboratory
Saurabh Hukerikar – Information Sciences Institute/USC
Hideyuki Jitsumoto – Tokyo Institute of Technology
Zhiling Lan – Illinois Institute of Technology
Scot Levy – University of New Mexico
Naoya Maruyama – RIKEN AICS
Bogdan Nicolae – IBM Research – Ireland
Thomas Ropars - EPFL
Yves Robert - ENS Lyon
Anthony Skjellum - Auburn University
Vilas Sridharan – AMD, Inc.
Devesh Tiwari – Oak Ridge National Laboratory
Abhinav Vishnu - Pacific Northwest National Laboratory

https://sites.google.com/site/ftxsworkshop/home/ftxs-2015

Related Resources

FTXS 2024   Fault Tolerance for HPC at eXtreme Scales (FTXS) Workshop
OpenSuCo @ ISC HPC 2017   2017 International Workshop on Open Source Supercomputing
REX-IO 2024   4th Workshop on Re-envisioning Extreme-Scale I/O for Emerging Hybrid HPC Workloads @ IEEE Cluster 2024
SHiPS 2025   The 2nd International Workshop on the Environmental Sustainability of High-Performance Software
XLOOP 2024   XLOOP 2024 : The 6th Annual Workshop on Extreme-Scale Experiment-in-the-Loop Computing
HPCMS - PDP 2025   PDP 2025 Special Session on High Performance Computing in Modelling and Simulation (HPCMS)
AHPC3 2025   The 1st Workshop on Accelerated HPC in the Cloud-Edge Continuum
CFP&CFSP-DFT 2024   DFT 2024 | 37th IEEE International Symposium on Defect and Fault Tolerance in VLSI and Nanotechnology Systems
EDCC 2025   20th European Dependable Computing Conference
SSS 2024   The 26th International Symposium on Stabilization, Safety, and Security of Distributed Systems