Actions

"Containment Domains: Programming and Execution Model Support for Resiliency," Mattan Erez, The University of Texas at Austin: Difference between revisions

From Modelado Foundation

imported>Mattan
(Created page with "''Containment domains'' are a programming construct with weak transactional semantics specifically designed to enable applications to express resiliency concerns. The key to c...")
 
imported>Mattan
No edit summary
Line 1: Line 1:
''Containment domains'' are a programming construct with weak transactional semantics specifically designed to enable applications
''Containment domains'' are a programming construct with weak transactional semantics specifically designed to enable applications
to express resiliency concerns. The key to containment domains is abandoning the prevailing
to express resiliency concerns. The goal of containment domains is to enable resilient-by-default applications that can
be progressively and systematically optimized and tuned to improve their performance and
efficiency. The key to containment domains is abandoning the prevailing
one-size-fits all approach to resiliency and embrace the diversity of application needs
one-size-fits all approach to resiliency and embrace the diversity of application needs
and resiliency mechanisms. The new capabilities include
and resiliency mechanisms. The new capabilities include

Revision as of 10:52, January 10, 2014

Containment domains are a programming construct with weak transactional semantics specifically designed to enable applications to express resiliency concerns. The goal of containment domains is to enable resilient-by-default applications that can be progressively and systematically optimized and tuned to improve their performance and efficiency. The key to containment domains is abandoning the prevailing one-size-fits all approach to resiliency and embrace the diversity of application needs and resiliency mechanisms. The new capabilities include tuning and specializing error detection, state preservation and restoration, and recovery schemes. Containment domains are nested to take advantage of the machine hierarchy and to enable effective low-complexity uncoordinated localized recovery. They also provide the means of expressing algorithm-specific detection and recovery. These characteristics are critical to achieve the power- and resource-efficiency needed for extreme-scale computing while guaranteeing correct results. The programming model is accompanied by an execution model that provides interfaces to runtime services. The containment domains runtime consolidates various resiliency techniques and optimizations and provides the interfaces for co-tuning application and system level resiliency. The combination of programming and execution model is used to implement a default resiliency scheme which is then amenable to progressive and aggressive optimization.

This page is under construction.

For publications, posters, and other documents, please see [1].