Actions

Resilience: Difference between revisions

From Modelado Foundation

imported>Achien
No edit summary
imported>Achien
No edit summary
Line 16: Line 16:
| '''PI''' || Ron Brightwell || Shekhar Borkar || Katherine Yelick || Daniel Quinlan  || Guang Gao || Mary Hall || Andrew Chien || Koushik Sen || Milind Kulkarni || Martin Schulz
| '''PI''' || Ron Brightwell || Shekhar Borkar || Katherine Yelick || Daniel Quinlan  || Guang Gao || Mary Hall || Andrew Chien || Koushik Sen || Milind Kulkarni || Martin Schulz
|- style="vertical-align:top;"
|- style="vertical-align:top;"
|'''How does your project approach to resilience? (i.e. uses lower-level mechanisms from hardware or lower level software, depends on higher level management, creates new mechanisms) How will it ensure resilient and efficient program execution at 100K sockets and high transient error rates?"
|Describe how your approach to resilience and its dependence on other programming, runtime, or resilience technologies? (i.e. uses lower-level mechanisms from hardware or lower level software, depends on higher level management, creates new mechanisms)  
|(EXPRESS)
|(EXPRESS)
|(TG)
|(TG)
Line 28: Line 28:
|(PIPER)
|(PIPER)
|- style="vertical-align:top;"
|- style="vertical-align:top;"
|'''What opportunities are there to improve resilience or efficiency of resilience by exporting/exploiting runtime or application semantics information in your system?"
|'One challenging problem for Exascale systems is that projections of soft error rates and hardware lifetimes (wearout) span a wide range from a modest increase over current systems to as much as a 100-fold increase.  How does your system scale in resilience to ensure effective exascale capabilities on both the varied systems that are likely to exist and varied operating points (power, error rate)?
|(EXPRESS)
|(TG)
|(DEGAS)
|(D-TEC)
|(DynAX)
|(X-TUNE)
|(GVR)
|(CORVETTE)
|N/A
|(PIPER)
|- style="vertical-align:top;"
|''What opportunities are there to improve resilience or efficiency of resilience by exporting/exploiting runtime or application semantics information in your system?"
|(EXPRESS)
|(EXPRESS)
|(TG)
|(TG)

Revision as of 00:43, April 9, 2014

Sonia requested that Andrew Chien initiate this page. For comments, please contact Andrew Chien.

QUESTIONS XPRESS TG X-Stack DEGAS D-TEC DynAX X-TUNE GVR CORVETTE SLEEC PIPER
PI Ron Brightwell Shekhar Borkar Katherine Yelick Daniel Quinlan Guang Gao Mary Hall Andrew Chien Koushik Sen Milind Kulkarni Martin Schulz
Describe how your approach to resilience and its dependence on other programming, runtime, or resilience technologies? (i.e. uses lower-level mechanisms from hardware or lower level software, depends on higher level management, creates new mechanisms) (EXPRESS) (TG) (DEGAS) (D-TEC) (DynAX) (X-TUNE) (GVR) (CORVETTE) SLEEC (PIPER)
'One challenging problem for Exascale systems is that projections of soft error rates and hardware lifetimes (wearout) span a wide range from a modest increase over current systems to as much as a 100-fold increase. How does your system scale in resilience to ensure effective exascale capabilities on both the varied systems that are likely to exist and varied operating points (power, error rate)? (EXPRESS) (TG) (DEGAS) (D-TEC) (DynAX) (X-TUNE) (GVR) (CORVETTE) N/A (PIPER)
What opportunities are there to improve resilience or efficiency of resilience by exporting/exploiting runtime or application semantics information in your system?" (EXPRESS) (TG) (DEGAS) (D-TEC) (DynAX) (X-TUNE) (GVR) (CORVETTE) N/A (PIPER)
What capabilities of provided by resilience researchers (software or hardware) could have a significant impact on the capabilities or efficiency of resilience?" (EXPRESS) (TG) (DEGAS) (D-TEC) (DynAX) (X-TUNE) (GVR) (CORVETTE) N/A (PIPER)