Actions

Traleika Glacier: Difference between revisions

From Modelado Foundation

imported>Cdenny
(Created page with "{{Infobox project | title = Traleika Glacier | image = 180px | imagecaption = | team-members = List of team members | pi = Lead PI (Institute) | c...")
 
imported>Cdenny
No edit summary
Line 3: Line 3:
| image = [[File:Your-team-logo.png|180px]]
| image = [[File:Your-team-logo.png|180px]]
| imagecaption =  
| imagecaption =  
| team-members = List of team members
| team-members = [http://www.intel.com/ Intel], [https://www.reservoir.com/ Reservoir Labs], [http://www.etinternational.com/ ETI], [http://www.udel.edu/ UDEL], [http://www.ucsd.edu/ UC San Diego], [http://www.rice.edu/ Rice U.], [http://cs.illinois.edu/ UIUC], [http://www.pnnl.gov/ PNNL], [http://www.sandia.gov/ SNL]
| pi = Lead PI (Institute)
| pi = Shekhar Borkar (Intel)
| co-pi = Co-PIs (Institute)
| co-pi = Wilf Pinfold (Intel), Richard Lethin (Reservoir Labs), Rishi Khan (ETI), Guang Gao (UDEL), Laura Carrington (UC San Diego), Vivek Sarkar (Rice U.), David Padua (UIUC), Josep Torrellas (UIUC), John Feo (PNNL), Jackie Chen (SNL)
| website = team website
| website = [https://sites.google.com/site/traleikaglacierxstack/ https://sites.google.com/site/traleikaglacierxstack]
}}
}}


''Description about your project goes here.....''
 
'''Traleika Glacier'''
 


== Team Members ==
== Team Members ==
* [http://www.intel.com/ Intel:] Hardware guidance, HW/SW co-design, resiliency, technical management
* [https://www.reservoir.com/ Reservoir Labs:] Programming system, R-Stream, tools, optimization
* [http://www.etinternational.com/ ET International (ETI):] Simulators, execution model and runtime support
* [http://www.udel.edu/ University of Delaware (UDEL):] Execution model research
* [http://www.ucsd.edu/ University of California, San Diego (UC San Diego):] Applications
* [http://www.rice.edu/ Rice University:] Programming system, runtime system
* [http://cs.illinois.edu/ University of Illinois at Urbana-Champaign (UIUC):] Programming system, Hierarchical Tiles Arrays (HTA), architecture, system architecture evaluation
* [http://www.pnnl.gov/ Pacific Northwest National Laboratory (PNNL):] Kernels and proxy apps for evaluation
* [http://www.sandia.gov/ Sandia National Lab (SNL):] Co-design lead, combustion proxy app
== Goals and Objectives ==
'''Goal:'''
* Research and mature software technologies addressing major Exascale challenges and get ready to intercept by 2018-2020


'''Objectives:'''


== Objectives ==
* '''Energy efficiency:''' SW components interoperate, harmonize, exploit HW features, and optimize the system for energy efficiency
* '''Data locality:''' PGM system & system SW optimize to reduce data movement
* '''Scalability:''' SW components scalable, portable to O(109)—extreme parallelism
* '''Programmability:''' New (Codelet) & legacy (MPI), with gentle slope for productivity
* '''Execution model:''' Objective function based, dynamic, global system optimization
* '''Self-awareness:''' Dynamically respond to changing conditions and demands
* '''Resiliency:''' Asymptotically provide reliability of N-modular redundancy using HW/SW co-design; HW detection, SW correction
 
 
== Scope of the Project ==
[[File:TG-Scope.png|600px]]




== Roadmap ==
== Roadmap ==
[[File:TG-Roadmap.png]]
== Architecture ==
'''Straw-man System Architecture and Evaluation'''
[[File:TG-Strawman-System.png|600px]]
'''Data-locality and BW Tapering, Why So Important?'''
[[File:TG-Data-Locality.png|600px]]
== Programming and Execution Models ==
[[File:TG-Programming-Model.png]]
'''Programming model'''
* Separation of concerns: Domain specification & HW mapping
* Express data locality with hierarchical tiling
* Global, shared, non-coherent address space
* Optimization and auto generation of codelets (HW specific)
'''Execution model'''
* Dataflow inspired, tiny codelets (self contained)
* Dynamic, event-driven scheduling, non-blocking
* Dynamic decision to move computation to data
* Observation based adaption (self-awareness)
* Implemented in the runtime environment
'''Separation of concerns'''
* User application, control, and resource management
=== Programming System Components ===
[[File:TG-System-Components.png|600px]]
=== Runtime ===
* Different runtimes target different aspects
** IRR: targeted for Intel Straw-man architecture
** SWARM: runtime for a wide range of parallel machines
** DAR3TS: explore codelet PXM using portable C++
** Habanero-C: interfaces IRR, tie-in to CnC
* All explore related aspects of the codelet Program Exec Model (PXM)
* Goal: Converge towards Open Collaborative Runtime (OCR)
** Enabling technology development for codelet execution
** Model systems, foster novel runtime systems research
* Greater visibility through SW stack -> efficient computing
** Break OS/Runtime information firewall
'''Some Promising Results:'''
[[File:TG-Runtime-Results.png|600px]]
'''Runtime Research Agenda'''
* Locality aware scheduling—heuristics for locality/E-efficiency
** Extensions to standard Habanero-C runtime
* Adaptive boosting and idling of hardware
** Avoid energy expensive unsuccessful steals that perform no work
** Turbo mode for a core executing serial code
** Fine grain resource (including energy) management
* Dynamic data-block movement
** Co-locate codelets and data
** Move codelets to data
* Introspection and dynamic optimization
** Performance counters, sensors provide real time information
** Optimization of the system for user defined objective
** (Go beyond energy proportional computing)
=== Simulators and Tools ===
[[File:TG-Simulators-Tools.png|600px]]
'''Simulators—what to expect and not'''
* Evaluation of architecture features for PGM and EXE models
* Relative comparison of performance, energy
* Data movement patterns to memory and interconnect
* Relative evaluation of resource management techniques
[[File:TG-Simulator-Expect-Not.png|400px]]
'''Results Using Simulators'''
[[File:TG-Simulator-Results.png|600px]]
== Applications and HW-SW Codesign ==
[[File:TG-App-HW-Co-design.png|600px]]




== Impact ==
== X-Stack Components ==
[[File:TG-XStack-Components.png|600px]]




== Software Stack ==
== Metrics ==
[[File:TG-Metrics.png|600px]]

Revision as of 18:02, February 7, 2013

Traleika Glacier
File:Your-team-logo.png
Team Members Intel, Reservoir Labs, ETI, UDEL, UC San Diego, Rice U., UIUC, PNNL, SNL
PI Shekhar Borkar (Intel)
Co-PIs Wilf Pinfold (Intel), Richard Lethin (Reservoir Labs), Rishi Khan (ETI), Guang Gao (UDEL), Laura Carrington (UC San Diego), Vivek Sarkar (Rice U.), David Padua (UIUC), Josep Torrellas (UIUC), John Feo (PNNL), Jackie Chen (SNL)
Website https://sites.google.com/site/traleikaglacierxstack
Download {{{download}}}


Traleika Glacier


Team Members


Goals and Objectives

Goal:

  • Research and mature software technologies addressing major Exascale challenges and get ready to intercept by 2018-2020

Objectives:

  • Energy efficiency: SW components interoperate, harmonize, exploit HW features, and optimize the system for energy efficiency
  • Data locality: PGM system & system SW optimize to reduce data movement
  • Scalability: SW components scalable, portable to O(109)—extreme parallelism
  • Programmability: New (Codelet) & legacy (MPI), with gentle slope for productivity
  • Execution model: Objective function based, dynamic, global system optimization
  • Self-awareness: Dynamically respond to changing conditions and demands
  • Resiliency: Asymptotically provide reliability of N-modular redundancy using HW/SW co-design; HW detection, SW correction


Scope of the Project

TG-Scope.png


Roadmap

TG-Roadmap.png


Architecture

Straw-man System Architecture and Evaluation

TG-Strawman-System.png


Data-locality and BW Tapering, Why So Important?

TG-Data-Locality.png


Programming and Execution Models

TG-Programming-Model.png

Programming model

  • Separation of concerns: Domain specification & HW mapping
  • Express data locality with hierarchical tiling
  • Global, shared, non-coherent address space
  • Optimization and auto generation of codelets (HW specific)

Execution model

  • Dataflow inspired, tiny codelets (self contained)
  • Dynamic, event-driven scheduling, non-blocking
  • Dynamic decision to move computation to data
  • Observation based adaption (self-awareness)
  • Implemented in the runtime environment

Separation of concerns

  • User application, control, and resource management


Programming System Components

TG-System-Components.png

Runtime

  • Different runtimes target different aspects
    • IRR: targeted for Intel Straw-man architecture
    • SWARM: runtime for a wide range of parallel machines
    • DAR3TS: explore codelet PXM using portable C++
    • Habanero-C: interfaces IRR, tie-in to CnC
  • All explore related aspects of the codelet Program Exec Model (PXM)
  • Goal: Converge towards Open Collaborative Runtime (OCR)
    • Enabling technology development for codelet execution
    • Model systems, foster novel runtime systems research
  • Greater visibility through SW stack -> efficient computing
    • Break OS/Runtime information firewall


Some Promising Results:

TG-Runtime-Results.png

Runtime Research Agenda

  • Locality aware scheduling—heuristics for locality/E-efficiency
    • Extensions to standard Habanero-C runtime
  • Adaptive boosting and idling of hardware
    • Avoid energy expensive unsuccessful steals that perform no work
    • Turbo mode for a core executing serial code
    • Fine grain resource (including energy) management
  • Dynamic data-block movement
    • Co-locate codelets and data
    • Move codelets to data
  • Introspection and dynamic optimization
    • Performance counters, sensors provide real time information
    • Optimization of the system for user defined objective
    • (Go beyond energy proportional computing)


Simulators and Tools

TG-Simulators-Tools.png


Simulators—what to expect and not

  • Evaluation of architecture features for PGM and EXE models
  • Relative comparison of performance, energy
  • Data movement patterns to memory and interconnect
  • Relative evaluation of resource management techniques

TG-Simulator-Expect-Not.png


Results Using Simulators

TG-Simulator-Results.png


Applications and HW-SW Codesign

TG-App-HW-Co-design.png


X-Stack Components

TG-XStack-Components.png


Metrics

TG-Metrics.png