<?xml version="1.0"?>
<feed xmlns="http://www.w3.org/2005/Atom" xml:lang="en">
	<id>https://modelado.org/index.php?action=history&amp;feed=atom&amp;title=X-Stack_Project_Publications</id>
	<title>X-Stack Project Publications - Revision history</title>
	<link rel="self" type="application/atom+xml" href="https://modelado.org/index.php?action=history&amp;feed=atom&amp;title=X-Stack_Project_Publications"/>
	<link rel="alternate" type="text/html" href="https://modelado.org/index.php?title=X-Stack_Project_Publications&amp;action=history"/>
	<updated>2026-04-08T01:27:03Z</updated>
	<subtitle>Revision history for this page on the wiki</subtitle>
	<generator>MediaWiki 1.43.1</generator>
	<entry>
		<id>https://modelado.org/index.php?title=X-Stack_Project_Publications&amp;diff=4255&amp;oldid=prev</id>
		<title>imported&gt;ChunhuaLiao: /* D-­TEC: DSL Technology for Exascale Computing */</title>
		<link rel="alternate" type="text/html" href="https://modelado.org/index.php?title=X-Stack_Project_Publications&amp;diff=4255&amp;oldid=prev"/>
		<updated>2015-06-18T20:49:41Z</updated>

		<summary type="html">&lt;p&gt;&lt;span class=&quot;autocomment&quot;&gt;D-­TEC: DSL Technology for Exascale Computing&lt;/span&gt;&lt;/p&gt;
&lt;table style=&quot;background-color: #fff; color: #202122;&quot; data-mw=&quot;interface&quot;&gt;
				&lt;col class=&quot;diff-marker&quot; /&gt;
				&lt;col class=&quot;diff-content&quot; /&gt;
				&lt;col class=&quot;diff-marker&quot; /&gt;
				&lt;col class=&quot;diff-content&quot; /&gt;
				&lt;tr class=&quot;diff-title&quot; lang=&quot;en&quot;&gt;
				&lt;td colspan=&quot;2&quot; style=&quot;background-color: #fff; color: #202122; text-align: center;&quot;&gt;← Older revision&lt;/td&gt;
				&lt;td colspan=&quot;2&quot; style=&quot;background-color: #fff; color: #202122; text-align: center;&quot;&gt;Revision as of 20:49, June 18, 2015&lt;/td&gt;
				&lt;/tr&gt;&lt;tr&gt;&lt;td colspan=&quot;2&quot; class=&quot;diff-lineno&quot; id=&quot;mw-diff-left-l36&quot;&gt;Line 36:&lt;/td&gt;
&lt;td colspan=&quot;2&quot; class=&quot;diff-lineno&quot;&gt;Line 36:&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;# Kevin Alan Stock. Vectorization and Register Reuse in High Performance Computing. PhD thesis, The Ohio State University, 2014.&lt;/div&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;# Kevin Alan Stock. Vectorization and Register Reuse in High Performance Computing. PhD thesis, The Ohio State University, 2014.&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;# Tom Henretty, Richard Veras, Franz Franchetti, Louis-Noel Pouchet, J. Ramanujam, and P. Sadayappan. A stencil compiler for short-vector simd architectures. In Proceedings of the 27th International ACM Conference on International Conference on Supercomputing, ICS &amp;#039;13, pages 13{24, New York, NY, USA, 2013. ACM.&lt;/div&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;# Tom Henretty, Richard Veras, Franz Franchetti, Louis-Noel Pouchet, J. Ramanujam, and P. Sadayappan. A stencil compiler for short-vector simd architectures. In Proceedings of the 27th International ACM Conference on International Conference on Supercomputing, ICS &amp;#039;13, pages 13{24, New York, NY, USA, 2013. ACM.&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class=&quot;diff-marker&quot; data-marker=&quot;−&quot;&gt;&lt;/td&gt;&lt;td style=&quot;color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #ffe49c; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;# Justin Holewinski, Louis-Noel Pouchet, and P. Sadayappan. High-performance code generation&lt;/div&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot; data-marker=&quot;+&quot;&gt;&lt;/td&gt;&lt;td style=&quot;color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;# Justin Holewinski, Louis-Noel Pouchet, and P. Sadayappan. High-performance code generation for stencil computations on gpu architectures. In Proceedings of the 26th ACM International Conference on Supercomputing, ICS &#039;12, pages 311{320, New York, NY, USA, 2012. ACM.&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class=&quot;diff-marker&quot; data-marker=&quot;−&quot;&gt;&lt;/td&gt;&lt;td style=&quot;color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #ffe49c; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;for stencil computations on gpu architectures. In Proceedings of the 26th ACM International&lt;/div&gt;&lt;/td&gt;&lt;td colspan=&quot;2&quot; class=&quot;diff-side-added&quot;&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class=&quot;diff-marker&quot; data-marker=&quot;−&quot;&gt;&lt;/td&gt;&lt;td style=&quot;color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #ffe49c; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;Conference on Supercomputing, ICS &#039;12, pages 311{320, New York, NY, USA, 2012. ACM.&lt;/div&gt;&lt;/td&gt;&lt;td colspan=&quot;2&quot; class=&quot;diff-side-added&quot;&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;# Louis-Noel Pouchet, Peng Zhang, P. Sadayappan, and Jason Cong. Polyhedral-based data reuse optimization for configurable computing. In Proceedings of the ACM/SIGDA International Symposium on Field Programmable Gate Arrays, FPGA &amp;#039;13, pages 29{38, New York, NY, USA, 2013. ACM.&lt;/div&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;# Louis-Noel Pouchet, Peng Zhang, P. Sadayappan, and Jason Cong. Polyhedral-based data reuse optimization for configurable computing. In Proceedings of the ACM/SIGDA International Symposium on Field Programmable Gate Arrays, FPGA &amp;#039;13, pages 29{38, New York, NY, USA, 2013. ACM.&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;# S. Rajbhandari, A. Nikam, Pai-Wei Lai, K. Stock, S. Krishnamoorthy, and P. Sadayappan. Cast: Contraction algorithm for symmetric tensors. In Parallel Processing (ICPP), 2014 43rd International Conference on, pages 261{272, Sept 2014.&lt;/div&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;# S. Rajbhandari, A. Nikam, Pai-Wei Lai, K. Stock, S. Krishnamoorthy, and P. Sadayappan. Cast: Contraction algorithm for symmetric tensors. In Parallel Processing (ICPP), 2014 43rd International Conference on, pages 261{272, Sept 2014.&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;

&lt;!-- diff cache key modwiki-mod:diff:1.41:old-4254:rev-4255:php=table --&gt;
&lt;/table&gt;</summary>
		<author><name>imported&gt;ChunhuaLiao</name></author>
	</entry>
	<entry>
		<id>https://modelado.org/index.php?title=X-Stack_Project_Publications&amp;diff=4254&amp;oldid=prev</id>
		<title>imported&gt;ChunhuaLiao: /* D-­TEC: DSL Technology for Exascale Computing */</title>
		<link rel="alternate" type="text/html" href="https://modelado.org/index.php?title=X-Stack_Project_Publications&amp;diff=4254&amp;oldid=prev"/>
		<updated>2015-06-18T20:49:16Z</updated>

		<summary type="html">&lt;p&gt;&lt;span class=&quot;autocomment&quot;&gt;D-­TEC: DSL Technology for Exascale Computing&lt;/span&gt;&lt;/p&gt;
&lt;a href=&quot;https://modelado.org/index.php?title=X-Stack_Project_Publications&amp;amp;diff=4254&amp;amp;oldid=4253&quot;&gt;Show changes&lt;/a&gt;</summary>
		<author><name>imported&gt;ChunhuaLiao</name></author>
	</entry>
	<entry>
		<id>https://modelado.org/index.php?title=X-Stack_Project_Publications&amp;diff=4253&amp;oldid=prev</id>
		<title>imported&gt;ChunhuaLiao: /* D-­TEC: DSL Technology for Exascale Computing */</title>
		<link rel="alternate" type="text/html" href="https://modelado.org/index.php?title=X-Stack_Project_Publications&amp;diff=4253&amp;oldid=prev"/>
		<updated>2015-06-18T20:46:25Z</updated>

		<summary type="html">&lt;p&gt;&lt;span class=&quot;autocomment&quot;&gt;D-­TEC: DSL Technology for Exascale Computing&lt;/span&gt;&lt;/p&gt;
&lt;a href=&quot;https://modelado.org/index.php?title=X-Stack_Project_Publications&amp;amp;diff=4253&amp;amp;oldid=4252&quot;&gt;Show changes&lt;/a&gt;</summary>
		<author><name>imported&gt;ChunhuaLiao</name></author>
	</entry>
	<entry>
		<id>https://modelado.org/index.php?title=X-Stack_Project_Publications&amp;diff=4252&amp;oldid=prev</id>
		<title>imported&gt;ChunhuaLiao: /* D-­TEC: DSL Technology for Exascale Computing */</title>
		<link rel="alternate" type="text/html" href="https://modelado.org/index.php?title=X-Stack_Project_Publications&amp;diff=4252&amp;oldid=prev"/>
		<updated>2015-06-18T20:42:42Z</updated>

		<summary type="html">&lt;p&gt;&lt;span class=&quot;autocomment&quot;&gt;D-­TEC: DSL Technology for Exascale Computing&lt;/span&gt;&lt;/p&gt;
&lt;a href=&quot;https://modelado.org/index.php?title=X-Stack_Project_Publications&amp;amp;diff=4252&amp;amp;oldid=4251&quot;&gt;Show changes&lt;/a&gt;</summary>
		<author><name>imported&gt;ChunhuaLiao</name></author>
	</entry>
	<entry>
		<id>https://modelado.org/index.php?title=X-Stack_Project_Publications&amp;diff=4251&amp;oldid=prev</id>
		<title>imported&gt;ChunhuaLiao: Created page with &quot;==D-­TEC: DSL Technology for Exascale Computing== List * Markus Schordan, Pei-Hung Lin, Dan Quinlan, and Louis-Nol Pouchet. Veri�cation of polyhedral optimizations with con...&quot;</title>
		<link rel="alternate" type="text/html" href="https://modelado.org/index.php?title=X-Stack_Project_Publications&amp;diff=4251&amp;oldid=prev"/>
		<updated>2015-06-18T20:39:36Z</updated>

		<summary type="html">&lt;p&gt;Created page with &amp;quot;==D-­TEC: DSL Technology for Exascale Computing== List * Markus Schordan, Pei-Hung Lin, Dan Quinlan, and Louis-Nol Pouchet. Veri�cation of polyhedral optimizations with con...&amp;quot;&lt;/p&gt;
&lt;p&gt;&lt;b&gt;New page&lt;/b&gt;&lt;/p&gt;&lt;div&gt;==D-­TEC: DSL Technology for Exascale Computing==&lt;br /&gt;
List&lt;br /&gt;
* Markus Schordan, Pei-Hung Lin, Dan Quinlan, and Louis-Nol Pouchet. Veri�cation of polyhedral&lt;br /&gt;
optimizations with constant loop bounds in �nite state space computations. In Tiziana&lt;br /&gt;
Margaria and Bernhard Ste�en, editors, Leveraging Applications of Formal Methods, Veri�ca-&lt;br /&gt;
tion and Validation. Specialized Techniques and Applications, volume 8803 of Lecture Notes in&lt;br /&gt;
Computer Science, pages 493{508. Springer Berlin Heidelberg, 2014.&lt;br /&gt;
* Chunhua Liao, Daniel J. Quinlan, Thomas Panas, and Bronis R. de Supinski. A rose-based&lt;br /&gt;
openmp 3.0 research compiler supporting multiple runtime libraries. In Mitsuhisa Sato, Toshihiro&lt;br /&gt;
Hanawa, Matthias S. Muller, Barbara M. Chapman, and Bronis R. de Supinski, editors,&lt;br /&gt;
IWOMP, volume 6132 of Lecture Notes in Computer Science, pages 15{28. Springer, 2010.&lt;br /&gt;
* Chunhua Liao, Yonghong Yan, Bronis R de Supinski, Daniel J Quinlan, and Barbara Chapman.&lt;br /&gt;
Early experiences with the openmp accelerator model. In OpenMP in the Era of Low Power&lt;br /&gt;
Devices and Accelerators, pages 84{98. Springer, 2013.&lt;br /&gt;
*  Dan Quinlan and Chunhua Liao. The ROSE source-to-source compiler infrastructure. In Cetus&lt;br /&gt;
Users and Compiler Infrastructure Workshop, Galveston Island, TX, USA, October 2011.&lt;br /&gt;
* Yonghong Yan, Pei-Hung Lin, Chunhua Liao, Bronis R. de Supinski, and Daniel J. Quinlan.&lt;br /&gt;
Supporting multiple accelerators in high-level programming models. In Proceedings of the Sixth&lt;br /&gt;
International Workshop on Programming Models and Applications for Multicores and Many-&lt;br /&gt;
cores, PMAM &amp;#039;15, pages 170{180, New York, NY, USA, 2015. ACM.&lt;br /&gt;
Pei-Hung Lin, Chunhua Liao, Daniel J. Quinlan, and Stephen Guzik. Experiences of using&lt;br /&gt;
the openmp accelerator model to port doe stencil applications, 2014. Poster presented at the&lt;br /&gt;
Workshop on accelerator programming using directives, Nov. 17, 2014, New Orleans, LA.&lt;br /&gt;
* Markus Schordan, Pei-Hung Lin, Dan Quinlan, and Louis-Nol Pouchet. Veri�cation of parallel&lt;br /&gt;
polyhedral transformations with arbitrary constant loop bounds, 2015. In review process of&lt;br /&gt;
EuroPar2015.&lt;br /&gt;
* Jonathan Ragan-Kelley. Decoupling Algorithms from the Organization of Computation for High&lt;br /&gt;
Performance Image Processing. Ph.d. thesis, Massachusetts Institute of Technology, Cambridge,&lt;br /&gt;
MA, June 2014.&lt;br /&gt;
* Jason Ansel. Autotuning Programs with Algorithmic Choice. Ph.d. thesis, Massachusetts Institute&lt;br /&gt;
of Technology, Cambridge, MA, February 2014.&lt;br /&gt;
* Je�rey Bosboom. Streamjit: A commensal compiler for high-performance stream programming.&lt;br /&gt;
S.m. thesis, Massachusetts Institute of Technology, Cambridge, MA, June 2014.&lt;br /&gt;
* Eric Wong. Optimizations in stream programming for multimedia applications. M.eng. thesis,&lt;br /&gt;
Massachusetts Institute of Technology, Cambridge, MA, Aug 2012.&lt;br /&gt;
* Phumpong Watanaprakornkul. Distributed data as a choice in petabricks. M.eng. thesis,&lt;br /&gt;
Massachusetts Institute of Technology, Cambridge, MA, Jun 2012.&lt;br /&gt;
* Charith Mendis, Je�rey Bosboom, Kevin Wu, Shoaib Kamil, Jonathan Ragan-Kelley, Sylvain&lt;br /&gt;
Paris, Qin Zhao, and Saman Amarasinghe. Helium: Lifting high-performance stencil kernels&lt;br /&gt;
from stripped x86 binaries to halide dsl code. In ACM SIGPLAN Conference on Programming&lt;br /&gt;
Language Design and Implementation, June 2015.&lt;br /&gt;
* Jason Ansel, Shoaib Kamil, Kalyan Veeramachaneni, Jonathan Ragan-Kelley, Je�rey Bosboom,&lt;br /&gt;
Una-May O&amp;#039;Reilly, and Saman Amarasinghe. Opentuner: An extensible framework for&lt;br /&gt;
program autotuning. In International Conference on Parallel Architectures and Compilation&lt;br /&gt;
Techniques, Edmonton, Canada, August 2014.&lt;br /&gt;
* Je�rey Bosboom, Sumanaruban Rajadurai, Weng-Fai Wong, and Saman Amarasinghe.&lt;br /&gt;
Streamjit: A commensal compiler for high-performance stream programming. In ACM SIG-&lt;br /&gt;
PLAN Conference on Object-Oriented Programming Systems and Applications, Portland, OR,&lt;br /&gt;
October 2014.&lt;br /&gt;
* Jonathan Ragan-Kelley, Connelly Barnes, Andrew Adams, Sylvain Paris, Fr�edo Durand, and&lt;br /&gt;
Saman Amarasinghe. Halide: A language and compiler for optimizing parallelism, locality, and&lt;br /&gt;
recomputation in image processing pipelines. In ACM SIGPLAN Conference on Programming&lt;br /&gt;
Language Design and Implementation, Seattle, WA, June 2013.&lt;br /&gt;
* Phitchaya Mangpo Phothilimthana, Jason Ansel, Jonathan Ragan-Kelley, and Saman Amarasinghe.&lt;br /&gt;
Portable performance on heterogeneous architectures. In The International Conference&lt;br /&gt;
on Architectural Support for Programming Languages and Operating Systems, Houston, TX,&lt;br /&gt;
March 2013.&lt;br /&gt;
[18] Maciej Pacula, Jason Ansel, Saman Amarasinghe, and Una-May O&amp;#039;Reilly. Hyperparameter&lt;br /&gt;
tuning in bandit-based adaptive operator selection. In European Conference on the Applications&lt;br /&gt;
of Evolutionary Computation, Malaga, Spain, Apr 2012.&lt;br /&gt;
[19] Jason Ansel, Maciej Pacula, Yee Lok Wong, Cy Chan, Marek Olszewski, Una-May O&amp;#039;Reilly,&lt;br /&gt;
and Saman Amarasinghe. Siblingrivalry: Online autotuning through local competitions. In&lt;br /&gt;
International Conference on Compilers Architecture and Synthesis for Embedded Systems, Tampere,&lt;br /&gt;
Finland, Oct 2012.&lt;br /&gt;
[20] Jonathan Ragan-Kelley, Andrew Adams, Sylvain Paris, Marc Levoy, Saman Amarasinghe, and&lt;br /&gt;
Fr�edo Durand. Decoupling algorithms from schedules for easy optimization of image processing&lt;br /&gt;
pipelines. ACM Transactions on Graphics, 31(4), July 2012.&lt;br /&gt;
[21] Dan Alistarh, Patrick Eugster, Maurice Herlihy, Alexander Matveev, and Nir Shavit. Stacktrack:&lt;br /&gt;
An automated transactional approach to concurrent memory reclamation. In Proceedings&lt;br /&gt;
of the Ninth European Conference on Computer Systems, EuroSys &amp;#039;14, pages 25:1{25:14, New&lt;br /&gt;
York, NY, USA, 2014. ACM.&lt;br /&gt;
[22] Jason Ansel, Shoaib Kamil, Kalyan Veeramachaneni, Una-May O&amp;#039;Reilly, and Saman Amarasinghe.&lt;br /&gt;
Opentuner: An extensible framework for program autotuning. Technical Report&lt;br /&gt;
MIT/CSAIL Technical Report MIT-CSAIL-TR-2013-026, Massachusetts Institute of Technology,&lt;br /&gt;
Cambridge, MA, Nov 2013.&lt;br /&gt;
[23] Alexander Matveev and Nir Shavit. Reduced hardware norec: A safe and scalable hybrid transactional&lt;br /&gt;
memory. In 20th International Conference on Architectural Support for Programming&lt;br /&gt;
Languages and Operating Systems, ASPLOS 2015, Istanbul, Turkey, 2015. ACM.&lt;br /&gt;
[24] Sasa Misailovic, Michael Carbin, Sara Achour, Zichao Qi, and Martin C. Rinard. Chisel:&lt;br /&gt;
Reliability- and accuracy-aware optimization of approximate computational kernels. In Pro-&lt;br /&gt;
ceedings of the 2014 ACM International Conference on Object Oriented Programming Systems&lt;br /&gt;
Languages &amp;amp;#38; Applications, OOPSLA &amp;#039;14, pages 309{328, New York, NY, USA, 2014. ACM.&lt;br /&gt;
[25] Zhilei Xu, Shoaib Kamil, and Armando Solar-Lezama. Msl: A synthesis enabled language for&lt;br /&gt;
distributed implementations. In Proceedings of the International Conference for High Perfor-&lt;br /&gt;
mance Computing, Networking, Storage and Analysis, SC &amp;#039;14, pages 311{322, Piscataway, NJ,&lt;br /&gt;
USA, 2014. IEEE Press.&lt;br /&gt;
[26] F. Augustin and Y. ~ M. Marzouk. Uncertainty quanti�cation in high performance computing&lt;br /&gt;
(invited position paper). SIGPLAN Workshop on Probabilistic and Approximate Computing&lt;br /&gt;
(APPROX), 2014.&lt;br /&gt;
[27] David Grove, Josh Milthorpe, and Olivier Tardieu. Supporting array programming in X10. In&lt;br /&gt;
Proceedings of ACM SIGPLAN International Workshop on Libraries, Languages, and Compilers&lt;br /&gt;
for Array Programming, ARRAY&amp;#039;14, pages 38:38{38:43, New York, NY, USA, 2014. ACM.&lt;br /&gt;
[28] Wei Zhang, Olivier Tardieu, David Grove, Benjamin Herta, Tomio Kamada, Vijay Saraswat,&lt;br /&gt;
and Mikio Takeuchi. GLB: Lifeline-based global load balancing library in X10. In Proceedings&lt;br /&gt;
of the First Workshop on Parallel Programming for Analytics Applications, PPAA &amp;#039;14, pages&lt;br /&gt;
31{40, New York, NY, USA, 2014. ACM.&lt;br /&gt;
[29] Olivier Tardieu, David Grove, Benjamin Herta, Tomio Kamada, Vijay Saraswat, Mikio&lt;br /&gt;
Takeuchi, and Wei Zhang. X10 for Productivity and Performance at Scale: A Submission&lt;br /&gt;
to the 2013 HPC Class II Challenge, October 2013.&lt;br /&gt;
[30] Craig Rasmussen, Matthew Sottile, Daniel Nagle, and Soren Rasmussen. Locally-oriented&lt;br /&gt;
programming: A simple programming model for stencil-based computations on multi-level distributed&lt;br /&gt;
memory architectures. In Proceedings of Euro-Par 2015 Parallel Processing, Lecture&lt;br /&gt;
Notes in Computer Science. Springer International Publishing, 2015. Submitted, February 2015.&lt;br /&gt;
[31] Thomas Steel Henretty. Performance Optimization of Stencil Computations on Modern SIMD&lt;br /&gt;
Architectures. PhD thesis, The Ohio State University, 2014.&lt;br /&gt;
[32] Justin Andrew Holewinski. Automatic Code Generation for Stencil Computations on GPU&lt;br /&gt;
Architectures. PhD thesis, The Ohio State University, 2012.&lt;br /&gt;
[33] Mahesh Ravishankar. Automatic parallelization of loops with data dependent control &lt;br /&gt;
ow and&lt;br /&gt;
array access patterns. PhD thesis, The Ohio State University, 2014.&lt;br /&gt;
[34] Kevin Alan Stock. Vectorization and Register Reuse in High Performance Computing. PhD&lt;br /&gt;
thesis, The Ohio State University, 2014.&lt;br /&gt;
[35] Tom Henretty, Richard Veras, Franz Franchetti, Louis-Noel Pouchet, J. Ramanujam, and&lt;br /&gt;
P. Sadayappan. A stencil compiler for short-vector simd architectures. In Proceedings of the&lt;br /&gt;
27th International ACM Conference on International Conference on Supercomputing, ICS &amp;#039;13,&lt;br /&gt;
pages 13{24, New York, NY, USA, 2013. ACM.&lt;br /&gt;
[36] Justin Holewinski, Louis-Noel Pouchet, and P. Sadayappan. High-performance code generation&lt;br /&gt;
for stencil computations on gpu architectures. In Proceedings of the 26th ACM International&lt;br /&gt;
Conference on Supercomputing, ICS &amp;#039;12, pages 311{320, New York, NY, USA, 2012. ACM.&lt;br /&gt;
[37] Louis-Noel Pouchet, Peng Zhang, P. Sadayappan, and Jason Cong. Polyhedral-based data&lt;br /&gt;
reuse optimization for con�gurable computing. In Proceedings of the ACM/SIGDA International&lt;br /&gt;
Symposium on Field Programmable Gate Arrays, FPGA &amp;#039;13, pages 29{38, New York, NY, USA,&lt;br /&gt;
2013. ACM.&lt;br /&gt;
[38] S. Rajbhandari, A. Nikam, Pai-Wei Lai, K. Stock, S. Krishnamoorthy, and P. Sadayappan.&lt;br /&gt;
Cast: Contraction algorithm for symmetric tensors. In Parallel Processing (ICPP), 2014 43rd&lt;br /&gt;
International Conference on, pages 261{272, Sept 2014.&lt;br /&gt;
[39] Samyam Rajbhandari, Akshay Nikam, Pai-Wei Lai, Kevin Stock, Sriram Krishnamoorthy,&lt;br /&gt;
and P. Sadayappan. A communication-optimal framework for contracting distributed tensors.&lt;br /&gt;
In Proceedings of the International Conference for High Performance Computing, Networking,&lt;br /&gt;
Storage and Analysis, SC &amp;#039;14, pages 375{386, Piscataway, NJ, USA, 2014. IEEE Press.&lt;br /&gt;
[40] Mahesh Ravishankar, John Eisenlohr, Louis-Noel Pouchet, J. Ramanujam, Atanas Rountev,&lt;br /&gt;
and P. Sadayappan. Code generation for parallel execution of a class of irregular loops on distributed&lt;br /&gt;
memory systems. In Proceedings of the International Conference on High Performance&lt;br /&gt;
Computing, Networking, Storage and Analysis, SC &amp;#039;12, pages 72:1{72:11, Los Alamitos, CA,&lt;br /&gt;
USA, 2012. IEEE Computer Society Press.&lt;br /&gt;
[41] Mahesh Ravishankar, John Eisenlohr, Louis-Noel Pouchet, J. Ramanujam, Atanas Rountev,&lt;br /&gt;
and P. Sadayappan. Automatic parallelization of a class of irregular loops for distributed&lt;br /&gt;
memory systems. ACM Transactions on Parallel Computing, 1(1):7:1{7:37, September 2014.&lt;br /&gt;
[42] Mahesh Ravishankar, Roshan Dathathri, Venmugil Elango, Louis-Noel Pouchet, J Ramanujam,&lt;br /&gt;
Atanas Rountev, and P Sadayappan. Distributed memory code generation for mixed&lt;br /&gt;
irregular/regular computations. In Proceedings of the 20th ACM SIGPLAN Symposium on&lt;br /&gt;
Principles and Practice of Parallel Programming, pages 65{75. ACM, 2015.&lt;br /&gt;
[43] Kevin Stock, Martin Kong, Tobias Grosser, Louis-Noel Pouchet, Fabrice Rastello, J. Ramanujam,&lt;br /&gt;
and P. Sadayappan. A framework for enhancing data reuse via associative reordering.&lt;br /&gt;
In Proceedings of the 35th ACM SIGPLAN Conference on Programming Language Design and&lt;br /&gt;
Implementation, PLDI &amp;#039;14, pages 65{76, New York, NY, USA, 2014. ACM.&lt;br /&gt;
[44] Venmugil Elango, Fabrice Rastello, Louis-Noel Pouchet, J. Ramanujam, and P. Sadayappan.&lt;br /&gt;
On characterizing the data movement complexity of computational dags for parallel execution.&lt;br /&gt;
In Proceedings of the 26th ACM Symposium on Parallelism in Algorithms and Architectures,&lt;br /&gt;
SPAA &amp;#039;14, pages 296{306, New York, NY, USA, 2014. ACM.&lt;br /&gt;
[45] Naznin Fauzia, Venmugil Elango, Mahesh Ravishankar, J. Ramanujam, Fabrice Rastello,&lt;br /&gt;
Atanas Rountev, Louis-Noel Pouchet, and P. Sadayappan. Beyond reuse distance analysis:&lt;br /&gt;
Dynamic analysis for characterization of data locality potential. ACM Trans. Archit. Code&lt;br /&gt;
Optim., 10(4):53:1{53:29, Dec. 2013.&lt;br /&gt;
[46] Martin Kong, Richard Veras, Kevin Stock, Franz Franchetti, Louis-Noel Pouchet, and P. Sadayappan.&lt;br /&gt;
When polyhedral transformations meet simd code generation. In Proceedings of&lt;br /&gt;
the 34th ACM SIGPLAN Conference on Programming Language Design and Implementation,&lt;br /&gt;
PLDI &amp;#039;13, pages 127{138, New York, NY, USA, 2013. ACM.&lt;br /&gt;
[47] Lai Wei and John Mellor-Crummey. Autotuning tensor transposition. In Proceedings of the&lt;br /&gt;
19th International Workshop on High-level Parallel Programming Models and Supportive Envi-&lt;br /&gt;
ronments, May 2014.&lt;br /&gt;
[48] Vivek Sarkar. Jun Shirako, Louis-Noel Pouchet. Oil and water can mix: An integration of polyhedral&lt;br /&gt;
and ast-based transformations. In IEEE Conference on High Performance Computing,&lt;br /&gt;
Networking, Storage and Analysis (SC&amp;#039;14). IEEE, 2014.&lt;br /&gt;
[49] Vivek Sarkar. Prasanth Chatarasi, Jun Shirako. Polyhedral transformations of explicitly parallel&lt;br /&gt;
programs. In 5th International Workshop on Polyhedral Compilation Techniques (IMPACT&lt;br /&gt;
2015). IEEE, 2015.&lt;br /&gt;
[50] Kamal Sharma. Locality Transformations of Computation and Data for Portable Performance.&lt;br /&gt;
PhD thesis, Rice University, August 2014.&lt;br /&gt;
[51] Jun Shirako and Vivek Sarkar. Oil and water can mix! Experiences with integrating polyhedral&lt;br /&gt;
and AST-based Transformations. In 17th Workshop on Compilers for Parallel Programming,&lt;br /&gt;
July 2013.&lt;br /&gt;
[52] Jisheng Zhao, Michael Burke, and Vivek Sarkar. Rice ROSE Compositional Analysis and&lt;br /&gt;
Transformation Framework (R2CAT). Technical report, LLNL Technical Report 590233, October&lt;br /&gt;
2012.&lt;br /&gt;
[53] Phitchaya Mangpo Phothilimthana, Tikhon Jelvis, Rohin Shah, Nishant Totla, Sarah Chasins,&lt;br /&gt;
and Rastislav Bod��k. Chlorophyll: synthesis-aided compiler for low-power spatial architectures.&lt;br /&gt;
In O&amp;#039;Boyle and Pingali [58], page 42.&lt;br /&gt;
[54] Emina Torlak and Rastislav Bod��k. A lightweight symbolic virtual machine for solver-aided&lt;br /&gt;
host languages. In O&amp;#039;Boyle and Pingali [58], page 54.&lt;br /&gt;
[55] Rajeev Alur, Rastislav Bod��k, Garvit Juniwal, Milo M. K. Martin, Mukund Raghothaman,&lt;br /&gt;
Sanjit A. Seshia, Rishabh Singh, Armando Solar-Lezama, Emina Torlak, and Abhishek Udupa.&lt;br /&gt;
Syntax-guided synthesis. In Formal Methods in Computer-Aided Design, FMCAD 2013, Port-&lt;br /&gt;
land, OR, USA, October 20-23, 2013, pages 1{8. IEEE, 2013.&lt;br /&gt;
[56] Emina Torlak and Rastislav Bod��k. Growing solver-aided languages with rosette. In Antony L.&lt;br /&gt;
Hosking, Patrick Th. Eugster, and Robert Hirschfeld, editors, ACM Symposium on New Ideas&lt;br /&gt;
in Programming and Re&lt;br /&gt;
ections on Software, Onward! 2013, part of SPLASH &amp;#039;13, Indianapolis,&lt;br /&gt;
IN, USA, October 26-31, 2013, pages 135{152. ACM, 2013.&lt;br /&gt;
[57] Leo A. Meyerovich, Matthew E. Torok, Eric Atkinson, and Rastislav Bod��k. Parallel schedule&lt;br /&gt;
synthesis for attribute grammars. In Alex Nicolau, Xiaowei Shen, Saman P. Amarasinghe, and&lt;br /&gt;
Richard W. Vuduc, editors, ACM SIGPLAN Symposium on Principles and Practice of Parallel&lt;br /&gt;
Programming, PPoPP &amp;#039;13, Shenzhen, China, February 23-27, 2013, pages 187{196. ACM, 2013.&lt;br /&gt;
[58] Michael F. P. O&amp;#039;Boyle and Keshav Pingali, editors. ACM SIGPLAN Conference on Program-&lt;br /&gt;
ming Language Design and Implementation, PLDI &amp;#039;14, Edinburgh, United Kingdom - June 09&lt;br /&gt;
- 11, 2014. ACM, 2014.&lt;br /&gt;
[59] Tan Nguyen, Pietro Cicotti, Eric Bylaska, Dan Quinlan, and Scott B. Baden. Bamboo: translating&lt;br /&gt;
mpi applications to a latency-tolerant, data-driven form. In Proceedings of the Interna-&lt;br /&gt;
tional Conference on High Performance Computing, Networking, Storage and Analysis, SC &amp;#039;12,&lt;br /&gt;
pages 39:1{39:11, Los Alamitos, CA, USA, 2012. IEEE Computer Society Press.&lt;br /&gt;
[60] Tan Nguyen and Scott B. Baden. Bamboo-preliminary scaling results on multiple hybrid nodes&lt;br /&gt;
of knights corner and sandy bridge processors. In Proc. WOLFHPC: Workshop on Domain-&lt;br /&gt;
Speci�c Languages and High-Level Frameworks for HPC, SC13, The International Conference&lt;br /&gt;
for High Performance Computing, Networking, Storage and Analysis, Denver CO, 2013.&lt;/div&gt;</summary>
		<author><name>imported&gt;ChunhuaLiao</name></author>
	</entry>
</feed>