Main Program

Sunday, June 15, 2014

PM
6:00-8:00
Reception (Salon A/B)

Monday, June 16, 2014

AM
8:30-8:45
Opening Remarks (Marquette I/IV)
 
8:45-9:45
Keynote I (Marquette I/IV): Insight into the MICROSOFT XBOX ONE Technology
Dr. Ilan Spillinger, Corporate Vice President, Technology and Silicon, Microsoft
 
9:45-10:15
Fast Forward Session I (Marquette I/IV)
 
10:15-10:45
Break (Marquette I/IV)
 
10:45-12:00
Session 1 (Marquette I/IV): Machines and Prototypes
PM
12:00-1:15
Lunch (Symphony I - III)
 
1:15-2:55
Session 2A (Marquette I): Resilience
Session 2B (Marquette IV): Design Space Exploration
 
2:55-3:25
Break (Marquette I/IV)
 
3:25-5:05
Session 3A (Marquette I): Caches
Session 3B (Marquette IV): GPUs and Parallelism
 
6:00-onwards
Business Meeting (Marquette I/IV)

Tuesday, June 17, 2014

AM
8:30-9:30
Keynote II (Marquette I/IV): Should Computer Architects Take a Closer Look At Today's Most Pervasive Computer System – The Mobile Phone?
Prof. Trevor Mudge, Department of Computer Science and Engineering, University of Michigan
 
9:30-10:15
Fast Forward Session II (Marquette I/IV)
 
10:15-10:45
Break (Marquette I/IV)
 
10:45-12:00
Session 4 (Marquette I/IV): Emerging Technologies
PM
12:00-2:00
Awards Luncheon (Salon A - C)
 
2:00-3:15
Session 5A (Marquette I): NVRAM
Session 5B (Marquette IV): Datacenters and Cloud
 
3:15-3:45
Break (Marquette I/IV)
 
3:45-5:00
Session 6A (Marquette I): DRAM
Session 6B (Marquette IV): Circuits and Architecture
 
6:00-onwards
Excursion: Mill City Museum, Minneapolis

Wednesday, June 18, 2014

AM
8:30-10:10
Session 7A (Marquette I): Coherence and Replay
Session 7B (Marquette IV): Security/OOO Processors
 
10:10-10:40
Break (Marquette I/IV)
 
10:40-12:20
Session 8 (Marquette I/IV): Accelerators
PM
12:20-12:35
Closing Remarks (Marquette I/IV)
 
Monday, June 16, 2014
 
Monday, 8:45am-9:45am
Keynote I: Insight into the MICROSOFT XBOX ONE Technology
Chair: Doug Burger, Microsoft
  • Bio: Dr. Ilan Spillinger was instrumental in bringing Kinect for Xbox 360 to market, which continues to proliferate additional innovative programs and products within the Natural User Interface industry. His team's recent efforts include developing the new architecture and silicon design for Xbox One and the new Kinect, which launched in the fall of 2013. Previously, during a six-year tenure with IBM, Dr. Spillinger served as a distinguished engineer and vice president for advanced processor design. In that role he was responsible for development of all Power Architecture-based processors at IBM: server processors, embedded processors, and client-driven solutions. Prior to that, Dr. Spillinger was a principal engineer and manager of the architecture team in Intel Israel, responsible for the definition of x86-based low-cost and low-power microprocessors, specifically the first Intel mobile processor in the Intel Centrino roadmap. Spillinger holds a D.Sc. and M.Sc. in electrical engineering from the Technion Israel Institute of Technology in Haifa, Israel, and joined Microsoft in 2007.
 
Monday, 10:45am-12:00pm
Session 1: Machines and Prototypes
Chair: Mark D. Hill, University of Wisconsin-Madison
 
  • Unifying on-chip and inter-node switching within the Anton 2 network
  • Brian Towles, J.P. Grossman, Brian Greskamp, and David E. Shaw (D. E. Shaw Research)
     
  • A Reconfigurable Fabric for Accelerating Large-Scale Datacenter Services
  • Andrew Putnam (Microsoft), Adrian M. Caulfield (Microsoft), Eric S. Chung (Microsoft), Derek Chiou (Microsoft and University of Texas at Austin), Kypros Constantinides (Amazon), John Demme (Columbia University), Hadi Esmaeilzadeh (Georgia Institute of Technology), Jeremy Fowers (Microsoft), Gopi Prashanth Gopal (Microsoft), Jan Gray (Microsoft), Michael Haselman (Microsoft), Scott Hauck (Microsoft and University of Washington), Stephen Heil (Microsoft), Amir Hormati (Google), Joo-Young Kim (Microsoft), Sitaram Lanka (Microsoft), James Larus (EPFL), Eric Peterson (Microsoft), Simon Pope (Microsoft), Aaron Smith (Microsoft), Jason Thong (Microsoft), Phillip Yi Xiao (Microsoft), Doug Burger (Microsoft)
     
  • SCORPIO: A 36-Core Research Chip Demonstrating Snoopy Coherence on a Scalable Mesh NoC with In-Network Ordering
  • Bhavya K. Daya, Chia-Hsin Owen Chen, Suvinay Subramanian, Woo-Cheol Kwon, Sunghyun Park, Tushar Krishna, Jim Holt, Anantha P. Chandrakasan, Li-Shiuan Peh (Massachusetts Institute of Technology)
     
Monday, 1:15pm-2:55pm
Section 2A: Resilience
Chair: Yuan Xie, Pennsylvania State University
 
  • Avoiding Core's DUE & SDC via Acoustic Wave Detectors and Tailored Error Containment and Recovery
  • Gaurang Upasani (Universitat Politècnica de Catalunya), Xavier Vera (Intel Barcelona Research Center), Antonio González (Universitat Politècnica de Catalunya / Intel Barcelona Research Center)
     
  • MemGuard: A Low Cost and Energy Efficient Design to Support and Enhance Memory System Reliability
  • Long Chen, Zhao Zhang (Iowa State University)
     
  • GangES: Gang Error Simulation for Hardware Resiliency Evaluation
  • Siva Kumar Sastry Hari (NVIDIA), Radha Venkatagiri (University of Illinois at Urbana-Champaign), Sarita V. Adve (University of Illinois at Urbana-Champaign), Helia Naeimi (Intel Labs)
     
  • Real-World Design and Evaluation of Compiler-Managed GPU Redundant Multithreading
  • Jack Wadden (University of Virginia), Alexander Lyashevsky (AMD Research), Sudhanva Gurumurthi (AMD Research), Vilas Sridharan (RAS Architecture, AMD), Kevin Skadron (University of Virginia)
     
Monday, 1:15pm-2:55pm
Section 2B: Design Space Exploration
Chair: Martha Kim, Columbia University
 
  • ArchRanker: A Ranking Approach to Design Space Exploration
  • Tianshi Chen (Chinese Academy of Sciences), Qi Guo (Carnegie Mellon University), Ke Tang (University of Science and Technology of China), Olivier Temam (Inria), Zhiwei Xu (Chinese Academy of Sciences), Zhi-Hua Zhou (Nanjing University), Yunji Chen (Chinese Academy of Sciences)
     
  • Aladdin: A Pre-RTL, Power-Performance Accelerator Simulator Enabling Large Design Space Exploration of Customized Architectures
  • Yakun Sophia Shao, Brandon Reagen, Gu-Yeon Wei, David Brooks (Harvard University)
     
  • SynFull: Synthetic Traffic Models Capturing Cache Coherent Behaviour
  • Mario Badr, Natalie Enright Jerger (University of Toronto)
     
  • Harnessing ISA Diversity: Design of a Heterogeneous-ISA Chip Multiprocessor
  • Ashish Venkat, Dean M. Tullsen (University of California, San Diego)
     
Monday, 3:25pm-5:05pm
Section 3A: Caches
Chair: Sandhya Dwarkadas, University of Rochester
 
  • The Direct-to-Data (D2D) Cache: Navigating the Cache Hierarchy with a Single Lookup
  • Andreas Sembrant, Erik Hagersten, David Black-Schaffer (Uppsala University)
     
  • SC2: A Statistical Compression Cache Scheme
  • Angelos Arelakis, Per Stenstrom (Chalmers University of Technology)
     
  • The Dirty-Block Index
  • Vivek Seshadri (Carnegie Mellon University), Abhishek Bhowmick (Carnegie Mellon University), Onur Mutlu (Carnegie Mellon University), Phillip B. Gibbons (Intel Pittsburgh), Michael A. Kozuch (Intel Pittsburgh), Todd C. Mowry (Carnegie Mellon University)
     
  • Going Vertical in Memory Management: Handling Multiplicity by Multi-policy
  • Lei Liu (Chinese Academy of Sciences), Yong Li (University of Pittsburgh), Zehan Cui (Chinese Academy of Sciences), Yungang Bao (Chinese Academy of Sciences), Mingyu Chen (Chinese Academy of Sciences), Chengyong Wu (Chinese Academy of Sciences)
     
Monday, 3:25pm-5:05pm
Section 3B: GPUs and Parallelism
Chair: Pradeep Dubey, Intel
 
  • Fine-grain Task Aggregation and Coordination on GPUs
  • Marc S. Orr (University of Wisconsin-Madison / AMD Research), Bradford M. Beckmann (AMD Research), Steven K. Reinhardt (AMD Research), David A. Wood (University of Wisconsin-Madison / AMD Research)
     
  • Enabling Preemptive Multiprogramming on GPUs
  • Ivan Tanasic (Barcelona Supercomputing Center / Universitat Politecnica de Catalunya), Isaac Gelado (NVIDIA Research), Javier Cabezas (Barcelona Supercomputing Center / Universitat Politecnica de Catalunya), Alex Ramirez (Barcelona Supercomputing Center / Universitat Politecnica de Catalunya), Nacho Navarro (Barcelona Supercomputing Center / Universitat Politecnica de Catalunya), Mateo Valero (Barcelona Supercomputing Center / Universitat Politecnica de Catalunya)
     
  • Single-Graph Multiple Flows: Energy Efficient Design Alternative for GPGPUs
  • Dani Voitsechov, Yoav Etsion (Technion – Israel Institute of Technology)
     
  • HELIX-RC: An Architecture-Compiler Co-Design for Automatic Parallelization of Irregular Programs
  • Simone Campanoni (Harvard University), Kevin Brownell (Harvard University), Svilen Kanev (Harvard University), Timothy M. Jones (University of Cambridge), Gu-Yeon Wei (Harvard University), David Brooks (Harvard University)
     
 
Tuesday, June 17, 2014
 
Tuesday, 8:30am-9:30am
Keynote II: Should Computer Architects Take a Closer Look At Today's Most Pervasive Computer System – The Mobile Phone?
Chair: Steve Keckler, NVIDIA/University of Texas at Austin
  • Bio: Dr. Trevor Mudge received his Ph.D. in Computer Science from the University of Illinois. He is now in the Computer Science and Engineering Department at The University of Michigan, where he served a ten-year term as Director of the Advanced Computer Architecture Laboratory. In 2003 Dr. Mudge was named the Bredt Professor of Computer Engineering. He is the author of numerous papers on computer architecture, programming languages, VLSI design, and computer vision. He has supervised 49 theses in these areas. Dr. Mudge is a Fellow of the IEEE, a member of the ACM, the IET, and the British Computer Society. In addition to his position as a faculty member, he runs Idiot Savants, a chip design consultancy.
 
Tuesday, 10:45am-12:00pm
Section 4: Emerging Technologies
Chair: Alvin Lebeck, Duke University
 
  • Efficient Digital Neurons for Large Scale Cortical Architectures
  • James E. Smith (University of Wisconsin-Madison)
     
  • An Examination of the Architecture and System-level Tradeoffs of Employing Steep Slope Devices in 3D CMPs
  • Karthik Swaminathan, Huichu Liu, Jack Sampson, Vijaykrishnan Narayanan (Pennsylvania State University)
     
  • STAG: Spintronic-Tape Architecture for GPGPU Cache Hierarchies
  • Rangharajan Venkatesan, Shankar Ganesh Ramasubramanian, Swagath Venkataramani, Kaushik Roy, Anand Raghunathan (Purdue University)
     
Tuesday, 2:00pm-3:15pm
Section 5A: NVRAM
Chair: Yoav Etsion, Technion
 
  • Memory Persistency
  • Steven Pelley, Peter M. Chen, Thomas F. Wenisch (University of Michigan)
     
  • Reducing Access Latency of MLC PCMs through Line Striping
  • Morteza Hoseinzadeh (Sharif University of Technology), Mohammad Arjomand (Sharif University of Technology), Hamid Sarbazi-Azad (Sharif University of Technology / Institute for Research in Fundamental Sciences)
     
  • HIOS: A Host Interface I/O Scheduler for Solid State Disks
  • Myoungsoo Jung (University of Texas at Dallas), Wonil Choi (University of Texas at Dallas), Shekhar Srikantaiah (Qualcomm), Joonhyuk Yoo (Daegu University), Mahmut T. Kandemir (Pennsylvania State University)
     
Tuesday, 2:00pm-3:15pm
Section 5B: Datacenters and Cloud
Chair: Nacho Navarro, U. Politecnica de Catalunya
 
  • Towards Energy Proportionality for Large-Scale Latency-Critical Workloads
  • David Lo (Stanford University), Liqun Cheng (Google), Rama Govindaraju (Google), Luiz André Barroso (Google), Christos Kozyrakis (Stanford University)
     
  • SleepScale: Runtime Joint Speed Scaling and Sleep States Management for Power Efficient Data Centers
  • Yanpei Liu (University of Wisconsin-Madison), Stark C. Draper (University of Toronto), Nam Sung Kim (University of Wisconsin-Madison)
     
  • Optimizing Virtual Machine Consolidation Performance on NUMA Server Architecture for Cloud Workloads
  • Ming Liu, Tao Li (University of Florida)
     
Tuesday, 3:45pm-5:00pm
Section 6A: DRAM
Chair: Fred Chong, University of California at Santa Barbara
 
  • Row-Buffer Decoupling: A Case for Low-Latency DRAM Microarchitecture
  • Seongil O (Seoul National University), Young Hoon Son (Seoul National University), Nam Sung Kim (University of Wisconsin-Madison), Jung Ho Ahn (Seoul National University)
     
  • Half-DRAM: a High-bandwidth and Low-power DRAM Architecture from the Rethinking of Fine-grained Activation
  • Tao Zhang (Pennsylvania State University / NVIDIA), Ke Chen (Oracle), Cong Xu (Pennsylvania State University), Guangyu Sun (Peking University), Tao Wang (Peking University), Yuan Xie (Pennsylvania State University)
     
  • Flipping Bits in Memory Without Accessing Them: An Experimental Study of DRAM Disturbance Errors
  • Yoongu Kim (Carnegie Mellon University), Ross Daly, Jeremie Kim (Carnegie Mellon University), Chris Fallin, Ji Hye Lee (Carnegie Mellon University), Donghyuk Lee (Carnegie Mellon University), Chris Wilkerson (Intel Labs), Konrad Lai, Onur Mutlu (Carnegie Mellon University)
     
Tuesday, 3:45pm-5:00pm
Section 6B: Circuits and Architecture
Chair: Carole-Jean Wu, Arizona State University
 
  • Architecture Implications of Pads as a Scarce Resource
  • Runjie Zhang (University of Virginia), Ke Wang (University of Virginia), Brett H. Meyer (McGill University), Mircea R. Stan (University of Virginia), Kevin Skadron (University of Virginia)
     
  • Increasing Off-Chip Bandwidth in Multi-Core Processors with Switchable Pins
  • Shaoming Chen, Yue Hu, Ying Zhang, Lu Peng, Jesse Ardonne, Samuel Irving, Ashok Srivastava (Louisiana State University)
     
  • A Low Power and Reliable Charge Pump Design for Phase Change Memories
  • Lei Jiang, Bo Zhao, Jun Yang, Youtao Zhang (University of Pittsburgh)
     
 
Wednesday, June 18, 2014
 
Wednesday, 8:30am-10:10am
Section 7A: Coherence and Replay
Chair: James Laudon, Google
 
  • Fractal++: Closing the Performance Gap between Fractal and Conventional Coherence
  • Gwendolyn Voskuilen, T. N. Vijaykumar (Purdue University)
     
  • OmniOrder: Directory-Based Conflict Serialization of Transactions
  • Xuehai Qian (University of California, Berkeley), Benjamin Sahelices (Universidad de Valladolid), Josep Torrellas (University of Illinois at Urbana-Champaign)
     
  • Pacifier: Record and Replay for Relaxed-Consistency Multiprocessors with Distributed Directory Protocol
  • Xuehai Qian (University of California, Berkeley), Benjamin Sahelices (Universidad de Valladolid), Depei Qian (Beihang University)
     
  • Replay Debugging: Leveraging Record and Replay for Program Debugging
  • Nima Honarmand, Josep Torrellas (University of Illinois at Urbana-Champaign)
     
Wednesday, 8:30am-10:10am
Section 7B: Security/OOO Processors
Chair: Mohit Tiwari, University of Texas at Austin
 
  • The CHERI capability model: Revisiting RISC in an age of risk
  • Jonathan Woodruff (University of Cambridge), Robert N. M. Watson (University of Cambridge), David Chisnall (University of Cambridge), Simon W. Moore (University of Cambridge), Jonathan Anderson (University of Cambridge), Brooks Davis (SRI International), Ben Laurie (Google UK Ltd), Peter G. Neumann (SRI International), Robert Norton (University of Cambridge), Michael Roe (University of Cambridge)
     
  • CODOMs: Protecting Software with Code-centric Memory Domains
  • Lluís Vilanova (Barcelona Supercomputing Center / Universitat Politècnica de Catalunya / Technion – Israel Institute of Technology), Muli Ben-Yehuda (Technion - Israel Institute of Technology), Nacho Navarro (Barcelona Supercomputing Center / Universitat Politècnica de Catalunya), Yoav Etsion (Technion – Israel Institute of Technology), Mateo Valero (Barcelona Supercomputing Center / Universitat Politècnica de Catalunya)
     
  • EOLE: Paving the Way for an Effective Implementation of Value Prediction
  • Arthur Perais, André Seznec (IRISA/INRIA)
     
  • Improving the Energy Efficiency of Big Cores
  • Kenneth Czechowski (Georgia Institute of Technology), Victor W. Lee (Intel), Ed Grochowski (Intel), Ronny Ronen (Intel), Ronak Singhal (Intel), Richard Vuduc (Georgia Institute of Technology), Pradeep Dubey (Intel)
     
Wednesday, 10:40am-12:20pm
Section 8: Accelerators
Chair: Boris Grot, University of Edinburgh
 
  • General-Purpose Code Acceleration with Limited-Precision Analog Computation
  • Renée St. Amant (University of Texas at Austin), Amir Yazdanbakhsh (Georgia Institute of Technology), Jongse Park (Georgia Institute of Technology), Bradley Thwaites (Georgia Institute of Technology), Hadi Esmaeilzadeh (Georgia Institute of Technology), Arjang Hassibi (University of Texas at Austin), Luis Ceze (University of Washington), Doug Burger (Microsoft Research)
     
  • Race Logic: A Hardware Acceleration for Dynamic Programming Algorithms
  • Advait Madhavan, Timothy Sherwood, Dmitri Strukov (University of California, Santa Barbara)
     
  • Eliminating Redundant Fragment Shader Executions on a Mobile GPU via Hardware Memoization
  • Jose-Maria Arnau (Universitat Politecnica de Catalunya), Joan-Manuel Parcerisa (Universitat Politecnica de Catalunya), Polychronis Xekalakis (Intel)
     
  • WebCore: Architectural Support for Mobile Web Browsing
  • Yuhao Zhu, Vijay Janapa Reddi (The University of Texas at Austin)