2024

MoQ: Mixture-of-format Activation Quantization for Communication-efficient AI Inference System

Haonan Wang, Zeli Liu, Chao Fang, John Paul Walters, Stephen P Crago
NeurIPS 2024 Workshop Machine Learning with new Compute Paradigms, 2024

2020

Scalable parallel file write from a large numa system

Dong In D Kang, John Paul Walters, Stephen P Crago
HPEC, 2020

2019

Increased Fault-Tolerance and Real-Time Performance Resiliency for Stream Processing Workloads through Redundancy

Geoffrey Phi Tran, John Paul Walters, Stephen Crago
2019 IEEE International Conference on Services Computing (SCC), 51-55, 2019

Computational requirements for real-time ptychographic image reconstruction

Kaushik Datta, Andrew Rittenbach, Dong-In Kang, John Paul Walters, Stephen P Crago, John Damoulakis
Applied Optics 58 (7), B19-B27, 2019

2018

Reducing Tail Latencies While Improving Resiliency to Timing Errors for Stream Processing Workloads

G.P.C. Tran, J.P. Walters, S. P. Crago
2018 IEEE International Conference on Services Computing (SCC), 2018

Pacer: Automated Feedback-Based Vertical Elasticity for Heterogeneous Soft Real-Time Workloads

Yu-An Chen, Geoffrey Tran, Andrew Rittenbach, John Walters, Stephen Crago
2018 IEEE/ACM 11th International Conference on Utility and Cloud Computing …, 2018

2017

A Comparison of System Performance on a Private OpenStack Cloud and Amazon EC2

Mikyung Kang, Dong-In Kang, John Paul Walters, Stephen P. Crago
10th IEEE International Conference on Cloud Computing (IEEE Cloud), 2017

Dynamically Improving Resiliency to Timing Errors for Stream Processing Workloads

G.P.C Tran, J.P. Walters, S.P. Crago
The 18th International Conference on Parallel and Distributed Computing …, 2017

Computer system and network security

Wayne Wolf, Gregory B White, Eric A Fisch, Stephen P Crago, Udo W Pooch, Janice Onanian McMahon, Donald Yeung, Huy Nguyen, Masahiro Arakawa, Tom MacDonald, Bilge ES Akgul, Pinar Korkmaz, Lakshmi Chakrapani, Hahn Kim, Glenn E Schrad, James M Lebak
CRC press, 2017

Load Balancing for Minimizing Deadline Misses and Total Runtime for Connected Car System in Fog Computing

Y.-A. Chen, J.P. Walters, S.P. Crago
15th International Symposium on Parallel and Distributed Processing with …, 2017

2016

Hypervisor Performance Analysis for Real-Time Workloads

Geoffrey Phi C. Tran, Yu-An Chen, Dong-In Kang, John Paul Walters, Stephen P. Crago
High Performance Extreme Computing Conference (HPEC) IEEE, 2016

Automated Demand-Based Vertical Elasticity for Heterogeneous Real-Time Workloads

Geoffrey Phi Tran, Yu-An Chen, Dong-In Kang, John Paul Walters, Stephen Crago
9th Annual IEEE International Conference on Cloud Computing, 2016

Reducing Data Movement with Approximate Computing Techniques

Stephen Crago, Donald Yeung
IEEE International Conference on Rebooting Computing, 2016

2015

Supporting high performance molecular dynamics in virtualized clusters using IOMMU, SR-IOV, and GPUDirect

Andrew J Younge, John Paul Walters, Stephen P Crago, Geoffrey C Fox
ACM SIGPLAN Notices 50 (7), 31-38, 2015

Heterogeneous Cloud Computing: The Way Forward

Stephen P. Crago, John Paul Walters
IEEE Computer, 59-61, 2015

2014

Energy performance of fpgas on perfect suite kernels

Sanmukh R Kuppannagari, Ren Chen, Andrea Sanny, Shreyas G Singapura, Geoffrey Phi C Tran, Shijie Zhou, Yusong Hu, Stephen P Crago, Viktor K Prasanna
2014 IEEE High Performance Extreme Computing Conference (HPEC), 1-6, 2014

Dynamic runtime optimizations for systems of heterogeneous architectures

Geoffrey Phi C Tran, Dong-In Kang, Stephen Crago
2014 IEEE High Performance Extreme Computing Conference (HPEC), 1-6, 2014

Bridging the Virtualization Performance Gap for HPC using SR-IOV for InfiniBand

Malek Musleh, Vijay Pai, Andrew J. Younge, John Paul Walters, Stephen P. Crago
7th IEEE International Conference on Cloud Computing, 2014

GPU-Passthrough Performance: A Comparison of KVM, Xen, VMWare ESXi, and LXC for CUDA and OpenCL Applications

John Paul Walters, Andrew Younge, Dong-In Kang, Ke-Thia Yao, Mikyung Kang, Stephen P. Crago, Geoffrey Fox
7th IEEE International Conference on Cloud Computing, 2014

Evaluating GPU passthrough in Xen for high performance cloud computing

Andrew J Younge, John Paul Walters, Stephen Crago, Geoffrey C Fox
2014 IEEE international parallel & distributed processing symposium …, 2014

2013

Implementation of kernels on the Maestro processor

Jinwoo Suh, Dong In D Kang, Stephen P Crago
2013 IEEE Aerospace Conference, 1-6, 2013

Enabling high performance computing in cloud infrastructure using virtualized GPUs

Andrew J Younge, John Paul Walters, Steve Crago, Geoffrey C Fox
2013

2012

Implementation of fft and crblaster on the maestro processor

Jinwoo Suh, Kenneth J Mighell, Dong-In Kang, Stephen P Crago
2012 IEEE Aerospace Conference, 1-6, 2012

Integrating high performance file systems in a cloud computing environment

Abhisek Pan, John Paul Walters, Vijay S Pai, Dong-In D Kang, Stephen P Crago
2012 SC Companion: High Performance Computing, Networking Storage and …, 2012

2011

Heterogeneous cloud computing

Steve Crago, Kyle Dunn, Patrick Eads, Lorin Hochstein, Dong-In Kang, Mikyung Kang, Devendra Modium, Karandeep Singh, Jinwoo Suh, John Paul Walters
Proceedings of the Workshop on Parallel Programming on Accelerator Clusters …, 2011

Programming models and development software for a space-based many-core processor

Stephen P Crago, Dong-In Kang, Mikyung Kang, Robert Kost, Karandeep Singh, Joseph Suh, John Paul Walters
2011 IEEE Fourth International Conference on Space Mission Challenges for …, 2011

Fftw and complex ambiguity function performance on the maestro processor

Karandeep Singh, John Paul Walters, Joel Hestness, Jinwoo Suh, Craig M Rogers, Stephen P Crago
2011 Aerospace Conference, 1-8, 2011

Software-based fault tolerance for the Maestro many-core processor

John Paul Walters, Robert Kost, Karandeep Singh, Jinwoo Suh, Stephen P Crago
2011 Aerospace Conference, 1-12, 2011

Design and development of a run-time monitor for multi-core architectures in cloud computing

Mikyung Kang, Dong-In Kang, Stephen P Crago, Gyung-Leen Park, Junghoon Lee
Sensors 11 (4), 3595-3610, 2011

P. et al Eads

S Crago, K Dunn
Heterogeneous cloud computing. In, 378-385, 2011

2010

Opportunities for concurrent dynamic analysis with explicit inter-core communication

Jungwoo Ha, Stephen P Crago
Proceedings of the 9th ACM SIGPLAN-SIGSOFT workshop on Program analysis for …, 2010

Algorithm Classes for Architecture Research (ACAR)

Jinwoo Suh, Stephen P Crago, Karandeep Singh, Janice O McMahon, UNIVERSITY OF SOUTHERN CALIFORNIA ARLINGTON VA INFORMATION SCIENCES INST
Final Report, 2010

2009

MPI performance analysis and optimization on Tile64/Maestro

Mikyung Kang, Eunhui Park, Minkyoung Cho, Jinwoo Suh, Dong-In Kang, Stephen P Crago
Proceedings of Workshop on Multi-core Processors for Space—Opportunities …, 2009

2008

Tera-Op Reliable Intelligently Adaptive Processing System (TRIPS) Implementation

Stephen Keckler, Doug Buger, Kathryn McKinley, Steve Crago, Richard Lethin
AFRL-RI-WPTR-2008-1529, The University of Texas at Austin, 2008

Advanced Microprocessor Architectures

Janice McMahon, Stephen Crago, Donald Yeung
High Performance Embedded Computing Handbook, 499-521, 2008

2007

Preliminary study toward intelligent run-time resource management techniques for large multi-core architectures

Dong-In Kang, Jinwoo Suh, Janice O McMahon, Stephen P Crago
Proceedings of the 2007 Workshop on High Performance Embedded Computing (HPEC07), 2007

A voltage and resource synthesis technique for energy-aware real-time systems

Dong-In Kang, Stephen P Crago, Jinwoo Suh, Janice McMahon
13th IEEE International Conference on Embedded and Real-Time Computing …, 2007

Evaluation of Stream Virtual Machine on Raw Processor

Jinwoo Suh, Richard Lethin, Stephen P Crago, Janice O McMahon, Dong-In Kang
2007 IEEE International Parallel and Distributed Processing Symposium, 1-8, 2007

Abstract Machines for Polymorphous Computing

Stephen Crago, UNIVERSITY OF SOUTHERN CALIFORNIA LOS ANGELES
2007

2006

CEARCH: Cognition enabled architecture

SP Crago, JO McMahon, C Archer, K Asanovic, R Chaung, K Goolsbey, M Hall, C Kozyrakis, K Olukotun, UM OReilly, R Pancoast, V Prasanna, R Rabbah, S Ward, D Yeung
Proceedings of the Tenth Annual High Performance Embedded Computing Workshop …, 2006

Design and evaluation of a hierarchical decoupled architecture

Won W Ro, Stephen P Crago, Alvin M Despain, Jean-Luc Gaudiot
The Journal of Supercomputing 38, 237-259, 2006

CEARCH: Cognitive Enabled ARCHitectures

Stephen P Crago, Janice Onanian McMahon
Proceedings of the 10th Annual High Performance Embedded Computing Workshop …, 2006

2005

2004

RATS: Reactive Architectures

Marc Christensen, Fouad Kiamelev, Michael Haney, Charlie Kuznia, Stephen Crago, UNIVERSITY OF SOUTHERN CALIFORNIA MARINA DEL REY INFORMATION SCIENCES INST
2004

SLIIC: System-Level Intelligent Intensive Computing

Stephen Crago, Jinwoo Suh, UNIVERSITY OF SOUTHERN CALIFORNIA MARINA DEL REY INFORMATION SCIENCES INST
2004

2003

Dynamic power management of heterogeneous systems

Jinwoo Suh, Dong-In Kang, Stephen P Crago
Proceedings International Parallel and Distributed Processing Symposium, 8 pp., 2003

HiDISC: A decoupled architecture for data-intensive applications

Won Woo Ro, J-L Gaudiot, Stephen P Crago, Alvin M Despain
Proceedings International Parallel and Distributed Processing Symposium, 8 pp., 2003

Robust Highly-Connected Direct Interconnection Network Topologies.

Ilya Sharapov, Gary Lauterbach, Stephen P Crago
PDPTA, 995-1000, 2003

2002

A stream processor development platform

Ben Serebrin, John D Owens, Chen H Chen, Stephen P Crago, Ujval J Kapasi, Brucek Khailany, Peter Mattson, Jinyung Namkoong, Scott Rixner, William J Dally
Proceedings. IEEE International Conference on Computer Design: VLSI in …, 2002

A power-aware, satellite-based parallel signal processing scheme

Patrick M Shriver, Maya B Gokhale, Scott D Briles, Dong-In Kang, Michael Cai, Kevin McCabe, Stephen P Crago, Jinwoo Suh
Power Aware Computing, 243-259, 2002

An optimal voltage synthesis technique for a power-efficient satellite application

Dong-In Kang, Jinwoo Suh, Stephen P Crago
Proceedings of the 39th annual Design Automation Conference, 492-497, 2002

Dynamic power management of multiprocessor systems

Jinwoo Suh, Dong-In Kang, Stephen P Crago
Proceedings 16th International Parallel and Distributed Processing Symposium …, 2002

A fast resource synthesis technique for energy-efficient real-time systems

Dong-In Kang, Stephen P Crago, Jinwoo Suh
23rd IEEE Real-Time Systems Symposium. RTSS 2002., 225-234, 2002

2001

Power-aware design synthesis techniques for distributed real-time systems

Dong-In Kang, Stephen Crago, Jinwoo Suh
Proceedings of the ACM SIGPLAN workshop on Languages, compilers and tools …, 2001

Implementations of Real-time Data Intensive Applications on PIM-based Multiprocessor Systems.

Jinwoo Suh, Ming Zhu, Changping Li, Stephen P Crago, Stephen F Shank, Richard H Chau, Walter J Mazur, Rick Pancoast
IPDPS, 99, 2001

A PIM-based multiprocessor system

Jinwoo Suh, Changping Li, Stephen P Crago, Robert Parker
Proceedings 15th International Parallel and Distributed Processing Symposium …, 2001

PIM-and stream processor-based processing for radar signal applications

Jinwoo Suh, Stephen P Crago
Proceedings of the 3rd Workshop on Media and Streaming Processors, 77-85, 2001

Efficient Algorithms for Fixed-Point Arithmetic Operations In An Embedded PIM

Jinwoo Suh, Dong-In Kang, Stephen P Crago
University of Southern California/Information Sciences Institute, 2001

2000

A communication scheduling algorithm for multi-FPGA systems

Jinwoo Suh, Dong-In Kang, Stephen P Crago
Proceedings 2000 IEEE Symposium on Field-Programmable Custom Computing …, 2000

Programming and development environments for configurable computing systems

S Kumar, D Bhatt, S Vestal, B Wren, J Shackleton, H Shirley, R Bhatt, J Golusky, M Vojta, C Nanavati, P Zumsteg, P Symosek, S Crago, B Schott, R Parker, G Gardner
2000 IEEE Aerospace Conference. Proceedings (Cat. No. 00TH8484) 5, 487-497, 2000

A high-performance, hierarchical decoupled architecture

Stephen P Crago, Alvin Despain, Jean-Luc Gaudiot, Manil Makhija, Wonwoo Ro, Apoorv Srivastava
Proceedings of MEDEA Workshop, 2000

1998

SLAAC: a distributed architecture for adaptive computing

Stephen P Crago, Brian Schott, Robert Parker
Proceedings. IEEE Symposium on FPGAs for Custom Computing Machines (Cat. No …, 1998

1997

HiDisc: A Decoupled Architecture for the future

Stephen P Crago, Alvin M Despain
University of Southern California, 1997

HiDISC: A High-Performance Hierarchical, Decoupled Architecture

Stephen P Crago
Ph. D. Thesis, University of Southern California, 1997

HiDISC: A high-performance hierarchical decoupled computer architecture

Stephen Philip Crago
University of Southern California, 1997

1996

A Hierarchical Decoupled Architecture: Preliminary Results

Stephen P Crago, Apoorv Srivastava, Kevin Obenland, Alvin M Despain
1996

Improving the performance of loop-based programs using a prefetch processor

Stephen P Crago, Alvin M Despain
Submitted to the 24th Annual International Symposium on Computer Architecture, 1996

Application-Architecture Interaction on Shared-Memory Multiprocessors

Michael Carlton, Stephen P Crago, Alvin M Despain
1996