Publications

DBLP   Google Scholar

Underlined are my students!

SOCC'20 Wukong: A Scalable and Locality-Enhanced Framework for Serverless Parallel Computing
Benjamin Carver, Ao Wang, Ali Anwar, Panruo Wu, Yue Cheng
The 11th ACM Symposium on Cloud Computing
October 19-21, 2020, (virtual event due to COVID-19)
ICS'20 TensorSVM: Accelerating Kernel Machines with Tensor Engine
Shaoshuai Zhang, Ruchi Shah, and Panruo Wu
The 34th ACM International Conference on Supercomputing
June 29 - July 2, 2020. Barcelona, Spain. Acceptance Rate: 30% (40/132)
PDF from ACM
HPDC'20
Best Paper Nominee
High Accuracy Matrix Computations on Neural Engines: a Study of QR Factorization and its Applications
Shaoshuai Zhang, Elaheh Baharlouei, and Panruo Wu
The 20th ACM International Symposium on High-Performance Parallel and Distributed Computing
Stockholm, Sweden, June 23-26, 2020. Acceptance Rate: 22% (16/71)
PDF/Web/Video from ACM DL
BigData'19 xSVM: Scalable Distributed Kernel Support Vector Machine Training
Ruchi Shah, Shaoshuai Zhang, Ying Lin, and Panruo Wu
IEEE International Conference on Big Data in 2019
Los Angeles, CA, USA, Dec 9 - 12, 2019. Acceptance Rate: 19.3% (106/550)
ACM TOMS'19 PLASMA: Parallel Linear Algebra Software for Multicore Using OpenMP
Jack J. Dongarra, Mark Gates, Azzam Haidar, Jakub Kurzak, Piotr Luszczek, Panruo Wu, Ichitaro Yamazaki, Asim YarKhan, Maksims Abalenkovs, Negin Bagherpour, Sven Hammarling, Jakub SĂ­stek, David Stevens, Mawussi Zounon, Samuel D. Relton
ACM Transactions on Mathematical Software, Volume 46(2): 16:1-16:35 (2019)
PDF from ACM
RTSS'18 Work-in-Progress: Incorporating Deadline-Based Scheduling in Tasking Programming Model for Extreme-Scale Parallel Computing
Albert Mo Kim Cheng, Panruo Wu
2018 IEEE Real-Time Systems Symposium,
Nashville, TN, USA, December 11-14, 2018
SC'18 Fault Tolerant One-sided Matrix Decompositions on Heterogeneous Systems with GPUs
Jieyang Chen, Hongbo Li, Sihuan Li, Xin Liang, Panruo Wu, Dingwen Tao, Kaiming Ouyang, Yuanlai Liu, Kai Zhao, Qiang Guan, and Zizhong Chen
Proceedings of the 30th ACM/IEEE International Conference for High Performance Computing, Networking, Storage and Analysis,
Dallas, Texas, USA, Nov 11 - 16, 2018. Acceptance Rate: 19.1% (55/288).
ICCS'18 The Design of Fast and Energy-Efficient Linear Solvers: On the Potential of Half-Precision Arithmetic and Iterative Refinement Techniques
Azzam Haidar, Ahmad Abdelfattah, Mawussi Zounon, Panruo Wu, Srikara Pranesh, Stanimire Tomov, Jack Dongarra
International Conference on Computational Science, ICCS 2018
Lecture Notes in Computer Science, vol 10860. Springer, Cham
TPDS'18 Symmetric Indefinite Linear Solver using OpenMP Task on Multicore Architecture
Ichitaro Yamazaki, Jakub Kurzak, Panruo Wu, Mawussi Zounon, Jack Dongarra
IEEE Transactions on Parallel and Distributed Systems
ScalA'17 Investigating half precision arithmetic to accelerate dense linear system solvers
Azzam Haidar, Panruo Wu, Stanimire Tomov, and Jack Dongarra
ScalA '17 Proceedings of the 8th Workshop on Latest Advances in Scalable Algorithms for Large-Scale Systems, Arcticle No. 10
Denver, Colorado, November 12 - 17, 2017
PDF from ACM
SC'17 Correcting Soft Errors Online in Fast Fourier Transform,
Xin Liang, Jieyang Chen, Dingwen Tao, Sihuan Li, Panruo Wu, Hongbo Li, Kaiming Ouyang, Yuanlai Liu, Fengguang Song, and Zizhong Chen
Proceedings of the 29th ACM/IEEE International Conference for High Performance Computing, Networking, Storage and Analysis,
Denver, Colorado, USA, Nov 12 - 17, 2017. Acceptance Rate: 18.6% (61/327).
PDF from ACM
TSP'17 Fast Discrete Distribution Clustering Using Wasserstein Barycenter with Sparse Support,
Jianbo Ye, Panruo Wu, James Z. Wang and Jia Li,
IEEE Transactions on Signal Processing, vol. 65, no. 9, 2317-2332, 2017.
arXiv preprint
PPoPP'17Silent Data Corruption Resilient Two-sided Matrix Factorizations,
Panruo Wu, Nathan DeBardeleben, Qiang Guan, Sean Blanchard, Jieyang Chen, Dingwen Tao, Xin Liang, Ouyang Kaiming, Sihuan Li, and Zizhong Chen
Proceedings of the 22nd ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming,
Austin, Texas, USA, February 4-8 2017. Acceptance Rate: 21.9% (29/132).
PDF from ACM
SC'16GreenLA: Green Linear Algebra Software for GPU-Accelerated Heterogeneous Computing,
Jieyang Chen, Li Tan, Panruo Wu, Dingwen Tao, Hongbo Li, Xin Liang, Sihuan Li, Rong Ge, Laxmi Bhuyan, and Zizhong Chen
Proceedings of the 28th ACM/IEEE International Conference for High Performance Computing, Networking, Storage and Analysis, Salt Lake City, Utah, USA, Nov 13- 18, 2016. Acceptance Rate: 18.4% (82/446)
Download accepted version
HPDC'16Towards Practical Algorithm Based Fault Tolerance in Dense Linear Algebra,
Panruo Wu, Qiang Guan, Nathan DeBardeleben, Sean Blanchard, Dingwen Tao, Xin Liang, Jieyang Chen, Zizhong Chen
Proceedings of the 25th ACM International Symposium on High-Performance Parallel and Distributed Computing_,
Kyoto, Japan, May 31- June 4, 2016. Acceptance Rate: 15.5% (20/129).
Download from ACM
HPDC'16 Algorithm-Directed Data Placement in Explicitly Managed Non-Volatile Memory,
Panruo Wu, Dong Li, Zizhong Chen, Jeffrey S. Vetter, Sparsh Mittal
Proceedings of the 25th ACM International Symposium on High-Performance Parallel and Distributed Computing_,
Kyoto, Japan, May 31- June 4, 2016. Acceptance Rate: 15.5% (20/129).
Download from ACM
HPDC'16New-Sum: A Novel Online ABFT Scheme For General Iterative Methods,
Dingwen Tao, Shuaiwen Leon Song, Sriram Krishnamoorthy, Panruo Wu, Xin Liang, Eddy Z. Zhang, Darren J. Kerbyson, Zizhong Chen
Proceedings of the 25th ACM International Symposium on High-Performance Parallel and Distributed Computing_,
Kyoto, Japan, May 31- June 4, 2016. Acceptance Rate: 15.5% (20/129).
Download from ACM
TPDS'15Fail-Stop Failure Algorithm-Based Fault Tolerance for Cholesky Decomposition,
Doug Hakkarinen, Panruo Wu, and Zizhong Chen
IEEE Transactions on Parallel and Distributed Systems,
Volume: 26, Issue: 5, Page 1323-1335,May, 2015.
Download
HPDC'14FT-ScaLAPACK: Correcting Soft Errors On-Line for ScaLAPACK Cholesky, QR, and LU Factorization Routines,
Panruo Wu and Zizhong Chen,
Proceedings of the 23rd ACM International Symposium on High-Performance Parallel and Distributed Computing,
Vancouver, Canada, June 23-27, 2014. Acceptance Rate: 16.2% (21/130).
Download from ACM
SC'13Rethinking Algorithm-Based Fault Tolerance with a Cooperative Software-Hardware Approach,
Dong Li, Zizhong Chen, Panruo Wu, and Jeffrey Vetter,
Proceedings of the 25th ACM/IEEE International Conference for High Performance Computing, Networking, Storage and Analysis,
Denver, CO, November 17-22, 2013. Acceptance Rate: 19.7% (90/457).
Download from ACM