Underlined are my students!
SOCC'20 | Wukong: A Scalable and Locality-Enhanced Framework for Serverless Parallel Computing Benjamin Carver, Ao Wang, Ali Anwar, Panruo Wu, Yue Cheng The 11th ACM Symposium on Cloud Computing October 19-21, 2020, (virtual event due to COVID-19) |
ICS'20 | TensorSVM: Accelerating Kernel Machines with Tensor Engine Shaoshuai Zhang, Ruchi Shah, and Panruo Wu The 34th ACM International Conference on Supercomputing June 29 - July 2, 2020. Barcelona, Spain. Acceptance Rate: 30% (40/132) PDF from ACM |
HPDC'20 Best Paper Nominee |
High Accuracy Matrix Computations on Neural Engines: a Study of QR Factorization and its Applications Shaoshuai Zhang, Elaheh Baharlouei, and Panruo Wu The 20th ACM International Symposium on High-Performance Parallel and Distributed Computing Stockholm, Sweden, June 23-26, 2020. Acceptance Rate: 22% (16/71) PDF/Web/Video from ACM DL |
BigData'19 | xSVM: Scalable Distributed Kernel Support Vector Machine Training Ruchi Shah, Shaoshuai Zhang, Ying Lin, and Panruo Wu IEEE International Conference on Big Data in 2019 Los Angeles, CA, USA, Dec 9 - 12, 2019. Acceptance Rate: 19.3% (106/550) |
ACM TOMS'19 | PLASMA: Parallel Linear Algebra Software for Multicore Using OpenMP Jack J. Dongarra, Mark Gates, Azzam Haidar, Jakub Kurzak, Piotr Luszczek, Panruo Wu, Ichitaro Yamazaki, Asim YarKhan, Maksims Abalenkovs, Negin Bagherpour, Sven Hammarling, Jakub SĂstek, David Stevens, Mawussi Zounon, Samuel D. Relton ACM Transactions on Mathematical Software, Volume 46(2): 16:1-16:35 (2019) PDF from ACM |
RTSS'18 | Work-in-Progress: Incorporating Deadline-Based Scheduling in Tasking Programming Model for Extreme-Scale Parallel Computing Albert Mo Kim Cheng, Panruo Wu 2018 IEEE Real-Time Systems Symposium, Nashville, TN, USA, December 11-14, 2018 |
SC'18 | Fault Tolerant One-sided Matrix Decompositions on Heterogeneous Systems with GPUs Jieyang Chen, Hongbo Li, Sihuan Li, Xin Liang, Panruo Wu, Dingwen Tao, Kaiming Ouyang, Yuanlai Liu, Kai Zhao, Qiang Guan, and Zizhong Chen Proceedings of the 30th ACM/IEEE International Conference for High Performance Computing, Networking, Storage and Analysis, Dallas, Texas, USA, Nov 11 - 16, 2018. Acceptance Rate: 19.1% (55/288). |
ICCS'18 | The Design of Fast and Energy-Efficient Linear Solvers: On the Potential of Half-Precision Arithmetic and Iterative Refinement Techniques Azzam Haidar, Ahmad Abdelfattah, Mawussi Zounon, Panruo Wu, Srikara Pranesh, Stanimire Tomov, Jack Dongarra International Conference on Computational Science, ICCS 2018 Lecture Notes in Computer Science, vol 10860. Springer, Cham |
TPDS'18 | Symmetric Indefinite Linear Solver using OpenMP Task on Multicore Architecture Ichitaro Yamazaki, Jakub Kurzak, Panruo Wu, Mawussi Zounon, Jack Dongarra IEEE Transactions on Parallel and Distributed Systems |
ScalA'17 | Investigating half precision arithmetic to accelerate dense linear system solvers Azzam Haidar, Panruo Wu, Stanimire Tomov, and Jack Dongarra ScalA '17 Proceedings of the 8th Workshop on Latest Advances in Scalable Algorithms for Large-Scale Systems, Arcticle No. 10 Denver, Colorado, November 12 - 17, 2017 PDF from ACM |
SC'17 | Correcting Soft Errors Online in Fast Fourier Transform, Xin Liang, Jieyang Chen, Dingwen Tao, Sihuan Li, Panruo Wu, Hongbo Li, Kaiming Ouyang, Yuanlai Liu, Fengguang Song, and Zizhong Chen Proceedings of the 29th ACM/IEEE International Conference for High Performance Computing, Networking, Storage and Analysis, Denver, Colorado, USA, Nov 12 - 17, 2017. Acceptance Rate: 18.6% (61/327). PDF from ACM |
TSP'17 |
Fast Discrete Distribution Clustering Using Wasserstein
Barycenter with Sparse Support, Jianbo Ye, Panruo Wu, James Z. Wang and Jia Li, IEEE Transactions on Signal Processing, vol. 65, no. 9, 2317-2332, 2017. arXiv preprint |
PPoPP'17 | Silent Data Corruption Resilient Two-sided Matrix Factorizations, Panruo Wu, Nathan DeBardeleben, Qiang Guan, Sean Blanchard, Jieyang Chen, Dingwen Tao, Xin Liang, Ouyang Kaiming, Sihuan Li, and Zizhong Chen Proceedings of the 22nd ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, Austin, Texas, USA, February 4-8 2017. Acceptance Rate: 21.9% (29/132). PDF from ACM |
SC'16 | GreenLA: Green Linear Algebra Software for GPU-Accelerated Heterogeneous Computing, Jieyang Chen, Li Tan, Panruo Wu, Dingwen Tao, Hongbo Li, Xin Liang, Sihuan Li, Rong Ge, Laxmi Bhuyan, and Zizhong Chen Proceedings of the 28th ACM/IEEE International Conference for High Performance Computing, Networking, Storage and Analysis, Salt Lake City, Utah, USA, Nov 13- 18, 2016. Acceptance Rate: 18.4% (82/446) Download accepted version |
HPDC'16 | Towards Practical Algorithm Based Fault Tolerance in Dense Linear Algebra, Panruo Wu, Qiang Guan, Nathan DeBardeleben, Sean Blanchard, Dingwen Tao, Xin Liang, Jieyang Chen, Zizhong Chen Proceedings of the 25th ACM International Symposium on High-Performance Parallel and Distributed Computing_, Kyoto, Japan, May 31- June 4, 2016. Acceptance Rate: 15.5% (20/129). Download from ACM |
HPDC'16 | Algorithm-Directed Data Placement in Explicitly Managed Non-Volatile Memory, Panruo Wu, Dong Li, Zizhong Chen, Jeffrey S. Vetter, Sparsh Mittal Proceedings of the 25th ACM International Symposium on High-Performance Parallel and Distributed Computing_, Kyoto, Japan, May 31- June 4, 2016. Acceptance Rate: 15.5% (20/129). Download from ACM |
HPDC'16 | New-Sum: A Novel Online ABFT Scheme For General Iterative Methods, Dingwen Tao, Shuaiwen Leon Song, Sriram Krishnamoorthy, Panruo Wu, Xin Liang, Eddy Z. Zhang, Darren J. Kerbyson, Zizhong Chen Proceedings of the 25th ACM International Symposium on High-Performance Parallel and Distributed Computing_, Kyoto, Japan, May 31- June 4, 2016. Acceptance Rate: 15.5% (20/129). Download from ACM |
TPDS'15 | Fail-Stop Failure Algorithm-Based Fault Tolerance for Cholesky Decomposition, Doug Hakkarinen, Panruo Wu, and Zizhong Chen IEEE Transactions on Parallel and Distributed Systems, Volume: 26, Issue: 5, Page 1323-1335,May, 2015. Download |
HPDC'14 | FT-ScaLAPACK: Correcting Soft Errors On-Line for ScaLAPACK Cholesky, QR, and LU Factorization Routines, Panruo Wu and Zizhong Chen, Proceedings of the 23rd ACM International Symposium on High-Performance Parallel and Distributed Computing, Vancouver, Canada, June 23-27, 2014. Acceptance Rate: 16.2% (21/130). Download from ACM |
SC'13 | Rethinking Algorithm-Based Fault Tolerance with a Cooperative Software-Hardware Approach, Dong Li, Zizhong Chen, Panruo Wu, and Jeffrey Vetter, Proceedings of the 25th ACM/IEEE International Conference for High Performance Computing, Networking, Storage and Analysis, Denver, CO, November 17-22, 2013. Acceptance Rate: 19.7% (90/457). Download from ACM |