E. Chuah, A. Jhumka, S. Alt, D.B-Thomert, J.C. Browne, M. Parashar, "Towards Comprehensive Dependability-Driven Resource Use and Message Log-Analysis for HPC Systems Diagnosis", Journal of Parallel and Distributed Computing (JPDC), Elsevier, 2019.
E. Chuah, A. Jhumka, J. C. Browne, N. Gurumdimma, S. Narasimhamurthy and B. Barth, "Using Message Logs and Resource Use Data for Cluster Failure Diagnosis," 2016 IEEE 23rd International Conference on High Performance Computing (HiPC), Hyderabad, 2016, pp. 232-241. doi: 10.1109/HiPC.2016.035
Aji, A & Heafield, K 2017, Sparse Communication for Distributed Gradient Descent. in Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing (EMNLP 2017). Association for Computational Linguistics (ACL), pp. 440-445, EMNLP 2017: Conference on Empirical Methods in Natural Language Processing, Copenhagen, Denmark, 7-11 September.