Publications (4)0 Total impact
[show abstract] [hide abstract]
ABSTRACT: A judiciously chosen symmetric permutation can significantly reduce the amount of storage and computation for the Cholesky factorization of sparse matrices. On distributed memory machines, the issue of mapping data and computation on processors is also important. Previous research on ordering for parallelism has focussed on idealized measures like execution time on an unbounded number of processors, with zero communication costs. In this paper, we propose an ordering and mapping algorithm that attempts to minimize communication and performs load-balancing of work among the processors. Performance results on an Intel iPSC/860 hypercube are presented to demonstrate its effectiveness. 1 Introduction We consider the solution of linear systems of the form Ax = b where A is large, sparse, symmetric and positive definite. With such systems, Cholesky decomposition can be applied to factorize A as A = LL T , where L is lower triangular, followed by forward and backward triangular solves. Du...01/1996;
Article: A Tensor Product Formulation of Strassen's Matrix Multiplication Algorithm with Memory Reduction.Scientific Programming. 01/1995; 4:275-289.
Parallel Processing Letters. 01/1995; 5:685-696.
Conference Proceeding: On sparse matrix reordering for parallel factorization.01/1994