{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,9,25]],"date-time":"2025-09-25T18:04:50Z","timestamp":1758823490337},"reference-count":213,"publisher":"Elsevier BV","issue":"13-14","license":[{"start":{"date-parts":[[1999,12,1]],"date-time":"1999-12-01T00:00:00Z","timestamp":944006400000},"content-version":"tdm","delay-in-days":0,"URL":"https:\/\/www.elsevier.com\/tdm\/userlicense\/1.0\/"}],"content-domain":{"domain":[],"crossmark-restriction":false},"short-container-title":["Parallel Computing"],"published-print":{"date-parts":[[1999,12]]},"DOI":"10.1016\/s0167-8191(99)00077-0","type":"journal-article","created":{"date-parts":[[2003,4,25]],"date-time":"2003-04-25T08:06:40Z","timestamp":1051258000000},"page":"1931-1970","source":"Crossref","is-referenced-by-count":70,"title":["Developments and trends in the parallel solution of linear systems"],"prefix":"10.1016","volume":"25","author":[{"given":"Iain S.","family":"Duff","sequence":"first","affiliation":[]},{"given":"Henk A.","family":"van der Vorst","sequence":"additional","affiliation":[]}],"member":"78","reference":[{"issue":"9","key":"10.1016\/S0167-8191(99)00077-0_BIB1","doi-asserted-by":"crossref","first-page":"1407","DOI":"10.1016\/0167-8191(95)00029-N","article-title":"Parallel sparse matrix solution and performance","volume":"21","author":"Alaghband","year":"1995","journal-title":"Parallel Comput."},{"issue":"1","key":"10.1016\/S0167-8191(99)00077-0_BIB2","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1023\/A:1019170609950","article-title":"Sparse approximate inverse preconditioning for dense linear systems arising in computational electromagnetics","volume":"16","author":"All\u00e9on","year":"1997","journal-title":"Numer. Algorithms"},{"key":"10.1016\/S0167-8191(99)00077-0_BIB3","unstructured":"F.L. Alvarado, H. Dag, Incomplete partitioned inverse preconditioners, Technical report, Department of Electrical and Computer Engineering, University of Wisconsin, Madison, 1994"},{"key":"10.1016\/S0167-8191(99)00077-0_BIB4","doi-asserted-by":"crossref","unstructured":"F.L. Alvarado, A. Pothen, R. Schreiber, Highly parallel sparse triangular solution, in: Alan George, J.R. Gilbert, J.W.H. Liu (Eds.), Graph Theory and Sparse Matrix Computation, Springer, Berlin, 1993","DOI":"10.1007\/978-1-4613-8369-7_7"},{"key":"10.1016\/S0167-8191(99)00077-0_BIB5","doi-asserted-by":"crossref","first-page":"446","DOI":"10.1137\/0914027","article-title":"Optimal parallel solution of sparse triangular systems","volume":"14","author":"Alvarado","year":"1993","journal-title":"SIAM J. Sci. Comput."},{"key":"10.1016\/S0167-8191(99)00077-0_BIB6","doi-asserted-by":"crossref","first-page":"452","DOI":"10.1109\/59.54552","article-title":"Partitioned sparse A\u22121 methods","volume":"3","author":"Alvarado","year":"1990","journal-title":"IEEE Trans. Power Syst."},{"key":"10.1016\/S0167-8191(99)00077-0_BIB7","unstructured":"P.R. Amestoy, Factorization of large sparse matrices based on a multifrontal approach in a multiprocessor environment, INPT Ph.D. thesis TH\/PA\/91\/2, CERFACS, Toulouse, France, 1991"},{"key":"10.1016\/S0167-8191(99)00077-0_BIB8","doi-asserted-by":"crossref","first-page":"41","DOI":"10.1177\/109434208900300303","article-title":"Vectorization of a multiprocessor multifrontal code","volume":"3","author":"Amestoy","year":"1989","journal-title":"Inter. J. Supercomputer Appl."},{"key":"10.1016\/S0167-8191(99)00077-0_BIB9","doi-asserted-by":"crossref","unstructured":"P.R. Amestoy, I.S. Duff, J.Y. L'Excellent, Multifrontal solvers within the PARASOL environment, in: B. K\u00e5gstr\u00f6m, J. Dongarra, E. Elmroth, J. Wa\u015bniewski (Eds.), Applied Parallel Computing, PARA'98, Lecture Notes in Computer Science, No. 1541, Springer, Berlin, 1998, pp. 7\u201311","DOI":"10.1007\/BFb0095312"},{"issue":"4","key":"10.1016\/S0167-8191(99)00077-0_BIB10","doi-asserted-by":"crossref","first-page":"275","DOI":"10.1002\/(SICI)1099-1506(199607\/08)3:4<275::AID-NLA83>3.0.CO;2-7","article-title":"Multifrontal QR factorization in a multiprocessor environment","volume":"3","author":"Amestoy","year":"1996","journal-title":"Numer. Linear Algebra with Appl."},{"key":"10.1016\/S0167-8191(99)00077-0_BIB11","unstructured":"E. Anderson, Z. Bai, C. Bischof, J. Demmel, J. Dongarra, J. DuCroz, A. Greenbaum, S. Hammarling, A. McKenney, S. Ostrouchov, D. Sorensen, LAPACK Users' Guide, Second ed., SIAM, Philadelphia, PA, 1995"},{"key":"10.1016\/S0167-8191(99)00077-0_BIB12","doi-asserted-by":"crossref","first-page":"73","DOI":"10.1142\/S0129053389000056","article-title":"Solving sparse triangular systems on parallel computers","volume":"1","author":"Anderson","year":"1989","journal-title":"Inter. J. High Speed Comput."},{"key":"10.1016\/S0167-8191(99)00077-0_BIB13","doi-asserted-by":"crossref","first-page":"47","DOI":"10.1137\/0913003","article-title":"A block projection method for sparse matrices","volume":"13","author":"Arioli","year":"1992","journal-title":"SIAM J. Sci. Statist. Comput."},{"key":"10.1016\/S0167-8191(99)00077-0_BIB14","doi-asserted-by":"crossref","first-page":"766","DOI":"10.1137\/0612059","article-title":"Minimax polynomial preconditioning for Hermitian linear systems","volume":"12","author":"Ashby","year":"1991","journal-title":"SIAM J. Matrix Anal. Appl."},{"key":"10.1016\/S0167-8191(99)00077-0_BIB15","doi-asserted-by":"crossref","first-page":"593","DOI":"10.1137\/0911033","article-title":"A fan-in algorithm for distributed sparse numerical factorization","volume":"11","author":"Ashcraft","year":"1990","journal-title":"SIAM J. Sci. Statist. Comput."},{"key":"10.1016\/S0167-8191(99)00077-0_BIB16","doi-asserted-by":"crossref","unstructured":"C. Ashcraft, S.C. Eisenstat, J.W.H. Liu, A.H. Sherman, A comparison on three column-based distributed sparse factorization schemes, Technical Report CS-90-09, Department of Computer Science, York University, York, Ontario, Canada, 1990","DOI":"10.21236\/ADA228143"},{"key":"10.1016\/S0167-8191(99)00077-0_BIB17","unstructured":"C. Ashcraft, J.W.H. Liu, Robust ordering of sparse matrices using multisection, Technical Report ISSTECH-96-002, Boeing Information and Support Services, Seattle, 1996, also Report CS-96-01, Department of Computer Science, York University, Ontario, Canada"},{"key":"10.1016\/S0167-8191(99)00077-0_BIB18","doi-asserted-by":"crossref","unstructured":"O. Axelsson, Iterative Solution Methods, Cambridge University Press, Cambridge, 1994","DOI":"10.1017\/CBO9780511624100"},{"key":"10.1016\/S0167-8191(99)00077-0_BIB19","doi-asserted-by":"crossref","first-page":"53","DOI":"10.1137\/0709008","article-title":"Numerical stability in problems of linear algebra","volume":"9","author":"Babuska","year":"1972","journal-title":"SIAM J. Numer. Anal."},{"key":"10.1016\/S0167-8191(99)00077-0_BIB20","doi-asserted-by":"crossref","first-page":"563","DOI":"10.1093\/imanum\/14.4.563","article-title":"A Newton basis GMRES implementation","volume":"14","author":"Bai","year":"1991","journal-title":"IMA J. Numer. Anal."},{"key":"10.1016\/S0167-8191(99)00077-0_BIB21","unstructured":"S.T. Barnard, L.M. Bernardo, H.D. Simon, An MPI implementation of the SPAI preconditioner on the T3E, Technical Report LBNL-40794 UC405, Lawrence Berkeley National Laboratory, 1997"},{"key":"10.1016\/S0167-8191(99)00077-0_BIB22","unstructured":"S.T. Barnard, R.L. Clay, A portable MPI implementation of the SPAI preconditioner in ISIS++, in: Michael Heath, Virginia Torczon, Greg Astfalk, Petter E. Bj\u00f6rstad, Alan H. Karp, Charles H. Koebel, V. Kumar, R.F. Lucas, Layne T. Watson, David E. Womble (Eds.), Proceedings of the Eighth SIAM Conference on Parallel Processing for Scientific Computing, 1997, pp. xxx\u2013yyy"},{"issue":"4","key":"10.1016\/S0167-8191(99)00077-0_BIB23","doi-asserted-by":"crossref","first-page":"317","DOI":"10.1002\/nla.1680020402","article-title":"A spectral algorithm for envelope reduction of sparse matrices","volume":"2","author":"Barnard","year":"1995","journal-title":"Numer. Linear Algebra Appl."},{"key":"10.1016\/S0167-8191(99)00077-0_BIB24","doi-asserted-by":"crossref","first-page":"1457","DOI":"10.1137\/0912079","article-title":"Parallelization of robust multigrid methods: ILU factorization and frequency decomposition method","volume":"6","author":"Bastian","year":"1991","journal-title":"SIAM J. Sci. Statist. Comput."},{"key":"10.1016\/S0167-8191(99)00077-0_BIB25","doi-asserted-by":"crossref","first-page":"1135","DOI":"10.1137\/S1064827594271421","article-title":"A sparse approximate inverse preconditioner for the Conjugate Gradient method","volume":"17","author":"Benzi","year":"1996","journal-title":"SIAM J. Sci. Comput."},{"key":"10.1016\/S0167-8191(99)00077-0_BIB26","unstructured":"M. Benzi, J. Mar\u0131\u0301n, M. T\u016fma, A two-level parallel preconditioner based on sparse approximate inverses, in: D.R. Kincaid, A.C. Elster (Eds.), Iterative Methods in Scientific Computation, II, IMACS, 1999, pp. 1\u201311"},{"key":"10.1016\/S0167-8191(99)00077-0_BIB27","doi-asserted-by":"crossref","first-page":"234","DOI":"10.1007\/BF02512364","article-title":"Numerical experiments with two sparse approximate inverse preconditioners","volume":"38","author":"Benzi","year":"1998","journal-title":"BIT"},{"issue":"3","key":"10.1016\/S0167-8191(99)00077-0_BIB28","doi-asserted-by":"crossref","first-page":"968","DOI":"10.1137\/S1064827595294691","article-title":"A sparse approximate inverse preconditioner for non-symmetric linear systems","volume":"19","author":"Benzi","year":"1998","journal-title":"SIAM J. Sci. Comput."},{"key":"10.1016\/S0167-8191(99)00077-0_BIB29","unstructured":"A. Berger, J. Mulvey, E. Rothberg, R. Vanderbei, Solving multistage stochastic programs using tree dissection, Technical Report SOR-97-07, Programs in Statistics and Operations Research, Princeton University, Princeton, New Jersey, 1995"},{"key":"10.1016\/S0167-8191(99)00077-0_BIB30","doi-asserted-by":"crossref","first-page":"186","DOI":"10.1016\/0743-7315(90)90093-5","article-title":"Krylov methods preconditioned with incompletely factored matrices on the CM-2","volume":"8","author":"Berryman","year":"1990","journal-title":"J. Par. Dist. Comp."},{"key":"10.1016\/S0167-8191(99)00077-0_BIB31","first-page":"51","article-title":"A parallel Interior Point algorithm for linear programming on a network of transputers","volume":"43","author":"Bisseling","year":"1993","journal-title":"Annal. Oper. Res."},{"key":"10.1016\/S0167-8191(99)00077-0_BIB32","unstructured":"R.H. Bisseling, W.F. McColl, Scientific computing on bulk synchronous parallel architectures, in: B. Pehrson, I. Simon (Eds.), Technology and Foundations: Information Processing '94, vol. I, IFIP Trans. A, Elsevier, Amsterdam, 51, 1994, pp. 509\u2013514"},{"key":"10.1016\/S0167-8191(99)00077-0_BIB33","doi-asserted-by":"crossref","unstructured":"L.S. Blackford, J. Choi, A. Cleary, E. D'Azevedo, J. Demmel, I. Dhillon, J. Dongarra, S. Hammarling, G. Henry, A. Petitet, K. Stanley, D. Walker, R.C. Whaley, ScaLAPACK Users' Guide, SIAM, Philadelphia, PA, 1997","DOI":"10.1137\/1.9780898719642"},{"key":"10.1016\/S0167-8191(99)00077-0_BIB34","unstructured":"S. Bondeli, Divide and Conquer: parallele Algorithmen zur L\u00f6sing tridiagonaler Gleichungssysteme, Ph.D. thesis, ETH Z\u00fcrich, Z\u00fcrich, 1991"},{"key":"10.1016\/S0167-8191(99)00077-0_BIB35","doi-asserted-by":"crossref","first-page":"168","DOI":"10.1137\/0913010","article-title":"Row projection methods for large non-symmetric linear systems","volume":"13","author":"Bramley","year":"1992","journal-title":"SIAM J. Sci. Statist. Comput."},{"key":"10.1016\/S0167-8191(99)00077-0_BIB36","unstructured":"H.M. B\u00fccker, M. Sauren, A parallel version of the unsymmetric Lanczos algorithm and its application to QMR, Technical Report KFA-ZAM-IB-9605, Forschungszentrum J\u00fclich Gmbh, J\u00fclich, Germany, 1996"},{"key":"10.1016\/S0167-8191(99)00077-0_BIB37","doi-asserted-by":"crossref","first-page":"627","DOI":"10.1137\/0707049","article-title":"On direct methods for solving Poisson's equations","volume":"7","author":"Buzbee","year":"1970","journal-title":"SIAM J. Numer. Anal."},{"key":"10.1016\/S0167-8191(99)00077-0_BIB38","unstructured":"D.A. Calahan, Parallel solution of sparse simultaneous linear equations, in: Proceedings of the 11th Annual Allerton Conference on Circuits and System Theory, University of Illinois, 1973, pp. 729\u2013735"},{"key":"10.1016\/S0167-8191(99)00077-0_BIB39","doi-asserted-by":"crossref","first-page":"338","DOI":"10.1137\/0915023","article-title":"A quasi-minimal residual variant of the Bi-CGSTAB algorithm for non-symmetric systems","volume":"15","author":"Chan","year":"1994","journal-title":"SIAM J. Sci. Comput."},{"key":"10.1016\/S0167-8191(99)00077-0_BIB40","doi-asserted-by":"crossref","first-page":"794","DOI":"10.1137\/0911046","article-title":"A note on the efficiency of domain decomposed incomplete factorizations","volume":"11","author":"Chan","year":"1990","journal-title":"SIAM J. Sci. Statist. Comput."},{"key":"10.1016\/S0167-8191(99)00077-0_BIB41","doi-asserted-by":"crossref","unstructured":"T.F. Chan, H.A. van der Vorst, Approximate and incomplete factorizations, in: D.E. Keyes, A. Sameh, V. Venkatakrishnan (Eds.), Parallel Numerical Algorithms, ICASE\/LaRC Interdisciplinary Series in Science and Engineering, Kluwer, Dordrecht, 1997, pp. 167\u2013202","DOI":"10.1007\/978-94-011-5412-3_6"},{"key":"10.1016\/S0167-8191(99)00077-0_BIB42","doi-asserted-by":"crossref","first-page":"270","DOI":"10.1145\/355791.355797","article-title":"Practical parallel band triangular system solvers","volume":"4","author":"Chen","year":"1978","journal-title":"ACM Trans. Math. Softw."},{"key":"10.1016\/S0167-8191(99)00077-0_BIB43","unstructured":"H. Choi, D.B. Szyld, Threshold ordering for preconditioning non-symmetric problems with highly varying coefficients, Technical Report 96-51, Department of Mathematics, Temple University, Philadelphia, 1996"},{"key":"10.1016\/S0167-8191(99)00077-0_BIB44","doi-asserted-by":"crossref","first-page":"995","DOI":"10.1137\/S1064827594270415","article-title":"Approximate inverse preconditioners via sparse-sparse iterations","volume":"19","author":"Chow","year":"1998","journal-title":"SIAM J. Sci. Comput."},{"key":"10.1016\/S0167-8191(99)00077-0_BIB45","doi-asserted-by":"crossref","unstructured":"A.T. Chronopoulos, Towards efficient parallel implementation of the CG method applied to a class of block tridiagonal linear systems, in: Supercomputing '91, IEEE Computer Society Press, Los Alamitos, CA, 1991, pp. 578\u2013587","DOI":"10.1145\/125826.126134"},{"key":"10.1016\/S0167-8191(99)00077-0_BIB46","doi-asserted-by":"crossref","first-page":"153","DOI":"10.1016\/0377-0427(89)90045-9","article-title":"s-Step iterative methods for symmetric linear systems","volume":"25","author":"Chronopoulos","year":"1989","journal-title":"J. Comput. Appl. Math."},{"key":"10.1016\/S0167-8191(99)00077-0_BIB47","unstructured":"A.T. Chronopoulos, S.K. Kim, s-Step Orthomin and GMRES implemented on parallel computers, Technical Report 90\/43R, UMSI, Minneapolis, 1990"},{"key":"10.1016\/S0167-8191(99)00077-0_BIB48","first-page":"4","article-title":"Towards efficient parallel implementation of Krylov subspace iterative methods","volume":"47","author":"Chronopoulos","year":"1992","journal-title":"Supercomputer"},{"key":"10.1016\/S0167-8191(99)00077-0_BIB49","doi-asserted-by":"crossref","first-page":"623","DOI":"10.1016\/0167-8191(96)00022-1","article-title":"Parallel iterative s-step methods for unsymmetric linear systems","volume":"22","author":"Chronopoulos","year":"1996","journal-title":"Parallel Comput."},{"key":"10.1016\/S0167-8191(99)00077-0_BIB50","doi-asserted-by":"crossref","first-page":"220","DOI":"10.1137\/0906018","article-title":"Block preconditioning for the Conjugate Gradient method","volume":"6","author":"Concus","year":"1985","journal-title":"SIAM J. Sci. Statist. Comput."},{"key":"10.1016\/S0167-8191(99)00077-0_BIB51","doi-asserted-by":"crossref","unstructured":"P. Concus, G. Meurant, On computing INV block preconditionings for the Conjugate Gradient method, BIT, 1986, pp. 493\u2013504","DOI":"10.1007\/BF01935055"},{"key":"10.1016\/S0167-8191(99)00077-0_BIB52","unstructured":"J.M. Conroy, S.G. Kratzer, R.F. Lucas, Data-parallel sparse matrix factorization, in: J.G. Lewis (Ed.), Proceedings of the Fifth SIAM Conference on Linear Algebra, SIAM, Philadelphia, PA, 1994, pp. 377\u2013381"},{"key":"10.1016\/S0167-8191(99)00077-0_BIB53","doi-asserted-by":"crossref","first-page":"91","DOI":"10.1080\/00207169208804097","article-title":"Approximate inverse preconditionings for sparse linear systems","volume":"44","author":"Cosgrove","year":"1992","journal-title":"Inter. J. Comput. Math."},{"key":"10.1016\/S0167-8191(99)00077-0_BIB54","unstructured":"L. Crone, H. van der Vorst, Communication aspects of the Conjugate Gradient method on distributed-memory machines, Supercomputer X(6) (1993) 4\u20139"},{"issue":"1","key":"10.1016\/S0167-8191(99)00077-0_BIB55","doi-asserted-by":"crossref","first-page":"140","DOI":"10.1137\/S0895479894246905","article-title":"An unsymmetric-pattern multifrontal method for sparse LU factorization","volume":"18","author":"Davis","year":"1997","journal-title":"SIAM J. Matrix Anal. Appl."},{"key":"10.1016\/S0167-8191(99)00077-0_BIB56","doi-asserted-by":"crossref","first-page":"383","DOI":"10.1137\/0611028","article-title":"A non-deterministic parallel algorithm for general unsymmetric sparse LU factorization","volume":"11","author":"Davis","year":"1990","journal-title":"SIAM J. Matrix Anal. Appl."},{"key":"10.1016\/S0167-8191(99)00077-0_BIB57","doi-asserted-by":"crossref","unstructured":"E.F. D'Azevedo, V. Eijkhout, C. Romine, LAPACK working note 56: Reducing communication costs in the Conjugate Gradient algorithm on distributed memory multiprocessor, Technical report, Computer Science Department, University of Knoxville, Knoxville, TN, 1993","DOI":"10.2172\/10176473"},{"key":"10.1016\/S0167-8191(99)00077-0_BIB58","doi-asserted-by":"crossref","unstructured":"E.F. D'Azevedo, C. Romine, Reducing communication costs in the Conjugate Gradient algorithm on distributed memory multiprocessors, Technical Report ORNL\/TM-12192, Oak Ridge National Lab, Oak Ridge, TN, 1992","DOI":"10.2172\/7172467"},{"key":"10.1016\/S0167-8191(99)00077-0_BIB59","doi-asserted-by":"crossref","first-page":"117","DOI":"10.1016\/0168-9274(91)90046-3","article-title":"Base p-cyclic reduction for tridiagonal systems of equations","volume":"8","author":"de Groen","year":"1991","journal-title":"Appl. Numer. Math."},{"key":"10.1016\/S0167-8191(99)00077-0_BIB60","doi-asserted-by":"crossref","unstructured":"J. De Keyser, D. Roose, Distributed mapping of SPMD programs with a generalized Kernighan-Lin heuristic, in: W. Gentzsch, U. Harms (Eds.), High-Performance Computing and Networking, Lecture Notes in Computer Science, Springer, Berlin, 797, 1994, pp. 227\u2013232","DOI":"10.1007\/3-540-57981-8_123"},{"key":"10.1016\/S0167-8191(99)00077-0_BIB61","unstructured":"E. de Sturler, A parallel restructured version of GMRES(m), Technical Report 91-85, Delft University of Technology, Delft, 1991"},{"key":"10.1016\/S0167-8191(99)00077-0_BIB62","unstructured":"E. de Sturler, Iterative methods on distributed memory computers, Ph.D. thesis, Delft University of Technology, Delft, The Netherlands, 1994"},{"key":"10.1016\/S0167-8191(99)00077-0_BIB63","doi-asserted-by":"crossref","first-page":"57","DOI":"10.1016\/0167-8191(95)00057-7","article-title":"A performance model for Krylov subspace methods on mesh-based parallel computers","volume":"22","author":"de Sturler","year":"1996","journal-title":"Parallel Comput."},{"key":"10.1016\/S0167-8191(99)00077-0_BIB64","doi-asserted-by":"crossref","first-page":"441","DOI":"10.1016\/0168-9274(95)00079-A","article-title":"Reducing the effect of global communication in GMRES(m) and CG on parallel distributed memory computers","volume":"18","author":"de Sturler","year":"1995","journal-title":"Appl. Numer. Math."},{"key":"10.1016\/S0167-8191(99)00077-0_BIB65","doi-asserted-by":"crossref","unstructured":"J.W. Demmel, M.T. Heath, H.A. van der Vorst, Parallel numerical linear algebra, in: Acta Numerica 1993 Cambridge University Press, Cambridge, 1993","DOI":"10.1017\/S096249290000235X"},{"key":"10.1016\/S0167-8191(99)00077-0_BIB66","doi-asserted-by":"crossref","first-page":"720","DOI":"10.1137\/S0895479895291765","article-title":"A supernodal approach to sparse partial pivoting","volume":"20","author":"Demmel","year":"1999","journal-title":"SIAM J. Matrix Anal. Appl."},{"key":"10.1016\/S0167-8191(99)00077-0_BIB67","unstructured":"J.W. Demmel, J.R. Gilbert, X.S. Li, SuperLU users' guide, Technical report, Computer Science Division, UC Berkeley, Berkeley, California, February 1995 (available from netlib)"},{"key":"10.1016\/S0167-8191(99)00077-0_BIB68","doi-asserted-by":"crossref","first-page":"417","DOI":"10.1016\/0168-9274(91)90011-N","article-title":"On parallelism and convergence of incomplete LU factorizations","volume":"7","author":"Doi","year":"1991","journal-title":"Appl. Numer. Math."},{"key":"10.1016\/S0167-8191(99)00077-0_BIB69","doi-asserted-by":"crossref","first-page":"143","DOI":"10.1080\/00207169208804100","article-title":"Large numbered multicolor MILU preconditioning on SX-3\/14","volume":"44","author":"Doi","year":"1992","journal-title":"Inter. J. Comput. Math."},{"key":"10.1016\/S0167-8191(99)00077-0_BIB70","unstructured":"J.J. Dongarra, Performance of various computers using standard linear algebra software, Technical Report CS-89-85, University of Tennessee, Knoxville, Tennessee, 1999. Updated version at Web address http:\/\/www.netlib.org\/benchmark\/performance.ps"},{"key":"10.1016\/S0167-8191(99)00077-0_BIB71","doi-asserted-by":"crossref","unstructured":"J.J. Dongarra, J.R. Bunch, C.B. Moler, G.W. Stewart, LINPACK User's Guide, SIAM, Philadelphia, 1979","DOI":"10.1137\/1.9781611971811"},{"key":"10.1016\/S0167-8191(99)00077-0_BIB72","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1145\/42288.42291","article-title":"An extented set of Fortran basic linear algebra subprogramsm","volume":"14","author":"Dongarra","year":"1988","journal-title":"ACM Trans. Math. Softw."},{"key":"10.1016\/S0167-8191(99)00077-0_BIB73","doi-asserted-by":"crossref","first-page":"1","DOI":"10.1145\/77626.79170","article-title":"A set of Level 3 basic linear algebra subprograms","volume":"16","author":"Dongarra","year":"1990","journal-title":"ACM Trans. Math. Softw."},{"key":"10.1016\/S0167-8191(99)00077-0_BIB74","doi-asserted-by":"crossref","unstructured":"J.J. Dongarra, I.S. Duff, D.C. Sorensen, H.A. van der Vorst, Numerical Linear Algebra for High-Performance Computers, SIAM Press, Philadelphia, 1998","DOI":"10.1137\/1.9780898719611"},{"key":"10.1016\/S0167-8191(99)00077-0_BIB75","unstructured":"P. Dubois, G. Rodrigue, An analysis of the recursive doubling algorithm, in: D.J. Kuck, A.H. Sameh (Eds.), High Speed Computer and Algorithm Organization, Academic Press, New York, 1977"},{"key":"10.1016\/S0167-8191(99)00077-0_BIB76","doi-asserted-by":"crossref","first-page":"257","DOI":"10.1007\/BF02243566","article-title":"Approximating the inverse of a matrix for use in iterative algorithms on vector processors","volume":"22","author":"Dubois","year":"1979","journal-title":"Computing"},{"key":"10.1016\/S0167-8191(99)00077-0_BIB77","doi-asserted-by":"crossref","unstructured":"I.S. Duff, The use of vector and parallel computers in the solution of large sparse linear equations, in: P. Deuflhard, B. Engquist (Eds.), Large Scale Scientific Computing, Progress in Scientific Computing, vol. 7, Boston, Birkh\u00e4user, 1986, pp. 331\u2013348","DOI":"10.1007\/978-1-4684-6754-3_20"},{"key":"10.1016\/S0167-8191(99)00077-0_BIB78","doi-asserted-by":"crossref","unstructured":"I.S. Duff, The influence of vector and parallel computers in the solution of large sparse linear equations, in: M.J.D. Powell, A. Iserles (Eds.), The State of the Art in Numerical Analysis, Oxford University Press, Oxford, 1987, pp. 359\u2013407","DOI":"10.1007\/978-1-4684-6754-3_20"},{"key":"10.1016\/S0167-8191(99)00077-0_BIB79","doi-asserted-by":"crossref","unstructured":"I.S. Duff, A.M. Erisman, C.W. Gear, J.K. Reid, Sparsity structure and Gaussian elimination, SIGNUM Newsletter 23 (2) (1988) 2\u20138","DOI":"10.1145\/47917.47918"},{"key":"10.1016\/S0167-8191(99)00077-0_BIB80","unstructured":"I.S. Duff, A.M. Erisman, J.K. Reid, Direct Methods for Sparse Matrices, Oxford University Press, Oxford, England, 1986"},{"key":"10.1016\/S0167-8191(99)00077-0_BIB81","doi-asserted-by":"crossref","first-page":"889","DOI":"10.1137\/S0895479897317661","article-title":"The design and use of algorithms for permuting large entries to the diagonal of sparse matrices","volume":"20","author":"Duff","year":"1999","journal-title":"SIAM J. matrix Anal. Appl."},{"issue":"3","key":"10.1016\/S0167-8191(99)00077-0_BIB82","doi-asserted-by":"crossref","first-page":"379","DOI":"10.1145\/275323.275327","article-title":"Level 3 basic linear algebra subprograms for sparse matrices: a user level interface","volume":"23","author":"Duff","year":"1997","journal-title":"ACM Trans. Math. Softw."},{"key":"10.1016\/S0167-8191(99)00077-0_BIB83","doi-asserted-by":"crossref","first-page":"635","DOI":"10.1007\/BF01932738","article-title":"The effect of ordering on preconditioned Conjugate Gradient","volume":"29","author":"Duff","year":"1989","journal-title":"BIT"},{"key":"10.1016\/S0167-8191(99)00077-0_BIB84","doi-asserted-by":"crossref","first-page":"302","DOI":"10.1145\/356044.356047","article-title":"The multifrontal solution of indefinite sparse symmetric linear systems","volume":"9","author":"Duff","year":"1983","journal-title":"ACM Trans. Math. Softw."},{"key":"10.1016\/S0167-8191(99)00077-0_BIB85","doi-asserted-by":"crossref","first-page":"17","DOI":"10.1023\/A:1019122726788","article-title":"Two-dimensional block partitioning for the parallel sparse Cholesky factorization","volume":"16","author":"Dumitrescu","year":"1997","journal-title":"Numer. Algorithms"},{"key":"10.1016\/S0167-8191(99)00077-0_BIB86","unstructured":"V. Eijkhout, Beware of unperturbed modified incomplete point factorizations, in: R. Beauwens, P. de Groen (Eds.), Iterative Methods in Linear Algebra, Amsterdam, 1992, pp. 583\u2013591; IMACS Int. Symp., Brussels, Belgium, North-Holland, Amsterdam, 1991, pp. 2\u20134"},{"issue":"2","key":"10.1016\/S0167-8191(99)00077-0_BIB87","doi-asserted-by":"crossref","first-page":"352","DOI":"10.1137\/0721026","article-title":"Necessary and sufficient conditions for the existence of a Conjugate Gradient method","volume":"21","author":"Faber","year":"1984","journal-title":"SIAM J. Numer. Anal."},{"key":"10.1016\/S0167-8191(99)00077-0_BIB88","doi-asserted-by":"crossref","unstructured":"R. Fletcher, Conjugate Gradient methods for indefinite systems, Lecture Notes Math., vol. 506, Springer, Berlin, New York, 1976, pp. 73\u201389","DOI":"10.1007\/BFb0080116"},{"key":"10.1016\/S0167-8191(99)00077-0_BIB89","doi-asserted-by":"crossref","first-page":"315","DOI":"10.1007\/BF01385726","article-title":"QMR: A quasi-minimal residual method for non-Hermitian linear systems","volume":"60","author":"Freund","year":"1991","journal-title":"Numer. Math."},{"key":"10.1016\/S0167-8191(99)00077-0_BIB90","doi-asserted-by":"crossref","first-page":"470","DOI":"10.1137\/0914029","article-title":"A transpose-free quasi-minimal residual algorithm for non-Hermitian linear systems","volume":"14","author":"Freund","year":"1993","journal-title":"SIAM J. Sci. Comput."},{"key":"10.1016\/S0167-8191(99)00077-0_BIB91","doi-asserted-by":"crossref","first-page":"1291","DOI":"10.1016\/S0167-8191(96)00047-6","article-title":"Solving large non-symmetric sparse linear systems using MCSPARSE","volume":"22","author":"Gallivan","year":"1996","journal-title":"Parallel Comput."},{"key":"10.1016\/S0167-8191(99)00077-0_BIB92","doi-asserted-by":"crossref","first-page":"291","DOI":"10.1007\/BF01407861","article-title":"Task scheduling for parallel sparse Cholesky factorization","volume":"18","author":"Geist","year":"1989","journal-title":"Inter. J. Parallel Programming"},{"key":"10.1016\/S0167-8191(99)00077-0_BIB93","doi-asserted-by":"crossref","first-page":"345","DOI":"10.1137\/0710032","article-title":"Nested dissection of a regular finite element mesh","volume":"10","author":"George","year":"1973","journal-title":"SIAM J. Numer. Anal."},{"key":"10.1016\/S0167-8191(99)00077-0_BIB94","doi-asserted-by":"crossref","first-page":"309","DOI":"10.1007\/BF01407878","article-title":"Solution of sparse positive-definite systems on a shared memory multiprocessor","volume":"15","author":"George","year":"1986","journal-title":"Inter. J. Parallel Programming"},{"key":"10.1016\/S0167-8191(99)00077-0_BIB95","doi-asserted-by":"crossref","first-page":"327","DOI":"10.1137\/0909021","article-title":"Sparse Cholesky factorization on a local-memory multiprocessor","volume":"9","author":"George","year":"1988","journal-title":"SIAM J. Sci. Statist. Comput."},{"key":"10.1016\/S0167-8191(99)00077-0_BIB96","doi-asserted-by":"crossref","first-page":"129","DOI":"10.1016\/0377-0427(89)90364-6","article-title":"Solution of sparse positive definite systems on a hypercube","volume":"27","author":"George","year":"1989","journal-title":"J. Comput. Appl. Math."},{"issue":"2","key":"10.1016\/S0167-8191(99)00077-0_BIB97","doi-asserted-by":"crossref","first-page":"139","DOI":"10.1145\/355826.355829","article-title":"The design of a user interface for a sparse matrix package","volume":"5","author":"George","year":"1979","journal-title":"ACM Trans. Math. Softw."},{"key":"10.1016\/S0167-8191(99)00077-0_BIB98","unstructured":"A. George, J.W.H. Liu, E.G. Ng, User's guide for SPARSPAK: Waterloo sparse linear equations package, Technical Report CS-78-30 (Revised), University of Waterloo, Canada, 1980"},{"issue":"2","key":"10.1016\/S0167-8191(99)00077-0_BIB99","doi-asserted-by":"crossref","first-page":"9","DOI":"10.1145\/47917.47919","article-title":"Shared versus local memory in parallel sparse matrix computations","volume":"23","author":"George","year":"1988","journal-title":"SIGNUM Newsletter"},{"key":"10.1016\/S0167-8191(99)00077-0_BIB100","doi-asserted-by":"crossref","first-page":"663","DOI":"10.1137\/0715044","article-title":"Incomplete nested dissection for solving n-by-n grid problems","volume":"15","author":"George","year":"1978","journal-title":"SIAM J. Numer. Anal."},{"issue":"8","key":"10.1016\/S0167-8191(99)00077-0_BIB101","doi-asserted-by":"crossref","first-page":"1339","DOI":"10.1016\/0167-8191(95)00024-I","article-title":"Exploiting large grain parallelism in a sparse direct linear system solver","volume":"21","author":"Geschiere","year":"1995","journal-title":"Parallel Comput."},{"key":"10.1016\/S0167-8191(99)00077-0_BIB102","doi-asserted-by":"crossref","first-page":"334","DOI":"10.1137\/0614024","article-title":"Elimination structures for unsymmetric sparse LU factors","volume":"14","author":"Gilbert","year":"1993","journal-title":"SIAM J. Matrix Anal. Appl."},{"key":"10.1016\/S0167-8191(99)00077-0_BIB103","doi-asserted-by":"crossref","first-page":"1151","DOI":"10.1137\/0913067","article-title":"Highly parallel sparse Cholesky factorization","volume":"13","author":"Gilbert","year":"1992","journal-title":"SIAM J. Sci. Statist. Comput."},{"key":"10.1016\/S0167-8191(99)00077-0_BIB104","doi-asserted-by":"crossref","first-page":"427","DOI":"10.1007\/BF01388998","article-title":"A parallel graph partitioning algorithm for a message-passing multiprocessor","volume":"16","author":"Gilbert","year":"1987","journal-title":"Inter. J. Parallel Programming"},{"key":"10.1016\/S0167-8191(99)00077-0_BIB105","unstructured":"G.H. Golub, C.F. Van Loan, Matrix Computations, Third ed. The Johns Hopkins University Press, Baltimore, 1996"},{"issue":"2","key":"10.1016\/S0167-8191(99)00077-0_BIB106","doi-asserted-by":"crossref","first-page":"605","DOI":"10.1137\/S1064827595288425","article-title":"Sparse approximate-inverse preconditioners using norm-minimization techniques","volume":"19","author":"Gould","year":"1998","journal-title":"SIAM J. Sci. Comput."},{"key":"10.1016\/S0167-8191(99)00077-0_BIB107","doi-asserted-by":"crossref","unstructured":"A. Greenbaum, Iterative Methods for Solving Linear Systems, SIAM, Philadelphia, 1997","DOI":"10.1137\/1.9781611970937"},{"key":"10.1016\/S0167-8191(99)00077-0_BIB108","unstructured":"M. Grote, H. Simon, Parallel preconditioning and approximate inverses on the connection machine, in: R.F. Sincovec, D.E. Keyes, M.R. Leuze, L.R. Petzold, D.A. Reed (Eds.), Proceedings of the Sixth SIAM Conference on Parallel Processing for Scientific Computing, SIAM, Philadelphia, 1993, pp. 519\u2013523"},{"key":"10.1016\/S0167-8191(99)00077-0_BIB109","doi-asserted-by":"crossref","first-page":"838","DOI":"10.1137\/S1064827594276552","article-title":"Parallel preconditionings with sparse approximate inverses","volume":"18","author":"Grote","year":"1997","journal-title":"SIAM J. Sci. Comput."},{"key":"10.1016\/S0167-8191(99)00077-0_BIB110","unstructured":"A. Gupta, M. Joshi, V. Kumar, WSSMP: Watson Symmetric Sparse Matrix Package, Users Manual: Version 2.0\u03b2, Technical Report RC 20923 (92669), IBM T.J. Watson Research Centre, P.O. Box 218, Yorktown Heights, NY 10598, July 1997"},{"key":"10.1016\/S0167-8191(99)00077-0_BIB111","doi-asserted-by":"crossref","first-page":"502","DOI":"10.1109\/71.598277","article-title":"Highly scalable parallel algorithms for sparse matrix factorization","volume":"8","author":"Gupta","year":"1997","journal-title":"IEEE Trans. Parallel and Distributed Systems"},{"key":"10.1016\/S0167-8191(99)00077-0_BIB112","doi-asserted-by":"crossref","first-page":"201","DOI":"10.1016\/0045-7825(86)90053-8","article-title":"A preconditioning technique based on element matrix factorizations","volume":"55","author":"Gustafsson","year":"1986","journal-title":"Comput. Meth. Appl. Mech. Engrg."},{"key":"10.1016\/S0167-8191(99)00077-0_BIB113","doi-asserted-by":"crossref","first-page":"447","DOI":"10.1002\/nla.1680020506","article-title":"Completely parallelizable preconditioning methods","volume":"2","author":"Gustafsson","year":"1995","journal-title":"Numer. Linear Algebra Appl."},{"key":"10.1016\/S0167-8191(99)00077-0_BIB114","doi-asserted-by":"crossref","first-page":"1020","DOI":"10.1137\/0914062","article-title":"Variants of BICGSTAB for matrices with complex spectrum","volume":"14","author":"Gutknecht","year":"1993","journal-title":"SIAM J. Sci. Comput."},{"key":"10.1016\/S0167-8191(99)00077-0_BIB115","doi-asserted-by":"crossref","unstructured":"M.H. Gutknecht, Lanczos-type solvers for non-symmetric linear systems of equations, in: Acta Numerica 1997, Cambridge University Press, Cambridge, 1997, pp. 271\u2013397","DOI":"10.1017\/S0962492900002737"},{"key":"10.1016\/S0167-8191(99)00077-0_BIB116","doi-asserted-by":"crossref","first-page":"1685","DOI":"10.1016\/S0167-8191(98)00046-5","article-title":"Parallel incomplete Cholesky preconditioners based on the non-overlapping data distribution","volume":"24","author":"Haase","year":"1998","journal-title":"Parallel Comput."},{"key":"10.1016\/S0167-8191(99)00077-0_BIB117","doi-asserted-by":"crossref","first-page":"420","DOI":"10.1137\/1033099","article-title":"Parallel algorithms for sparse linear systems","volume":"33","author":"Heath","year":"1991","journal-title":"SIAM Review"},{"key":"10.1016\/S0167-8191(99)00077-0_BIB118","doi-asserted-by":"crossref","unstructured":"M.T. Heath, P. Raghavan (Eds.), Performance of a fully parallel sparse solver, in: IEEE, Proceedings of SHPCC '94, Scalable High-Performance Computing Conference, May 23\u201325, 1994; IEEE Computer Society Press, Knoxville, Tennessee, Los Alamitos, California, 1994, pp. 334\u2013341","DOI":"10.1109\/SHPCC.1994.296662"},{"key":"10.1016\/S0167-8191(99)00077-0_BIB119","doi-asserted-by":"crossref","first-page":"1527","DOI":"10.1016\/S0167-8191(96)00055-5","article-title":"Parallel solvers for non-linear elliptic problems based on domain decomposition ideas","volume":"22","author":"Heisse","year":"1997","journal-title":"Parallel Comput."},{"key":"10.1016\/S0167-8191(99)00077-0_BIB120","doi-asserted-by":"crossref","first-page":"484","DOI":"10.1137\/0713042","article-title":"Some aspects of the cyclic reduction algorithm for block tridiagonal linear systems","volume":"13","author":"Heller","year":"1978","journal-title":"SIAM J. Numer. Anal."},{"key":"10.1016\/S0167-8191(99)00077-0_BIB121","doi-asserted-by":"crossref","unstructured":"B. Hendrickson, R. Leland, The CHACO User's Guide, Version 2.0. Technical Report SAND94-2692, Sandia National Laboratories, Albuquerque, October 1994","DOI":"10.2172\/10106339"},{"key":"10.1016\/S0167-8191(99)00077-0_BIB122","doi-asserted-by":"crossref","first-page":"468","DOI":"10.1137\/S1064827596300656","article-title":"Improving the runtime and quality of nested dissection ordering","volume":"20","author":"Hendrickson","year":"1998","journal-title":"SIAM J. Sci. Comput."},{"key":"10.1016\/S0167-8191(99)00077-0_BIB123","doi-asserted-by":"crossref","first-page":"409","DOI":"10.6028\/jres.049.044","article-title":"Methods of Conjugate Gradients for solving linear systems","volume":"49","author":"Hestenes","year":"1954","journal-title":"J. Res. Natl. Bur. Stand."},{"key":"10.1016\/S0167-8191(99)00077-0_BIB124","doi-asserted-by":"crossref","first-page":"241","DOI":"10.1016\/0045-7825(83)90115-9","article-title":"An element-by-element solution algorithm for problems of structural and solid mechanics","volume":"36","author":"Hughes","year":"1983","journal-title":"J. Comput. Meth. Appl. Mech. Engrg."},{"key":"10.1016\/S0167-8191(99)00077-0_BIB125","unstructured":"Z. Johan, Data Parallel Finite Element Techniques for Large-Scale Computational Fluid Dynamics, Ph.D. thesis, Stanford University, Stanford, CA, 1992"},{"key":"10.1016\/S0167-8191(99)00077-0_BIB126","doi-asserted-by":"crossref","unstructured":"Z. Johan, K.K. Mathur, S.L. Johnsson, T.J.R. Hughes, Mesh decomposition and communication procedures for finite element applications on the connection machine CM-5 system, in: W. Gentzsch, U. Harms (Eds.), High-Performance Computing and Networking, Lecture Notes in Computer Science, Springer, Berlin, 797, 1994, pp. 233\u2013240","DOI":"10.1007\/3-540-57981-8_124"},{"key":"10.1016\/S0167-8191(99)00077-0_BIB127","doi-asserted-by":"crossref","first-page":"363","DOI":"10.1137\/0720025","article-title":"Polynomial preconditioning for Conjugate Gradient calculations","volume":"20","author":"Johnson","year":"1983","journal-title":"SIAM J. Numer. Anal."},{"key":"10.1016\/S0167-8191(99)00077-0_BIB128","doi-asserted-by":"crossref","unstructured":"M.T. Jones, P.E. Plassmann, The efficient parallel iterative solution of large sparse linear systems, in: A. George, J.R. Gilbert, J.W.H. Liu (Eds.), Graph Theory and Sparse Matrix Computations, IMA vol 56, Springer, Berlin, 1994","DOI":"10.1007\/978-1-4613-8369-7_11"},{"key":"10.1016\/S0167-8191(99)00077-0_BIB129","unstructured":"G. Karypis, V. Kumar, METIS: unstructured graph partitioning and sparse matrix ordering system, Technical report, Department of Computer Science, University of Minnesota, 1995"},{"key":"10.1016\/S0167-8191(99)00077-0_BIB130","unstructured":"G. Karypis, V. Kumar, Parallel multilevel graph partitioning, Technical Report TR-95-036, Department of Computer Science, University of Minnesota, May 1995"},{"key":"10.1016\/S0167-8191(99)00077-0_BIB131","doi-asserted-by":"crossref","unstructured":"D.R. Kincaid, T.C. Oppe, Recent vectorization and parallelization of ITPACKV, in: O. Axelsson, L.Yu. Kolotilina (Eds.), Preconditioned Conjugate Gradient Methods, Berlin, 1990, pp. 58\u201374; Lecture Notes in Mathematics 1457, Nijmegen 1989, Springer, Berlin","DOI":"10.1007\/BFb0090902"},{"key":"10.1016\/S0167-8191(99)00077-0_BIB132","doi-asserted-by":"crossref","first-page":"45","DOI":"10.1137\/0614004","article-title":"Factorized sparse approximate inverse preconditionings","volume":"14","author":"Kolotilina","year":"1993","journal-title":"SIAM J. Matrix Anal. Appl."},{"key":"10.1016\/S0167-8191(99)00077-0_BIB133","unstructured":"J. Koster, R.H. Bisseling, Parallel sparse LU decomposition on a distributed-memory multiprocessor, 1994, submitted to SIAM J. Scientific Computing"},{"key":"10.1016\/S0167-8191(99)00077-0_BIB134","doi-asserted-by":"crossref","first-page":"767","DOI":"10.1137\/0911045","article-title":"Two-color Fourier analysis of iterative algorithms for elliptic problems with red\/black ordering","volume":"11","author":"Kuo","year":"1990","journal-title":"SIAM J. Sci. Statist. Comput."},{"key":"10.1016\/S0167-8191(99)00077-0_BIB135","doi-asserted-by":"crossref","unstructured":"J.J. Lambiotte, R.G. Voigt, The solution of tridiagonal linear systems on the CDC-STAR-100 computer, Technical report, ICASE-NASA Langley Research Center, Hampton, VA, 1974","DOI":"10.1145\/355656.355658"},{"key":"10.1016\/S0167-8191(99)00077-0_BIB136","doi-asserted-by":"crossref","first-page":"33","DOI":"10.6028\/jres.049.006","article-title":"Solution of systems of linear equations by minimized iterations","volume":"49","author":"Lanczos","year":"1952","journal-title":"J. Res. Natl. Bur. Stand"},{"key":"10.1016\/S0167-8191(99)00077-0_BIB137","doi-asserted-by":"crossref","first-page":"308","DOI":"10.1145\/355841.355847","article-title":"Basic linear algebra subprograms for Fortran usage","volume":"5","author":"Lawson","year":"1979","journal-title":"ACM Trans. Math. Softw."},{"key":"10.1016\/S0167-8191(99)00077-0_BIB138","doi-asserted-by":"crossref","first-page":"1005","DOI":"10.1016\/S0167-8191(97)00004-5","article-title":"A block variant of the GMRES method on massively parallel processors","volume":"23","author":"Li","year":"1997","journal-title":"Parallel Comput."},{"key":"10.1016\/S0167-8191(99)00077-0_BIB139","unstructured":"X.S. Li, J.W. Demmel, Making sparse Gaussian elimination scalable by static pivoting, in: Proceedings of Supercomputing Orlando, Florida, November 1998"},{"key":"10.1016\/S0167-8191(99)00077-0_BIB140","doi-asserted-by":"crossref","first-page":"249","DOI":"10.1145\/7921.11325","article-title":"On the storage requirement in the out-of-core multifrontal method for sparse factorization","volume":"12","author":"Liu","year":"1987","journal-title":"ACM Trans. Math. Softw."},{"key":"10.1016\/S0167-8191(99)00077-0_BIB141","doi-asserted-by":"crossref","first-page":"73","DOI":"10.1016\/0167-8191(89)90064-1","article-title":"Reordering sparse matrices for parallel elimination","volume":"11","author":"Liu","year":"1989","journal-title":"Parallel Comput."},{"key":"10.1016\/S0167-8191(99)00077-0_BIB142","doi-asserted-by":"crossref","first-page":"41","DOI":"10.1016\/0020-0190(76)90077-6","article-title":"Matrix multiplication by diagonals on a vector\/parallel processor","volume":"5","author":"Madsen","year":"1976","journal-title":"Inform. Process. Lett."},{"key":"10.1016\/S0167-8191(99)00077-0_BIB143","doi-asserted-by":"crossref","unstructured":"F. Manne, H. Hafsteinsson, Efficient sparse Cholesky factorization on a massively parallel SIMD computer, SIAM J. Sci. Comput. 16(4) (1995) 934\u2013950","DOI":"10.1137\/0916054"},{"key":"10.1016\/S0167-8191(99)00077-0_BIB144","doi-asserted-by":"crossref","unstructured":"H.M. Markowitz, The elimination form of the inverse and its application to linear programming, Management Science, 3, 1957, pp. 255\u2013269","DOI":"10.1287\/mnsc.3.3.255"},{"key":"10.1016\/S0167-8191(99)00077-0_BIB145","doi-asserted-by":"crossref","first-page":"227","DOI":"10.1137\/0908023","article-title":"Hypercube algorithms and implementations","volume":"8","author":"McBryan","year":"1987","journal-title":"SIAM J. Sci. Statist. Comput."},{"key":"10.1016\/S0167-8191(99)00077-0_BIB146","unstructured":"V. Mehrmann, Divide and conquer methods for block tridiagonal systems, Technical Report Bericht Nr. 68, Inst. f\u00fcr Geometrie und Prakt. Math., RWTH, Aachen, 1991"},{"key":"10.1016\/S0167-8191(99)00077-0_BIB147","doi-asserted-by":"crossref","first-page":"33","DOI":"10.1016\/0167-8191(85)90016-X","article-title":"A parallel partition method for solving banded systems of linear equations","volume":"2","author":"Meier","year":"1985","journal-title":"Parallel Comput."},{"key":"10.1016\/S0167-8191(99)00077-0_BIB148","unstructured":"U. Meier, A. Sameh, The behavior of Conjugate Gradient algorithms on a multivector processor with a hierarchical memory, Technical Report CSRD 758, University of Illinois, Urbana, IL, 1988"},{"key":"10.1016\/S0167-8191(99)00077-0_BIB149","first-page":"148","article-title":"An iterative solution method for linear systems of which the coefficient matrix is a symmetric M-matrix","volume":"31","author":"Meijerink","year":"1977","journal-title":"Math. Comput."},{"key":"10.1016\/S0167-8191(99)00077-0_BIB150","doi-asserted-by":"crossref","first-page":"623","DOI":"10.1007\/BF01934919","article-title":"The block preconditioned Conjugate Gradient method on vector computers","volume":"24","author":"Meurant","year":"1984","journal-title":"BIT"},{"key":"10.1016\/S0167-8191(99)00077-0_BIB151","unstructured":"G. Meurant, Numerical experiments for the preconditioned Conjugate Gradient method on the CRAY X-MP\/2, Technical Report LBL-18023, University of California, Berkeley, CA, 1984"},{"key":"10.1016\/S0167-8191(99)00077-0_BIB152","doi-asserted-by":"crossref","unstructured":"G. Meurant, The Conjugate Gradient method on vector and parallel supercomputers, Technical Report CTAC-89, University of Brisbane, July 1989","DOI":"10.1016\/0010-4655(89)90179-3"},{"key":"10.1016\/S0167-8191(99)00077-0_BIB153","unstructured":"P.H. Michielse, Parallelism in Adaptive Multigrid Solvers, Ph.D. thesis, Delft University of Technology, Delft, 1990"},{"key":"10.1016\/S0167-8191(99)00077-0_BIB154","doi-asserted-by":"crossref","first-page":"87","DOI":"10.1016\/0167-8191(88)90099-3","article-title":"Data transport in Wang's partition method","volume":"7","author":"Michielse","year":"1988","journal-title":"Parallel Comput."},{"key":"10.1016\/S0167-8191(99)00077-0_BIB155","doi-asserted-by":"crossref","first-page":"1034","DOI":"10.1137\/0914063","article-title":"Block sparse Cholesky algorithms on advanced uniprocessor computers","volume":"14","author":"Ng","year":"1993","journal-title":"SIAM J. Sci. Comput."},{"key":"10.1016\/S0167-8191(99)00077-0_BIB156","doi-asserted-by":"crossref","first-page":"761","DOI":"10.1137\/0914048","article-title":"A supernodal Cholesky factorization algorithm for shared-memory multiprocessors","volume":"14","author":"Ng","year":"1993","journal-title":"SIAM J. Sci. Comput."},{"key":"10.1016\/S0167-8191(99)00077-0_BIB157","doi-asserted-by":"crossref","first-page":"293","DOI":"10.1016\/0024-3795(80)90247-5","article-title":"The block Conjugate Gradient algorithm and related methods","volume":"29","author":"O'Leary","year":"1980","journal-title":"Linear Algebra Appl."},{"key":"10.1016\/S0167-8191(99)00077-0_BIB158","doi-asserted-by":"crossref","first-page":"127","DOI":"10.1016\/0167-8191(87)90013-5","article-title":"Parallel implementation of the Block Conjugate Gradient Algorithm","volume":"5","author":"O'Leary","year":"1987","journal-title":"Parallel Comput."},{"key":"10.1016\/S0167-8191(99)00077-0_BIB159","doi-asserted-by":"crossref","unstructured":"J.M. Ortega, Introduction to Parallel and Vector Solution of Linear Systems, Plenum Press, New York, London, 1988","DOI":"10.1007\/978-1-4899-2112-3"},{"key":"10.1016\/S0167-8191(99)00077-0_BIB160","doi-asserted-by":"crossref","first-page":"617","DOI":"10.1137\/0712047","article-title":"Solution of sparse indefinite systems of linear equations","volume":"12","author":"Paige","year":"1975","journal-title":"SIAM J. Numer. Anal."},{"key":"10.1016\/S0167-8191(99)00077-0_BIB161","doi-asserted-by":"crossref","unstructured":"B.W. Peyton, A. Pothen, X. Yuan, Partitioning a chordal graph into transitive subgraphs for parallel sparse triangular solution, Technical Report ORNL\/TM-12270, Engineering Physics and Mathematics Division, Oak Ridge National Laboratory, Tennessee, December 1992","DOI":"10.2172\/10121667"},{"key":"10.1016\/S0167-8191(99)00077-0_BIB162","unstructured":"C. Pommerell, Solution of large unsymmetric systems of linear equations, Ph.D. thesis, Swiss Federal Institute of Technology, Z\u00fcrich, 1992"},{"key":"10.1016\/S0167-8191(99)00077-0_BIB163","doi-asserted-by":"crossref","first-page":"194","DOI":"10.1137\/0913011","article-title":"A set of new mapping and coloring heuristics for distributed-memory parallel computers","volume":"13","author":"Pommerell","year":"1992","journal-title":"SIAM J. Sci. Statist. Comput."},{"key":"10.1016\/S0167-8191(99)00077-0_BIB164","doi-asserted-by":"crossref","unstructured":"A. Pothen, C. Sun, A mapping algorithm for parallel sparse Cholesky factorization, SIAM J. Sci. Comput. 14(5) (1993) 1253\u20131257, timely communication","DOI":"10.1137\/0914074"},{"key":"10.1016\/S0167-8191(99)00077-0_BIB165","doi-asserted-by":"crossref","first-page":"223","DOI":"10.1016\/0167-8191(89)90030-6","article-title":"Parallel Conjugate Gradient-like algorithms for solving sparse non-symmetric systems on a vector multiprocessor","volume":"11","author":"Radicati di Brozolo","year":"1989","journal-title":"Parallel Comput."},{"key":"10.1016\/S0167-8191(99)00077-0_BIB166","unstructured":"G. Radicati di Brozolo, M. Vitaletti, Sparse matrix\u2013vector product and storage representations on the IBM 3090 with Vector Facility, Technical Report, IBM-ECSEC, Rome, July 1986, pp. 513\u20134098"},{"key":"10.1016\/S0167-8191(99)00077-0_BIB167","unstructured":"P. Raghavan, Efficient parallel sparse triangular solution with selective inversion, Technical Report CS-95-314, Department of Computer Science, University of Tennessee, Knoxville, Tennessee, 1995"},{"key":"10.1016\/S0167-8191(99)00077-0_BIB168","doi-asserted-by":"crossref","first-page":"1045","DOI":"10.1016\/S0167-8191(97)00018-5","article-title":"Parallel ordering using edge contraction","volume":"23","author":"Raghavan","year":"1997","journal-title":"Parallel Computing"},{"key":"10.1016\/S0167-8191(99)00077-0_BIB169","unstructured":"E. Rothberg, Exploring the tradeoff between imbalance and separator size in nested dissection ordering, Technical Report Unnumbered, Silicon Graphics Inc, 1996"},{"issue":"3","key":"10.1016\/S0167-8191(99)00077-0_BIB170","doi-asserted-by":"crossref","first-page":"699","DOI":"10.1137\/S106482759426715X","article-title":"Performance of panel and block approaches to sparse Cholesky factorization on the iPSC\/860 and Paragon multicomputers","volume":"17","author":"Rothberg","year":"1996","journal-title":"SIAM J. Sci. Comput."},{"key":"10.1016\/S0167-8191(99)00077-0_BIB171","doi-asserted-by":"crossref","unstructured":"E. Rothberg, A. Gupta, An evaluation of left-looking, right-looking and multifrontal approaches to sparse Cholesky factorization on hierarchical-memory machines, Technical Report STAN-CS-91-1377, Department of Computer Science, Stanford University, 1991","DOI":"10.21236\/ADA326874"},{"key":"10.1016\/S0167-8191(99)00077-0_BIB172","doi-asserted-by":"crossref","unstructured":"E. Rothberg, A. Gupta, An efficient block-oriented approach to parallel sparse Cholesky factorization, SIAM J. Sci. Comput. 15(6) (1994) 1413\u20131439","DOI":"10.1137\/0915085"},{"key":"10.1016\/S0167-8191(99)00077-0_BIB173","doi-asserted-by":"crossref","unstructured":"E. Rothberg, R. Schreiber, Improved load distribution in parallel sparse Cholesky factorization, Technical Report 94-13, Research Institute for Advanced Computer Science, 1994","DOI":"10.1109\/SUPERC.1994.344344"},{"key":"10.1016\/S0167-8191(99)00077-0_BIB174","doi-asserted-by":"crossref","first-page":"865","DOI":"10.1137\/0906059","article-title":"Practical use of polynomial preconditionings for the Conjugate Gradient method","volume":"6","author":"Saad","year":"1985","journal-title":"SIAM J. Sci. Statist. Comput."},{"key":"10.1016\/S0167-8191(99)00077-0_BIB175","unstructured":"Y. Saad, Krylov subspace methods on supercomputers, Technical report, RIACS, Moffett Field, CA, September 1988"},{"key":"10.1016\/S0167-8191(99)00077-0_BIB176","unstructured":"Y. Saad, Iterative methods for sparse linear systems, PWS Publishing Company, Boston, 1996"},{"key":"10.1016\/S0167-8191(99)00077-0_BIB177","doi-asserted-by":"crossref","first-page":"856","DOI":"10.1137\/0907058","article-title":"GMRES: a generalized minimal residual algorithm for solving non-symmetric linear systems","volume":"7","author":"Saad","year":"1986","journal-title":"SIAM J. Sci. Statist. Comput."},{"key":"10.1016\/S0167-8191(99)00077-0_BIB178","unstructured":"J.J.F.M. Schlichting, H.A. van der Vorst, Solving bidiagonal systems of linear equations on the CDC CYBER 205, Technical Report NM-R8725, CWI, Amsterdam, The Netherlands, 1987"},{"key":"10.1016\/S0167-8191(99)00077-0_BIB179","doi-asserted-by":"crossref","first-page":"323","DOI":"10.1016\/0377-0427(89)90373-7","article-title":"Solving 3-D block bidiagonal linear systems on vector computers","volume":"27","author":"Schlichting","year":"1989","journal-title":"J. Comput. Appl. Math."},{"key":"10.1016\/S0167-8191(99)00077-0_BIB180","doi-asserted-by":"crossref","unstructured":"R. Schreiber, Scalability of sparse direct solvers, in: A. George, J.R. Gilbert, J.W.H. Liu (Eds.), Graph Theory and Sparse Matrix Computation, The IMA Volumes in Mathematics and its Applications, Springer, Berlin, New York, 56, 1993, pp. 191\u2013209","DOI":"10.1007\/978-1-4613-8369-7_9"},{"key":"10.1016\/S0167-8191(99)00077-0_BIB181","doi-asserted-by":"crossref","unstructured":"H.D. Simon, Partitioning of unstructured problems for parallel processing, Technical Report RNR-91-008, NASA Ames Research Center, Moffett Field, CA, 1991","DOI":"10.1016\/0956-0521(91)90014-V"},{"key":"10.1016\/S0167-8191(99)00077-0_BIB182","doi-asserted-by":"crossref","first-page":"917","DOI":"10.1137\/0916053","article-title":"An iterative method for non-symmetric systems with multiple right-hand sides","volume":"16","author":"Simoncini","year":"1995","journal-title":"SIAM J. Sci. Comput."},{"key":"10.1016\/S0167-8191(99)00077-0_BIB183","first-page":"11","article-title":"BICGSTAB (\u2113) for linear equations involving unsymmetric matrices with complex spectrum","volume":"1","author":"Sleijpen","year":"1993","journal-title":"ETNA"},{"key":"10.1016\/S0167-8191(99)00077-0_BIB184","doi-asserted-by":"crossref","first-page":"141","DOI":"10.1007\/BF02309342","article-title":"Reliable updated residuals in hybrid Bi\u2013CG methods","volume":"56","author":"Sleijpen","year":"1996","journal-title":"Computing"},{"key":"10.1016\/S0167-8191(99)00077-0_BIB185","doi-asserted-by":"crossref","first-page":"75","DOI":"10.1007\/BF02141261","article-title":"Bi-CGSTAB (\u2113) other hybrid Bi\u2013CG methods","volume":"7","author":"Sleijpen","year":"1994","journal-title":"Numer. Algorithms"},{"key":"10.1016\/S0167-8191(99)00077-0_BIB186","doi-asserted-by":"crossref","first-page":"36","DOI":"10.1137\/0910004","article-title":"CGS: a fast Lanczos-type solver for non-symmetric linear systems","volume":"10","author":"Sonneveld","year":"1989","journal-title":"SIAM J. Sci. Statist. Comput."},{"key":"10.1016\/S0167-8191(99)00077-0_BIB187","doi-asserted-by":"crossref","first-page":"27","DOI":"10.1145\/321738.321741","article-title":"An efficient parallel algorithm for the solution of a tridiagonal linear system of equations","volume":"20","author":"Stone","year":"1973","journal-title":"J. Assoc. Comput. Mach."},{"key":"10.1016\/S0167-8191(99)00077-0_BIB188","doi-asserted-by":"crossref","first-page":"287","DOI":"10.1016\/S0167-8191(98)00007-6","article-title":"Performance of parallel solution of a block-tridiagonal linear system on a Fujitsu VPP 500","volume":"24","author":"Sumiyoshin","year":"1998","journal-title":"Parallel Comput."},{"key":"10.1016\/S0167-8191(99)00077-0_BIB189","unstructured":"C. Sun, Efficient parallel solutions of large sparse SPD systems on distributed-memory multiprocessors, Technical Report CTC92TR102, Advanced Computing Research Institute, Cornell University, Ithaca, NY, 1992"},{"key":"10.1016\/S0167-8191(99)00077-0_BIB190","unstructured":"C. Sun, A package for solving sparse symmetric positive definite systems on distributed-memory multiprocessors, Technical Report CTC92TR114, Advanced Computing Research Institute, Cornell University, Ithaca, NY, November 1992"},{"key":"10.1016\/S0167-8191(99)00077-0_BIB191","first-page":"18","article-title":"A parallel and vector variant of the cyclic reduction algorithm","volume":"22","author":"Sweet","year":"1987","journal-title":"Supercomputer"},{"key":"10.1016\/S0167-8191(99)00077-0_BIB192","doi-asserted-by":"crossref","unstructured":"X.-H. Sun, H.-Z. Sun, L. Ni, Parallel algorithms for solution of tridiagonal systems on multicomputers, Technical report, Dept. of Computer Science, Michigan State University, 1989","DOI":"10.1145\/318789.318822"},{"key":"10.1016\/S0167-8191(99)00077-0_BIB193","unstructured":"K.H. Tan, Local coupling in domain decomposition, Ph.D. thesis, Utrecht University, Utrecht, The Netherlands, 1995"},{"key":"10.1016\/S0167-8191(99)00077-0_BIB194","doi-asserted-by":"crossref","first-page":"1801","DOI":"10.1109\/PROC.1967.6011","article-title":"Direct solutions of sparse network equations by optimally ordered triangular factorization","volume":"55","author":"Tinney","year":"1967","journal-title":"Proc. IEEE"},{"key":"10.1016\/S0167-8191(99)00077-0_BIB195","doi-asserted-by":"crossref","first-page":"853","DOI":"10.1137\/0614059","article-title":"Parallel sparse LU decomposition on a mesh network of transputers","volume":"14","author":"van der Stappen","year":"1993","journal-title":"SIAM J. Matrix Anal. Appl."},{"key":"10.1016\/S0167-8191(99)00077-0_BIB196","doi-asserted-by":"crossref","first-page":"86","DOI":"10.1137\/0903021","article-title":"A vectorizable variant of some ICCG methods","volume":"3","author":"van der Vorst","year":"1982","journal-title":"SIAM J. Sci. Statist. Comput."},{"key":"10.1016\/S0167-8191(99)00077-0_BIB197","doi-asserted-by":"crossref","first-page":"49","DOI":"10.1016\/0167-8191(86)90006-2","article-title":"The performance of Fortran implementations for preconditioned Conjugate Gradients on vector computers","volume":"3","author":"van der Vorst","year":"1986","journal-title":"Parallel Comput."},{"key":"10.1016\/S0167-8191(99)00077-0_BIB198","doi-asserted-by":"crossref","first-page":"45","DOI":"10.1016\/0167-8191(87)90005-6","article-title":"Large tridiagonal and block tridiagonal linear systems on vector and parallel computers","volume":"5","author":"van der Vorst","year":"1987","journal-title":"Parallel Comput."},{"key":"10.1016\/S0167-8191(99)00077-0_BIB199","doi-asserted-by":"crossref","first-page":"1174","DOI":"10.1137\/0910071","article-title":"High performance preconditioning","volume":"10","author":"van der Vorst","year":"1989","journal-title":"SIAM J. Sci. Statist. Comput."},{"key":"10.1016\/S0167-8191(99)00077-0_BIB200","doi-asserted-by":"crossref","first-page":"631","DOI":"10.1137\/0913035","article-title":"Bi-CGSTAB: A fast and smoothly converging variant of Bi\u2013CG for the solution of non-symmetric linear systems","volume":"13","author":"van der Vorst","year":"1992","journal-title":"SIAM J. Sci. Statist. Comput."},{"key":"10.1016\/S0167-8191(99)00077-0_BIB201","unstructured":"H.A. van der Vorst, J.M. van Kats, The performance of some linear algebra algorithms in FORTRAN on CRAY-1 and Cyber-205 supercomputers, Technical report, Academisch Computer Centrum, Utrecht, 1984"},{"key":"10.1016\/S0167-8191(99)00077-0_BIB202","unstructured":"A.C.N. van Duin, Parallel Sparse Matrix Computations, Ph.D. thesis, Leiden University, Leiden, The Netherlands, 1998"},{"key":"10.1016\/S0167-8191(99)00077-0_BIB203","doi-asserted-by":"crossref","unstructured":"M.B. van Gijzen, Iterative solution methods for linear equations in finite element computations, Ph.D. thesis, Delft University of Technology, Delft, The Netherlands, 1994","DOI":"10.1007\/BFb0046706"},{"key":"10.1016\/S0167-8191(99)00077-0_BIB204","doi-asserted-by":"crossref","first-page":"1927","DOI":"10.1016\/S0167-8191(98)00084-2","article-title":"Parallelism in ILU-preconditioned GMRES","volume":"24","author":"Vuik","year":"1998","journal-title":"Parallel Comput."},{"key":"10.1016\/S0167-8191(99)00077-0_BIB205","doi-asserted-by":"crossref","first-page":"170","DOI":"10.1145\/355945.355947","article-title":"A parallel method for tridiagonal equations","volume":"7","author":"Wang","year":"1989","journal-title":"ACM Trans. Math. Softw."},{"key":"10.1016\/S0167-8191(99)00077-0_BIB206","doi-asserted-by":"crossref","first-page":"533","DOI":"10.1002\/nla.1680010603","article-title":"Parallel block preconditioning based on SSOR and MILU","volume":"1","author":"Washio","year":"1994","journal-title":"Numer. Linear Algebra Appl."},{"key":"10.1016\/S0167-8191(99)00077-0_BIB207","unstructured":"R.C. Whaley, Lapack working note 73 : Basic linear algebra communication subprograms: Analysis and implementation across multiple parallel architectures. Technical Report CS-94-234, Computer Science Department, University of Tennessee, Knoxville, Tennessee, May 1994"},{"key":"10.1016\/S0167-8191(99)00077-0_BIB208","doi-asserted-by":"crossref","unstructured":"J.H. Wilkinson, C. Reinsch, Handbook for automatic computation, Linear Algebra, vol. II, Springer, Berlin, 1971","DOI":"10.1007\/978-3-642-86940-2"},{"key":"10.1016\/S0167-8191(99)00077-0_BIB209","unstructured":"J. Zhang, A sparse approximate inverse technique for parallel preconditioning of general sparse matrices, Technical Report 281-98, Department of Computer Science, University of Kentucky, KY, 1998"},{"key":"10.1016\/S0167-8191(99)00077-0_BIB210","unstructured":"Z. Zlatev, J. Wa\u015bniewski, P.C. Hansen, Tz. Ostromsky, PARASPAR: a package for the solution of large linear algebraic equations on parallel computers with shared memory, Technical Report 95-10, Tech Univ Denmark, Lyngby, 1995"},{"key":"10.1016\/S0167-8191(99)00077-0_BIB211","unstructured":"Z. Zlatev, J. Wa\u015bniewski, K. Schaumburg, Introduction to PARASPAR, solution of large and sparse systems of linear algebraic equations, specialised for parallel computers with shared memory. Technical Report 93-02, Tech Univ Denmark, Lyngby, 1993"},{"key":"10.1016\/S0167-8191(99)00077-0_BIB212","unstructured":"E. Zmijewski, Sparse Cholesky Factorization on a Multiprocessor, Ph.D. thesis, Cornell University, 1987"},{"key":"10.1016\/S0167-8191(99)00077-0_BIB213","doi-asserted-by":"crossref","first-page":"199","DOI":"10.1016\/0167-8191(88)90039-7","article-title":"A parallel algorithm for sparse symbolic Cholesky factorization on a multiprocessor","volume":"7","author":"Zmijewski","year":"1988","journal-title":"Parallel Comput."}],"container-title":["Parallel Computing"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/api.elsevier.com\/content\/article\/PII:S0167819199000770?httpAccept=text\/xml","content-type":"text\/xml","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/api.elsevier.com\/content\/article\/PII:S0167819199000770?httpAccept=text\/plain","content-type":"text\/plain","content-version":"vor","intended-application":"text-mining"}],"deposited":{"date-parts":[[2021,5,6]],"date-time":"2021-05-06T07:13:48Z","timestamp":1620285228000},"score":1,"resource":{"primary":{"URL":"https:\/\/linkinghub.elsevier.com\/retrieve\/pii\/S0167819199000770"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[1999,12]]},"references-count":213,"journal-issue":{"issue":"13-14","published-print":{"date-parts":[[1999,12]]}},"alternative-id":["S0167819199000770"],"URL":"https:\/\/doi.org\/10.1016\/s0167-8191(99)00077-0","relation":{},"ISSN":["0167-8191"],"issn-type":[{"value":"0167-8191","type":"print"}],"subject":[],"published":{"date-parts":[[1999,12]]}}}