A High Performance Multifrontal Code for Linear Solution of Structures Using Multi-Core Microprocessors |
| |
作者单位: | Computer Aided Structural Engineering Center,School of Civil and Environmental Engineering Georgia Institute of Technology |
| |
摘 要: | A multifrontal code is introduced for the efficient solution of the linear system of equations arising from the analysis of structures. The factorization phase is reduced into a series of interleaved element assembly and dense matrix operations for which the BLAS3 kernels are used. A similar approach is generalized for the forward and back substitution phases for the efficient solution of structures having multiple load conditions. The program performs all assembly and solution steps in parallel. Examples are presented which demonstrate the code’s performance on single and dual core processor computers.
|
关 键 词: | multifrontal method Cholesky decomposition high performance computing finite element method multi-core programming BLAS3 parallel computing |
A High Performance Multifrontal Code for Linear Solution of Structures Using Multi—Core Microprocessors |
| |
Authors: | Efe Guney Kenneth Will |
| |
Institution: | Computer Aided Structural Engineering Center, School of Civil and Environmental Engineering Georgia Institute of Technology |
| |
Abstract: | A multifrontal code is introduced for the efficient solution of the linear system of equations arising from the analysis of structures. The factorization phase is reduced into a series of interleaved element assembly and dense matrix operations for which the BLAS3 kernels are used. A similar approach is generalized for the forward and back substitution phases for the efficient solution of structures having multiple load conditions. The program performs all assembly and solution steps in parallel. Examples are presented which demonstrate the code's performance on single and dual core processor computers. |
| |
Keywords: | multifrontal method Cholesky decomposition high performance computing finite element method multi-core programming BLAS3 parallel computing |
本文献已被 CNKI 维普 万方数据 等数据库收录! |