Carolin Penke

Carolin Penke
Forschungszentrum Jülich · Jülich Supercomputing Centre (JSC)

Doctor of Philosophy
GPU Infrastructure @ OpenGPT-X

About

17
Publications
1,081
Reads
How we measure 'reads'
A 'read' is counted each time someone views a publication summary (such as the title, abstract, and list of authors), clicks on a figure, or views or downloads the full-text. Learn more
36
Citations

Publications

Publications (17)
Preprint
Full-text available
The rapid advancement of machine learning (ML) technologies has driven the development of specialized hardware accelerators designed to facilitate more efficient model training. This paper introduces the CARAML benchmark suite, which is employed to assess performance and energy consumption during the training of transformer-based large language mod...
Preprint
Full-text available
Benchmarks are essential in the design of modern HPC installations, as they define key aspects of system components. Beyond synthetic workloads, it is crucial to include real applications that represent user requirements into benchmark suites, to guarantee high usability and widespread adoption of a new system. Given the significant investments in...
Preprint
We devise a spectral divide-and-conquer scheme for matrices that are self-adjoint with respect to a given indefinite scalar product (i.e. pseudosymmetic matrices). The pseudosymmetric structure of the matrix is preserved in the spectral division, such that the method can be applied recursively to achieve full diagonalization. The method is well-sui...
Article
Optical properties of materials related to light absorption and scattering are explained by the excitation of electrons. The Bethe–Salpeter equation is the state-of-the-art approach to describe these processes from first principles (ab initio), i.e. without the need for empirical data in the model. To harness the predictive power of the equation, i...
Preprint
Full-text available
We present methods for computing the generalized polar decomposition of a matrix based on the dynamically weighted Halley (DWH) iteration. This method is well established for computing the standard polar decomposition. A stable implementation is available, where matrix inversion is avoided and QR decompositions are used instead. We establish a natu...
Article
Full-text available
For a given matrix, we are interested in computing GR decompositions A = GR, where G is an isometry with respect to given scalar products. The orthogonal QR decomposition is the representative for the Euclidian scalar product. For a signature matrix, a respective factorization is given as the hyperbolic QR decomposition. Considering a skew‐symmetri...
Preprint
Full-text available
Optical properties of materials related to light absorption and scattering are explained by the excitation of electrons. The Bethe-Salpeter equation is the state-of-the-art approach to describe these processes from first principles (ab initio), i.e. without the need for empirical data in the model. To harness the predictive power of the equation, i...
Preprint
Full-text available
For a given matrix, we are interested in computing GR decompositions $A=GR$, where $G$ is an isometry with respect to given scalar products. The orthogonal QR decomposition is the representative for the Euclidian scalar product. For a signature matrix, a respective factorization is given as the hyperbolic QR decomposition. Considering a skew-symmet...
Article
We present a high-performance solver for dense skew-symmetric matrix eigenvalue problems. Our work is motivated by applications in computational quantum physics, where one solution approach to solve the Bethe-Salpeter equation involves the solution of a large, dense, skew-symmetric eigenvalue problem. The computed eigenpairs can be used to compute...
Preprint
Full-text available
We present a high-performance solver for dense skew-symmetric matrix eigenvalue problems. Our work is motivated by applications in computational quantum physics, where one solution approach to solve the so-called Bethe-Salpeter equation involves the solution of a large, dense, skew-symmetric eigenvalue problem. The computed eigenpairs can be used t...
Article
The Bethe‐Salpeter eigenvalue problem arises in the computation of the electronic structure of many‐body physical systems. The resulting matrix is complex, admits a certain block structure and can become extremely large. This raises the need for structure‐preserving algorithms running in parallel on high performance compute clusters. In this paper...
Conference Paper
The LAPACK routines GEQRT2 and GEQRT3 can be used to compute the QR decomposition of a matrix of size m×n as well as the storage-efficient representation of the orthogonal factor . A GPU-accelerated algorithm is presented that expands a blocked CPU-GPU hybrid QR decomposition to compute the triangular matrix T. The storage-efficient representation...
Article
Full-text available
The solution of linear systems of equations with many right hand sides is mostly seen as a trivial extension of solving a linear system and the algorithmic developments mostly focus on the efficient computation of the LU decomposition. This is, however, not regarding the case where many right hand sides increase the runtime influence of the forward...

Network

Cited By