CodePDF Available

OpenLB Release 1.4: Open Source Lattice Boltzmann Code

Authors:
A preview of the PDF is not available

Files (2)

Content uploaded by Mathias J. Krause
Author content
Content uploaded by Stephan Simonis
Author content
... The capabilities of the present method with respect to turbulence recovery are isolated along the benchmark tests and critically assessed. The results obtained with the present MRT LBM TLES implemented in OpenLB [25,7] are compared to the TDDM [20] implemented in the computational fluid dynamics (CFD) code Nek5000 [26] which is based on spectral element methods (SEM). Moreover, characteristic features of the MRT LBM TLES are quantitatively analyzed via subgrid activity and energy spectrum error measurements [27]. ...
... the total dissipation rate tot (t) = − dk dt (24) and the resolved (enstrophy-based) dissipation rate [31] res (t) = 2π νζ (25) are extracted. Subtraction leads to the model dissipation rate ...
... For clarity, Table 1 summarizes the typical conversion factors for LBM including some of parameter choices used in the present work. All LBM computations are carried out with OpenLB [7,25] on at most 75 nodes with two Intel Xeon E5-2660 v3 each. ...
Article
Full-text available
We provide a first investigation of using lattice Boltzmann methods (LBM) for temporal large eddy simulation (TLES). The temporal direct deconvolution model (TDDM) is injected as a closure for the filtered discrete velocity Bhatnagar–Gross–Krook (BGK) Boltzmann equation with orthogonal multiple-relaxation-time (MRT) collision. The novel combination of methods is calibrated for decaying homogeneous isotropic turbulence. The Taylor–Green vortex flow is used as a benchmark for the Reynolds numbers 800 and 3000. Various turbulence quantities are numerically evaluated. The numerical results obtained with the MRT LBM are validated against a well-established spectral element method both, with and without the proposed turbulence model. A qualitatively good agreement to reference results of direct numerical simulation is observed in terms of dissipation rate, energy spectrum and dissipation spectrum. Whereas the latter two show marginal differences compared to underresolved simulations without TLES, the dissipation rate exhibits substantial improvement through the model in capturing the peak region for low and intermediate resolutions. The consistency of the TDDM with first and second order discretization is numerically demonstrated for single-relaxation-time and MRT collision via computing the total dissipation rate error. Measuring the interaction between subgrid activity and energy spectrum error serves as a proof of concept for the proposed MRT LBM TLES. Conclusively, the model recovers and enhances the expected numerical features of LBM with respect to the target equation.
... Nevertheless, only few papers [17,32,33,49,57,61] provide some comparison on how floating-point formats affect the accuracy of the LBM and mostly find only insignificant differences between FP64 and FP32 except at very low velocity and where floating-point round-off leads to spontaneous symmetry breaking. Besides the question of accuracy, a quantitative performance comparison across different hardware microarchitectures is missing as the vast majority of LBM software is either written only for CPUs [62][63][64][65][66][67][68][69][70][71][72][73][74] or only for Nvidia GPUs or CPUs and Nvidia GPUs [16][17][18][19][20][21][22][23][24][25][26]. ...
... This offers a large benefit, most prominently on FP16 accuracy, by substantially reducing numerical loss of significance at no additional computational cost. Since it is also beneficial for regular FP32 accuracy, it is already widely used in LBM codes such as our FluidX3D [6-10], OpenLB [63][64][65][66], ESPResSo [22][23][24], Palabos [67][68][69][70][71] and some versions of waLBerla [50]. In the appendix in section 8.2 we provide the entire algorithm without and with DDFshifting for comparison and in section 8.3 we clarify our notation. ...
... For GPUs, the most efficient streaming step implementation [58] is the one-step-pull scheme (A-B pattern) with two copies of the DDFs in memory, because the non-coalesced memory read penalty is lower than non-coalesced write penalty on GPUs [10, 27,30,32,34,35,48], see figure 20 in the appendix. While most LBM implementations are limited to one particular hardware platform -either CPUs [62][63][64][65][66][67][68][69][70][71][72][73][74], Nvidia GPUs , CPUs and Nvidia GPUs [16][17][18][19][20][21][22][23][24][25][26] or mobile SoCs [116,117] -only few use OpenCL [5][6][7][8][9][10][11][12][13][14][15]. ...
Preprint
Full-text available
Fluid dynamics simulations with the lattice Boltzmann method (LBM) are very memory-intensive. Alongside reduction in memory footprint, significant performance benefits can be achieved by using FP32 (single) precision compared to FP64 (double) precision, especially on GPUs. Here, we evaluate the possibility to use even FP16 and Posit16 (half) precision for storing fluid populations, while still carrying arithmetic operations in FP32. For this, we first show that the commonly occurring number range in the LBM is a lot smaller than the FP16 number range. Based on this observation, we develop novel 16-bit formats - based on a modified IEEE-754 and on a modified Posit standard - that are specifically tailored to the needs of the LBM. We then carry out an in-depth characterization of LBM accuracy for six different test systems with increasing complexity: Poiseuille flow, Taylor-Green vortices, Karman vortex streets, lid-driven cavity, a microcapsule in shear flow (utilizing the immersed-boundary method) and finally the impact of a raindrop (based on a Volume-of-Fluid approach). We find that the difference in accuracy between FP64 and FP32 is negligible in almost all cases, and that for a large number of cases even 16-bit is sufficient. Finally, we provide a detailed performance analysis of all precision levels on a large number of hardware microarchitectures and show that significant speedup is achieved with mixed FP32/16-bit.
... , 0) with derivative ∂α i ∂α i = 1 at the i-th component. Any arithmetic operation of two variables is then applied to the values and to their partial derivatives, according to the basic derivation rules, e.g., the result of a multiplication x · y is then In the framework of the open source CFD library OpenLB [33,34], AD was implemented via operator overloading in C++: for every arithmetic operation, the corresponding operation for the data type ADf<double,n> that stores value as well as partial derivatives was implemented. All functions and classes were templatized with regard to the underlying primitive data type so that they can be instantiated using the data type ADf<double,n> and then can yield function values as well as their partial derivatives. ...
... The simulations were performed within the open source LBM-library OpenLB [33,34], employing the numerical methods specified in Section 2.3. The process parameters correspond to phosphate or ink diluted in water. ...
Article
Full-text available
As micromixers offer the cheap and simple mixing of fluids and suspensions, they have become a key device in microfluidics. Their mixing performance can be significantly increased by periodically varying the inlet pressure, which leads to a non-static flow and improved mixing process. In this work, a micromixer with a T-junction and a meandering channel is considered. A periodic pulse function for the inlet pressure is numerically optimized with regard to frequency, amplitude and shape. Thereunto, fluid flow and adsorptive concentration are simulated three-dimensionally with a lattice Boltzmann method (LBM) in OpenLB. Its implementation is then combined with forward automatic differentiation (AD), which allows for the generic application of fast gradient-based optimization schemes. The mixing quality is shown to be increased by 21.4% in comparison to the static, passive regime. Methodically, the results confirm the suitability of the combination of LBM and AD to solve process-scale optimization problems and the improved accuracy of AD over difference quotient approaches in this context.
... This data-holding Cell was refactored by OpenLB 1.4 [63] to only provide a view of the data maintained in field arrays (cf. Section 2.2.2.1). ...
Preprint
Full-text available
OpenLB is an object-oriented implementation of LBM. It is the first implementation of a generic platform for LBM programming, which is shared with the open source community (GPLv2). Since the first release in 2007, the code has been continuously improved and extended which is documented by thirteen releases as well as the corresponding release notes which are available on the OpenLB website (https://www.openlb.net). The OpenLB code is written in C++ and is used by application programmers as well as developers, with the ability to implement custom models OpenLB supports complex data structures that allow simulations in complex geometries and parallel execution using MPI, OpenMP and CUDA on high-performance computers. The source code uses the concepts of interfaces and templates, so that efficient, direct and intuitive implementations of the LBM become possible. The efficiency and scalability has been checked and proved by code reviews. This user manual and a source code documentation by DoxyGen are available on the OpenLB project website.
... All computations are done with OpenLB release 1.4 [19] on several HPC machines, using for example 16 nodes with five quad-core Intel Xeon E5-2609v2 cores each, or up to 75 nodes with respectively two deca-core Intel Xeon E5-2660v3. ...
Article
We use free energy lattice Boltzmann methods to simulate shear and extensional flows of a binary fluid in two and three dimensions. To this end, two classical configurations are digitally twinned, namely a parallel-band device for binary shear flow and a four-roller apparatus for binary extensional flow. The free energy lattice Boltzmann method and the test cases are implemented in the open-source parallel C++ framework OpenLB and evaluated for several non-dimensional numbers. Characteristic deformations are captured, where breakup mechanisms occur for critical capillary regimes. Though the known mass leakage for small droplet-domain ratios and large Cahn numbers is observed, suitable mesh sizes show good agreement to analytical predictions and reference results.
... The simulations were obtained by OpenLB software, release 1.4-0 [8], [9]. This software has been developed at Karlsruhe Institute of Technology (KIT) by the Lattice Boltzmann Research Group (LBRG) of Dr. Krause. ...
Conference Paper
Full-text available
Lattice Boltzmann method (LBM) is a mesoscale numerical method suitable for incompressible fluid flow simulations and its popularity has increased in recent years. It is applicable for many engineering applications dealing with fluid flow and the method can be fully parallelised which is beneficial for calculation on CPU clusters or GPUs. In this paper, the LBM is tested on the benchmark geometry of a constricted tube to verify the laminar-turbulent transition, which can appear in many engineering and biological systems (lungs, arteries). The motivation for this study is to identify suitable models for simulation of human airways which consists of several bifurcations and constricted channels. For this research, the CPU cluster Salomon from IT4Innovations was used. The flow regime through a cosine-curved constriction given by Reynolds number 2000 was simulated with the Lattice Boltzmann Smagorinsky subgrid model. To reveal the unsteady flow character, a period of 5 seconds was simulated. The results of the time-averaged (last 2 seconds) axial velocities on the tube axes and at 6 determined cross sections in the post-stenotic part were compared with already published CFD simulations using the conventional finite volume method FVM (several LES and RANS turbulence models) and experimental data. To reduce numerical errors caused by discretization, the grid independent test for different three types of geometries was done. The LBM predicts the velocities in the wall vicinity well, but in the constriction the velocities are higher and it underestimates the velocities in the core flow behind the constriction compared to the remaining models and experimental data. At the moment, the results from this benchmark simulation will be used for education purposes. However, it is also the initial step for the application of the LBM to solve the flow through the human airways and the deposition of aerosols there.
... All computations are done with OpenLB release 1.4 [18] on several HPC machines, either using up to 16 nodes with five quad-core Intel Xeon E5-2609v2 cores each, or a maximum of 75 nodes with respectively two deca-core Intel Xeon E5-2660v3. ...
Preprint
Full-text available
We use free energy lattice Boltzmann methods (FRE LBM) to simulate shear and extensional flow of a binary mixture in two and three dimensions. To this end, two classical configurations are digitally twinned, namely a parallel-band device for binary shear flow and a four-roller apparatus for binary extensional flow. The FRE LBM and the test cases are implemented in the open-source C++ framework OpenLB and evaluated for several non-dimensional numbers. Characteristic deformations are captured, where breakup mechanisms occur for critical capillary regimes. Though the known mass leakage for small droplet-domain ratios is observed, suitable mesh sizes show good agreement to analytical predictions and reference results.
Preprint
Full-text available
OpenLB is a generic implementation of lattice Boltzmann methods (LBM) that is shared with the open source community under the terms of the GPLv2 license. Since the first release in 2007, the code continues to be improved and extended, resulting in fifteen releases and counting. The OpenLB framework is written in C++ and covers the full scope of simulations – from pre-processing over parallel and efficient execution to post-processing of results. It offers both the possibility of setting up new simulation cases using the existing rich collection of models and of implementing new custom models. OpenLB supports MPI, OpenMP, AVX(51)2 vectorization and CUDA for parallel execution on systems ranging from low-end smartphones over multi-GPU workstation up to supercomputers.
Thesis
Full-text available
Wall-flow filters are applied in the exhaust treatment of internal combustion engines, where they are used for the removal of emitted particulate matter (PM). Such filters consist of oppositely arranged inlet and outlet channels separated by porous substrate walls. When traversing the filter, the entering exhaust gas is forced to pass through those walls while being stripped from the majority of the contained particles. This way, the introduced solid material forms a permeable deposition layer on the filter substrate with continuously increasing thickness. Over time, the pressure drop over the filter increases, respectively, leading to the necessity of the filter's regeneration. During such a regeneration process, the coherent layer can break up into individual layer fragments, which potentially rearrange inside single filter channels. This can, eventually, lead to the formation of specific deposition patterns, which affect a filter's pressure drop, its ash storage capacity and the overall separation efficiency. The rearrangement process is governed by the interaction of hydrodynamic and adhesive forces, which can only be accurately modelled by taking the surfaces of individual layer fragments into account. Despite its well-known significance on the filtration process, the formation of specific deposition patterns can still not consistently be attributed to particular influence factors and appropriate calculation models that enable a quantification of respective factors do not exist. This present work, therefore, approaches a fundamental investigation of particle-layer rearrangement events in ceramic wall-flow filters by means of surface resolved particle simulations with lattice Boltzmann methods (LBM). The main goal is the determination of the sensitivity and the quantification of the impact of engineering-related influence factors on the transient rearrangement process in wall-flow filters. A versatile and easily extendable implementation of the lattice Boltzmann method is provided by the open source software OpenLB. It is therefore used and extended accordingly in the present work. This way, the conservation of the developed methodology is ensured in a publicly available resource that enables full reproducibility of the obtained results. The development and application of an appropriate simulation model is considered in four consecutive solution steps, which are based on four separate works. The first step is dedicated to the model development by providing a simulation approach, which is capable of capturing the time dependent development of the gaseous flow field inside a wall-flow filter during its regeneration. Its validity and applicability is then assessed by conducting grid convergence studies and comparing it to a reference solution. A static fragmented particle layer is examined regarding the spatial distribution of the acting hydrodynamic forces and the impact of fragment clusters at different channel positions. Afterwards, some first transient simulations of fragment detachment are conducted in a low velocity regime and the impact of assorted layer properties is examined. A second step is intended to provide closer insights into the behaviour of individual particle layer fragments during the filter regeneration. In this context, the temporal evolution of the hydrodynamic forces acting on a fragment's surface are examined in detail for different fragment densities and detachment positions. Relevant key quantities, such as the impact velocity at the channel's back wall and the stopping distance, are defined, determined and interpreted with respect to predictions regarding the resulting deposition patterns. At first, a physically sensible substrate permeability is determined by performing aligned experimental and numerical pressure drop studies. Afterwards, the transient behaviour of single particle layer fragments with different densities are examined in detail with respect to all acting forces. The fragment's pneumatic transport through the channel is then investigated for different starting positions, substrate permeabilities, inflow velocities and average densities of the introduced particulate matter. Eventually, the general movement characteristics are assessed with respect to their qualitative accordance with experimental observations. In a third step, the developed LBM approach is applied to an extended velocity range that covers as much as possible of the physically relevant domain, which includes averaged channel inflow velocities of up to 80 m/s. After an adaption of the boundary conditions, three different scenarios are investigated, by considering the particle-free flow only, a single layer fragment attached to one of the substrate walls and a fragmented particle layer. At first, a comprehensive quantification of the stability and accuracy of both particle-free and particle-including flows, considering static, impermeable deposition-layer fragments, is conducted. Afterwards, the local effects of a fragmented particle layer are investigated for different static situations along the channel. In order to derive predictions on the detachment likelihood of individual layer fragments and their mutual influence, the spatial distribution of hydrodynamic forces is evaluated additionally. A final step is specifically dedicated to the investigation of the process of plug formation, by additionally considering the fragments' dynamic behaviour following the layer fragmentation. This includes the detachment and transport of the fragments along the channel, as well as the subsequent formation of a channel-plug. In order to accurately account for fragment contact during their transport, the model is extended by a previously developed discrete contact method. With the updated model, the influence of the fragmented layer topology, the fragment properties and the operating conditions is evaluated and relevant key quantities are determined. The results attained with those four steps represent a comprehensive quantification of the impact of relevant influence factors on various aspects of the transient rearrangement process in wall-flow filters. In that way, the present work represents a significant contribution to the persisting research gaps in this field. It additionally, provides potential opportunities for modifications of the filtration process, which could lead to an optimization of engine performance, fuel consumption and service life of wall-flow filters.
ResearchGate has not been able to resolve any references for this publication.