Large-scale simulations pose significant challenges not only to the solver itself but also to the pre- and postprocessing framework. Hence, we present generally applicable improvements to enhance the performance of those tools and thus increase the feasibility of large-scale jobs and convergence studies. To accomplish this, we use a shared memory approach implemented in the Message Passing Interface (MPI) libraries. Additionally, we improve the read and write performance of the flow solver during runtime to minimize the load imposed on the file system. A detailed discussion of the current performance and scaling behavior is given for up to 262144 processes. FLEXI shows excellent scalability for all tested features. We conclude by showing selected applications, where we use the introduced improvements to maximize performance.