Tobias Glasmachers’s research while affiliated with Ruhr University Bochum and other places

What is this page?


This page lists works of an author who doesn't have a ResearchGate profile or hasn't added the works to their profile yet. It is automatically generated from public (personal) data to further our legitimate goal of comprehensive and accurate scientific recordkeeping. If you are this author and want this page removed, please let us know.

Publications (127)


FIGURE 5-6. Immediate and cumulative reward metrics of the advanced sorting environment with seasonal input (for pattern, see Fig. 4, right). The top panel (Fig. 5) shows PPO Agent actions, and the bottom panel (Fig. 6) shows Rule-Based Agent actions. The left panels show the current reward metrics over 50 timesteps, including reward (green), speed (blue), accuracy (purple), occupancy (red), and sorting mode (yellow), coded as basic (0), positive sorting (0.5), and negative sorting (1.0). The right panel illustrates the cumulative reward metrics over the same timesteps.
FIGURE 7. Comparison of benchmarking performance ("reward") of multiple RL algorithms in different setups (A, B, C, D) of the sorting environment (see Table 1). Each value depicts the mean of evaluations in ten distinct environments.
SortingEnv: An Extendable RL-Environment for an Industrial Sorting Process
  • Preprint
  • File available

March 2025

·

9 Reads

Tom Maus

·

·

Tobias Glasmachers

We present a novel reinforcement learning (RL) environment designed to both optimize industrial sorting systems and study agent behavior in evolving spaces. In simulating material flow within a sorting process our environment follows the idea of a digital twin, with operational parameters like belt speed and occupancy level. To reflect real-world challenges, we integrate common upgrades to industrial setups, like new sensors or advanced machinery. It thus includes two variants: a basic version focusing on discrete belt speed adjustments and an advanced version introducing multiple sorting modes and enhanced material composition observations. We detail the observation spaces, state update mechanisms, and reward functions for both environments. We further evaluate the efficiency of common RL algorithms like Proximal Policy Optimization (PPO), Deep-Q-Networks (DQN), and Advantage Actor Critic (A2C) in comparison to a classical rule-based agent (RBA). This framework not only aids in optimizing industrial processes but also provides a foundation for studying agent behavior and transferability in evolving environments, offering insights into model performance and practical implications for real-world RL applications.

Download


Variable Metric Evolution Strategies for High-dimensional Multi-Objective Optimization

December 2024

·

6 Reads

We design a class of variable metric evolution strategies well suited for high-dimensional problems. We target problems with many variables, not (necessarily) with many objectives. The construction combines two independent developments: efficient algorithms for scaling covariance matrix adaptation to high dimensions, and evolution strategies for multi-objective optimization. In order to design a specific instance of the class we first develop a (1+1) version of the limited memory matrix adaptation evolution strategy and then use an established standard construction to turn a population thereof into a state-of-the-art multi-objective optimizer with indicator-based selection. The method compares favorably to adaptation of the full covariance matrix.


Deep-learning-based identification of individual motion characteristics from upper-limb trajectories towards disorder stage evaluation

December 2024

·

3 Reads

The identification of individual movement characteristics sets the foundation for the assessment of personal rehabilitation progress and can provide diagnostic information on levels and stages of movement disorders. This work presents a preliminary study for differentiating individual motion patterns using a dataset of 3D upper-limb transport trajectories measured in task-space. Identifying individuals by deep time series learning can be a key step to abstracting individual motion properties. In this study, a classification accuracy of about 95% is reached for a subset of nine, and about 78% for the full set of 31 individuals. This provides insights into the separability of patient attributes by exerting a simple standardized task to be transferred to portable systems.





Figure 4: Power spectral density comparison between generated and ground truth signals of MI-EEG signals and alpha-EEG signals. a1,2 and 3) represent the comparison between power spectral densities of generated and ground truth signals based on three randomly selected prompts of MI-EEG signals. b1, 2 and 3 represent the comparison between power spectral densities of generated and ground truth signals based on three randomly selected prompts of alpha-EEG signals.
Figure 5: Univariate signal generation of neural signals with unfiltered prompt of 150 samples. a) and b) show two examples of neural signal generation based on randomly selected unseen input prompts of MI-EEG dataset. Whereas c) and d) show two examples of neural signal generation based on randomly selected unseen input prompts of alpha waves dataset. For each example, first column shows the input prompt in blue whereas the generated signal in red color. The second column shows the frequency spectra of the input prompt generated by STFT whereas the third column represents the frequence spectra of generated signal.
Figure 6: Example 1. Multivariate signal generation of neural signals with unfiltered prompt of 150 samples. a), b) and c) show an example of the multi-electrode neural signal generation based on the unseen prompt from alpha waves dataset. Here each row corresponds to the neural signal generation of the respective electrode. For each electrode, the first column represents the input prompt in blue whereas the generated signal in red color. Columns two and three show the frequency spectra of input prompts and the generated signals respectively.
Figure 7: Example 2. Multivariate signal generation of neural signals with unfiltered prompt of 150 samples. a), b) and c) show an example of the multi-electrode neural signal generation based on the unseen prompt from alpha waves dataset. Here each row corresponds to the neural signal generation of the respective electrode. For each electrode, the first column represents the input prompt in blue whereas the generated signal in red color. Columns two and three show the frequency spectra of input prompts and the generated signals respectively.
GET: A Generative EEG Transformer for continuous context-based neural

June 2024

·

122 Reads

·

·

·

[...]

·

Generating continuous electroencephalography (EEG) signals through advanced artificial neural networks presents a novel opportunity to enhance brain-computer interface (BCI) technology. This capability has the potential to significantly enhance applications ranging from simulating dynamic brain activity and data augmentation to improving real-time epilepsy detection and BCI inference. By harnessing generative transformer neural networks, specifically designed for EEG signal generation, we can revolutionize the interpretation and interaction with neural data. Generative AI has demonstrated significant success across various domains, from natural language processing (NLP) and computer vision to content creation in visual arts and music. It distinguishes itself by using large-scale datasets to construct context windows during pre-training, a technique that has proven particularly effective in NLP, where models are fine-tuned for specific downstream tasks after extensive foundational training. However, the application of generative AI in the field of BCIs, particularly through the development of continuous, context-rich neural signal generators, has been limited. To address this, we introduce the Generative EEG Transformer (GET), a model leveraging transformer architecture tailored for EEG data. The GET model is pre-trained on diverse EEG datasets, including motor imagery and alpha wave datasets, enabling it to produce high-fidelity neural signals that maintain contextual integrity. Our empirical findings indicate that GET not only faithfully reproduces the frequency spectrum of the training data and input prompts but also robustly generates continuous neural signals. By adopting the successful training strategies of the NLP domain for BCIs, the GET sets a new standard for the development and application of neural signal generation technologies.


Weighted Initialisation of Evolutionary Instrument and Pitch Detection in Polyphonic Music

March 2024

·

22 Reads

Lecture Notes in Computer Science

Current state-of-the-art methods for instrument and pitch detection in polyphonic music often require large datasets and long training times; resources which are sparse in the field of music information retrieval, presenting a need for unsupervised alternative methods that do not require such prerequisites. We present a modification to an evolutionary algorithm for polyphonic music approximation through synthesis that uses spectral information to initialise populations with probable pitches. This algorithm can perform joint instrument and pitch detection on polyphonic music pieces without any of the aforementioned constraints. Sets of tuples of (instrument, style, pitch) are graded with a COSH distance fitness function and finally determine the algorithm’s instrument and pitch labels for a given part of a music piece. Further investigation into this fitness function indicates that it tends to create false positives which may conceal the true potential of our modified approach. Regardless of that, our modification still shows significantly faster convergence speed and slightly improved pitch and instrument detection errors over the baseline algorithm on both single onset and full piece experiments.


Figure 2. This figure offers an analysis of volume flows in a waste sorting facility with two presses. It juxtaposes the volume measured in the bunker (red) and belt (blue) (a), shows the selection of press 1 (blue) or press 2 (red) based on baling event frequency (b), presents the variation (trimmed) in the bunker/belt ratio around its mean (c), and quantifies bales pressed per baling event (d).
Volume Determination Challenges in Waste Sorting Facilities: Observations and Strategies

March 2024

·

42 Reads

·

2 Citations

In this case study on volume determination in waste sorting facilities, we evaluate the effectiveness of ultrasonic sensors and address waste-material-specific challenges. Although ultrasonic sensors offer a cost-effective automation solution, their accuracy is affected by irregular waste shapes, varied compositions, and environmental factors. Notable inconsistencies in volume measurements between storage bunkers and conveyor belts underscore the need for a comprehensive approach to standardize bale production. With prediction reliability being constrained by limited datasets, undocumented modifications to machine settings, and sensor failures, this task renders a challenging application area for machine learning. We explore related research and present dataset analyses from three distinct waste sorting facilities in Europe, addressing issues such as sensor usability, data quality, and material specifics. Our analysis suggests promising strategies and future directions for enhancing waste volume measurement accuracy, ultimately aiming to advance sustainable waste management.


Citations (46)


... Traditional methods, such as rule-based models, rely on predefined rules that must be modified to manage upgrades [6]. However, these methods lack the flexibility and adaptability required for dynamic and evolving environments, which may show a considerable amount of sensor noise as well [11]. RL offers a powerful alternative, capable of handling complex, unpredictable environments by learning optimal strategies through interaction with the environment [1]. ...

Reference:

SortingEnv: An Extendable RL-Environment for an Industrial Sorting Process
Volume Determination Challenges in Waste Sorting Facilities: Observations and Strategies

... However, gaming benchmarks often lack the applicability to real-world problems in terms of complexity, stochasticity, and safety constraints that are usually present in industrial setups. In real-world industrial settings, the repercussions of poor decisions can be immediate and severe, requiring a more cautious approach to exploration [3][4][5]. ...

ContainerGym: A Real-World Reinforcement Learning Benchmark for Resource Allocation
  • Citing Chapter
  • February 2024

Lecture Notes in Computer Science

... EEG has attracted significant attention for various applications and research fields, including brain-computer interfaces (BCIs) [1], emotion classification [2], seizure detection [3], monitoring sleep stages [4], and the exploration of diverse mental disorders and cognitive functions [5]. EEG signals are inherently dynamic and stochastic, with short-and longrange dependencies, influenced by individual characteristics and environmental factors [6]. The biological complexities, artifacts from physiological and non-physiological sources, motion artifacts, and environmental noise pose significant challenges to EEG interpretability. ...

ConTraNet: A hybrid network for improving the classification of EEG and EMG signals with limited training data
  • Citing Article
  • November 2023

Computers in Biology and Medicine

... Artificial intelligence (AI) has emerged as a powerful tool in various sectors [29][30][31][32][33], including agriculture [34][35][36], offering new and innovative solutions for disease detection and management [37,38]. In particular, deep learning (DL) techniques [39], such as convolutional neural networks (CNNs) [40], have shown exceptional performance in image classification tasks by effectively extracting spatial and hierarchical features from image data. ...

Deep learning-based scan range optimization can reduce radiation exposure in coronary CT angiography

European Radiology

... Some further conclusions about controlling the complex system of bunkers to presses may involve training an RL-agent to learn hidden dynamics of the systems and find the best points in time for emptying the bunker at an optimal volume, given the current sensory data and historical data as input. A recent study proposed an RL-based environment for regulating this specific part of a waste sorting facility [29]. To address the possible effect of data scarcity, investigating whether limited data acts as a constraint by further reducing data amounts and analyzing the resultant impact might prove insightful. ...

ContainerGym: A Real-World Reinforcement Learning Benchmark for Resource Allocation

... A pragmatic solution to these challenges could be to construct EMG signals directly from joint kinematics data. For example, an RNN-based model has been proposed to predict upper limb muscle activity using motion parameters such as joint angle and velocity [37]. More recently, a hybrid generative model has been developed to synthesize intramuscular electromyography signals from kinematic data, effectively capturing the spatial and temporal dynamics of muscle activity [38]. ...

The concepts of muscle activity generation driven by upper limb kinematics

BioMedical Engineering OnLine

... • Knowledge Representation Difference. The lack of disclosure of agents' internal details results in a distributed black-box optimization problem [5]. Traditional federated aggregation methods (such as FedAvg [28]) are not applicable due to the agent heterogeneity. ...

DiBB: distributing black-box optimization
  • Citing Conference Paper
  • July 2022

... It is important to record the SA of a large population of neurons to understand the relationship between neuronal activity and behavioral functionality e.g. relationship between imagining a hand movement and the SA of corresponding brain areas (Aflalo et al., 2015), (Ali et al., 2021), (Ali et al., 2022), and (Klaes et al., 2015a). Henceforth, instead of a single microelectrode array, often multiple microelectrode arrays with several hundred channels are implanted simultaneously. ...

ConTraNet: A single end-to-end hybrid network for EEG-based and EMG-based human machine interfaces

... We suppose then that Π X •F ((φ, ξ),α((φ, ξ), u)) = F (φ, α(φ, u)) (1. 3) for every (φ, ξ, u) ∈ X × Y × U, where Π X : X × Y → X is the canonical projection of X × Y on X. The Markov chain {(φ t , ξ t )} t∈N is said to be redundant, whereas {φ t } t∈N is said to be projected. ...

Global Linear Convergence of Evolution Strategies on More than Smooth Strongly Convex Functions
  • Citing Article
  • June 2022

SIAM Journal on Optimization

... The decoding of these signals can be improved by including adversarial data augmentation algorithms. An average accuracy of 89% may be achieved on classification tasks [44]. Conversely, the impressive results of the Direct Neural Interface (DNI) come at a high computational cost. ...

Enhancing the decoding accuracy of EEG signals by the introduction of anchored-STFT and adversarial data augmentation method