Armin Nikpour’s research while affiliated with The University of Sydney and other places

What is this page?


This page lists works of an author who doesn't have a ResearchGate profile or hasn't added the works to their profile yet. It is automatically generated from public (personal) data to further our legitimate goal of comprehensive and accurate scientific recordkeeping. If you are this author and want this page removed, please let us know.

Publications (103)


Study recruitment pathway. Study recruitment pathway showing sequencing of referral, screening and consenting procedures, in addition to randomisation into the two study groups, before merging into an identical follow-up pathway. ASM, antiseizure medication; ML, machine learning; UC, usual care.
Personalised selection of medication for newly diagnosed adult epilepsy: study protocol of a first-in-class, double-blind, randomised controlled trial
  • Article
  • Full-text available

April 2025

·

57 Reads

Daniel Thom

·

Richard Shek-kwan Chang

·

·

[...]

·

Patrick Kwan

Introduction Selection of antiseizure medications (ASMs) for newly diagnosed epilepsy remains largely a trial-and-error process. We have developed a machine learning (ML) model using retrospective data collected from five international cohorts that predicts response to different ASMs as the initial treatment for individual adults with new-onset epilepsy. This study aims to prospectively evaluate this model in Australia using a randomised controlled trial design. Methods and analysis At least 234 adult patients with newly diagnosed epilepsy will be recruited from 14 centres in Australia. Patients will be randomised 1:1 to the ML group or usual care group. The ML group will receive the ASM recommended by the model unless it is considered contraindicated by the neurologist. The usual care group will receive the ASM selected by the neurologist alone. Both the patient and neurologists conducting the follow-up will be blinded to the group assignment. Both groups will be followed up for 52 weeks to assess treatment outcomes. Additional information on adverse events, quality of life, mood and use of healthcare services and productivity will be collected using validated questionnaires. Acceptability of the model will also be assessed. The primary outcome will be the proportion of participants who achieve seizure-freedom (defined as no seizures during the 12-month follow-up period) while taking the initially prescribed ASM. Secondary outcomes include time to treatment failure, time to first seizure after randomisation, changes in mood assessment score and quality of life score, direct healthcare costs, and loss of productivity during the treatment period. This trial will provide class I evidence for the effectiveness of a ML model as a decision support tool for neurologists to select the first ASM for adults with newly diagnosed epilepsy. Ethics and dissemination This study is approved by the Alfred Health Human Research Ethics Committee (Project 130/23). Findings will be presented in academic conferences and submitted to peer-reviewed journals for publication. Trial registration number ACTRN12623000209695.

Download

Tiny dLIF: A dendritic spiking neural network enabling a time-domain energy-efficient seizure detection system.

March 2025

·

6 Reads

·

1 Citation

Epilepsy poses a significant global health challenge, driving the need for reliable diagnostic tools like scalp electroencephalogram (EEG), subscalp EEG, and intracranial EEG (iEEG) for accurate seizure detection, localization, and modulation for treating seizures. However, these techniques often rely on feature extraction techniques such as Short Time Fourier Transform (STFT) for efficiency in seizure detection. Drawing inspiration from brain architecture, we investigate biologically plausible algorithms, specifically emphasizing time-domain inputs with low computational overhead. Our novel approach features two hidden layer dendrites with Leaky Integrate-and-Fire (dLIF) spiking neurons, containing fewer than 300K parameters and occupying a mere 1.5 MB of memory. Our proposed network is tested and successfully generalized on four datasets from the USA and Europe, recorded with different front-end electronics. USA datasets are scalp EEG in adults and children, and European datasets are iEEG in adults. All datasets are from patients living with epilepsy. Our model exhibits robust performance across different datasets through rigorous training and validation. We achieved AUROC scores of 81.0% and 91.0% in two datasets. Additionally, we obtained AUPRC and F1 Score metrics of 91.9% and 88.9% for one dataset, respectively. We also conducted out-of-sample generalization by training on adult patient data, and testing on children’s data, achieving an AUROC of 75.1% for epilepsy detection. This highlights its effectiveness across continental datasets with diverse brain modalities, regardless of montage or age specificity. It underscores the importance of embracing system heterogeneity to enhance efficiency, thus eliminating the need for computationally expensive feature engineering techniques like Fast Fourier Transform (FFT) and STFT.


Representation of the past and future in neuro-AI. The multilayer perceptron was built based on non-learnable activation functions and relies on deep, fully connected networks for accurate performance (A). On the contrary, KAN emerges as a potential solution for more explainable, shallow, and efficient architectures at the core level (B).
of the datasets being used. The TUH dataset was set as the training and validation dataset and subsequently used for inference in both RPAH and EPILEPSIAE datasets (A). Testing across all datasets incorporates extensive background and seizure session data, facilitating comprehensive analysis of model efficacy in diverse clinical settings (B). Types of sinsights into the number ofeizures across the RPAH dataset (C) and EPILEPSIAE dataset (D).
KAN–EEG seizure system. We analysed different kinds of seizures using the TUH dataset for training. We then preprocessed our model using ICA and STFT. Subsequently, we incorporate a shallow KAN algorithm that leads to efficient results.
In-sample results on the TUH dataset. Our model was trained on 400 h of data, significantly less than the data used to train other models, such as ConvLSTM and transformer, with 752 and 910, respectively (A). A comparison of a KAN–EEG (Architecture I−764−256-O (table 2) training with other MLP-based models [32,34,46–48] demonstrates better performance with less training data.
KAN–EEG: towards replacing backbone–MLP for an effective seizure detection system

March 2025

·

22 Reads

·

8 Citations

The landscape of artificial intelligence (AI) research is witnessing a transformative shift with the emergence of the Kolmogorov–Arnold network (KAN), presenting a novel architectural paradigm aimed to redefine the structural foundations of AI models, which are based on multilayer perceptron (MLP). Through rigorous experimentation and evaluation, we introduce the KAN–electroencephalogram (EEG) model, a tailored design for efficient seizure detection. Our proposed network is tested and successfully generalized on three different datasets, one from the USA, one from Europe, and one from Oceania, recorded with different front-end hardware. All datasets are scalp EEG in adults and are from patients living with epilepsy. Our empirical findings reveal that while both architectures demonstrate commendable performance in seizure detection, the KAN model exhibits high-level out-of-sample generalization across datasets from diverse geographical regions, underscoring its inherent efficacy and adaptability at the backbone level. Furthermore, we demonstrate the resilience of the KAN architecture to model size reduction and shallow network configurations, highlighting its versatility and efficiency by preventing over-fitting in-sample datasets. This study advances our understanding of innovative neural network architectures and underscores the pioneering potential of KANs in critical domains such as medical diagnostics.


(A) Coronal/sagittal and axial T1 weight MRI. (B) Coronal/Sagittal and Axial PET-co-registered MRI showed hypometabolism in the right insula cortex. (C) On average montage, there are interictal discharges in the right temporal region during sleep stage 2. (D) On bipolar montage, the EEG seizure is not localized but lateralized to the right hemisphere during VEEG monitoring (The blue line was clinical seizure onset).
SEEG electrode implantation was based on the semiology and EEG findings, which were the right hemisphere and the insular-opercular region (the symptoms of hypersalivation and oral sensation).
Postoperative 3-D imaging: the location of the Z and Y electrodes are shown on the T1-weighted volumetric preoperative MRI, which is fused with the postoperative CT using curry 9.0 software. (A) Z 3-4. (B) Y 3-4. (C) On bipolar montages, EEG seizure onset in Z 2-3 (green arrow) and clinical seizure onset (red arrow). (D) Zoom up EEG seizure onset-Z2-3 with LVFA.
(A) Pre-RFTC -discharges in Y 3-4 & Z 2-4 on bipolar montage. (B) Post RFTC- flat lines in Y 3-4 & Z 3-4 on bipolar montage. (C) Z electrode, 3-4 (red arrow) & Y electrode, 3-4 (blue arrow) on CT scan post-RFTC. Two months post-T1 weight MRI results show no visible lesion. Therefore, Z and Y electrodes were placed to indicate their location, (D) Z 3-4. (E) Y 3-4.
Case Report: Ictal hypersalivation: a stereoelectroencephalography exploration

February 2025

·

9 Reads

Epilepsy is a chronic neurological condition with various etiologies, and recurrent unprovoked seizures characterize it. Hypersalivation is a recognized symptom of insular-opercular epilepsies. A wide range of symptoms can occur during a seizure, including hypersalivation, autonomic responses, oropharyngeal sensations, visceral sensations, somatosensory disturbances, and emotional manifestations. In this case study, we examine a unique scenario of a patient who experienced predominantly salivary seizures. Hypersalivation, pill-rolling movements, and lip-smacking characterized these seizures. Importantly, the patient became seizure-free after undergoing radiofrequency thermocoagulation (RFTC) with the assistance of Stereoelectroencephalography (SEEG). Our discussion will focus on the treatment approach involving SEEG-guided RFTC and the careful identification of the brain cortex responsible for triggering excessive salivation during seizures.


Liquid-Dendrite Spiking Neural Network for Edge Devices: A 130 K-Parameter, 535 KB Model for Time-Domain Epileptic Seizure Detection

February 2025

·

2 Reads

Epilepsy is a significant global health issue, requiring dependable diagnostic tools like scalp encephalogram (scalp-EEG), sub-scalp EEG, and intracranial EEG (iEEG) for precise seizure detection and treatment. AI has emerged as a powerful tool in this domain, offering the potential for real-time, responsive monitoring. Traditional methods often rely on feature extraction techniques like Short-Time Fourier Transform (STFT), which can increase power consumption, making them less suitable for deployment on edge devices. While large models can improve accuracy without STFT, their size also limits their practicality for edge applications. This study introduces Liquid-Dendrite, a novel bio-inspired model for seizure detection, leveraging Liquid-Time Constant Spiking Neurons (LTC-SN) and dendrites spiking neurons (dSN) with heterogeneous time-constants. The model comprises two hidden layers with dendritic neurons and one layer of liquid-time constant networks. Our model achieves a memory efficacy of 535 KB with 130 K trainable parameters. The model was tested across the most noteworthy epilepsy datasets for scalp EEG (TUH and CHB-MIT) and iEEG (EPILEPSIAE). Our model demonstrated commendable performance, achieving AUROC scores of 83%, 96%, and 93%, respectively, outperforming some existing models in an energy and memory-efficient way. Moreover, we conducted a robustness test by blacking out EEG channels at the inference stage, where we showed the ability of our network to work with fewer channels. We could deploy our tiny model and perform inference at the edge of the Raspberry Pi 5 without the need for additional quantization. This highlights the potential of Neuro-Inspired AI for efficient, small-scale, and energy-embedded AI systems across different brain modalities.


Advancing privacy-aware machine learning on sensitive data via edge-based continual µ-training for personalized large models

January 2025

·

3 Reads

·

1 Citation

This paper introduces an innovative method for fine-tuning a large multi-label model for abnormality detection, utilizing a smaller trainer and advanced knowledge distillation techniques. It studies the effects of fine-tuning on various abnormalities, noting different improvements based on the Original Model’s performance in specific tasks. The experimental setup, optimized for on-device inference and fine-tuning with limited computational resources, demonstrates moderate yet promising enhancements in model performance post-fine-tuning. Key insights from the study include the significance of aligning the µ-Trainer’s behavior with the Original Model and the influence of hyper-parameters like batch size on fine-tuning outcomes. The research acknowledges limitations such as the limited exploration of loss functions in multi-label models and constraints in architectural design, suggesting potential avenues for future investigation. While the proposed Naive Continual Fine-tuning Process is in its early stages, we highlight this paper’s potential model personalization on long-term data. Moreover, weight transfer in our system is exclusively for fine-tuning; hence, it improves user privacy protection by failing data reconstruction attempts from weights, like an issue with Federated learning models. Our on-device fine-tuning prevents the transferring of data or gradients from the edge of the network to their server. Despite modest performance improvements after fine-tuning, these working layers represent a small fraction (0.7%) of the total weights in the Original Model and 1.6% in the µ-Trainer. This study establishes a foundational framework for advancing personalized model adaptation, on-device inference and fine-tuning while emphasizing the importance of safeguarding data privacy in model development.


Fig. 3 Results of SAR analysis. (a) Right sagittal view of the SAR values. (b) The frontal view shows the SAR values and distribution. (c) Transverse view of the SAR analysis. (d) SAR volume exceeding 0.1 W/kg. (e) Left sagittal view of the temperature rise analysis. The color map for the SAR analysis is on a decibel (dB) scale, with the maximum value of 1.7 W/kg mapped to the 0 dB reference. The color map for the temperature analysis uses a linear scale, mapped from 25°C to 38°C.
Fig. 6 Experiment setup illustration. (a) experiment setup 3D model. (b) Shows the stents we used in the experiment with the cable connection. (c) The actual setup will be with the vessel, bone, and saline. When running the experiment, we have a clamp cover to clamp the stent to a position with designed Dt and Dsgp. The cover has been removed in this figure to show the internal structure. (d) The actual capacitive plate with the connection cable is shown. (e) shows the overview of the 3D printed testing container with our testing vector network analyzer (VNA). (f) shows the full system efficiency testing setup with the MCU-based PWM generator (MCU), Gate driver and Power amplifier board(AMP), transmission coil (T Coil), receiving coil (R Coil), testing power supplier (PS), AC to DC circuits (AC-DC), and the voltage-controlled current source (CS) as the load to measure peak harvested power. The data collection and load control device, the Analog Discovery 2 (AD2) device, and the PC used to program the MCU and connect the AD2 device are also shown in the photo.
Full system testing result.
Simulation result compare with experiment result.
Metrics against state-of-the-art.
A Wireless Power Transfer System for Leadless Endovascular Electrocorticography

January 2025

·

29 Reads

Purpose :Wireless power transfer (WPT) for stent-based medical devices in the brain, such as endovascular electrocorticography (endoECoG) devices, faces challenges. Typical stent-based endoECoG consists of electrodes placed on a stent and connected with a set of long lead wires to a subcutaneous chest implant responsible for wireless energy harvesting and data telemetry. Eliminating the long lead wires is not a trivial or straightforward task, introducing a great set of challenges. This work demonstrates a feasible method to deliver power directly to a standard medical stent without modifying its structure, mechanically or electrically. Methods :The proposed system employs a subcutaneous relay that converts inductive coupling to capacitive coupling. This solution not only enhances power transfer efficiency while maintaining minimal invasiveness but also addresses the challenges of unstable contact impedance with capacitive coupling WPT. Experimental validation was performed using real skin, bone, and vessel tissues, and finite element simulations were conducted to confirm model accuracy. Results :Experiments demonstrated over 45 mW of power delivery without exceeding safety limits, sufficient for powering endoECoG devices and biosignal sensors. The system achieved 7.26\% DC-to-DC efficiency, the highest reported for stent-based implants without additional transceivers or specialized stent designs. Results closely matched simulations, confirming practical viability. Safety assessments, including specific absorption rate (SAR) analysis and temperature rise simulations, showed compliance with regulatory standards and minimal risk to surrounding tissues. Conclusion :This work demonstrates a reasonably efficient and safe power delivery to stent-based implants in the brain, considering the anatomical challenges regarding the surgical delivery, paving the way for fully wireless, minimally invasive neuroprosthetic devices. The external device does not require close skin contact, making it suitable for long-term applications and improving patient comfort. Future efforts will optimize system components and address manufacturing challenges to facilitate clinical translation.


A leadless power transfer and wireless telemetry solutions for an endovascular electrocorticography

November 2024

·

51 Reads

·

2 Citations

Objective. Endovascular brain-computer interfaces (eBCIs) offer a minimally invasive way to connect the brain to external devices, merging neuroscience, engineering, and medical technology. Currently, solutions for endovascular electrocorticography (ECoG) include a stent in the brain with sensing electrodes, a chest implant to accommodate electronic components to provide power and data telemetry, and a long (tens of centimeters) cable travel through vessels with a set of wires in between. Removing this long cable is the key to the clinical viability of eBCIS as it carries risks and limitations, especially for patients with fragile vasculature. Approach. This work introduces a wireless and leadless telemetry and power transfer solution for ECoG. The proposed solution includes an optical telemetry module and a focused ultrasound (FUS) power transfer system. The proposed system can be miniaturised to fit in an endovascular stent, removing the need for long, intrusive cables. Main results. The optical telemetry achieves data transmission speeds of over 2 Mbit/s, capable of supporting 41 ECoG channels at a 2 kHz sampling rate with 24-bit resolution. The FUS power transfer system delivers up to 10 mW of power to the implant through the scalp(6 mm), skull(10 mm), and subdural space(5 mm), adhering to safety limits. Testing on bovine tissue (10 mm thick bone, 7 mm thick skin) confirmed the system’s efficacy. Significance. This leadless and wireless solution eliminates the need for long cables and auxiliary implants, potentially reducing complications and enhancing the clinical applicability of eBCIs. The proposed system represents a step forward in enabling safer and more effective ECoG for a broader range of patients.


Fig. 3. Neuromodulation approaches. a) An open-loop system with an expert who occasionally reviews the effectiveness of the system and adjusts the stimulation parameters accordingly. Such system employs cyclic stimulation regardless of the current state of the target (e.g. brain state). b) A closed-loop system with external computing for accessing the state of the target to condition the stimulation. The external computing component can be in the form of a portable device, e.g. tablet, or a local computer. The recording device continuously streams data (e.g. EEG signals) to the external computing where trained algorithms are executed to determine the target's state. The deployed algorithms on the external computing component can be updated occasionally by involving a review from an expert(s) and big data/cloud computing (retraining). c) The computing component is embedded within the device, which eliminates the need for continuous streaming of data to the outside world (44, 97). However, the on-device algorithms need to be occasionally updated to reflect the change in physiological signals (e.g. change of seizure patterns in epileptic patients). The device must also have sufficient memory to store the signals for the expert(s) to review and for the retraining that takes place in the cloud. d) A neuromorphic neuromodulation system where the medical device can run and retrain its algorithm by itself without relying on external computing resources. The system utilizes labels generated by a detection model that has performance on-par with a human expert (101-103) and a continuous learning strategy that as a recorder to train a prediction model. The rapid improvements in neuromorphic computing (104, 105) have made on-device active learning possible. It's noteworthy that such system will intermittently transmit relevant snapshots or markers of the recorded bio-signal to comply safety-efficacy standards.
A benchmark of neuromorphic chips. This list is not extensive but considers the breakthroughs.
Neuromorphic neuromodulation: Towards the next generation of closed-loop neurostimulation

October 2024

·

45 Reads

·

3 Citations

PNAS Nexus

Neuromodulation techniques have emerged as promising approaches for treating a wide range of neurological disorders, precisely delivering electrical stimulation to modulate abnormal neuronal activity. While leveraging the unique capabilities of artificial intelligence (AI) holds immense potential for responsive neurostimulation, it appears as an extremely challenging proposition where real-time (low-latency) processing, low power consumption, and heat constraints are limiting factors. The use of sophisticated AI-driven models for personalized neurostimulation depends on the back-telemetry of data to external systems (e.g. cloud-based medical mesosystems and ecosystems). While this can be a solution, integrating continuous learning within implantable neuromodulation devices for several applications, such as seizure prediction in epilepsy, is an open question. We believe neuromorphic architectures hold an outstanding potential to open new avenues for sophisticated on-chip analysis of neural signals and AI-driven personalized treatments. With more than three orders of magnitude reduction in the total data required for data processing and feature extraction, the high power- and memory-efficiency of neuromorphic computing to hardware-firmware co-design can be considered as the solution-in-the-making to resource-constraint implantable neuromodulation systems. This perspective introduces the concept of Neuromorphic Neuromodulation, a new breed of closed-loop responsive feedback system. It highlights its potential to revolutionize implantable brain-machine microsystems for patient-specific treatment.


KAN-EEG: Towards Replacing Backbone-MLP for an Effective Seizure Detection System

June 2024

·

25 Reads

·

4 Citations

The landscape of artificial intelligence (AI) research is witnessing a transformative shift with the emergence of the Kolmogorov-Arnold Network (KAN), presenting a novel architectural paradigm aimed to redefine the structural foundations of AI models, which are based on Multilayer Perceptron (MLP). Through rigorous experimentation and meticulous evaluation, we introduce the KAN-EEG model, a tailored design for efficient seizure detection. Our proposed network is tested and successfully generalized on three different datasets, one from the USA, one from Europe, and one from Oceania, recorded with different front-end hardware. All datasets are scalp Electroencephalogram (EEG) in adults and are from patients living with epilepsy. Our empirical findings reveal that while both architectures demonstrate commendable performance in seizure detection, the KAN model exhibits high-level out-of-sample generalization across datasets from diverse geographical regions, underscoring its inherent adaptability and efficacy at the backbone level. Furthermore, we demonstrate the resilience of the KAN architecture to model size reduction and shallow network configurations, highlighting its versatility and efficiency by preventing over-fitting in-sample datasets. This study advances our understanding of innovative neural network architectures and underscores the pioneering potential of KANs in critical domains such as medical diagnostics.


Citations (66)


... KANs have been incorporated into Convolutional Neural Networks (CNNs) [58,59], graph neural networks [60,61,62,63], and PointNet [64,65]. This approach has proven effective across a variety of fields, including physics-informed machine learning [66,67,68,69,70,71,72,73], deep operator networks [74, 67], neural ordinary differential equations [75,76], image classification [58,77,78,79,80,81,82,83,84,85], image segmentation [86, 87], image detection [88], audio classification [83], and numerous other scientific and industrial applications [89,90,91,92,93,94,95,96,97,98,99,100,101,102,103,104,105,106,107,108]. In this article, we focus on the contributions of KANs to the field of physics-informed deep learning. ...

Reference:

Physics-informed KAN PointNet: Deep learning for simultaneous solutions to inverse problems in incompressible flow on numerous irregular geometries
KAN–EEG: towards replacing backbone–MLP for an effective seizure detection system

... Moreover, transferring either data or gradients can rapidly drain the device's battery (if battery-powered), as data transmission, which often has to be wireless, consumes substantial energy [6]. Our previous work introduced a novel method for finetuning large foundational models without requiring data or gradient transfer [7]. This approach facilitates finetuning large models directly on resource-limited devices, enhancing data security and efficiency. ...

Advancing privacy-aware machine learning on sensitive data via edge-based continual µ-training for personalized large models

... inductive coupling [12], magnetic resonance coupling [13], far-field radiofrequency (RF) energy transfer [14], optical power delivery [15,16], and acoustic power transfer [15]. ...

A leadless power transfer and wireless telemetry solutions for an endovascular electrocorticography

... These implants must deliver stimuli with precisely defined amplitude and timing, adjusting to ongoing brain activity. Achieving this depends on understanding how different neural pathways contribute to sensory processing, cognition, and action, then using realtime algorithms to interpret signals and convert them into suitable stimulation protocols (Contreras et al., 2024;Chiappalone et al., 2022). ...

Neuromorphic neuromodulation: Towards the next generation of closed-loop neurostimulation

PNAS Nexus

... In this study, KAN, which is not yet widely used and which has few studies [1,4,6,[11][12][13][14][15]. The performance of KAN and CNN models in image classification has been evaluated using various metrics such as sensitivity, specificity, precision, Acc, F1, and MCC. ...

KAN-EEG: Towards Replacing Backbone-MLP for an Effective Seizure Detection System
  • Citing Preprint
  • June 2024

... We evaluated its performance on different EEG signatures to assess its adaptability to significant changes in seizure patterns. By generating five synthetic EEG datasets featuring different seizure signatures, we illustrated the model's ability to adapt without forgetting previous patterns, achieving an AUROC of nearly 0.80 (134). This feasibility proof paves the way for future studies focusing on integrating artificial meta-plastic behavior with SNN compatibilities for seizure prediction, addressing buffer limitations in the AURA system. ...

Metaplastic-EEG: Continuous Training on Brain-Signals
  • Citing Preprint
  • May 2024

... 53 Similarly, in a study by Ouchida and colleagues, ictal bradycardia and asystole were identified in a 57-year-old patient with left temporal lobe epilepsy; the patient was initially treated with pacemaker implantation followed by left temporal lobectomy. 54 Treatments applied in cases of ictal asystole and symptomatic ictal bradycardia have resulted in a reduction in syncopal episodes. Examining the long-term outcomes of various treatment approaches such as pacemaker implantation and epilepsy surgery may contribute to reducing SUDEP risk. ...

Syncope vs. Seizure: Ictal Bradycardia and Ictal Asystole

... BPTT is computationally intensive and susceptible to gradient vanishing/explosion. To overcome these challenges, we aim to incorporate forward propagation through time (FPTT), which has shown promise in previous EEG seizure detection studies [37]. This approach aims to enhance robustness for continuous learning with lower computational overhead, advancing our goal towards smart implantable system [43]. ...

Biological plausible algorithm for seizure detection: Toward AI-enabled electroceuticals at the edge

... This approach facilitates finetuning large models directly on resource-limited devices, enhancing data security and efficiency. However, recent advancements in machine learning have led to the development of smaller models that can deliver performance comparable to larger models [8]. Despite their smaller model size, fine-tuning these models on resourceconstrained edge devices poses a significant challenge. ...

Cardiac abnormality detection with a tiny diagonal state space model based on sequential liquid neural processing unit

... We identified the Diagonal State Space Sequence (S4D) model as highly effective for processing sequential data [12,13]. Additionally, we explored the capabilities of specialized neurons, such as Neural Circuit Policies (NCP) [14,15,16] and Kolmogorov-Arnold Networks (KAN) [17,18], both of which demonstrated promising results. Building on these findings, we successfully integrated the S4D model with NCP neurons to create an extremely compact model that delivers strong performance on timeseries data [8]. ...

On-device edge-learning for cardiac abnormality detection using a bio-inspired and spiking shallow network