Skip to content

Research

EEG-GAN: A Generative EEG Augmentation Toolkit for Enhancing Neural Classification.

Williams, Weinhardt, Hewson, Plomecka, Langer, & Musslick (2025). bioRxiv

Abstract:

Electroencephalography (EEG) is a widely applied method for decoding neural activity, offering insights into cognitive function and driving advancements in neurotechnology. However, decoding EEG data remains challenging, as classification algorithms typically require large datasets that are expensive and time-consuming to collect. Recent advances in generative artificial intelligence have enabled the creation of realistic synthetic EEG data, yet no method has consistently demonstrated that such synthetic data can lead to improvements in EEG decodability across diverse datasets. Here, we introduce EEG-GAN, an open-source generative adversarial network (GAN) designed to augment EEG data. In the most comprehensive evaluation study to date, we assessed its capacity to generate realistic EEG samples and enhance classification performance across four datasets, five classifiers, and seven sample sizes, while benchmarking it against six established augmentation techniques. We found that EEG-GAN, when trained to generate raw single-trial EEG signals, produced signals that reproduce grand-averaged waveforms and time-frequency patterns of the original data. Furthermore, training classifiers on additional synthetic data improved their ability to decode held-out empirical data. EEG-GAN achieved up to a 16% improvement in decoding accuracy, with enhancements consistent across datasets but varying among classifiers. Data augmentations were particularly effective for smaller sample sizes (30 and below), significantly improving 70% of these classification analyses and only significantly impairing 4% of analyses. Moreover, EEG-GAN significantly outperformed all benchmark techniques in 69% of the comparisons across datasets, classifiers, and sample sizes and was only significantly outperformed in 3% of comparisons. These findings establish EEG-GAN as a robust toolkit for generating realistic EEG data, which can effectively reduce the costs associated with real-world EEG data collection for neural decoding tasks.

Script and Data Availability

Script Repos

Data and Models