Generative Adversarial Networks for Synthetic Data Generation in Diabetic Patient Research: Techniques, Applications, and Challenges.
Synthetic data generation is a strategy used to address the lack and complex process to acquire clinical data and information, in particular in type 2 diabetes mellitus (T2DM) research. T2DM is characterized by chronic hyperglycemia with macrovascular and microvascular complications. Nevertheless, despite the importance of data to improve diagnostic accuracy, better treatments, and personalized patient care, medical datasets are often restricted by ethical and privacy constraints. In this sense, this chapter evaluates four synthetic data generation techniques, Gaussian Mixture Models (GMM), Generative Adversarial Networks (GAN), Wasserstein GAN (WGAN), and Variational Autoencoders (VAE). The quality of the generated data was assessed through statistical divergence metrics-specifically Jensen-Shannon (JSD) and Kullback-Leibler (KLD)and by analyzing their impact on classification performance. The results indicate that GMM achieved the lowest JSD, showing the best overall distributional similarity, while WGAN obtained the lowest KLD, suggesting a closer alignment in information content with real data. Additionally, GAN and WGAN demonstrated the highest predictive performance in classification tasks, indicating that they better preserved essential relationships within the data. These findings confirm that generative strategies of using synthetic data to improve T2DM research are feasible, offering an alternative to develop diagnosis tools without compromising patient confidentiality. It is possible to conclude that the generation method selection depends on the type of data and research objective, maximizing statistical similarity, optimizing performance, or balancing both aims. Synthetic data generation approaches represent a feasible approach to expand balanced and quality datasets to advance in personalized healthcare for diabetes patients.
Authors
García-Domínguez García-Domínguez, Acosta-Jiménez Acosta-Jiménez, Gonzalez-Curiel Gonzalez-Curiel, Villagrana-Bañuelos Villagrana-Bañuelos, Acosta-Cruz Acosta-Cruz, Galván-Tejada Galván-Tejada, Galván-Tejada Galván-Tejada
View on Pubmed