Longitudinal health data for machine learning research and education
These realistic synthetic datasets can be downloaded freely, for example to develop offline reinforcement learning algorithms.
Realistic open data created using the latest advances in generative models
Read the research paper on the use of Generative Adversarial Networks (GANs) to create synthetic datasets and their evaluation in terms of accuracy and disclosure risk.
Download the datasets
Select a dataset to download or view documentation.
This dataset comprises vital signs, lab tests, administered fluid boluses and vasopressors for 3,910 patients with acute hypotension in the intensive care unit.
Antiretroviral Therapy in HIV
This dataset comprises viral loads, CD4 counts, and drug regimen information for 8,916 patients with Human Immunodeficiency Virus (HIV).
The Team and Resources behind Health Gym
Learn more about us.
Be part of the community
here is how to reach out.
Join our community on GitHub or contact us