Clinical Trial Outcomes
Synthetic patient-level rows with fields: patient_id, age, sex, trial_arm, baseline_value, endpoint_value, ….
This resource represents a fully synthetic cohort patterned after healthcare scenarios: there are no real patients or protected health information, only statistically plausible records for method development and reproducible benchmarks.
Rows include variables such as patient_id, age, sex, trial_arm, baseline_value, endpoint_value, change_from_baseline, responder. You can inspect the full schema and representative preview below before downloading or generating a fresh cohort with the Syntherx SDK.
Teams use datasets like this for AI and statistical modeling, digital twin and pathway simulation, curriculum and sandbox environments, and cross-institutional collaborations where sharing real data is impractical.
Research Dataset — $99
Secure checkout via Stripe.
Includes CSV, JSON, and Parquet — ready for ML pipelines
Variable Schema
| Column Name | Type | Description |
|---|---|---|
| patient_id | string | Unique synthetic patient identifier |
| age | number | Synthetic patient age |
| sex | string | Synthetic patient sex |
| trial_arm | string | Treatment or control group assignment |
| baseline_value | number | Baseline measurement |
| endpoint_value | number | Outcome measurement at endpoint |
| change_from_baseline | number | Difference between endpoint and baseline |
| responder | number | Binary outcome (1 = responder, 0 = non-responder) |
Data Preview
First 9 rows (preview only)
Includes CSV, JSON, and Parquet — ready for ML pipelines
| patient_id | age | sex | trial_arm | baseline_value | endpoint_value | change_from_baseline | responder |
|---|---|---|---|---|---|---|---|
| P000001 | 66 | Female | treatment | 8.5 | 7.1 | -1.4 | 1 |
| P000002 | 59 | Male | control | 8.9 | 8.2 | -0.7 | 0 |
| P000003 | 71 | Female | treatment | 9.1 | 7.4 | -1.7 | 1 |
| P000004 | 63 | Male | control | 8.7 | 8 | -0.7 | 0 |
| P000005 | 54 | Female | treatment | 8.3 | 6.9 | -1.4 | 1 |
| P000006 | 68 | Male | control | 9 | 8.4 | -0.6 | 0 |
| P000007 | 60 | Female | treatment | 8.6 | 7 | -1.6 | 1 |
| P000008 | 65 | Male | control | 8.8 | 8.1 | -0.7 | 0 |
| P000009 | 57 | Female | treatment | 8.4 | 6.8 | -1.6 | 1 |
Reproduce This Dataset
Recreate this dataset in Python (Jupyter, Kaggle, or Google Colab) using the Syntherx SDK.
# Install Syntherx SDK
pip install syntherx
from syntherx import generate_dataset
df = generate_dataset(
blueprint="clinical_trial_outcomes",
rows=5000
)
df.to_csv("clinical_trial_outcomes.csv")Use Cases
- Build and validate AI/ML pipelines for Healthcare scenarios without using real patient data.
- Train and evaluate models on structured fields such as patient_id, age, sex, trial_arm.
- Run simulations, power analyses, and exploratory analytics in a privacy-safe sandbox.
- Prototype dashboards, ETL flows, and feature stores before touching production systems.
Dataset Characteristics
- Fully synthetic — no PHI; suitable for sharing, teaching, and external collaboration.
- Schema includes 8 variables: patient_id, age, sex, trial_arm, baseline_value, endpoint_value, change_from_baseline, responder
- Delivered in researcher-friendly formats (CSV, JSON, Parquet) for downstream tooling.
- Generated with the Syntherx simulation engine for reproducible cohort-scale draws.
Privacy-Safe Synthetic Dataset
- Contains no real patient data
- Generated using statistical simulation
- Designed for machine learning research
Related Datasets
Explore adjacent synthetic cohorts in the same domain or browse nearby clinical themes.
- Cardiology OutcomesSynthetic patient-level cardiovascular risk factors and biomarkers for ML and outcomes research.
- ClaimsSynthetic patient-level rows with fields: patient_id, age, sex, diagnosis_code, procedure_code, claim_amount, ….
- ClaimsSynthetic patient-level rows with fields: patient_id, age, sex, diagnosis_code, procedure_code, claim_amount, ….
- EhrSynthetic patient-level rows with fields: patient_id, visit_date, age, sex, diagnosis, medication, ….
- Diabetes Hba1c TrialSynthetic patient-level rows with fields: patient_id, age, sex, treatment_group, baseline_measure, outcome_measure.
Unlock the Syntherx Platform
Generate custom datasets tailored to your research and AI needs.
Generate Custom Datasets