Synthetic Clinical Trial Datasets

This page demonstrates the structure of synthetic datasets available in this category, including variable schema, preview data, and reproducibility using the Syntherx SDK.

Example Dataset

Example Clinical Trial Schema

Schema and example preview for datasets in this category, fetched from the blueprint API.

Variable Schema

Column NameTypeDescription
patient_idstringUnique synthetic patient identifier
agenumberSynthetic patient age
sexstringSynthetic patient sex
trial_armstringTreatment or control group assignment
baseline_valuenumberBaseline measurement
endpoint_valuenumberOutcome measurement at endpoint
change_from_baselinenumberDifference between endpoint and baseline
respondernumberBinary outcome (1 = responder, 0 = non-responder)

Data Preview

First 9 rows (preview only)

patient_idagesextrial_armbaseline_valueendpoint_valuechange_from_baselineresponder
P00000166Femaletreatment8.57.1-1.41
P00000259Malecontrol8.98.2-0.70
P00000371Femaletreatment9.17.4-1.71
P00000463Malecontrol8.78-0.70
P00000554Femaletreatment8.36.9-1.41
P00000668Malecontrol98.4-0.60
P00000760Femaletreatment8.67-1.61
P00000865Malecontrol8.88.1-0.70
P00000957Femaletreatment8.46.8-1.61

Reproduce This Dataset

Recreate this dataset in Python (Jupyter, Kaggle, or Google Colab) using the Syntherx SDK.

# Install Syntherx SDK
pip install syntherx

from syntherx import generate_dataset

df = generate_dataset(
    blueprint="clinical_trial_outcomes",
    rows=5000
)

df.to_csv("clinical_trial_outcomes.csv")

Use Cases

  • Clinical trial simulation
  • Endpoint analysis and modeling
  • Treatment effect evaluation

Privacy-Safe Synthetic Dataset

  • Contains no real patient data
  • Generated using statistical simulation
  • Designed for machine learning research

No datasets in this category yet. Browse all datasets

Unlock the Syntherx Platform

Generate custom datasets tailored to your research and AI needs.

Generate custom datasets