Synthetic Metabolic Datasets

This page demonstrates the structure of synthetic datasets available in this category, including variable schema, preview data, and reproducibility using the Syntherx SDK.

Example Dataset

Metabolic Outcomes Trial

Variable schema and example preview from the blueprint definition.

Variable Schema

Column NameTypeDescription
patient_idstringUnique synthetic patient identifier
agenumberSynthetic patient age
sexstringSynthetic patient sex
treatment_groupstringTreatment or control group
baseline_measurenumberBaseline metabolic marker (e.g. composite index)
outcome_measurenumberOutcome metabolic marker at follow-up

Data Preview

First 9 rows (preview only)

patient_idagesextreatment_groupbaseline_measureoutcome_measure
P00000171Femaletreatment8.36.8
P00000243Malecontrol97.1
P00000359Femaletreatment8.87.5
P00000467Malecontrol9.17.5
P00000544Malecontrol8.97
P00000665Maletreatment7.96.9
P00000751Femaletreatment9.68.3
P00000853Malecontrol8.78
P00000951Femaletreatment8.67.4

Includes CSV, JSON, and Parquet — ready for ML pipelines

Reproduce This Dataset

Recreate this dataset in Python (Jupyter, Kaggle, or Google Colab) using the Syntherx SDK

# Install Syntherx SDK
pip install syntherx

from syntherx import generate_dataset

df = generate_dataset(
    blueprint="metabolic_outcomes_trial",
    rows=5000
)

df.to_csv("metabolic_outcomes_trial.csv")

Use Cases

  • Metabolic syndrome and biomarker modeling
  • Intervention response and longitudinal outcomes
  • Risk prediction for cardiometabolic endpoints

Privacy-Safe Synthetic Dataset

  • Contains no real patient data
  • Generated using statistical simulation
  • Designed for machine learning research

Purchase Dataset

Research Dataset — $99

Secure checkout via Stripe.

Includes CSV, JSON, and Parquet — ready for ML pipelines

Unlock the Syntherx Platform

Generate custom datasets tailored to your research and AI needs.

Generate custom datasets
← Back to all datasets