Skip to content

chitresh99/Synthee

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

22 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Synthee

Synthee is a AI synthetic data generation service designed to automate and accelerate dataset creation for machine learning, analytics, and testing. With Synthee, users can generate structured and customizable synthetic datasets with ease using powerful AI models and prompt-driven workflows.


Features

  • AI-Powered Prompt Generation – Generate contextually accurate data from natural language prompts.
  • Custom Schema Support – Define your own fields, types, and constraints.
  • Streamlit UI – Easy-to-use frontend for quick testing and dataset generation.
  • Export to CSV – Download your datasets for instant use in ML or analytics pipelines.

How It Works

  1. Input a prompt describing the kind of dataset you want.
  2. Optionally define a schema and constraints.
  3. Synthee uses AI to generate data based on the prompt and your preferences.
  4. Download the dataset as a CSV file or preview it in-browser.

Example Use Cases

  • Generate tabular synthetic datasets for machine learning prototyping.
  • Simulate customer data for frontend or backend testing.

Tech Stack

Frontend: Streamlit

Backend: Python

AI Model Access: OpenRouter (routes to models like Meta LLaMA, DeepSeek, etc.)

Models Used: Meta LLaMA, DeepSeek

About

AI powered synthetic data generation service for creating structured datasets from prompts.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages