Explore Projects

Discover 31 open source projects

Active filters (1):
Search: syntheticร—
Clear all

Showing 1-20 of 31 projects

stefan-jansen/machine-learning-for-trading

Code for machine learning-based algorithmic trading strategies and workflows.

16.7K
Archived
Jupyter Notebook
Machine Learning Ops
#finance#investment#trading

openstatusHQ/openstatus

Open-source status page with uptime monitoring and API monitoring as code

8.4K
Active
TypeScript
API Frameworks
Monitoring
Next.js
#monitoring#api-monitoring#uptime

7kms/react-illustration-series

An illustration series that aims to explain the inner workings of the React library in detail.

7.9K
Archived
TypeScript
Component Libraries (React)
Tutorials & Courses
React
#react#fiber#hook

clovaai/donut

Donut is an OCR-free Document Understanding Transformer and Synthetic Document Generator for computer vision and document AI tasks.

6.8K
Archived
Python
React
#document-ai#computer-vision#open-source

datajuicer/data-juicer

A Python library for processing and analyzing data with foundation models and large language models.

6.0K
Active
Python
LLM Frameworks
ETL & Pipelines
Python
#data-processing#data-analysis#foundation-models

lk-geimfari/mimesis

Mimesis is a fast Python library for generating fake data in multiple languages for testing and development purposes.

4.8K
Active
Python
Databases
Testing
#data-generation#fake-data#testing

Kiln-AI/Kiln

Build, Evaluate, and Optimize AI Systems

4.7K
Active
Python
AI Editors/Agents/Copilot
#AI#chain-of-thought#collaboration

Belval/TextRecognitionDataGenerator

A synthetic data generator for text recognition, useful for training AI-powered text detection and OCR models.

3.6K
Archived
Python
Computer Vision
Python
#text-recognition#ocr#synthetic-data

sdv-dev/SDV

Generates synthetic tabular data for machine learning and AI applications

3.4K
Active
Python
AI Code Generation
Next.js
#synthetic-data-generation#tabular-data#machine-learning

DLR-RM/BlenderProc

A Python library for generating photorealistic training images using the Blender 3D software.

3.4K
Active
Python
Computer Vision
Backend Frameworks
Python
#computer-vision#3d-graphics#blender

NAalytics/Assemblies-of-putative-SARS-CoV2-spike-encoding-mRNA-sequences-for-vaccines-BNT-162b2-and-mRNA-1273

An open-source project that provides experimental sequence information for the RNA components of the Moderna and Pfizer/BioNTech COVID-19 vaccines.

3.4K
Archived
File Storage
Databases
#covid-19#vaccines#rna-sequencing

pgmpy/pgmpy

Python library for Causal AI and Bayesian networks

3.2K
Active
Python
React
#causal-inference#bayesian-networks#probabilistic-inference

synthetichealth/synthea

Synthea is an open-source synthetic patient population simulator for generating realistic healthcare data.

3.0K
Stable
Java
Databases
API Frameworks
#healthcare#data-simulation#fhir

Eladlev/AutoPrompt

A framework for prompt tuning using Intent-based Prompt Calibration, focused on AI coding tools.

2.9K
Stable
Python
LLM Frameworks
Prompt Engineering
Python
#prompt-tuning#intent-based-calibration#llm

hitsz-ids/synthetic-data-generator

A specialized Python framework for generating high-quality structured tabular data for AI and ML applications.

2.4K
Active
Python
Synthetic Data
Databases
Python
#data-generation#tabular-data#machine-learning

ankush-me/SynthText

Open-source Python library for generating synthetic text images, useful for computer vision tasks.

2.1K
Archived
Python
Computer Vision
Backend Frameworks
Python
#computer-vision#image-generation#data-augmentation

opendatalab/DocLayout-YOLO

An open-source library that enhances document layout analysis using diverse synthetic data and adaptive perception.

2.0K
Experimental
Python
Computer Vision
API Frameworks
Python
#document-layout-analysis#computer-vision#synthetic-data

apple/ml-hypersim

A photorealistic synthetic dataset for holistic indoor scene understanding using machine learning.

2.0K
Active
Python
Computer Vision
#computer-vision#machine-learning#dataset

bespokelabsai/curator

A Python library for synthetic data curation and structured data extraction for machine learning models.

1.6K
Active
Python
Synthetic Data
LLM Frameworks
Python
#machine-learning#data-generation#data-curation

huggingface/aisheets

Build, enrich, and transform datasets using AI models with no code

1.6K
Stable
TypeScript
LLM Frameworks
AI SDKs & Wrappers
TypeScript
#ai#llms#nocode
2

Stay in the loop

Get weekly updates on trending AI coding tools and projects.