Explore Projects

Discover 17 open source projects

Active filters (1):
Search: data-augmentation×
Clear all

Showing 1-17 of 17 projects

snorkel-team/snorkel

A powerful system for quickly generating high-quality training data with weak supervision for AI/ML projects.

5.9K
Archived
Python
LLM Frameworks
Data Pipelines
Python
#data-augmentation#weak-supervision#machine-learning

NVIDIA/DALI

A highly optimized GPU-accelerated library for accelerating deep learning training and inference applications.

5.6K
Active
C++
GPU
Data Processing
PyTorch
#gpu#data-processing#deep-learning

ZhaoJ9014/face.evoLVe

A high-performance face recognition library for developers built on PaddlePaddle and PyTorch.

3.6K
Experimental
Python
Computer Vision
Machine Learning
PyTorch
#face-recognition#computer-vision#deep-learning

QData/TextAttack

TextAttack is a Python framework for adversarial attacks, data augmentation, and model training in NLP.

3.4K
Experimental
Python
Adversarial Attacks & Security
Data Augmentation
Python
#adversarial-attacks#data-augmentation#natural-language-processing

webdataset/webdataset

A high-performance I/O system for large deep learning problems with strong PyTorch support.

3.0K
Experimental
Python
ML Ops
ETL & Pipelines
PyTorch
#data-augmentation#deep-learning#pytorch

TorchIO-project/torchio

TorchIO is a Python library for efficient medical image preprocessing and data augmentation for AI applications.

2.4K
Active
Python
Computer Vision
Databases
PyTorch
#medical-imaging#data-augmentation#computer-vision

425776024/nlpcda

A one-key Chinese data augmentation package for NLP and BERT model training.

1.9K
Experimental
Python
React
#data-augmentation#chinese-data-augmentation#nlp

visual-layer/fastdup

Accelerate data curation and augmentation with this scalable, free tool for image and video analysis.

1.8K
Stable
Python
Computer Vision
ETL & Pipelines
Python
#data-augmentation#data-curation#image-processing

jasonwei20/eda_nlp

Data augmentation for NLP using CNN and RNN, presented at EMNLP 2019

1.6K
Archived
Python
Python
#data-augmentation#nlp#text-classification

AgaMiko/data-augmentation-review

A comprehensive collection of data augmentation resources and techniques for developers working with AI tools.

1.6K
Archived
Data Augmentation
Tutorials & Courses
#data-augmentation#machine-learning#ai-tools

yongzhuo/nlp_xiaojiang

A comprehensive NLP toolkit for Chinese language processing, including chatbots, text similarity, classification, and more.

1.5K
Archived
Python
LLM Frameworks
API Frameworks
PyTorch
#nlp#chatbot#text-classification

LirongWu/awesome-graph-self-supervised-learning

A library for self-supervised learning on graphs, providing contrastive, generative, and predictive pretext tasks.

1.4K
Archived
Representation Learning
Graph Databases
#graph-neural-networks#self-supervised-learning#pre-training

zhanlaoban/EDA_NLP_for_Chinese

A Python library for easy data augmentation of Chinese text corpora using the EDA (Easy Data Augmentation) technique.

1.4K
Archived
Python
Text Augmentation
#chinese#data-augmentation#text-classification

Tebmer/Awesome-Knowledge-Distillation-of-LLMs

A comprehensive survey on knowledge distillation techniques for large language models.

1.3K
Experimental
LLM Frameworks
Tutorials & Courses
#knowledge-distillation#large-language-model#survey

Paperspace/DataAugmentationForObjectDetection

A Jupyter Notebook library for applying data augmentation techniques to improve object detection models.

1.2K
Archived
Jupyter Notebook
Computer Vision
Data Pipelines
#data-augmentation#object-detection#computer-vision

iver56/torch-audiomentations

A fast and customizable PyTorch library for audio data augmentation, useful for deep learning applications.

1.1K
Stable
Python
Audio
Caching
PyTorch
#audio-processing#data-augmentation#machine-learning

quqxui/Awesome-LLM4IE-Papers

A curated collection of papers on generative information extraction using large language models.

1.1K
Archived
LLM Frameworks
Information Extraction
#large-language-models#information-extraction#event-extraction

Stay in the loop

Get weekly updates on trending AI coding tools and projects.