Showing 1-20 of 26 projects
AI-powered dataset management and preprocessing library for ML projects
Unstructured is an open-source ETL solution for transforming complex documents into structured data for language models.
Duplicates images made easy with AI-powered image deduplication.
Gathers text and metadata from the web using crawling, scraping, and extraction techniques.
A comprehensive Chinese NLP preprocessing and parsing package with high accuracy, efficiency, and ease of use.
A delightful machine learning tool that allows you to train, test, and use models without writing code
A Python library for text preprocessing, representation, and visualization in machine learning and NLP projects.
A plugin framework for CSS preprocessing in Node.js
A versatile NLP toolkit for text mining and preprocessing, supporting tasks like sentiment analysis, entity extraction, and keyword summarization.
A repository for the 100 Knocks of Data Science Preprocessing, focused on structured data processing.
TorchIO is a Python library for efficient medical image preprocessing and data augmentation for AI applications.
A Svelte preprocessor with support for various languages and a focus on developer productivity.
MLBox is a powerful automated machine learning Python library that simplifies and accelerates the machine learning workflow.
Compile almost any preprocessing language with live browser refresh.
A Python toolkit for deep learning and healthcare applications, with support for clinical data and electronic health records.
Automated Time Series Forecasting library for Python with advanced features like deep learning and feature engineering.
This project provides advanced voiceprint recognition models and data preprocessing methods using PyTorch.
Starter code for solving real-world text data problems using NLP techniques like Gensim Word2Vec and text classification.
NVTabular is a feature engineering and preprocessing library for tabular data used in recommender systems.
A PyTorch-based audio processing library for spectrograms, CQT, and neural network-based preprocessing.
Get weekly updates on trending AI coding tools and projects.