Explore Projects

Discover 1,473 open source projects

Active filters (1):
Search: processร—
Clear all

Showing 81-100 of 1,473 projects

apache/hadoop

Apache Hadoop is a popular open-source distributed computing framework for processing and storing large datasets.

15.5K
Active
Java
API Frameworks
#distributed-computing#big-data#nosql

gpujs/gpu.js

A JavaScript library for GPU-accelerated parallel computations, enabling high-performance graphics and data processing on the web.

15.4K
Experimental
JavaScript
Frontend Frameworks
React
#gpu#parallel-computing#javascript

albumentations-team/albumentations

Fast and flexible image augmentation library for computer vision and machine learning projects.

15.3K
Experimental
Python
Computer Vision
Python
#augmentation#deep-learning#image-processing

getmaxun/maxun

Turn websites into clean data pipelines & structured APIs in minutes with a low-code web scraping tool.

15.2K
Active
TypeScript
API Clients & Testing
React
#web-scraping#automation#no-code

apache/doris

Apache Doris is a high-performance, unified analytics database for real-time data processing.

15.1K
Active
Java
Databases
Spark
#database#olap#real-time

modelscope/FunASR

A comprehensive speech recognition toolkit with state-of-the-art pretrained models for various speech tasks.

15.1K
Active
Python
AI Voice & Speech
PyTorch
#speech-recognition#voice-activity-detection#audio-visual-speech-recognition

zhisheng17/flink-learning

This is a comprehensive learning resource for the Flink stream processing framework, covering concepts, principles, and real-world use cases.

15.1K
Experimental
Java
Databases
#stream-processing#flink#kafka

graykode/nlp-tutorial

A tutorial for natural language processing (NLP) using deep learning frameworks like PyTorch and TensorFlow.

14.9K
Archived
Jupyter Notebook
LLM Frameworks
PyTorch
#natural-language-processing#deep-learning#attention-mechanisms

elastic/logstash

Logstash is a powerful open-source data processing pipeline that can ingest, transform, and output data from a variety of sources.

14.8K
Active
Java
API Frameworks
Java
#etl#logging#real-time-processing

FujiwaraChoki/MoneyPrinterV2

Automates the process of making money online through various techniques.

14.6K
Experimental
Python
General Utilities
Python
#automation#money#outreach

jimp-dev/jimp

An image processing library for Node.js with zero external or native dependencies, written in TypeScript.

14.6K
Stable
TypeScript
Backend Frameworks
Node
#image-processing#javascript#typescript

nltk/nltk

Natural Language Toolkit (NLTK) is a comprehensive Python library for NLP tasks.

14.5K
Active
Python
React
#natural-language-processing#machine-learning#nlp

Zulko/moviepy

A Python library for creating, editing, and compositing videos using a simple and intuitive API.

14.4K
Stable
Python
API Frameworks
Python
#video-editing#animation#python

flairNLP/flair

A simple, state-of-the-art NLP framework for tasks like named entity recognition and semantic role labeling.

14.4K
Stable
Python
NLP Frameworks
PyTorch
#natural-language-processing#machine-learning#sequence-labeling

Intervention/image

A powerful PHP library for processing and manipulating images with a wide range of features.

14.3K
Active
PHP
API Frameworks
PHP
#image-processing#gd#imagick

oxnr/awesome-bigdata

A curated list of awesome big data frameworks, resources and other awesomeness.

14.3K
Stable
Databases
#big-data#data-analytics#data-science

languagetool-org/languagetool

A Java-based open-source library that provides style and grammar checking for over 25 languages.

14.1K
Active
Java
Linters & Formatters
#grammar-checker#style-checker#proofreading

Unstructured-IO/unstructured

Unstructured is an open-source ETL solution for transforming complex documents into structured data for language models.

14.1K
Active
HTML
Document Processing
#document-processing#data-pipelines#natural-language-processing

fengyuanchen/cropperjs

A JavaScript library for cropping images, providing a simple and customizable image cropping experience.

13.8K
Stable
TypeScript
Component Libraries (React)
React
#cropper#image-processing#image-editor

dask/dask

Dask is a Python library for parallel computing and distributed data processing, providing a scalable alternative to NumPy and Pandas.

13.8K
Active
Python
Databases
Python
#parallel-computing#distributed-data-processing#data-analysis
1...46...74

Stay in the loop

Get weekly updates on trending AI coding tools and projects.