Showing 81-100 of 1,375 projects
Apache Hadoop is a popular open-source distributed computing framework for processing and storing large datasets.
A JavaScript library for GPU-accelerated parallel computations, enabling high-performance graphics and data processing on the web.
Fast and flexible image augmentation library for computer vision and machine learning projects.
Turn websites into clean data pipelines & structured APIs in minutes with a low-code web scraping tool.
Apache Doris is a high-performance, unified analytics database for real-time data processing.
A comprehensive speech recognition toolkit with state-of-the-art pretrained models for various speech tasks.
This is a comprehensive learning resource for the Flink stream processing framework, covering concepts, principles, and real-world use cases.
A tutorial for natural language processing (NLP) using deep learning frameworks like PyTorch and TensorFlow.
Logstash is a powerful open-source data processing pipeline that can ingest, transform, and output data from a variety of sources.
Automates the process of making money online through various techniques.
An image processing library for Node.js with zero external or native dependencies, written in TypeScript.
Natural Language Toolkit (NLTK) is a comprehensive Python library for NLP tasks.
A Python library for creating, editing, and compositing videos using a simple and intuitive API.
A simple, state-of-the-art NLP framework for tasks like named entity recognition and semantic role labeling.
A powerful PHP library for processing and manipulating images with a wide range of features.
A curated list of awesome big data frameworks, resources and other awesomeness.
A Java-based open-source library that provides style and grammar checking for over 25 languages.
Unstructured is an open-source ETL solution for transforming complex documents into structured data for language models.
A JavaScript library for cropping images, providing a simple and customizable image cropping experience.
Dask is a Python library for parallel computing and distributed data processing, providing a scalable alternative to NumPy and Pandas.
Get weekly updates on trending AI coding tools and projects.