Explore Projects

Discover 382 open source projects

Active filters (1):
Search: datasetsร—
Clear all

Showing 61-80 of 382 projects

apache/hbase

Apache HBase is a distributed, scalable, fault-tolerant database for large datasets built on top of HDFS.

5.6K
Active
Java
Databases
#database#distributed#scalable

lonePatient/awesome-pretrained-chinese-nlp-models

A curated collection of high-quality, pretrained Chinese NLP models for various tasks and applications.

5.5K
Stable
Python
LLM Frameworks
API Frameworks
Python
#bert#chinese#nlp

OpenCSGs/csghub

An open-source platform for managing large language models, datasets, and agents with features similar to Hugging Face.

5.5K
Active
Vue
LLM Frameworks
LLM Wrappers & SDKs
Vue
#ai#llm#dataset

goto456/stopwords

A Chinese stopwords dataset for natural language processing and text analysis tasks.

5.5K
Archived
API Frameworks
Databases
#natural-language-processing#text-analysis#chinese

zhaoxin94/awesome-domain-adaptation

A curated list of resources related to domain adaptation, a technique used to improve AI model performance on new datasets.

5.4K
Stable
ML Ops
Tutorials & Courses
#domain-adaptation#transfer-learning#machine-learning

potree/potree

A WebGL-based point cloud viewer for large datasets, suitable for 3D visualization and spatial analysis.

5.3K
Active
JavaScript
Charts & Visualization
Frontend Frameworks
React
#3d-visualization#point-cloud#spatial-analysis

isl-org/MiDaS

Code for robust monocular depth estimation, a key computer vision task for AI-powered apps.

5.3K
Archived
Python
Computer Vision
API Frameworks
PyTorch
#depth-estimation#computer-vision#pytorch

philackm/ScrollableGraphView

An adaptive scrollable graph view for iOS to visualise simple discrete datasets, written in Swift.

5.3K
Archived
Swift
Charts & Visualization
iOS
#charts#visualization#graphs

ownthink/KnowledgeGraphData

This is a large-scale open-source Chinese knowledge graph dataset for AI and machine learning applications.

5.2K
Archived
Python
LLM Frameworks
Databases
#knowledge-graph#chinese#machine-learning

minimaxir/textgenrnn

A Python library for easily training text-generating neural networks on any text dataset.

4.9K
Archived
Python
LLM Frameworks
CLI Tools
Python
#text-generation#deep-learning#machine-learning

togethercomputer/RedPajama-Data

A repository for preparing large datasets for training large language models (LLMs).

4.9K
Archived
Python
LLM Frameworks
Datasets
Python
#language-models#dataset-preparation#cli-tool

bukosabino/ta

Technical Analysis Library using Pandas and Numpy for financial data analysis and trading strategies.

4.9K
Archived
Jupyter Notebook
Pandas, Numpy
Backend Frameworks
Pandas
#financial-analysis#technical-analysis#trading-strategies

argilla-io/argilla

Argilla is a collaboration tool for AI engineers and domain experts to build high-quality datasets

4.9K
Active
Python
LLM Frameworks
Agents & Orchestration
Python
#active-learning#annotation-tool#human-in-the-loop

pudo/dataset

Easy-to-use data handling for SQL data stores with support for implicit table creation, bulk loading, and transactions.

4.9K
Experimental
Python
ORMs & Query Builders
CLI Tools
Python
#database#sql#orm

jazzband/tablib

A Python module for working with tabular datasets in various formats like XLS, CSV, JSON, and YAML.

4.8K
Active
Python
API Frameworks
Data Formats
Python
#tabular-data#data-manipulation#data-export

weiaicunzai/pytorch-cifar100

A PyTorch repository for practicing image classification on the CIFAR-100 dataset using various deep learning models.

4.8K
Archived
Python
Computer Vision
Datasets
PyTorch
#cifar100#image-classification#deep-learning

Kiln-AI/Kiln

Build, Evaluate, and Optimize AI Systems

4.7K
Active
Python
AI Editors/Agents/Copilot
#AI#chain-of-thought#collaboration

SPLWare/esProc

esProc SPL is a JVM-based programming language for structured data computation, serving as both a data analysis tool and an embedded computing engine.

4.7K
Active
Java
Databases
Dataset
#cluster-computing#sql#database

leoxiaobin/deep-high-resolution-net.pytorch

An official PyTorch implementation of a deep learning model for human pose estimation

4.5K
Archived
Cuda
Computer Vision
API Frameworks
PyTorch
#computer-vision#pose-estimation#deep-learning

hyunwoongko/transformer

A PyTorch implementation of the Transformer architecture, a key component of modern language models.

4.5K
Experimental
Python
LLM Frameworks
Frontend Frameworks
PyTorch
#transformer#attention-mechanism#language-modeling
1...35...20

Stay in the loop

Get weekly updates on trending AI coding tools and projects.