Explore Projects

Discover 117 open source projects

Active filters (1):

Search: pretraining×

Clear all

Showing 1-20 of 117 projects

huggingface/transformers

Model framework for state-of-the-art ML models in text, vision, audio, and multimodal tasks.

157.4K

Active

Python

LLM Frameworks

Agents & Orchestration

PyTorch

#transformers#huggingface#deep-learning

fighting41love/funNLP

Comprehensive Chinese NLP resource collection for developers

79.2K

Archived

Python

LLM Frameworks

RAG & Vector

Python

#nlp#chinese-nlp#ai-resources

opendatalab/MinerU

Converts complex documents into LLM-ready formats for agentic workflows

55.5K

Active

Python

Agents & Orchestration

Agent Coordination

Python

#document-analysis#pdf-extraction#llm-workflows

huggingface/pytorch-image-models

A collection of PyTorch image encoders/backbones with training, evaluation, and inference scripts.

36.4K

Active

Python

LLM Frameworks

Full-Stack Frameworks

Next.js

#PyTorch#Image Models#Deep Learning

explosion/spaCy

Industrial-strength NLP library for Python with pretrained models and fast processing

33.3K

Stable

Python

LLM Frameworks

CLI Tools

spaCy

#nlp#machine-learning#python

openai/CLIP

CLIP is a neural network for zero-shot image-text matching and understanding

32.7K

Archived

Jupyter Notebook

Computer Vision

PyTorch

#image-text-matching#zero-shot-learning#computer-vision

Lightning-AI/pytorch-lightning

PyTorch Lightning simplifies deep learning training and deployment at scale.

30.9K

Active

Python

ML Ops

Inference

PyTorch

#pytorch#deep-learning#ml-ops

deezer/spleeter

Source separation library for audio processing with pretrained models

28.1K

Experimental

Python

AI Voice & Speech

CLI Tools

Python

#audio-processing#source-separation#pretrained-models

karpathy/minGPT

Minimal PyTorch GPT re-implementation for training and inference

23.8K

Archived

Python

LLM Frameworks

Example Projects

PyTorch

#gpt#pytorch#llm

QwenLM/Qwen

Qwen is a large language model series by Alibaba Cloud with multiple variants and capabilities.

20.6K

Active

Python

LLM Frameworks

Inference

Hugging Face

#large-language-model#alibaba-cloud#qwen

deepseek-ai/Janus

Janus-Series: Unified Multimodal Understanding and Generation Models for AI-powered vibe coders.

17.7K

Experimental

Python

LLM Frameworks

Python

#foundation-models#multimodal#unified-model

modelscope/FunASR

A comprehensive speech recognition toolkit with state-of-the-art pretrained models for various speech tasks.

15.1K

Active

Python

AI Voice & Speech

PyTorch

#speech-recognition#voice-activity-detection#audio-visual-speech-recognition

tensorflow/tfjs-models

A collection of pre-trained machine learning models for TensorFlow.js, a library for running ML in the browser and on Node.js.

14.8K

Active

TypeScript

ML Ops

React

#tensorflow#machine-learning#models

LlamaFamily/Llama-Chinese

A Chinese community for learning and building with the Llama large language model, with open-source and commercial-ready resources.

14.7K

Experimental

Python

LLM Frameworks

Python

#llama#llm#pretraining

mlfoundations/open_clip

Open source implementation of CLIP, a contrastive learning model for multi-modal tasks like zero-shot classification.

13.5K

Stable

Python

Computer Vision

PyTorch

#computer-vision#contrastive-learning#pretrained-model

Lightning-AI/litgpt

A collection of high-performance large language models (LLMs) with recipes to pretrain, finetune, and deploy at scale.

13.2K

Active

Python

LLM Frameworks

Python

#ai#artificial-intelligence#large-language-models

PaddlePaddle/PaddleNLP

Easy-to-use and powerful LLM and SLM library with awesome model zoo for natural language processing.

12.9K

Stable

Python

LLM Frameworks

React

#llm#nlp#transformers

qubvel-org/segmentation_models.pytorch

Semantic segmentation models with 500+ pretrained convolutional and transformer-based backbones.

11.4K

Stable

Python

Computer Vision

PyTorch

#image-segmentation#semantic-segmentation#pretrained-weights

salesforce/LAVIS

LAVIS is a comprehensive library for multimodal deep learning, including image captioning, visual question answering, and more.

11.2K

Archived

Jupyter Notebook

Vision-Language Transformer

PyTorch

#deep-learning#multimodal-learning#vision-language

bigscience-workshop/petals

A distributed system for running large language models (LLMs) on personal devices, enabling faster fine-tuning and inference.

10.0K

Archived

Python

LLM Frameworks

PyTorch

#llm#distributed-computing#fine-tuning

2 3 4 5 6

Stay in the loop

Get weekly updates on trending AI coding tools and projects.