Explore Projects

Discover 116 open source projects

Active filters (1):
Search: attention×
Clear all

Showing 1-20 of 116 projects

fighting41love/funNLP

Comprehensive Chinese NLP resource collection for developers

79.2K
Archived
Python
LLM Frameworks
RAG & Vector
Python
#nlp#chinese-nlp#ai-resources

labmlai/annotated_deep_learning_paper_implementations

Deep learning paper implementations with side-by-side notes and explanations

65.9K
Active
Python
Fine-tuning
Computer Vision
PyTorch
#deep-learning#pytorch#transformers

sgl-project/sglang

High-performance serving framework for large language and multimodal models

24.1K
Active
Python
Inference
LLM Frameworks
Python
#llm#inference#serving

Dao-AILab/flash-attention

Optimized attention mechanism for deep learning

22.5K
Active
Python
Inference
Computer Vision
PyTorch
#flash-attention#deep-learning#pytorch

QwenLM/Qwen

Qwen is a large language model series by Alibaba Cloud with multiple variants and capabilities.

20.6K
Active
Python
LLM Frameworks
Inference
Hugging Face
#large-language-model#alibaba-cloud#qwen

datawhalechina/leedl-tutorial

A tutorial on deep learning by renowned professor Li Hongy, covering a wide range of AI and machine learning topics.

16.4K
Stable
Jupyter Notebook
LLM Frameworks
Jupyter Notebook
#deep-learning#machine-learning#tutorial

ddbourgin/numpy-ml

A comprehensive machine learning library in Python with implementations of various algorithms and models.

16.3K
Archived
Python
ML Ops
#machine-learning#numpy#deep-learning

graykode/nlp-tutorial

A tutorial for natural language processing (NLP) using deep learning frameworks like PyTorch and TensorFlow.

14.9K
Archived
Jupyter Notebook
LLM Frameworks
PyTorch
#natural-language-processing#deep-learning#attention-mechanisms

BlinkDL/RWKV-LM

RWKV is an RNN-based language model with high performance, fast training, and flexible transformer-like architecture.

14.4K
Active
Python
LLM Frameworks
PyTorch
#language-model#rnn#transformer

deepseek-ai/FlashMLA

Efficient multi-head latent attention kernels for AI coding tools and frameworks.

12.5K
Active
C++
LLM Frameworks
C++
#latent-attention#multi-head-attention#ai-coding

xmu-xiaoma666/External-Attention-pytorch

A PyTorch library that implements various attention mechanisms and other neural network components for visual tasks.

12.2K
Archived
Python
Computer Vision
PyTorch
#attention#computer-vision#neural-networks

xlite-dev/LeetCUDA

LeetCUDA is a comprehensive collection of modern CUDA learning resources, including 200+ CUDA kernels, Tensor Cores, HGEMM, and FA-2 MMA.

9.8K
Active
Cuda
ML Ops
PyTorch
#cuda#cuda-toolkit#cuda-demo

jadore801120/attention-is-all-you-need-pytorch

A PyTorch implementation of the Transformer model, a popular deep learning architecture for natural language processing.

9.6K
Archived
Python
LLM Frameworks
PyTorch
#natural-language-processing#transformer-model#deep-learning

brightmart/text_classification

A comprehensive library of text classification models and techniques built with deep learning.

8.0K
Archived
Python
LLM Frameworks
API Frameworks
TensorFlow
#text-classification#deep-learning#nlp

jessevig/bertviz

BertViz is a Python library for visualizing attention in transformer models like BERT, GPT-2, and RoBERTa.

7.9K
Active
Python
Visualization
Visualization
PyTorch
#nlp#transformer#attention-visualization

mit-han-lab/streaming-llm

Efficient streaming language models with attention sinks for AI-powered coding tools and applications.

7.2K
Archived
Python
LLM Frameworks
AI Code Generation
Python
#streaming#language-models#attention-mechanism

ymcui/Chinese-LLaMA-Alpaca-2

An open-source Chinese version of the LLaMA and Alpaca language models with 64K long context support for advanced NLP applications.

7.2K
Experimental
Python
LLM Frameworks
Fine-tuning
Python
#llm#alpaca#llama

InternLM/InternLM

Official release of the InternLM series of large language models focused on building AI tools and chatbots.

7.2K
Stable
Python
LLM Frameworks
Fine-tuning
Python
#chatbot#llm#fine-tuning

zhouhaoyi/Informer2020

A transformer-based time series forecasting library for vibe coders using AI tools.

6.4K
Experimental
Python
Forecasting
Transformer
PyTorch
#deep-learning#time-series#self-attention

649453932/Chinese-Text-Classification-Pytorch

A collection of popular Chinese text classification models implemented in PyTorch, ready to use out-of-the-box.

5.7K
Archived
Python
LLM Frameworks
API Frameworks
PyTorch
#text-classification#nlp#chinese-nlp

Stay in the loop

Get weekly updates on trending AI coding tools and projects.