Explore Projects

Discover 6 open source projects

Active filters (1):
Search: megatronร—
Clear all

Showing 1-6 of 6 projects

NVIDIA/Megatron-LM

An ongoing research project focused on training large language models at scale using transformers.

15.5K
Active
Python
LLM Frameworks
PyTorch
#large-language-models#transformers#model-parallelism

modelscope/ms-swift

A Python library for using and fine-tuning over 900 large language models and multimodal models for various AI tasks.

12.9K
Active
Python
LLM Frameworks
Python
#llm#multimodal#fine-tuning

EleutherAI/gpt-neox

An open-source implementation of large language models with a focus on model parallelism and efficiency.

7.4K
Stable
Python
LLM Frameworks
API Frameworks
PyTorch
#language-model#transformers#deepspeed

Orchestra-Research/AI-research-SKILLs

Comprehensive open-source library of AI research and engineering skills for any AI model.

4.4K
Active
TeX
LLM Frameworks
Agents & Orchestration
#ai#machine-learning#claude

alibaba/Pai-Megatron-Patch

Official repo for Pai-Megatron-Patch, a large language model and visual language model training framework developed by Alibaba Cloud.

1.5K
Stable
Python
LLM Frameworks
ML Ops
Python
#large-language-model#visual-language-model#distributed-training

bigscience-workshop/Megatron-DeepSpeed

An ongoing research project for training transformer language models at scale, including BERT and GPT-2.

1.4K
Archived
Python
LLM Frameworks
None
#language-model#transformer#large-language-model

Stay in the loop

Get weekly updates on trending AI coding tools and projects.