Explore Projects

Discover 23 open source projects

Active filters (1):
Search: rlhfร—
Clear all

Showing 1-20 of 23 projects

hiyouga/LlamaFactory

Fine-tuning framework for 100+ LLMs & VLMs

67.9K
Active
Python
Fine-tuning
#llm#fine-tuning#ai

LAION-AI/Open-Assistant

Open Assistant is a chat-based AI project aimed at providing access to large language models for improving language innovation.

37.4K
Archived
Python
Agents & Orchestration
Next.js
#ai#assistant#chatgpt

ymcui/Chinese-LLaMA-Alpaca-2

An open-source Chinese version of the LLaMA and Alpaca language models with 64K long context support for advanced NLP applications.

7.2K
Experimental
Python
LLM Frameworks
Fine-tuning
Python
#llm#alpaca#llama

InternLM/InternLM

Official release of the InternLM series of large language models focused on building AI tools and chatbots.

7.2K
Stable
Python
LLM Frameworks
Fine-tuning
Python
#chatbot#llm#fine-tuning

huggingface/alignment-handbook

A handbook of robust recipes to align language models with human and AI preferences.

5.5K
Stable
Python
LLM Frameworks
Agents & Orchestration
PyTorch
#llm#rlhf#transformers

argilla-io/argilla

Argilla is a collaboration tool for AI engineers and domain experts to build high-quality datasets

4.9K
Active
Python
LLM Frameworks
Agents & Orchestration
Python
#active-learning#annotation-tool#human-in-the-loop

shibing624/MedicalGPT

This repository allows developers to train their own medical language models using the ChatGPT training pipeline.

4.8K
Active
Python
LLM Frameworks
Fine-tuning
Python
#chatgpt#gpt#llama

transformerlab/transformerlab-app

Open-source platform for frontier AI/ML workflows, including diffusion models, LLMs, and more.

4.8K
Active
Python
LLM Frameworks
Diffusion
Python
#diffusion-models#llms#transformers

CarperAI/trlx

A distributed training framework for language models using Reinforcement Learning via Human Feedback (RLHF)

4.7K
Archived
Python
LLM Frameworks
Fine-tuning
PyTorch
#machine-learning#reinforcement-learning#language-models

Kiln-AI/Kiln

Build, Evaluate, and Optimize AI Systems

4.7K
Active
Python
AI Editors/Agents/Copilot
#AI#chain-of-thought#collaboration

opendilab/awesome-RLHF

A curated list of resources for reinforcement learning with human feedback (RLHF), a key technique for developing AI systems.

4.3K
Stable
LLM Frameworks
Tutorials & Courses
#reinforcement-learning#human-feedback#large-language-models

hiyouga/ChatGLM-Efficient-Tuning

Efficient fine-tuning of the ChatGLM-6B language model using PEFT, enabling vibe coders to customize LLMs for their needs.

3.7K
Archived
Python
LLM Frameworks
Fine-tuning
PyTorch
#chatglm#language-model#fine-tuning

Docta-ai/docta

A Python library that helps diagnose and curate datasets for data-centric AI applications.

3.5K
Archived
Python
LLM Frameworks
Caching
#data-curation#data-diagnosis#language-model

alibaba/ROLL

A library for efficient and user-friendly reinforcement learning with large language models

2.9K
Active
Python
LLM Frameworks
API Frameworks
React
#reinforcement-learning#large-language-models#efficient-scaling

qibin0506/Cortex

Complete LLM training implementation: pretraining, SFT, DPO, and RLHF from scratch in Python

2.4K
Active
Python
Fine-tuning
LLM Frameworks
PyTorch
#llm-pretraining#rlhf#dpo

HarderThenHarder/transformers_tasks

A library of NLP algorithms and utilities for text classification, generation, extraction, and more using the Transformers library.

2.4K
Archived
Jupyter Notebook
LLM Frameworks
ORMs & Query Builders
PyTorch
#nlp#text-classification#text-generation

tatsu-lab/alpaca_eval

An automatic evaluator for instruction-following language models with human-validated, high-quality, cheap, and fast evaluation.

2.0K
Stable
Jupyter Notebook
LLM Frameworks
Evaluation
Jupyter Notebook
#deep-learning#foundation-models#large-language-models

anthropics/hh-rlhf

Repository containing human preference data for training a helpful and harmless AI assistant.

1.8K
Experimental
LLM Frameworks
Agents & Orchestration
#language-model#reinforcement-learning#human-feedback

natolambert/rlhf-book

Textbook on reinforcement learning from human feedback, focused on AI alignment research.

1.7K
Active
TeX
LLM Frameworks
Books & Guides
#ai#alignment#rlhf

zai-org/ImageReward

A repository for ImageReward, a learning and evaluating human preferences for text-to-image generation

1.6K
Stable
Python
React
#authentication#diffusion-models#generative-model
2

Stay in the loop

Get weekly updates on trending AI coding tools and projects.