Explore Projects

Discover 23 open source projects

Active filters (1):

Search: rlhf×

Clear all

Showing 1-20 of 23 projects

hiyouga/LlamaFactory

Fine-tuning framework for 100+ LLMs & VLMs

67.9K

Active

Python

Fine-tuning

#llm#fine-tuning#ai

LAION-AI/Open-Assistant

Open Assistant is a chat-based AI project aimed at providing access to large language models for improving language innovation.

37.4K

Archived

Python

Agents & Orchestration

Next.js

#ai#assistant#chatgpt

ymcui/Chinese-LLaMA-Alpaca-2

An open-source Chinese version of the LLaMA and Alpaca language models with 64K long context support for advanced NLP applications.

7.2K

Experimental

Python

LLM Frameworks

Fine-tuning

Python

#llm#alpaca#llama

InternLM/InternLM

Official release of the InternLM series of large language models focused on building AI tools and chatbots.

7.2K

Stable

Python

LLM Frameworks

Fine-tuning

Python

#chatbot#llm#fine-tuning

huggingface/alignment-handbook

A handbook of robust recipes to align language models with human and AI preferences.

5.5K

Stable

Python

LLM Frameworks

Agents & Orchestration

PyTorch

#llm#rlhf#transformers

argilla-io/argilla

Argilla is a collaboration tool for AI engineers and domain experts to build high-quality datasets

4.9K

Active

Python

LLM Frameworks

Agents & Orchestration

Python

#active-learning#annotation-tool#human-in-the-loop

shibing624/MedicalGPT

This repository allows developers to train their own medical language models using the ChatGPT training pipeline.

4.8K

Active

Python

LLM Frameworks

Fine-tuning

Python

#chatgpt#gpt#llama

transformerlab/transformerlab-app

Open-source platform for frontier AI/ML workflows, including diffusion models, LLMs, and more.

4.8K

Active

Python

LLM Frameworks

Diffusion

Python

#diffusion-models#llms#transformers

CarperAI/trlx

A distributed training framework for language models using Reinforcement Learning via Human Feedback (RLHF)

4.7K

Archived

Python

LLM Frameworks

Fine-tuning

PyTorch

#machine-learning#reinforcement-learning#language-models

Kiln-AI/Kiln

Build, Evaluate, and Optimize AI Systems

4.7K

Active

Python

AI Editors/Agents/Copilot

#AI#chain-of-thought#collaboration

opendilab/awesome-RLHF

A curated list of resources for reinforcement learning with human feedback (RLHF), a key technique for developing AI systems.

4.3K

Stable

LLM Frameworks

Tutorials & Courses

#reinforcement-learning#human-feedback#large-language-models

hiyouga/ChatGLM-Efficient-Tuning

Efficient fine-tuning of the ChatGLM-6B language model using PEFT, enabling vibe coders to customize LLMs for their needs.

3.7K

Archived

Python

LLM Frameworks

Fine-tuning

PyTorch

#chatglm#language-model#fine-tuning

Docta-ai/docta

A Python library that helps diagnose and curate datasets for data-centric AI applications.

3.5K

Archived

Python

LLM Frameworks

Caching

#data-curation#data-diagnosis#language-model

alibaba/ROLL

A library for efficient and user-friendly reinforcement learning with large language models

2.9K

Active

Python

LLM Frameworks

API Frameworks

React

#reinforcement-learning#large-language-models#efficient-scaling

qibin0506/Cortex

Complete LLM training implementation: pretraining, SFT, DPO, and RLHF from scratch in Python

2.4K

Active

Python

Fine-tuning

LLM Frameworks

PyTorch

#llm-pretraining#rlhf#dpo

HarderThenHarder/transformers_tasks

A library of NLP algorithms and utilities for text classification, generation, extraction, and more using the Transformers library.

2.4K

Archived

Jupyter Notebook

LLM Frameworks

ORMs & Query Builders

PyTorch

#nlp#text-classification#text-generation

tatsu-lab/alpaca_eval

An automatic evaluator for instruction-following language models with human-validated, high-quality, cheap, and fast evaluation.

2.0K

Stable

Jupyter Notebook

LLM Frameworks

Evaluation

Jupyter Notebook

#deep-learning#foundation-models#large-language-models

anthropics/hh-rlhf

Repository containing human preference data for training a helpful and harmless AI assistant.

1.8K

Experimental

LLM Frameworks

Agents & Orchestration

#language-model#reinforcement-learning#human-feedback

natolambert/rlhf-book

Textbook on reinforcement learning from human feedback, focused on AI alignment research.

1.7K

Active

TeX

LLM Frameworks

Books & Guides

#ai#alignment#rlhf

zai-org/ImageReward

A repository for ImageReward, a learning and evaluating human preferences for text-to-image generation

1.6K

Stable

Python

React

#authentication#diffusion-models#generative-model

Stay in the loop

Get weekly updates on trending AI coding tools and projects.