#

human-feedback

Here are 15 public repositories matching this topic...

huggingface / data-is-better-together

Let's build better datasets, together!

community machine-learning datasets human-feedback

Updated Jun 6, 2024
Jupyter Notebook

opendilab / awesome-RLHF

A curated list of reinforcement learning with human feedback resources (continually updated)

reinforcement-learning deep-learning deep-reinforcement-learning large-language-models human-feedback rlhf

Updated May 26, 2024

ZiyiZhang27 / tdpo

[ICML 2024] Code for the paper "Confronting Reward Overoptimization for Diffusion Models: A Perspective of Inductive and Primacy Biases"

reinforcement-learning alignment text-to-image diffusion-models stable-diffusion human-feedback rlhf

Updated May 20, 2024
Python

trubrics-sdk

trubrics / trubrics-sdk

Product analytics for AI Assistants

machine-learning mlops streamlit ml-monitoring llm human-feedback llmops model-feedback

Updated May 13, 2024
Python

gao-g / prelude

Aligning LLM Agents by Learning Latent Preference from User Edits

transformers alignment user-feedback edits interpretability preference-learning gpt4 llm llms human-feedback

Updated May 1, 2024
Python

HannahKirk / prism-alignment

The Prism Alignment Project

dataset alignment multicultural sociotechnical human-feedback-data human-feedback

Updated Apr 25, 2024
Jupyter Notebook

yk7333 / d3po

[CVPR 2024] Code for the paper "Using Human Feedback to Fine-tune Diffusion Models without Any Reward Model"

reinforcement-learning diffusion-models human-feedback

Updated Apr 6, 2024
Python

conceptofmind / LaMDA-rlhf-pytorch

Open-source pre-training implementation of Google's LaMDA in PyTorch. Adding RLHF similar to ChatGPT.

machine-learning reinforcement-learning deep-learning transformers artificial-intelligence attention-mechanism human-feedback

Updated Feb 24, 2024
Python

lucidrains / PaLM-rlhf-pytorch

Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the PaLM architecture. Basically ChatGPT but with PaLM

reinforcement-learning deep-learning transformers artificial-intelligence attention-mechanisms human-feedback

Updated Jan 14, 2024
Python

PKU-Alignment / beavertails

BeaverTails is a collection of datasets designed to facilitate research on safety alignment in large language models (LLMs).

safety llama gpt datasets language-model beaver ai-safety human-feedback-data llm llms human-feedback rlhf large-language-model safe-rlhf

Updated Oct 27, 2023
Makefile

AlaaLab / pathologist-in-the-loop

[ NeurIPS 2023 ] Official Codebase for "Aligning Synthetic Medical Images with Clinical Knowledge using Human Feedback"

synthetic-data human-feedback rlhf pathology-images

Updated Oct 19, 2023
Python

wxjiao / ParroT

The ParroT framework to enhance and regulate the Translation Abilities during Chat based on open-sourced LLMs (e.g., LLaMA-7b, Bloomz-7b1-mt) and human written translation and evaluation data.

machine-translation llama lora contrastive gpt-4 chatgpt human-feedback instruction-tuning bloomz error-guided

Updated Oct 12, 2023
Python

victor-iyi / rlhf-trl

Reinforcement Learning from Human Feedback with 🤗 TRL

reinforcment-learning human-feedback rlhf

Updated Jun 14, 2023
Python

01Kevin01 / awesome-RLHF-Turkish

A curated list of reinforcement learning with human feedback resources[awesome-RLHF-Turkish] (continually updated)

ai artificial-intelligence turkish-language general-language-model human-feedback rlhf value-alignment awesome-rlhf rlhf-turkish

Updated Apr 27, 2023

xrsrke / instructGOOSE

Implementation of Reinforcement Learning from Human Feedback (RLHF)

reinforcement-learning chatgpt human-feedback rlhf instructgpt

Updated Apr 7, 2023
Jupyter Notebook

Improve this page

Add a description, image, and links to the human-feedback topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the human-feedback topic, visit your repo's landing page and select "manage topics."