golsun / DialogRPT Star 336 Code Issues Pull requests EMNLP 2020: "Dialogue Response Ranking Training with Large-Scale Human Feedback Data" dialog transformers pytorch dataset pretrained-models conversational-ai gpt-2 dialog-datasets dialogpt human-feedback-data Updated Aug 31, 2023 Python
PKU-Alignment / beavertails Star 81 Code Issues Pull requests Discussions BeaverTails is a collection of datasets designed to facilitate research on safety alignment in large language models (LLMs). safety llama gpt datasets language-model beaver ai-safety human-feedback-data llm llms human-feedback rlhf large-language-model safe-rlhf Updated Oct 27, 2023 Makefile
HannahKirk / prism-alignment Star 15 Code Issues Pull requests The Prism Alignment Project dataset alignment multicultural sociotechnical human-feedback-data human-feedback Updated Apr 25, 2024 Jupyter Notebook
nrimsky / Feedbackr Star 1 Code Issues Pull requests Easily collect yes/no feedback on language model outputs from humans python django human-feedback-data Updated Mar 2, 2023 Python