TheTravellingEngineer/bloom-560m-RLHF-v2

Lovesickness · September 5, 2023, 8:07am

The base model is bigscience/bloom-560m. It was finetuned using RLHF and the dataset and the model prompt is similar to the original model. This repo contains the merged fp16 model.

Legal Disclaimer: This model is bound by the usage restrictions of the original BLOOM model. And comes with no warranty or gurantees of any kind.

license:
- bigscience-bloom-rail-1.0
datasets:
- Anthropic/hh-rlhf
language:
- en
reference: GitHub - hiyouga/LLaMA-Efficient-Tuning: Easy-to-use LLM fine-tuning framework (LLaMA-2, BLOOM, Falcon, Baichuan, Qwen, ChatGLM2)