Releases · shibing624/MedicalGPT

11 Jun 03:25

shibing624

2.1.0

99bdd2f

2.1.0 Latest

Latest

v2.1版本：

支持了 Qwen2 系列模型微调训练

What's Changed

增加中文数据集汇总，本项目支持格式 by @ZhuangXialie in #370
Change Llama tokenizer from LlamaTokenizer to AutoTokenizer by @princepride in #380

New Contributors

@princepride made their first contribution in #380

Full Changelog: 2.0.0...2.1.0

Contributors

princepride and ZhuangXialie

Assets 2

27 Apr 05:36

shibing624

2.0.0

2336bfe

2.0.0

v2.0版本：

支持了 Meta Llama 3 系列模型微调训练
发布了适用于ORPO/DPO/RM模型的偏好数据集shibing624/DPO-En-Zh-20k-Preference
基于llama-3-8b-instruct-262k模型使用ORPO方法微调，得到模型权重：https://huggingface.co/shibing624/llama-3-8b-instruct-262k-chinese ，及对应的lora权重：https://huggingface.co/shibing624/llama-3-8b-instruct-262k-chinese-lora

What's Changed

Updates for readme and demo ipynb and a small update for deprecated function by @ker2xu in #360
Typo by @ker2xu in #362
add max_length and max_prompt_length by @ZhuangXialie in #367

New Contributors

@ker2xu made their first contribution in #360
@ZhuangXialie made their first contribution in #367

Full Changelog: 1.9.0...2.0.0

Contributors

ker2xu and ZhuangXialie

Assets 2

17 Apr 09:01

shibing624

1.9.0

9f61e99

1.9.0

v1.9版本

支持了 ORPO，详细用法请参照 run_orpo.sh。不需要参考模型的优化方法，通过ORPO，LLM可以同时学习指令遵循和满足人类偏好，可以直接用base模型训练ORPO，训练相较SFT+DRO更简单，相对需要更多偏好数据集数据。
新增了支持微调qwen1.5, cohere 模型，和对应的template。

What's Changed

Update transformers in requirements.txt by @dividez in #321

Full Changelog: 1.8.0...1.9.0

Contributors

dividez

Assets 2

26 Jan 10:20

shibing624

1.8.0

14098d4

v1.8.0

v1.8版本

支持微调Mixtral混合专家MoE模型 Mixtral 8x7B，SFT中如果用lora微调模型，可以开启4bit量化和QLoRA--load_in_4bit True --qlora True以节省显存，建议设置--target_modules q_proj,k_proj,v_proj,o_proj，这样可以避免对MoE专家网络的MLP层量化，因为它们很稀疏且量化后会导致性能效果下降。
新增了支持微调deepseek, deepseekcoder, orion 模型，和对应的template。

Full Changelog: 1.7.0...1.8.0

Assets 2

14 Jan 04:09

shibing624

1.7.0

f0c0956

v1.7.0

v1.7版本：

新增检索增强生成(RAG)的基于文件问答ChatPDF功能，代码chatpdf.py，可以基于微调后的LLM结合知识库文件问答提升行业问答准确率。运行python chatpdf.py调用rag问答。

Full Changelog: 1.6.0...1.7.0

Assets 2

23 Oct 08:01

shibing624

1.6.0

ed30fe0

v1.6.0

v1.6版本：

新增了RoPE插值来扩展GPT模型的上下文长度，通过位置插值方法，在增量数据上进行训练，使模型获得长文本处理能力，使用 --rope_scaling linear 参数训练模型；
针对LLaMA模型支持了FlashAttention-2，如果您使用的是 RTX4090、A100 或 H100 GPU，请使用 --flash_attn 参数以启用 FlashAttention-2；
新增了LongLoRA 提出的 $S^2$-Attn，使模型获得长文本处理能力，SFT中使用 --shift_attn 参数以启用该功能；
支持了NEFTune给embedding加噪SFT训练方法，NEFTune paper, 使用 --neft_alpha 参数启用 NEFTune，例如 --neft_alpha 5；
PT增量预训练支持qlora方法，如果使用的是 RTX4090、A100 或 H100 GPU，支持nf4，使用--qlora True --load_in_kbits 4参数启用qlora训练。