shibing624 / MedicalGPT Public

Notifications You must be signed in to change notification settings
Fork 422
Star 2.7k

Code
Issues 35
Pull requests
Actions
Projects
Wiki
Security
Insights

Additional navigation options

Code
Issues
Pull requests
Actions
Projects
Wiki
Security
Insights

Issues: shibing624/MedicalGPT

sft微调chatglm2，合并时报错

#68 by shawnlihst was closed Jul 5, 2023

Closed 3

ChatGLM全参数二次预训练过程中，loss马上变为0，val_loss = nan

#125 by gloryyoung was closed Aug 9, 2023

Closed 13

请教增量预训练后的两个问题：1）token长尾 2）group texts

#83 by Zagreus-lzy was closed Oct 22, 2023

Closed 10

Labels 9 Milestones 0

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

35 Open 319 Closed

Author

Filter by author

Label

Filter by label

Use alt + click/return to exclude labels

or ⇧ + click/return for logical OR

Projects

Filter by project

Milestones

Filter by milestone

Assignee

Filter by who’s assigned

Assigned to nobody

Sort

Sort by

Newest Oldest Most commented Least commented Recently updated Least recently updated Best match

Most reactions

Issues list

关于llama3的权重转换 question

Further information is requested

#378 opened May 26, 2024 by tszslovewanpu

医学大模型全流程体验 question

Further information is requested

#377 opened May 20, 2024 by YoshuaBengio

运行pretraining.py时报错：RuntimeError: CUDA error: device-side assert triggered bug

Something isn't working

#376 opened May 16, 2024 by Wenting1227

ppo训练时出现问题：UserWarning: KL divergence is starting to become negative: -233.50 question

Further information is requested

#374 opened May 15, 2024 by user2311717757

vocab扩展后的模型合并问题 question

Further information is requested

#373 opened May 15, 2024 by sungatetop

AMD 执行 run_pt.sh失败 bug

Something isn't working

#371 opened May 8, 2024 by liuyang6055

对chat模型进行二次预训练后，自问自答 question

Further information is requested

#366 opened Apr 24, 2024 by wsl1014

orpo脚本NoneType问题 bug

Something isn't working

#363 opened Apr 22, 2024 by songyao199681

reward_modeling咨询 question

Further information is requested

#361 opened Apr 20, 2024 by tuqingwen

Regarding RLHF and DPO training data question

Further information is requested

#358 opened Apr 3, 2024 by Aniketto16

使用deepspeed 全参数sft后，inference 回答的都为空，有解决办法吗 question

Further information is requested

#357 opened Apr 2, 2024 by Yian320

ValueError: operands could not be broadcast together with remapped shapes [original->remapped]: (3,2) and requested shape (1,2) bug

Something isn't working

#356 opened Mar 27, 2024 by Riapy

lora模型合并 question

Further information is requested

#355 opened Mar 26, 2024 by sevenandseven

扩充词表后能否直接进行SFT呢？ question

Further information is requested

#352 opened Mar 24, 2024 by HaotianLiu123

预训练后模型出现自问自答、输出未知序列、重复口吃现象 question

Further information is requested

#351 opened Mar 21, 2024 by Peter-of-Astora

增量预训练效果评估 question

Further information is requested

#349 opened Mar 14, 2024 by qibao77

llama进行rm训练的时候，出现问题ValueError: weight is on the meta device, we need a value to put in on cpu. bug

Something isn't working

#347 opened Mar 14, 2024 by cove1011

使用qwen进行pretrain的时候出现了问题：Cannot copy out of meta tensor; no data! bug

Something isn't working

#346 opened Mar 12, 2024 by cove1011

单机多卡sft deepspeed zero3 训练一直卡在训练阶段 question

Further information is requested

#330 opened Feb 12, 2024 by lainxx

请问，pt阶段，基础模型比较大(Yi-67B)，多机多卡用那种训练比较好呢？ question

Further information is requested

#315 opened Jan 23, 2024 by listwebit

请教DPO多轮对话的问题 question

Further information is requested

#293 opened Dec 26, 2023 by chloefresh

在单机多卡监督微调时使用的策略是DP还是DDP？ question

Further information is requested

#291 opened Dec 18, 2023 by CNUIGB

请问大佬，Reward model验证分类评分，一个问题回传两个tensor? question

Further information is requested

#284 opened Dec 11, 2023 by waycup7

关于GLM3微调细节 question

Further information is requested

#281 opened Dec 5, 2023 by DeMoth-1

大佬，使用自己数据进行增量预训练时，loss不降反增。 question

Further information is requested

#280 opened Dec 5, 2023 by SevenMpp

Previous 1 2 Next

Previous Next

ProTip! Find all open issues with in progress development work with linked:pr.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly