[LLM] support QWen1.5-Moe #8338

DrownFish19 · 2024-04-28T08:45:04Z

PR types

New features

PR changes

Models

Description

add QWen1.5 Moe model.
support same prefix for different models, such as QWen and QWen2Moe with same prefix QWen. The longest name will match each model name before others.
support sft and lora.

…-moe

paddle-bot · 2024-04-28T08:45:09Z

Thanks for your contribution!

…P into dev_add_qwen1.5-moe

codecov · 2024-05-06T03:46:48Z

Codecov Report

Attention: Patch coverage is 66.55556% with 301 lines in your changes are missing coverage. Please review.

Project coverage is 54.39%. Comparing base (3aa92ce) to head (58af3ec).
Report is 51 commits behind head on develop.

Files	Patch %	Lines
paddlenlp/transformers/qwen2moe/modeling.py	72.33%	197 Missing ⚠️
paddlenlp/transformers/qwen2moe/tokenizer.py	22.96%	104 Missing ⚠️

Additional details and impacted files

@@             Coverage Diff             @@
##           develop    #8338      +/-   ##
===========================================
- Coverage    55.36%   54.39%   -0.98%     
===========================================
  Files          614      621       +7     
  Lines        96016    97254    +1238     
===========================================
- Hits         53164    52904     -260     
- Misses       42852    44350    +1498

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

DesmonDay · 2024-05-24T07:34:22Z

paddlenlp/transformers/qwen2moe/__init__.py

+# See the License for the specific language governing permissions and
+# limitations under the License.
+
+from .configuration import QWen2MoeConfig


QWen2MoEConfig会不会更好，把Moe都改成MoE。

DesmonDay · 2024-05-24T09:00:55Z

paddlenlp/transformers/qwen2moe/modeling.py

@@ -0,0 +1,1580 @@
+# Copyright (c) 2023 PaddlePaddle Authors. All Rights Reserved.


2023 -> 2024，都改掉吧

DrownFish19 added 23 commits April 17, 2024 10:58

add Qwen2Moe

36ab9a7

update default config

3913e11

Merge remote-tracking branch 'paddlenlp/develop' into dev_add_qwen1.5…

0aa1aca

…-moe

update QWen2Moe modeling

a29e90d

update modeling

d514dff

update ckpt name

1e98323

Merge branch 'PaddlePaddle:develop' into dev_add_qwen1.5-moe

f81bb43

support same prefix model name for auto modeling

37dd2d5

update qwen2moe testing

d12938a

update qwen2moe modeling and config

8cc49fc

update qwen2moe import

9c8222e

fix mlp hidden_size

4d6ff87

update qkv bias convert

f350a2f

update modeling init_weight

c53690d

update _get_name_mappings

9d12995

update _get_name_mappings and _init_weight

dba0f74

add tokenizer

e487606

update modeling

cd9c753

update modeling

10407c4

update tokenizer

beb0f4c

update modeling and tokenizer

beefee9

fix index_add_ error

82ba345

Merge branch 'PaddlePaddle:develop' into dev_add_qwen1.5-moe

d522ee4

DrownFish19 added 4 commits April 28, 2024 11:08

fix

4a1b2e3

Merge branch 'dev_add_qwen1.5-moe' of github.com:DrownFish19/PaddleNL…

526a9db

…P into dev_add_qwen1.5-moe

Merge branch 'PaddlePaddle:develop' into dev_add_qwen1.5-moe

0c9d5ec

update comments

2bb3aba

update lora weights

f203983

add todo

58af3ec

ZHUI closed this May 24, 2024

ZHUI reopened this May 24, 2024

DesmonDay reviewed May 24, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[LLM] support QWen1.5-Moe #8338

[LLM] support QWen1.5-Moe #8338

DrownFish19 commented Apr 28, 2024 •

edited

paddle-bot bot commented Apr 28, 2024

codecov bot commented May 6, 2024 •

edited

DesmonDay May 24, 2024

DesmonDay May 24, 2024

		@@ -0,0 +1,1580 @@
		# Copyright (c) 2023 PaddlePaddle Authors. All Rights Reserved.

[LLM] support QWen1.5-Moe #8338

Are you sure you want to change the base?

[LLM] support QWen1.5-Moe #8338

Conversation

DrownFish19 commented Apr 28, 2024 • edited

PR types

PR changes

Description

paddle-bot bot commented Apr 28, 2024

codecov bot commented May 6, 2024 • edited

Codecov Report

DesmonDay May 24, 2024

Choose a reason for hiding this comment

DesmonDay May 24, 2024

Choose a reason for hiding this comment

DrownFish19 commented Apr 28, 2024 •

edited

codecov bot commented May 6, 2024 •

edited