Skip to content

[QA] Why is InternLM2-Chat-SFT based on InternLM2-Base instead of InternLM2? #606

Answered by ZwwWayne
underspirit asked this question in Q&A
Discussion options

You must be logged in to vote

This is simply due to the time issue. InternLM2 and InternLM2-Chat are trained parallelly due to limited time for release. Furthermore, InternLM2 and InternLM2-Chat are optimized for different capability dimensions as you can see from the evaluation results.

Replies: 1 comment

Comment options

You must be logged in to vote
0 replies
Answer selected by ZwwWayne
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
question Further information is requested
2 participants
Converted from issue

This discussion was converted from issue #603 on January 17, 2024 11:43.