Skip to content

Releases: NVIDIA/FasterTransformer

v5.3 release

23 Jan 13:01
d21dc02
Compare
Choose a tag to compare
release/v5.3_tag

Update CMakeLists.txt

release/v5.2.1

01 Jan 07:02
f0b5b86
Compare
Choose a tag to compare

Fix some bugs of v5.2

release/v5.2_bug_fix

06 Dec 06:32
Compare
Choose a tag to compare
release/v5.2_bug_fix_tag

fix: fix fmha kernel assert bug

v5.2 release

03 Dec 00:58
Compare
Choose a tag to compare
release/v5.2_tag

fix: add cutlass submodule

v5.1.1 bug fix

17 Oct 06:44
aa3aaf6
Compare
Choose a tag to compare
  1. fix stop criterion.
  2. fix bug of attention mask chosen when enabling shared context opt
  3. fix swin qk scale
  4. fix bug of repetition penalty of t5 under beam search
  5. fix bug of gpt_guide.md
  6. fix bug of decoder_masked_multihead_attention_template

v5.1 T5 triton bug fix

23 Aug 01:21
e709732
Compare
Choose a tag to compare

Fix the bug of model parallelism setting of T5 on v5.1

v5.1 release

16 Aug 03:02
bc21406
Compare
Choose a tag to compare
release/v5.1_tag

feat: update v5.1 (#281)

v5.0 release

15 Apr 16:15
Compare
Choose a tag to compare
release/v5.0_tag

feat: update v5.0

release/v1.0_tag: Merge pull request #123 from pmixer/del_v1_duplicated_files

03 Apr 00:27
a1248a3
Compare
Choose a tag to compare

release/v4.0_tag: Merge pull request #54 from NVIDIA/main

05 Apr 07:39
791d956
Compare
Choose a tag to compare
Update the v4.0 with new modification