Skip to content

Commit

Permalink
Update README.md
Browse files Browse the repository at this point in the history
  • Loading branch information
gitnlp authored May 13, 2024
1 parent 06980b1 commit df76db8
Showing 1 changed file with 1 addition and 0 deletions.
1 change: 1 addition & 0 deletions retnet/README.md
Original file line number Diff line number Diff line change
@@ -1,5 +1,6 @@
# Retentive Network: The Successor to Transformer for Large Language Models

- May 2024: Gated RetNet (i.e., RetNet-3) as part of YOCO / [You Only Cache Once: Decoder-Decoder Architectures for Language Models](https://arxiv.org/abs/2405.05254)
- Code release: [https://github.com/microsoft/torchscale](https://github.com/microsoft/torchscale)
- July 2023: release preprint [Retentive Network: A Successor to Transformer for Large Language Models](https://arxiv.org/abs/2307.08621)

Expand Down

0 comments on commit df76db8

Please sign in to comment.