Skip to content

Commit

Permalink
minor edits
Browse files Browse the repository at this point in the history
  • Loading branch information
dingo-actual committed May 4, 2024
1 parent ffa34ca commit aa5ff84
Showing 1 changed file with 2 additions and 1 deletion.
3 changes: 2 additions & 1 deletion readme.md
Original file line number Diff line number Diff line change
Expand Up @@ -26,7 +26,8 @@ Infini-Transformer ([https://arxiv.org/abs/2404.07143](https://arxiv.org/abs/240
- Support for multiple downstream tasks, including text classification, question answering, and language generation
- Efficient fine-tuning for task-specific adaptation
- Includes a Mixture-of-Depths ([https://arxiv.org/abs/2404.02258](https://arxiv.org/abs/2404.02258)) transformer layer that incorporates Infini-Attention
- Implementation of RoPE ([https://arxiv.org/abs/2104.09864](https://arxiv.org/abs/2104.09864)) that conforms to Infini-Attention's and Mixture-of-Depth's memory-efficient designs (YaRN and PoSE to be added soon)
- Implementation of RoPE ([https://arxiv.org/abs/2104.09864](https://arxiv.org/abs/2104.09864)) that conforms to Infini-Attention's and Mixture-of-Depth's memory-efficient designs (PoSE to be added soon)
- - Implementation of YaRN ([https://arxiv.org/abs/2309.00071](https://arxiv.org/abs/2309.00071)) that conforms to Infini-Attention's and Mixture-of-Depth's memory-efficient designs (PoSE to be added soon)

## Directory structure

Expand Down

0 comments on commit aa5ff84

Please sign in to comment.