diff --git a/README.md b/README.md index 64578c4..d2c58e5 100644 --- a/README.md +++ b/README.md @@ -191,6 +191,8 @@ output = model(input_ids) print(output.shape) # Expected shape: (batch_size, target_length, d_model) ``` +## Advanced Usage + ## Results My implementation of LongRoPE achieves the following results: