-
Notifications
You must be signed in to change notification settings - Fork 28.6k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Update falcon mamba card #37253
Update falcon mamba card #37253
Conversation
Hi 👋, thank you for opening this pull request! The pull request is converted to draft by default. The CI will be paused while the PR is in draft mode. When it is ready for review, please click the |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Super nice job! 🤩
@@ -14,97 +14,105 @@ rendered properly in your Markdown viewer. | |||
|
|||
--> | |||
|
|||
# FalconMamba | |||
|
|||
<div class="flex flex-wrap space-x-1"> |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Wrap this with the below to align it to the right:
<div style="float: right;">
badges
</div>
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Oh! Forgot about that. Adding.
Co-authored-by: Steven Liu <[email protected]>
Co-authored-by: Steven Liu <[email protected]>
Co-authored-by: Steven Liu <[email protected]>
Fixed comments, ready for review. |
The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks, one last comment!
## Notes | ||
|
||
- FalconMamba is based on the Mamba architecture. The same [tips and best practices](./mamba) for Mamba models are relevant here. | ||
- The architecture is compatible with `torch.compile` for faster generation via `model = torch.compile(model)`. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I think we can remove these notes as they're not that useful. The model is automatically compiled when we set cache_implementation="static"
in generate
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Removed!
* feat: edit falcon mamba card * fix: edit statement on falconmamba arch * Update docs/source/en/model_doc/falcon_mamba.md Co-authored-by: Steven Liu <[email protected]> * Update docs/source/en/model_doc/falcon_mamba.md Co-authored-by: Steven Liu <[email protected]> * Update docs/source/en/model_doc/falcon_mamba.md Co-authored-by: Steven Liu <[email protected]> * fix: add right indent for tags * fix: remove notas --------- Co-authored-by: Steven Liu <[email protected]>
#36979
Before submitting
Pull Request section?
to it if that's the case.
documentation guidelines, and
here are tips on formatting docstrings.