feat: Implement FP32 accumulation for matmul by peri044 · Pull Request #3110 · pytorch/TensorRT
Navigation Menu
{{ message }}
- Notifications You must be signed in to change notification settings
- Fork 384
Conversation
Copy link
Collaborator
peri044
commented
Aug 21, 2024
peri044
commented
Description
Implement FP32 accumulation for matmul layers
Type of change
Please delete options that are not relevant and/or add your own.
- Bug fix (non-breaking change which fixes an issue)
- New feature (non-breaking change which adds functionality)
- Breaking change (fix or feature that would cause existing functionality to not work as expected)
- This change requires a documentation update
Checklist:
- My code follows the style guidelines of this project (You can use the linters)
- I have performed a self-review of my own code
- I have commented my code, particularly in hard-to-understand areas and hacks
- I have made corresponding changes to the documentation
- I have added tests to verify my fix or my feature
- New and existing unit tests pass locally with my changes
- I have added the relevant labels to my PR in so that relevant reviewers are notified
peri044 added 30 commits
June 12, 2024 17:24…llm_examples_main
github-actions
bot
added
component: core
labels
Aug 30, 2024peri044 added 2 commits
August 30, 2024 15:47
peri044
changed the base branch from
llm_examples_main
to
main
peri044 added 2 commits
September 24, 2024 11:54
github-actions
bot
removed
component: tests
labels
Sep 24, 2024
peri044
mentioned this pull request
7 tasks
github-actions
bot
added
documentation
component: torch_compile labels
Sep 30, 2024peri044 added 3 commits
October 1, 2024 14:17narendasan reviewed Oct 7, 2024
examples/dynamo/torch_export_gpt2.py
Outdated
Show resolved
Hide resolved
examples/dynamo/torch_export_gpt2.py Outdated Show resolved Hide resolved
narendasan reviewed Oct 7, 2024
py/torch_tensorrt/dynamo/_compiler.py
Show resolved
Hide resolved
py/torch_tensorrt/dynamo/_compiler.py Show resolved Hide resolved
narendasan reviewed Oct 7, 2024
py/torch_tensorrt/dynamo/conversion/impl/elementwise/base.py
Show resolved
Hide resolved
py/torch_tensorrt/dynamo/conversion/impl/elementwise/base.py Show resolved Hide resolved
peri044 added 6 commits
October 7, 2024 13:51
peri044
merged commit
38771e6
into
main
lanluo-nvidia
mentioned this pull request
7 tasks
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters. Learn more about bidirectional Unicode characters
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment