feat: Refactor LLM model zoo and add KV cache support by peri044 · Pull Request #3527 · pytorch/TensorRT
and others added 9 commits
April 25, 2025 21:03
peri044
changed the title
feat : caching attempts
feat: caching attempts
Chengzhe Xu and others added 3 commits
May 27, 2025 19:13
peri044
changed the title
feat: caching attempts
feat: Refactor LLM optimization and add KV cache support
peri044
changed the title
feat: Refactor LLM optimization and add KV cache support
feat: Refactor LLM model zoo and add KV cache support
peri044 added a commit that referenced this pull request
Jul 8, 2025This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters. Learn more about bidirectional Unicode characters