feat: Refactor LLM model zoo and add KV cache support by peri044 · Pull Request #3527 · pytorch/TensorRT

and others added 9 commits

April 25, 2025 21:03

@peri044 peri044 added the WIP

Work is in progress, pull request should not be merged yet

label

May 20, 2025

@peri044 peri044 changed the title feat : caching attempts feat: caching attempts

May 20, 2025

Chengzhe Xu and others added 3 commits

May 27, 2025 19:13

github-actions[bot]

github-actions[bot]

@peri044

@peri044

github-actions[bot]

@peri044

@peri044

@peri044 peri044 changed the title feat: caching attempts feat: Refactor LLM optimization and add KV cache support

Jun 13, 2025

@peri044 peri044 changed the title feat: Refactor LLM optimization and add KV cache support feat: Refactor LLM model zoo and add KV cache support

Jun 13, 2025

@peri044

@peri044

@peri044

@peri044

narendasan

narendasan

narendasan

narendasan

zewenli98

@peri044

zewenli98

narendasan

@peri044

peri044 added a commit that referenced this pull request

Jul 8, 2025
Signed-off-by: Dheeraj Peri <peri.dheeraj@gmail.com>