Medusa/ROADMAP.md at main · FasterDecoding/Medusa

Skip to content

Navigation Menu

Sign in

Appearance settings

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Sign up

Appearance settings

Latest commit

History

History

19 lines (17 loc) · 700 Bytes

ROADMAP.md

File metadata and controls

19 lines (17 loc) · 700 Bytes

Roadmap

Functionality

  • Batched inference
  • Fine-grained KV cache management
  • Explore tree sparsity
  • Fine-tune Medusa heads together with LM head from scratch
  • Distill from any model without access to the original training data

Integration

Local Deployment

Serving