ranonrkm - Overview
Navigation Menu
Doctoral Student at InfiniAI lab, CMU ECE; Ex Research Fellow at MSR India
-
Carnegie Mellon University
- Pittsburgh, PA
- ranonrkm.github.io
Pinned Loading
-
Forked from microsoft/DiskANN
Scalable graph based indices for approximate nearest neighbor search
C++ 1
-
Speculative decoding for high-throughput long-context inference
JavaScript
-
MagicDec: Breaking the Latency-Throughput Tradeoff for Long Contexts with Speculative Decoding
JavaScript