lambda7xx - Overview

View lambda7xx's full-sized avatar

Xiao lambda7xx

千里之行, 始于足下 Build Systems Think AI in System, Think System in AI.

  • Shanghai Jiao Tong University

  • Shanghai

Organizations

@cs61

Block or report lambda7xx

👋 Hi, I’m Xiao. I’m now seeking opportunities in computer systems and machine learning systems, or related industry positions. Please feel free to reach out if you have an opening. Thanks!

Research Interests:

I am very interested in building Operating Systems and Distributed Systems for AI.

Publications:

  • ICSE-SEIP'23
  • Eurosys'24
  • ASPLOS'24
  • Autellix(NSDI'26)

Projects

  • Autellix: high throuhput LLM Agent serving system
  • DeepScaler: RL LLM training
  • Nexus: Intra-GPU PD disaggregated LLM serving system

Google Scholar

Google Scholar

📫 Feel free to email me at lambda7xx@gmail.com if you are interested in my work.

Pinned Loading

  1. paper and its code for AI System

    357 23

  2. A high-throughput and memory-efficient inference and serving engine for LLMs

    Python 74.8k 15k

  3. SGLang is a high-performance serving framework for large language models and multimodal models.

    Python 25.3k 5.1k

  4. Forked from kvcache-ai/ktransformers

    A Flexible Framework for Experiencing Heterogeneous LLM Inference/Fine-tune Optimizations

    Python