Deprecated: The each() function is deprecated. This message will be suppressed on further calls in /home/zhenxiangba/zhenxiangba.com/public_html/phproxy-improved-master/index.php on line 456
Projects | Hao AI Lab @ UCSD
[go: Go Back, main page]

DistCA

Core Attention Disaggregation for Efficient Long-context Language Model Training

JacobiForcing

Fast and Accurate Causal Parallel Decoding using Jacobi Forcing

d3LLM

Ultra-Fast Diffusion LLM 🚀

Dynasor

Making Reasoning Models More Token-Efficient

LMGame Bench

Evaluating LLM Reasoning through Live Computer Games

vLLM-LTR

Efficient LLM Scheduling by Learning to Rank

MuxServe

Serving Multiple LLMs with Flexible Spatial-Temporal Multiplexing

CLLM

Consistency Large Language Models: A Family of Efficient Parallel Decoders

DistServe

Maximizing Goodput in LLM Serving using Prefill-Decode Disaggregation