MorphServe

Workload-aware LLM serving via runtime layer swapping and KV cache resizing.