One paper about LLM serving over preemptive instances was accepted by ASPLOS 2024. :tada: