atoniolo76 Hey! I'm interested in distributed systems, LLM load balancing/inference, and networking on multi-node GPU clusters. Recently, I developed and benchmarked efficient KV-aware routing in cross-region chatbots. You can read the pre-print here. I also make coding tutorials from time to time