moving-bits-for-ai
Topic | Replies | Views | Activity | |
---|---|---|---|---|
Crux: GPU-Efficient Communication Scheduling for Deep Learning Training |
![]() |
0 | 303 | July 30, 2024 |
CacheGen: KV Cache Compression and Streaming for Fast Large Language Model Serving |
![]() |
0 | 112 | July 30, 2024 |
Rethinking Machine Learning Collective Communication as a Multi-Commodity Flow Problem |
![]() |
0 | 110 | July 30, 2024 |
RDMA over Ethernet for Distributed Training at Meta Scale |
![]() |
0 | 336 | July 30, 2024 |