moving-bits-for-ai
Topic | Replies | Views | Activity | |
---|---|---|---|---|
Crux: GPU-Efficient Communication Scheduling for Deep Learning Training |
![]() |
0 | 276 | July 30, 2024 |
CacheGen: KV Cache Compression and Streaming for Fast Large Language Model Serving |
![]() |
0 | 105 | July 30, 2024 |
Rethinking Machine Learning Collective Communication as a Multi-Commodity Flow Problem |
![]() |
0 | 104 | July 30, 2024 |
RDMA over Ethernet for Distributed Training at Meta Scale |
![]() |
0 | 310 | July 30, 2024 |