moving-bits-for-ai
| Topic | Replies | Views | Activity | |
|---|---|---|---|---|
| Crux: GPU-Efficient Communication Scheduling for Deep Learning Training |
|
0 | 336 | July 30, 2024 |
| CacheGen: KV Cache Compression and Streaming for Fast Large Language Model Serving |
|
0 | 174 | July 30, 2024 |
| Rethinking Machine Learning Collective Communication as a Multi-Commodity Flow Problem |
|
0 | 134 | July 30, 2024 |
| RDMA over Ethernet for Distributed Training at Meta Scale |
|
0 | 391 | July 30, 2024 |