moving-bits-for-ai
| Topic | Replies | Views | Activity | |
|---|---|---|---|---|
| Crux: GPU-Efficient Communication Scheduling for Deep Learning Training |
|
0 | 307 | July 30, 2024 |
| CacheGen: KV Cache Compression and Streaming for Fast Large Language Model Serving |
|
0 | 116 | July 30, 2024 |
| Rethinking Machine Learning Collective Communication as a Multi-Commodity Flow Problem |
|
0 | 113 | July 30, 2024 |
| RDMA over Ethernet for Distributed Training at Meta Scale |
|
0 | 351 | July 30, 2024 |