moving-bits-for-ai
| Topic | Replies | Views | Activity | |
|---|---|---|---|---|
| Crux: GPU-Efficient Communication Scheduling for Deep Learning Training |
|
0 | 311 | July 30, 2024 |
| CacheGen: KV Cache Compression and Streaming for Fast Large Language Model Serving |
|
0 | 137 | July 30, 2024 |
| Rethinking Machine Learning Collective Communication as a Multi-Commodity Flow Problem |
|
0 | 122 | July 30, 2024 |
| RDMA over Ethernet for Distributed Training at Meta Scale |
|
0 | 378 | July 30, 2024 |