Optimal Kernel Orchestration for Tensor Programs with Korch - Muyan Hu uwsampl 460 подписчиков Скачать
FlexGen:High-throughput Generative Inference of Large Language Models with a Single GPU - Ying Sheng Скачать
Verified Tensor-Program Optimization Via High-Level Scheduling Rewrites | SAMPL Talk 2022/04/21 Скачать
Decoupling Algorithm from Hardware Customizations for Software-Defined Reconfigurable Computing Скачать