Execution-Centric Characterization of FP8 Matrix Cores, Asynchronous Execution, and Structured Sparsity on AMD MI300A
Aaron Jarmusch, Connor Vitz, Sunita Chandrasekaran
arXiv preprint · February 2026 Preprint
AMD MI300AFP8Structured SparsityMicrobenchmarksGPU ComputingHPC