Tutorials
These tutorials introduce Ave through small kernels that showcase the language and runtime.
- Tutorial 1: AXPY Introduce `@ave.jit`, typed tensors, thread indexing, and kernel launch geometry.
- Tutorial 2: GEMM Start with naive BF16 `A @ B^T`, then add shared-memory tiling and vectorized tile loads.
- Tutorial 3: Intrinsics Use AMDGPU MFMA, runtime pointer views, and raw-buffer loads for generic GEMM kernels.