CausalFlow Logo CausalFlow.ai
Home Blogs
August 2025 GPU Optimization

Optimizing FP4 Mixed-Precision Inference on AMD GPUs

Learn how we developed Petit, a collection of optimized FP16/BF16 x FP4 mixed-precision GPU kernels for AMD GPUs, achieving 1.74x faster inference and up to 3.7x performance improvements.

Read more

© 2024 - 2025. CausalFlow Inc. All Rights Reserved.