The 8-bit floating-point (FP8) data format has been increasingly adopted in neural network (NN) computations due to its superior dynamic range compared to traditional INT8. However, FP8-based multiplication, a core operation in NNs, still incurs significant power consumption. To address this issue, this paper presents an FPGA-based approximate multiplier design for FP8. Firstly, we conduct a bit-level analysis of the approximation method. Based on this analysis, we implement a fine-grained optimized design on mainstream FPGAs (AMD and Altera) using primitives and templates combined with physical layout constraints. Then, the accuracy and resource utilization of the FP8 approximate multiplier are evaluated and analyzed. The results indicate that, compared to previous FPGA-based 8-bit designs, our design achieves the minimal LUT consumption. Finally, we integrate the design into the inference phase of a representative NN model, demonstrating its excellent power efficiency. To the best of our knowledge, this is the first FPGA-based FP8 approximate multiplier design, which can serve as a benchmark for future designs and comparisons of FPGA-based low-precision floating-point approximate multipliers. The code of this work is available in our GitLab.
Chen, R, Yangxintong, L, Bao, H, Liu, J, Zhu, Y, Tang, S, Ling, M & da Silva, B 2025, FPGA-Based Approximate Multiplier for FP8. in Proceedings - 2025 IEEE 33rd Annual International Symposium on Field-Programmable Custom Computing Machines, FCCM 2025. Proceedings - 2025 IEEE 33rd Annual International Symposium on Field-Programmable Custom Computing Machines, FCCM 2025, IEEE, pp. 227-235. https://doi.org/10.1109/FCCM62733.2025.00079
Chen, R., Yangxintong, L., Bao, H., Liu, J., Zhu, Y., Tang, S., Ling, M., & da Silva, B. (2025). FPGA-Based Approximate Multiplier for FP8. In Proceedings - 2025 IEEE 33rd Annual International Symposium on Field-Programmable Custom Computing Machines, FCCM 2025 (pp. 227-235). (Proceedings - 2025 IEEE 33rd Annual International Symposium on Field-Programmable Custom Computing Machines, FCCM 2025). IEEE. https://doi.org/10.1109/FCCM62733.2025.00079
@inproceedings{b342220fd82d4166b18cbd8c11fe5a15,
title = "FPGA-Based Approximate Multiplier for FP8",
abstract = "The 8-bit floating-point (FP8) data format has been increasingly adopted in neural network (NN) computations due to its superior dynamic range compared to traditional INT8. However, FP8-based multiplication, a core operation in NNs, still incurs significant power consumption. To address this issue, this paper presents an FPGA-based approximate multiplier design for FP8. Firstly, we conduct a bit-level analysis of the approximation method. Based on this analysis, we implement a fine-grained optimized design on mainstream FPGAs (AMD and Altera) using primitives and templates combined with physical layout constraints. Then, the accuracy and resource utilization of the FP8 approximate multiplier are evaluated and analyzed. The results indicate that, compared to previous FPGA-based 8-bit designs, our design achieves the minimal LUT consumption. Finally, we integrate the design into the inference phase of a representative NN model, demonstrating its excellent power efficiency. To the best of our knowledge, this is the first FPGA-based FP8 approximate multiplier design, which can serve as a benchmark for future designs and comparisons of FPGA-based low-precision floating-point approximate multipliers. The code of this work is available in our GitLab.",
author = "Ruiqi Chen and Lyu Yangxintong and Han Bao and Jiayu Liu and Yanxiang Zhu and Shidi Tang and Ming Ling and {da Silva}, Bruno",
note = "Publisher Copyright: {\textcopyright} 2025 IEEE.",
year = "2025",
month = may,
day = "28",
doi = "10.1109/FCCM62733.2025.00079",
language = "English",
series = "Proceedings - 2025 IEEE 33rd Annual International Symposium on Field-Programmable Custom Computing Machines, FCCM 2025",
publisher = "IEEE",
pages = "227--235",
booktitle = "Proceedings - 2025 IEEE 33rd Annual International Symposium on Field-Programmable Custom Computing Machines, FCCM 2025",
}