SMAUG
Simulating Machine Learning Applications on gem5-Aladdin
|
Use these macros to portably convert between vectors of single-precision and half-precision floats, instead of directly calling the FP16 library functions or the manual asm implementations. Depending on the compilation mode (e.g. native vs gem5 vs LLVM-Tracer) and hardware support, different underlying implementations will be selected. More...
Use these macros to portably convert between vectors of single-precision and half-precision floats, instead of directly calling the FP16 library functions or the manual asm implementations. Depending on the compilation mode (e.g. native vs gem5 vs LLVM-Tracer) and hardware support, different underlying implementations will be selected.
_CVT_PS_PH_128: Convert 4-wide vectors from single to half precision. _CVT_PH_PS_128: Convert 4-wide vectors from half to single precision. _CVT_PS_PH_256: Convert 8-wide vectors from single to half precision. _CVT_PH_PS_256: Convert 8-wide vectors from half to single precision.