6. gc-flops
This tool allows you to benchmark the number of floating point operations per second (FLOPS) on one or more IPU processors. The tool supports Mk2 architectures only.
6.1. Precision
The --fp
and -p
command options select the floating point precision.
You can choose between FP16
(the default), which is 16-bit floating point also known as
half-precision floating point, or FP32
for single-precision floating point.
6.2. Device
The --device-id
and -d
command options will let you specify the IPU device to benchmark.
Note
The tool reports benchmark results in gigaFLOPS. The clock speed of the IPU affects these results. The tool measures and reports the clock speed to the user. For Multi IPU device, the tool reports the lowest value from all IPUs.
6.2.1. Usage
6.3. Allowed options
|
Floating point precision [FP16|FP32|FP8] (default: FP16) |
|
Device id |
|
Emit JSON output |
|
Permit low clock-rate |
|
Produce help message |
|
Version number |