5.7. PopXL
3.1.0
New features
Added support for float8 inference (host-based conversion, variables, constants, view changers, cast, matmul and conv).
Added support for replicated tensor sharding (RTS) and replica-grouped initialisation with multiple instances for remote variables.
Added the following ops:
scatter_reducepowsubsamplebatch_norm_inference
Added the
diagmethod to thetensorclass.
Bug Fixes
None
Other improvements
Disabled storing variable data with executable caching.
Used
subsampleop to handle slicing with step > 1.
Known issues
None
Compatibility changes
None
3.0.0
New features
Add abs, cos and sin operations providing additional operation coverage.
Add ability to compile without acquiring IPUs providing improved IPU resource utilisation.
Add
DecomposeSumtransform.Add support for
ifoperation allowing for more expressive control flow.Add ability to broadcast binary operations.
Enable support for all stride configurations collectives and replicated variables.
The ability to stride replicas in collectives is useful for advanced use cases like Tensor Model Parallel (TMP).
For more details refer to Available operations in popxl.ops.collectives in the section on Supported operations in the PopXL User Guide and API (experimental).
Bug Fixes
None
Other improvements
Improvements to PopXL User Guide and API (experimental):
add description of environment variables
improved sections for custom operations
Known issues
None
Compatibility changes
Change
replicated_all_gatherop so that the output tensor has shape[size, *shape].