5.9. PopART
3.4.0
New features
Add
createHostTransferableTensorWithOffsetsession option (experimental). This can prevent out of memory errors for models with a large number of small input tensors.Enable custom ops created using the Python API to accept a list of float values as attributes.
Bug Fixes
None
Other improvements
Reduce the compilation time for the following:
when using overlapped I/O.
for models with a large number of inputs.
Remove the limit on the number of I/O tiles. The number of I/O tiles was previously limited to 192.
Known issues
None
Compatibility changes
None
3.3.0
New features
Added support for the
largest=falseoption in theTopKoperation.Added the
NormalizeImageop that normalises the data and pads to four channels on-device after the data has been transferred to the IPU. This can enable better performance for subsequent convolution operations.Added the
stashAllTensorsInferencePipelineoption to theSessionOptionsclass to enable all tensors to be stashed when doing inference. This may improve performance for certain use cases.Added the ability to fine-tune the auto virtual graph result with the
virtualGraphSplitRatiosoption in theSessionOptionsclass.Added the
SplineBasisandSplineWeightingops to support the PyTorch GeometricSplineConvoperator.Switched use of the deprecated
deviceIterationtodeviceIterations.
Bug Fixes
Fixed the
LoopandScanops so that any constant tensors are implicitly added into the subgraphs of these ops as inputs.Fixed the overflow in the
Clipoperator.
Other improvements
None
Known issues
None