5.9. PopART
3.4.0
New features
Add
createHostTransferableTensorWithOffset
session option (experimental). This can prevent out of memory errors for models with a large number of small input tensors.Enable custom ops created using the Python API to accept a list of float values as attributes.
Bug Fixes
None
Other improvements
Reduce the compilation time for the following:
when using overlapped I/O.
for models with a large number of inputs.
Remove the limit on the number of I/O tiles. The number of I/O tiles was previously limited to 192.
Known issues
None
Compatibility changes
None
3.3.0
New features
Added support for the
largest=false
option in theTopK
operation.Added the
NormalizeImage
op that normalises the data and pads to four channels on-device after the data has been transferred to the IPU. This can enable better performance for subsequent convolution operations.Added the
stashAllTensorsInferencePipeline
option to theSessionOptions
class to enable all tensors to be stashed when doing inference. This may improve performance for certain use cases.Added the ability to fine-tune the auto virtual graph result with the
virtualGraphSplitRatios
option in theSessionOptions
class.Added the
SplineBasis
andSplineWeighting
ops to support the PyTorch GeometricSplineConv
operator.Switched use of the deprecated
deviceIteration
todeviceIterations
.
Bug Fixes
Fixed the
Loop
andScan
ops so that any constant tensors are implicitly added into the subgraphs of these ops as inputs.Fixed the overflow in the
Clip
operator.
Other improvements
None
Known issues
None