Search — Targeting the IPU from TensorFlow 1

Version: 3.1.0

1. Introduction
- 1.1. Document overview
2. Tutorial
3. Targeting the Poplar XLA device
4. Compiling and pre-compiling executables
- 4.1. Caching of compiled executables
- 4.2. Pre-compiling executables
  - 4.2.1. Unsupported Operations
5. Training a model
6. Efficient IPU I/O
- 6.1. Prefetch elements
- 6.2. I/O Tiles
7. Example using IPUEstimator
8. Example using IPUPipelineEstimator
9. Distributed training
- 9.1. PopDistStrategy examples
10. Half-precision floating point and stochastic rounding
11. IPU-optimised operations
12. IPU Outlined Functions
- 12.1. Usage
- 12.2. Examples
  - 12.2.1. Models with common structures
  - 12.2.2. Serializing large operations
13. Writing custom operations
- 13.1. Custom operation on the IPU
- 13.2. Custom host CPU operations
  - 13.2.1. Gradient callback
14. IPU host embeddings
15. IPU embedded application runtime
16. Exporting precompiled models for TensorFlow Serving
17. Retrieving information about compilation and execution
- 17.1. TensorFlow options for reporting
- 17.2. XLA graph file naming
18. IPU TensorFlow Addons
- 18.1. Introduction
- 18.2. IPU SavedModel CLI
19. TensorFlow API changes
20. TensorFlow Python API
21. TensorFlow operators supported by the IPU
22. IPU TensorFlow Addons API changes
23. IPU TensorFlow Addons Python API
- 23.1. TensorFlow layers
  - 23.1.1. TensorFlow layers made for IPU TensorFlow
- 23.2. TensorFlow optimizers
  - 23.2.1. Optimizers made for IPU TensorFlow
24. Resources
25. Trademarks & copyright