Search help

Note: Searching from the top-level index page will search all documents. Searching from a specific document will search only that document.

Find an exact phrase: Wrap your search phrase in "" (double quotes) to only get results where the phrase is exactly matched. For example "PyTorch for the IPU" or "replicated tensor sharding"
Prefix query: Add an * (asterisk) at the end of any word to indicate a prefix query. This will return results containing all words with the specific prefix. For example tensor*

Fuzzy search: Use ~N (tilde followed by a number) at the end of any word for a fuzzy search. This will return results that are similar to the search word. N specifies the “edit distance” (fuzziness) of the match. For example Polibs~1
Words close to each other: ~N (tilde followed by a number) after a phrase (in quotes) returns results where the words are close to each other. N is the maximum number of positions allowed between matching words. For example "ipu version"~2
Logical operators. You can use the following logical operators in a search:
- + signifies AND operation
- | signifies OR operation
- - negates a single word or phrase (returns results without that word or phrase)
- () controls operator precedence

1. Overview

Usually, a model inference service will be deployed to a Kubernetes cluster to provide scalable and highly available services. This document describes how Kubernetes manages IPU resources with Graphcore’s Kubernetes IPU device plugin. The Kubernetes IPU device plugin is a DaemonSet for Kubernetes, which:

Exposes the number of IPUs of each node in the cluster
Allocates one or more IPUs for a Pod
Inspects the health of the IPUs

For more information about compiling models to run on an IPU refer to the IPU Inference Toolkit User Guide.