Network Requirements
The core scheduling component Volcano is deployed on the Kubernetes master node. To ensure service stability, the following suggestions are provided based on Kubernetes deployment requirements. You can adjust the requirements based on your service characteristics.
- Separate the master node from the worker and storage nodes. You are advised to deploy the master node on an independent server.
- If the cluster scale is large or high service reliability is required, deploy the master node in multi-node mode.
Deployment Logic
Figure 1 Deployment logic


Nodes in a data center cluster are classified into the following types:
- Master node: manages clusters, distributes training or inference jobs to each worker node for execution, and supports the master node-related cluster scheduling components.
- Worker node: executes training or inference jobs and supports the worker node-related cluster scheduling components.
- Storage node: stores datasets and trained models.
The functions of each network plane are as follows:
- Service plane: manages Kubernetes cluster services.
- Storage plane: reads a training dataset from storage nodes. Due to bandwidth requirements, you are advised to deploy an independent network to connect training nodes to storage nodes.
- Parameter plane: exchanges parameters between training nodes and connects training nodes for distributed training. For details, see the networking scheme of the solution.
Parent topic: Hardware