Affinity Scheduling Policies
The table below describes the characteristics and resource usage rules of an inference server (with Atlas 300I Duo inference cards).
Policy Name |
Policy Description |
|---|---|
Affinity scheduling by inference card |
The Ascend AI processors on one Atlas 300I Duo inference card are preferred. If one or two Ascend AI processors need to be allocated, ensure that the processor(s) is (are) selected from one Atlas 300I Duo inference card. The node with one available Atlas 300I Duo inference card is the best and then two. |
Distributed inference scheduling by Ascend AI processor |
The job must be scheduled to the entire Atlas 300I Duo inference card. If the number of Ascend AI processors required by the job is an odd number, the job is preferentially scheduled to the Atlas 300I Duo inference card with one remaining AscendAI processor. |