GetTrainingDataTraceSwitch

Description

Externally obtains the status of dynamic dotting for various data.

Prototype

rpc GetTrainingDataTraceSwitch (DataStatusReq) returns (DataStatusRes)

Input Parameters

Parameter

Type (Defined by Protobuf)

Description

DataStatusReq

message DataStatusReq{

string jobNsName = 1;

}

jobNsName: job namespace and name to be modified, separated by a slash (/), for example, default/test-pytorch.

Return Value

Parameter

Type (Defined by Protobuf)

Description

DataStatusRes

message DataStatusRes{

string message = 1;

ProfilingSwitch profilingSwitch = 2;

int32 code = 3;

}

message: API calling result

profilingSwitch: details of each switch

  • CommunicationOperator: communication operator
  • Step: step latency
  • SaveCheckpoint: time taken by saveCheckpoint
  • FP: forward propagation data
  • DataLoader: time taken by DataLoader
code: API calling return code
  • 1 (300): invalid input parameter
  • 2 (404): ConfigMap not queried
  • 3 (500): server error
  • 4 (200): normal response