Elastic-Agent
(断点续训相关接口)
mindx_elastic.__version__
mindx_elastic.api.patch_torch_methods(内部接口,严禁调用)
mindx_elastic.recover_manager.DLRecoverManager
report_stop_complete(code: int, msg: str, fault_ranks: dict) -> int
report_recover_strategy(fault_ranks: dict, strategy_list: list) -> int
report_recover_status(code: int, msg: str, fault_ranks: dict, strategy: str) -> int
report_process_fault(fault_ranks: dict) -> int
返回码说明
父主题:
API参考