(Optional) Using DPC to Access Storage to Accelerate Checkpoint Loading
Check whether the following conditions are met:
- Check whether the Distributed Parallel Client (DPC) file system is used to access the storage.
- Check whether the NDS 1.0 package (/opt/oceanstor/dataturbo/sdk/lib/libdpc_nds.so) is successfully installed.
- Check whether the training process (if in a container) can access the preceding .so file.
If all the preceding conditions are met, the passthrough read function of NDS 1.0 is automatically enabled to accelerate checkpoint loading.
To check whether NDS 1.0 is successfully loaded, check whether the following information is displayed in the log:
"initial and open nds file driver success"
For more information about NDS 1.0, see OceanStor DataTurbo 25.x.x DTFS User Guide.
If the DPC file system is used to access the storage, and the NDS 1.0 package (/opt/oceanstor/dataturbo/sdk/lib/libdpc_nds.so) is successfully installed, the function can work normally after the permission is set to 444. Before starting training, exercise caution when setting the permission on the file.
Parent topic: Appendixes