Model Training and Saving

After the Removing Fixed Randomness (deterministic computing disabled) is executed, reduce the interval for saving the model as much as possible. For example, save the model every five steps to ensure that the precision exception can be reproduced within a reasonable period of time. Obtain the model file of the latest step after the loss exception occurs and compare it with the model file of the same number of steps during normal training.