Use the same parameter but produce different results in Bayesian Sweep

Well, finally find the problem. I used the LSTM in my code. And there are some “non-determinism issues for RNN functions on some versions of cuDNN and CUDA.”