Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

The most typical reason argmax (temp 0) is non-deterministic is that your request is running batched with other people requests. The number and size of these affects the matrix sizes and thus tiling decisions. Then you get different floating point order and thus different results.

Nvidia gives some guarantees about deterministic results of their kernels but that only applies when you have exact same input data and this is not the case when in-flight batching.



Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: