I believe the answer is: Change very little from day to day
The stability of the test scores is most likely influenced by the reliability of testing method that imposed to the subjects.
Due to this reliability, the data that extracted from the subjects is much more accurate and often lead to same conclusion from day to day.