Your post was interesting. Now forgive me if this sounds like an inexact analogy.
When USB became a popular interface it was quickly noticed that the asynchronous delivery-the music arriving in packets- was adversely affecting the sound due to timing errors. Essentially this was solved by machines storing the packets and then reclocking them. In fact there were CD players that would put about 10 seconds of music in a buffer before playing it, even before streaming dominated the landscape.
Essentially what I see in your post is the buffering of streaming content. Am I off base here?

