Haven't had time to read all the responses but wanted to weigh in as an audio engineer/recording engineer/mix engineer:
There are some fantastic comments in here that combined make up most of my viewpoint which is based around real world testing and listening.
I agree there is no way to hear "the original sound" of the engineer who mixed it unless you have the same room dimensions and materials, same absorption and same system. When I started investigating the massive difference even cabling can make in an acoustically treated space, I realized there's no way to really hear what was originally heard in the studio. BUT, who cares ?? As a matter of fact, many systems nowadays that are well synergized will sound BETTER than the studio in which they were recorded. As someone who has toured the big boy studios and heard their systems, carefully matching components to your room/speakers and for your preferences, can yield results better than many studios if done right.
I agree with those saying why do you have to sacrifice one or the other? A good system should be highly resolute while providing a lot of "musicality" at the same time. Yes there is a bit of a tradeoff with gear but super solid gear should be able to convey both the details and the musicality of the signal.
As a side note, a good engineer should have 20-20 khz dialed in fairly flat (with respect to treble and bass curve preferences, my mid and sub bass are elevated for instance as I make mostly electronic music which requires careful analysis with bass frequencies) on their main rig, and it's recommended to have two other playback systems or devices to test mixes on. I use tiny speakers and headphones for my 2nd and 3rd source for a forest for the trees perspective with the little speakers and the headphones for ultra detail and for those who listen primarily with headphones.
Just a little insight into how engineers actually come about the final sound you hear. Most of the time it has been mixed and tested on multiple systems with massively varying responses in order to test the content on average joe playback systems

