Yes I should clarify I guess. My thought is the Yvette or older Sophia 3 with a pair of subs high passed (JL CR-1 etc) at 60hz would play in the same league as Wilson's much bigger models. I don’t like monitors with subs at all. Just to different of a dynamic impact for me. The advantage is you can work around room nodes much better unless you can get this big speakers 7’ off the wall.
But what is scale? Image size? Which is defined by dispersion and room interaction (time alignment, reflections etc)? Or it is the impact of moving more air? So much of the sound comes from the mids and tweeters so how much more air is being moved by a very large speaker vs large speaker with subs. Honest question, I really don’t know.
My guess is that it comes into room and time delay of the sound combined with dynamic impact. Big drivers just move more air for less displacement. I know my headphones don’t do it and the driver to room size could not get any bigger than headphones so maybe it is more phase, room/ time delay.
But what is scale? Image size? Which is defined by dispersion and room interaction (time alignment, reflections etc)? Or it is the impact of moving more air? So much of the sound comes from the mids and tweeters so how much more air is being moved by a very large speaker vs large speaker with subs. Honest question, I really don’t know.
My guess is that it comes into room and time delay of the sound combined with dynamic impact. Big drivers just move more air for less displacement. I know my headphones don’t do it and the driver to room size could not get any bigger than headphones so maybe it is more phase, room/ time delay.

