OP, in my opinion, the key issue isn't AI involvement but the questions themselves. Sonic comparisons are system-dependent variables: amplification, speakers, room acoustics, cables, and listener perception and taste all affect the outcome. If you don't specify those parameters, the AI can only synthesize reviewers' and posters' generalizations. And it will do it confidently with attractive descriptions.
But in all of that word salad, it will be unlikely to flag whether your questions are missing critical information. That's a characteristic AI limitation.
So...why not try asking AI an additional question to the one you're posing –
"Gemini, before answering, identify what information about my system and listening context is missing from this question that would be necessary to give me a more accurate and personalized response."
That kind of metacognitive prompt turns AI into an interlocutor rather than an oracle. It forces it to reveal its own assumptions before generating what might otherwise be a fluent but under-specific answer.

