MM vs. MC: What Happens When People Jump Into MC Too Early? 


MM vs. MC: What Happens When People Jump Into MC Too Early? 

The decision to purchase an expensive moving-coil cartridge rarely happens immediately. Most music lovers arrive at that point only after spending some time with vinyl.

Perhaps they started with an entry-level MM cartridge, or they already own an affordable MC cartridge. Or perhaps they started with a carefully selected vintage MM cartridge, as I suggested in my article – The Best Vinyl Sound For $1000, and discovered just how much performance can be achieved without spending a fortune.

But after a year or two of listening, reading forums, watching reviews, and talking to other audiophiles, a question inevitably appears:

What am I missing?

What would happen if I installed one of those expensive MC cartridges that audiophiles praise so enthusiastically?

What about Lyra Delos, or some highly praised retipped vintage MC cartridge?

Before we go any further, it is worth mentioning another group of audiophiles—more experienced vinyl enthusiasts—the ones who have already spent several years with analog playback and dream of eventually building a serious turntable with two tonearms, just like the systems they admire on Instagram.

Whenever I asked people planning such systems what tonearms they intended to install, the answer was remarkably consistent:

“One light tonearm and one heavy tonearm.”

In my experience, both approaches lead to expensive detours.

The first mistake is trying to jump into the MC segment before you can afford a solution that really works.

The second is spending a fortune on a two-arm turntable without fully understanding what those tonearms are supposed to accomplish, rather than pursuing a more versatile path.

To explain these mistakes, we need to go back to the beginning.

Cartridges Evolved Together with Records

Phono cartridges did not develop in isolation. They evolved alongside the record itself.

Nobody would seriously consider playing a shellac 78 RPM record with a modern microline MC cartridge. Those records were designed for stylus profiles that, by modern standards, look almost like sewing needles. In many ways, the same principle applies throughout the history of vinyl playback.

As record manufacturing evolved, groove geometry changed. Grooves became narrower, cutting techniques became more sophisticated, and playback equipment became increasingly refined.

Simultaneously, stylus profiles grew more complex. Cantilevers became lighter and stiffer. Magnet structures became more advanced. Cartridge designers continuously searched for new ways to extract more information from the groove.

Many of the moving-coil cartridges that appeared during the 1970s and early 1980s were genuine advances over the moving-magnet designs of their era.

Evaluating those cartridges today, however, is difficult. Unlike vintage MM cartridges, many of which can still be restored to near-original condition simply by installing a modern replacement stylus, a moving-coil cartridge is a much more integrated device. The stylus assembly is not a user-replaceable component but part of the cartridge’s entire mechanical and electromagnetic structure.

As a result, original factory styli for most vintage MC cartridges are long gone.

Restoration typically requires retipping by a specialist, often using a completely different stylus profile, cantilever material, or suspension system than the original design. Sometimes the results are excellent. But what emerges from the process is no longer the same cartridge that left the factory decades ago.

However, cartridge technology did not stop in 1980.

Advances in magnet materials, stylus geometry, coil design, and manufacturing precision have continued for decades. As a result, with only a handful of exceptions, even many of the most celebrated vintage MC cartridges struggle to compete directly with the best modern designs.

What Happens When People Upgrade Too Soon?

What usually happens when people jump into the MC segment with only $1-2K and buy something like Lyra Delos or a similarly priced cartridge?

Some records immediately sound better.

This is especially true of later pressings, where groove geometry became narrower and more sophisticated. Suddenly there is more air, more detail, a deeper soundstage, and a stronger sense of separation between instruments.

The owner smiles and says:

“I knew it. MC really is better.”

Then something else happens.

He pulls out an older record—perhaps Grand Funk Railroad, Nazareth, or Led Zeppelin.

And the reaction becomes less enthusiastic.

The sound is detailed, but somehow less convincing. The music feels more analytical, less cohesive, less emotionally connected. Instead of hearing a performance, the listener begins hearing individual elements of the recording.

Many of the records that define the golden age of vinyl were recorded between the late 1960s and the mid-1980s. Their groove structure and cutting techniques differ substantially from later digitally mastered records, and while the fine stylus of many MC cartridges can extract more information from thin grooves than thicker MM styli, when it comes to the older records with wider grooves – less delicate MM cartridges often produce a more coherent and natural presentation. If you let me use an automobile analogy again, this is like driving a sports car on a highway and a 4x4 off-road.

The Two-Tonearm Solution

This duality brought many music lovers to the two-tonearm solution. Between 2015 and 2018, I built a dozen vinyl systems around exactly this concept. One tonearm carried a cartridge, such as a Shure V15 or an ADC XLM, for earlier records. The second initially carried an Audio-Technica AT33, but its tonal balance was surprisingly similar to the ADC XLM. To make the setup more interesting, I developed my own product, Denon Goliath - a modified Denon 103 that I extensively reworked with retipping and a custom wooden-body conversion. The result deserves a separate article.

It was through the Denon Goliath that I first rediscovered later recordings such as Iron Maiden albums from 1986–1988. For the first time, I heard how much information these records actually contained. Compared to the MM cartridges I had been using previously, it felt as though ten to fifteen percent of the information in the groove had not been retrieved.

These two-tonearm systems were remarkably versatile. They handled both older and newer records with equal confidence and provided an elegant solution to the problem. I used vintage Victor or Denon direct-drive motors and custom plinths to build those turntables, and for that period, they were a perfect audiophile product within a budget below $6000.

And yes – one tonearm on that type of player was light for high-compliance vintage MM cartridges, and the other was heavy for a low-compliance Denon.

Why is it a costly mistake to use the same solution today?

The economics now are very different. Many vintage motors have reached the end of their practical service life and require extensive restoration. A modern turntable capable of accommodating two tonearms, together with the required armboards and tonearms, can easily exceed $15,000–20,000 before you even start thinking about cartridges.

At the same time, today we have My Sonic Lab cartridges that combine very high resolution with a surprisingly natural and balanced presentation across a much wider range of records than was previously possible.

And this is the key point.

Today, buying a versatile My Sonic Lab cartridge and one good tonearm for it is a much smarter decision than a two-tonearm solution. Even the entry-level My Sonic Lab models can compete successfully with combinations that would previously have required two separate tonearms.

As a result, the traditional solution of maintaining separate cartridges for different eras of records becomes much harder to justify when working within a limited budget.

For listeners seeking a turntable capable of reproducing an entire record collection with equal confidence, the most sensible strategy today is often to aim directly for a My Sonic Lab cartridge and a single 12-inch tonearm with low-to-medium effective mass. In my practice, the Moerch DP8 12-inch red dot worked best, but common SME tonearms work perfectly well too, except for heavier models such as the 3012R or M2-12R.

 A Quick Word About Lyra

No discussion of MC cartridges would be complete without mentioning Lyra.

On this website, you will find a separate review of the Lyra Delos, one of the most popular first MC cartridges. In that article, I explain why I rarely recommend it. However, that does not mean it is a bad cartridge. In fact, for many listeners it can be exactly the right choice.

Many entry-level and mid-level high-end systems suffer from a lack of overall transparency. This is often caused not by the cartridge itself, but by the system as a whole. Loudspeakers may have a high Effective Drive Mass, amplifiers may deliver insufficient current, and the entire system may sound slightly closed-in.

In such systems, the energetic and highly detailed character of a cartridge like the Lyra Delos can be extremely beneficial. The cartridge effectively cuts through the system’s overall lack of openness, creating more air, more apparent detail, and a greater sense of excitement.

This is precisely why opinions about Lyra cartridges are often so polarized.

One listener installs a Lyra and immediately falls in love. Another listener hears the same cartridge and finds it overly analytical or fatiguing. Both listeners may be correct. The difference is the system.

So, Should You Upgrade to MC?

My answer is yes, absolutely.

A properly chosen MC cartridge can absolutely deliver a meaningful improvement in analog playback.

The mistake was upgrading without sufficient funds and purchasing an MC cartridge that lacks versatility.

If your system is already highly transparent—sensitive loudspeakers, low Effective Drive Mass, sufficient amplifier current—I would suggest looking toward a cartridge no less than the My Sonic Lab Eminent EX or better, such as the My Sonic Lab Hyper Eminent.

If your system still lacks overall transparency, a cartridge such as the Lyra Kleos may be a good choice, but I recommend addressing the lack of transparency first and then moving to the My Sonic Lab Hyper Eminent anyway.

If your budget for an upgrade is less than $4K, which would allow you to afford Lyra Kleos or a My Sonic Lab Eminent Ex, I would not rush into MC cartridges at all.

Don’t forget that if your phonostage doesn’t properly support MC cartridges, you will need a SUP and a cable to connect, which adds at least $1-2K more.

A properly restored vintage MM cartridge, such as a Shure V15 or an ADC XLM, remains one of the smartest purchases in analog audio below the $4K threshold. More affordable MC cartridges improve certain aspects of reproduction while simultaneously sacrificing others. You gain something on one group of records and lose something on another, and the result is often not a breakthrough, but a compromise.

If your budget allows, skip the intermediate steps and move directly toward a truly versatile design such as the My Sonic Lab.

Does a two-tonarm solution make any sense today?

Oh, yes, but on a much higher level, where lies a very different world of pushing analog playback to the limit. In this game, the goal is no longer versatility.

The goal becomes specialization.

Coming back to the automobile analogy again – BMW X6 is excellent both on a highway and a moderate cross-country, but if you want to set lap records at a racetrack and compete in the Camel Trophy, you need two totally different cars – let’s say a Ferrari and a Defender.

Different cartridges and different tonearms are optimized for different records from different recording eras. This approach can extract the absolute maximum from the medium, but it also requires a completely different level of investment. The threshold begins at roughly $35K, and with the necessary infrastructure – step-up transformers and cabling – it can easily reach $ 45K–$50K. And in this game, the formula “one light-mass tonearm and one heavy tonearm” doesn’t always apply. Some of the best contemporary setups need two heavy tonearms because both cartridges, specialized for records from early and later eras, are low-compliance.

The core of this article is that choosing between MM and MC cartridges is not the real question.

The real question is whether you are optimizing your setup for budget, versatility, or specialization.

At the first level, the goal is straightforward: achieve the best possible sound within a comfortable budget. And vintage MM cartridges like the Shure V15, ADC XLM, and others with Jico styli outperform most existing low-priced MC cartridges, both contemporary and vintage.

The second level begins when the goal of achieving better sound meets a meaningful upgrade that works equally well across a wide range of records. Here's where My Sonic Lab cartridges stand out as one of the most compelling solutions available. If you already know you have no interest in building a $35,000-plus analog front end in the future and that extracting the absolute last few percent of performance from vinyl is not your goal, then a simpler, more universal approach makes perfect sense. In that case, I would focus on turntables that can accommodate a quality 12-inch tonearm but do not necessarily require multiple tonearms.

How Far Down the Rabbit Hole Do You Want to Go?

If, on the other hand, you believe that analog playback may become a long-term passion and that one day you may want to pursue the specialized approach discussed in this article, it may be wise to choose a turntable platform with greater long-term flexibility.

In that scenario, a single universal cartridge such as a My Sonic Lab can serve as an excellent starting point. Later, as priorities and experience evolve, the same turntable can become the foundation for a more specialized two- or even three-tonearm system.

The challenge is that the right answer depends not only on your budget, but also on your goals, your record collection, and how you plan to enjoy music in the years ahead.

If you would like to hear all the cartridges mentioned yourself rather than read about them, I encourage you to visit the REFERENCE VINYL RECORDINGS page.

Reference Vinyl Setup No. 1 in that collection was built around the very concept discussed in this article: a turntable with two specialized tonearms, one optimized for older records and the other for later pressings. Incidentally, that turntable used two heavy-mass tonearms, not the traditional “one light and one heavy” combination.

The Reference Track 1 and Reference Track 2 collections contain recordings made with a wide variety of cartridges, including vintage MC designs and modern cartridges from different price categories.

Well-known and relatively expensive MC cartridges occupy the last two positions. Yet despite their price and brand name, they are outperformed by cartridges ranked much higher—including a humble Shure V15 mounted on a very modest turntable. More importantly, that particular Shure was used in a stock configuration. It did not benefit from many of the upgrades I would normally recommend, such as a top-level JICO stylus with a boron cantilever, a wooden headshell, or mounting it on a 12-inch Moerch. Had that Shure received all those upgrades, it would have been higher in the list, closer to the top 5 positions occupied by the best modern MC carts.

I hope the information in this article—and the accompanying reference recordings—helps you make more informed decisions about your analog front end.

Paul Gerbert, Independent Audio Consultant

Helping audiophiles navigate an expensive and confusing hobby through smarter decisions and long-term planning.

Disclaimer: I don't sell My Sonic Lab cartridges, Lyra cartridges or any other gear mentioned in this article.

colossalsound

Raul, in your treatise on SUTs and their effects on cartridge performance, particularly high frequency response, are you inferring that high frequency response is profoundly influenced by the impedance “seen” by the cartridge as that relates to the cartridge internal impedance and assuming the ratio between the impedance seen and the cartridge internal Z is at least 10:1? At ratios higher than 10:1, I would assume it doesn’t make much difference in most cases. (A listener to any particular combination of cartridge, SUT, and phono might say differently.)

I don’t use SUTs and have never owned one, so I have no opinion on the subject. But I must say that the combination of an ypsilon phono, EMIA SUT, and ZYX UNIverse cartridge was stunningly good one evening at a friend’s home.

@lewm I don't doubt in any way the very good impression being reported about receiving a demo of a Cart' > SUT > Phon' at a friend's residence. 

If you were to be present with a mind full of equations and only a singular deciphering of what those equation values mean in the tangible and audible demonstration. There is a strong likelihood that the moment shared, seemingly indelible as an encounter, will not have been realised.

Math and sub-categories associated with it are thought, which when used, acts as a ubiquitous form of communicating. When used it serves as rigorous reasoning used to understand what the world is around us, as well as being used as the design created to solve complex problems. Math and dib categories, when used, are typically expressed through written presentation.

Math commonly results in a realisation of a Tangible manifestation.

The environment and the interaction within an environmental space is the main contributors to impacting on the senses. The senses are the Nature given tools that create attraction and repulsion.

Some individuals are repulsed by the Bark on a Tree; whereas the Math known for the same Tree will pretty much create awe and wonder for its life function.

Sight, Hearing, Smell, Touch, Taste are what the design is for the human being to find attraction and repulsion; go eat a Toadstool with Red Spots on its pileus (no please don't do that). 

Sound is no different; it is either an attraction or a repulsion, and the finest sound created according to Math is also with the potential to be classed as repulsive to certain persons.

Humans are complex, and knowledge really does influence, maybe just maybe, for certain types the Math has importance, and there is a personality developed that must prioritise the Math as part of an assessment typically assessed by the affect on the senses. 

Hi @billstevenson   : Everithing you think but hubris,please don't do that misunderstood.

Where belongs and start MUSIC? : in and through transient response where HARMONICS are developed , we don't be almost aware of the fundamental frequencies but just all the harmonics developed.

Please read the information of experts ( unfortunatelly the link gones ):

 

There's Life Above 20 Kilohertz!
A Survey of Musical Instrument Spectra to 102.4 KHz


James Boyk
California Institute of Technology
Music Lab, 0-51 Caltech, Pasadena, CA 91125, USA
Tel: +626 395-4590, E-mail: [email protected]
Home: http://www.cco.caltech.edu/~musiclab

Copyright © 1992, 1997 James Boyk. All rights reserved.

 

Abstract
       At least one member of each instrument family (strings, woodwinds, brass and percussion) produces energy to 40 kHz or above, and the spectra of some instruments reach this work's measurement limit of 102.4 kHz. Harmonics of muted trumpet extend to 80 kHz; violin and oboe, to above 40 kHz; and a cymbal crash was still strong at 100 kHz. In these particular examples, the proportion of energy above 20 kHz is, for the muted trumpet, 2 percent; violin, 0.04 percent; oboe, 0.01 percent; and cymbals, 40 percent. Instruments surveyed are trumpet with Harmon ("wah-wah") and straight mutes; French horn muted, unmuted and bell up; violin sul ponticello and double-stopped; oboe; claves; triangle; a drum rimshot; crash cymbals; piano; jangling keys; and sibilant speech. A discussion of the significance of these results describes others' work on perception of air- and bone-conducted ultrasound; and points out that even if ultrasound be taken as having no effect on perception of live sound, yet its presence may still pose a problem to the audio equipment designer and recording engineer.


I. Introduction
Each musical instrument family — strings, winds, brass and percussion — has at least one member which produces energy to 40 kHz or above. Some of the spectra reach this work's measurement limit of 102.4 kHz.
       Harmonics of French horn can extend to above 90 kHz; trumpet, to above 80; violin and oboe, to above 40; and a cymbal crash shows no sign of running out of energy at 100 kHz. Also shown in this paper are samples from sibilant speech, claves, a drum rimshot, triangle, jangling keys, and piano.
       The proportion of energy above 20 kilohertz is low for most instruments; but for one trumpet sample it is 2%; for another, 0.5%; for claves, 3.8%; for a speech sibilant, 1.7%; and for the cymbal crash, 40%. The cymbal's energy shows no sign of stopping at the measurement limit, so its percentage may be much higher.
      The spectra in this paper were found by recording each instrument's sound into a spectrum analyzer, then "prospecting" moment by moment through the recordings. Two instruments (clarinet and vibraphone) showed no ultrasonics, and so are absent here. Other instruments' sounds extended high up though at low energy. A few combined ultrasonic extension with power.
      The mere existence of this energy is the point of this paper, and most of the discussion just explains why I think that the spectra are correct, within the limits described below. At the end, however, I cite others' work on perception of air- and bone-conducted ultrasound, and offer a few remarks on the possible relevance of our spectra to human perception and music recording.
 

II. Explanation of trumpet spectra in Figures 1(a) & 1(b)
The upper trace in Figure 1(a) shows the spectrum of a concert B-flat played on a trumpet with a Harmon ("wah-wah") mute, as captured by an Aco/Pacific quarter-inch microphone four feet away and analyzed with a Hewlett-Packard model 3567A FFT spectrum analyzer. This and all other instruments were played in normal concert fashion. (For details of instruments and players, see Appendix A.)
      The lower trace shows the background with the trumpet silent; this is dominated by the microphone's "self-noise," as shown in section VIII, below. Of course this background is present when the trumpet plays; and that is why the upper trace is identified as "Trumpet + Background."
      Are the trumpet peaks actually harmonics? To find out, we'd like to place markers at harmonic frequencies. To be easily readable, though, such a graph would have to be huge, so Figure 1(b) provides excerpts from it.
      The first excerpt shows the spectrum up to 8 kHz; the second, from 15 to 32 kHz; the third, from 38 to 53 kHz. Note the 100th harmonic at 46,560 Hz and the 108th at 50,263 Hz. (The vertical scale has been adjusted separately in each excerpt to make it easier to judge the presence or absence of harmonics. Figure 1(a) shows the overall relationships of level.) It is clear that the peaks are indeed harmonics (and equally clear in the omitted portions of the frequency spectrum, for this and the other spectra).
      The fourth excerpt shows that by 55 kHz, the harmonics are vanishing. Note that, as seen in Figure 1(a), the trumpet is still 12 to 15 dB above the background at this frequency; so the energy seen at 55 kHz, though non-harmonic, is still trumpet sound. To be conservative, however, I don't claim this portion of the spectrum as part of the sound; and Table I says only that harmonics are visible to "above 50 kHz." Similarly, where the last column in Table I shows that 0.5% of the total energy is above 20 kHz; this is calculated only to the 50 kHz limit given for the harmonics.
      In Figures 2(b) through 9(b), as in this one, the last excerpt will show the region where visible harmonics vanish.

 
III. More trumpet, horn, violin, and oboe
In the same way as just described for Figure 1, Figures 2 through 9 give information about other instruments whose sound has harmonics.
      Skipping Figure 1(c) for the moment, in Figure 2 we see another sample of trumpet with Harmon mute, 20 dB lower in level than the sample in Figure 1, yet with harmonics extending higher, and with a higher percentage of its total energy in the harmonics. (See Table I.) Figure 3 shows trumpet with straight mute. Here the harmonics extend higher yet, to above 85 kHz.
      Figures 45, and 6 give three examples of French horn, played respectively "bell up," with mute, and in normal fashion. One hundred or more harmonics are visible in each!
      Figure 7 shows a violin "double-stop", that is, two notes played simultaneously. Since each note produces its own harmonic series, Figure 7(b) uses markers of two different shapes to show the two harmonic series.
      Figure 8 shows a single violin note played sul ponticello, that is, with the bow very close to the bridge. This gives a distinctive squeaky-scratchy sound which composers sometimes specify, as for example Beethoven in the C-sharp Minor string quarter, Opus 131. Even in this mezzo-piano (medium-soft) note, harmonics are still visible past 40 kHz. (Due to absence of mind, I took no sample of normal violin sound playing a single note normally.)
      Figure 9 shows an oboe note. It is striking how the harmonics suddenly drop in level after the 40th at 43 kHz.
      Not shown are any clarinet or vibraphone samples, because, as mentioned above, I could find no harmonic activity above 20 kHz anywhere in several samples of each, despite the closest "prospecting" with spectrum analyzer. These were the only instruments of the group that did not show such activity.
 

IV. Microphone and analyzer distortion
We return now to Figure 1 to ask whether the harmonics are spurious. Are they perhaps caused by overload of the microphone or analyzer? The waveform from which the spectrum was derived is shown in Figure 1(c), between the "Begin" and "End" points. Gross microphone overload would be shown by "flat-topping," which is absent. Nor was the analyzer overloaded on this or other samples. [1]
      Microphone distortion short of gross overload is not a factor, either, according to information supplied by the makers of the microphones. 
[2] Capsule distortion is primarily 2nd harmonic, and falls 20 dB with every 20 dB drop in level down to 136 dB SPL (23 dB higher than any of my samples), continuing to fall below that level.
      Since distortion is predominantly second harmonic [2], a spectral peak at 50 kHz, if due to distortion, would be the second harmonic of 25 kHz. If the 50 kHz peak were found to be at the 0.1% level, that is, 60 dB below the 25 kHz peak, then it might be due to distortion—if the distortion were indeed as high as 0.1%.
      But in fact, the 50 kHz region in Figure 1(a) is 25 dB higher than this. Coupling this with the undoubted fact that the distortion is lower than 0.1%, the energy seen in the 50 kHz region is certainly not due to microphone distortion.
      Similar reasoning applied to Figures 2 through 9 leads to the conclusion that microphone distortion is not a factor in any of them, nor by extension in Figures 10-16. However, in addition to such reasoning, I wished to test the microphones directly. Verifying the performance of the capsules (the diaphragm assemblies) is beyond the capability of my equipment; however, these units are widely regarded as a "gold standard" and their performance claims universally accepted as true. I rely on this.
      I was able to test the preamps, however. I injected test signals into them via a B&K adapter, with a small capacitance to mimic the presence of the microphone capsule. Test signals were the following:

      a. B&K 2639: preamp: A pure tone at 550 Hz, at a level 1 dB higher than that of the loudest musical-instrument sample.
      b. B&K 2639: The same, but 1 dB lower than the softest musical sample.
      c. B&K 2639 and Aco 4012 preamps: Tone cluster at 500, 1000 and 1500 Hz, 1 dB higher than the loudest musical sample.
      d. B&K 2639 and Aco 4012 preamps: The same cluster, 1 dB lower than the softest musical sample.

      Figures 18(a) and (b) show the performance of the B&K preamp to signals (c) and (d) respectively. The preamp is clearly free of harmonics at both high and low levels. The small bump at 85 kHz in the low-level test is breakthrough from the switching power supply. [3] I don't know the source of the even smaller bump at 50 kHz. Both are so small that they may be ignored, however.
      The behavior of the Aco 4012 preamp was indistinguishable from the B&K 2639 at the higher level, and superior at the lower.
      Note that these tests of the preamps are also tests of the H-P 3567A FFT analyzer. From the clean results, one may conclude that neither preamps nor analyzer are creating a false appearance in any of the spectra in this paper.

 
V. Room acoustics and rattles
I assume that the room acoustics are linear, and thus cannot create spurious frequencies in the spectrum. On the other hand, the room does contain objects which could conceivably rattle at ultrasonic frequencies, including loudspeakers, vacuum tubes, fluorescent light fixtures, metal chassis, and so on. What is more, the time samples analyzed for instruments with harmonics were generally long enough (31.25 milliseconds) for the microphone to pick up not only the direct sound of the instrument but also many reflections, and conceivably rattles, from around the room. (This is another reason that I do not claim greater high-frequency extension for these spectra than validated by visible harmonics.)
      It is impossible however that these hypothetical rattles would fall precisely at frequencies that were harmonics for all the variety of fundamental pitches shown in the various Figures, so I discount rattles as a source of spurious harmonics.

 
VI. Instruments without harmonics
For sounds with no harmonics, the argument just given cannot eliminate the possibility of room contamination; so instead I attempted to eliminate the room from the sound. To do this, I analyzed only the very beginning of a sound (so the room was not already excited), and cut short the time record before the first reflection could return from the nearest surface. A shorter time record means coarser resolution of frequency; but since we're no longer looking for harmonic peaks, this does not matter. This procedure was followed for Figures 11 through 16; but first I present Figure 10, in which the speech sibilant of interest happened to come after the beginning of the sound, so any rattles might already have been excited. I believe Figure 10 to be all right nonetheless, for two reasons. First, the microphone was much closer to the desired source than to any possible rattle except those in the microphone mount, boom, or cable. Second, the spectrum presents a coherent picture. One would expect rattles to be at one or a few specific frequencies or narrow frequency bands; but this spectrum smoothly covers a very broad band. I rely on the latter point also to support my ignoring the possibility of rattles from the microphone cable, shock mount and boom in Figures 10-16.
       In Figure 10, since the room was already excited by the sound preceding the analyzed segment, there was no point in limiting the length of the time record; so I used a 31.25-millisecond record for a high-resolution analysis of 32 Hz per spectral line. I hoped this might reveal rattles more clearly; but none showed up. In Figures 11-16, as described, I analyzed the beginning of the sounds and cut off the time records before the first reflections.
      Figure 11(a) shows the spectrum of a claves strike; 11(b) shows the 60-microsecond rise from a standing start to 104 dB. If one discounts the first tiny wiggle, the rise takes just 30 microseconds.
      Figures 12(a) and (b) show a jazz-style rimshot on a remarkably beautiful-sounding drum. (See Appendix A for identification of instruments and musicians.)
      Figure 13 shows crash cymbals. Note that the energy at 20, 30 and 40 kHz is higher than at 2, 3 and 4 kHz respectively; and that at 100 kHz it is still far above the background. I had never heard crash cymbals up close before, but I now think that this sound would be adequate for a sound-track of either the Big Bang or the Apocalypse!
      Figure 14 shows a strike of a ten-inch triangle.
      Figure 15 shows keys jangling. Recording engineers often use this sound to test their equipment, and one can see that it is indeed demanding, with the energy at an elevated level from 7 kHz to above 40 kHz.
      Figure 16 shows a high note on the piano (G-sharp 72, where the notes are numbered 1 to 88 from lowest to highest). I took the hardwood floor of the concert room as part of the instrument, since a piano is always on a floor; so I cut off the time-capture not before the floor reflection but before the reflection from the nearest wall, 12 feet away. Note that the partials are not harmonic, as one sees clearly in 16(b). I'm not sure how to divide responsibility for this inharmonicity between the strings and the soundboard, both of which can vibrate non-harmonically (the former acting as a 'bar' rather than an ideal vibrating string; the latter, because the solutions for a two-dimensional system, as the soundboard essentially is, are inherently non-harmonic).
      Whatever the cause, even at middle C on the piano (not shown), the first seven partials do look harmonic; but higher partials of middle C do go increasingly sharp, and the 17th partial is where the 18th harmonic would be. (The way this non-harmonicity functions in piano sound, and perhaps in the meaning of piano music, might make an interesting study.)

 
VII. Aliasing and "window splatter"
Returning to the topic of possible errors in the spectra, note that a spectrum may be corrupted in more subtle ways than those already mentioned. Aliasing, for example, is the spurious appearance below the Nyquist frequency of energy actually above that frequency.
      In the case of the H-P 3567A spectrum analyzer, the Nyquist frequency is 131,072 Hz; that is, the analyzer samples 262,144 times per second. Of course the analyzer has an 'anti-alias' filter, which eliminates aliasing as a problem. But it's interesting to note that, since my point is the mere existence of ultrasonic energy from musical instruments, aliasing would be no problem even in the absence of the filter; for the presence of aliased energy would mean that the musical instrument sound extends above 131 kHz, which would make my point even more strongly.
      Besides aliasing, I also considered "window splatter." The phenomenon by which a pure sine wave appears in Fourier analysis not as a single narrow spectral line but broadened and with bumps decreasing in level on either side—this is well-known. Extending the idea, one can imagine that even if a spectrum had no content above 20 kHz, it might nevertheless look as though it did because of the adding-up of "bumps" from energy below 20 kHz. I call this putative phenomenon "window-splatter." (More generally, window-splatter means that every point of the spectrum is potentially affected by every other point; but we are concerned here only with the issue of energy above 20 kHz.)
      To calculate the effect of window-splatter, we should construct an artificial spectrum that is flat up to 20 kHz, then convolve this truncated spectrum with the Fourier Transform of the Hann window. The result will be a spectrum whose energy above 20 kHz will be due entirely to window-splatter.
      Luckily, it turns out that window-splatter is an insignificant source of corruption for this work. (Not non-existent, but insignificant.) Using the "Math" function of the H-P 3567A analyzer, I created a spectrum whose value was 1 at all points up to just short of 20 kHz. At 20 kHz and above, the value was 0. After the convolution, the point at 20 kHz was indeed raised to -12 dB relative to the constant spectrum, but points at higher frequencies were at -150 dB, indistinguishable from the computation noise. (Figure 17.)

 
VIII. Correcting for the microphones' response curves
With regard for the points already discussed — microphone and input overload, microphone distortion, room acoustics and rattles, aliasing and "window-splatter" — I see no reason to doubt the existence of the ultrasonic energy. I did however correct the figures to allow for the unflatness of the microphones' responses.
      Each microphone has not one but a family of responses: on-axis or random-incidence, each with protective grid on or off. The on-axis response with the grid off is very flat to 100 kHz; with grid on, it is not flat, nor even known beyond 70 kHz. The random-incidence response is not flat with grid on or off; and the two curves differ.
      I first considered how to correct the background spectra (the lower curve in Figure 1(a), for example). I reflected that if the microphone had no "self-noise," then the background I measured would consist only of ambient sound in the room. Since this comes from all angles in a random fashion (as verified in measurements not shown), and is therefore captured according to the microphone's random-incidence response curve, the correct spectrum would be obtained by applying the opposite of that curve to the measured background. In other words, if the random-incidence response curve supplied by the manufacturer is down by 5 decibels at a certain frequency, I should raise the background spectrum by 5 dB at that frequency to get a correct reading.
      If on the other hand the room were silent, so that the measured background came entirely from microphone self-noise, then no correction would be necessary, since self-noise is generated electrically in the microphone, and has nothing to do with the presence or absence of the grid, nor with the angle of a source relative to the microphone.
      Since the room is quiet and the microphone, like any quarter-inch microphone, is noisy, I thought it likely that microphone noise dominated. To check this, I compared two background measurements taken with the quarter-inch B&K 4135 microphone, one measurement with grid on and the other with grid off. These two spectra were identical; they superimposed even when viewed using a vertical scale of only 0.6 dB per division.
      If I assumed that the background were dominated by room sound, I would "correct" each trace to allow for the unflatness of its particular microphone frequency response. The grid-on and grid-off random-incidence responses of the B&K 4135 differ by 3 dB at 30 kHz, 4 dB at 40 kHz, 1½ dB at 50 kHz and presumably by substantial amounts at higher frequencies, where the calibration of one of the curves is not known (that is, not supplied by the manufacturer).
      Since the background spectra I obtained are identical, "correcting" them based on two different curves would give different spectra for the same acoustic background. This nonsensical result implies that the assumption was wrong and the measured background is indeed dominated by microphone self-noise.
      I conclude that microphone self-noise indeed dominates the background measurements, and therefore that they are good over the full band without correction.

Now I consider the musical instrument spectra; for example, the upper trace in Figure 1(a). Here I take the applicable curve to be the on-axis microphone response, since the microphones were always pointed at the instruments. While the on-axis "grid-off" curve is very flat to 100 kHz, the "grid-on" curve deviates by as much as 6 dB, and moreover is not known beyond 70 kHz. When correcting the response of a "grid-on" measurement, I freeze the correction at the 70 kHz level; that is, I make it constant from 70 kHz on up.
      It is likely that the grid-on response continues to roll off beyond 70 kHz, and that therefore a true correction would not freeze at 70 kHz but would continue increasing. This would raise the apparent level of the high-frequency energy. Thus, by freezing the correction, I am taking a conservative approach.
      It is even possible, contrary to my assumption of two paragraphs ago, that the on-axis curves may not be the appropriate ones to use at all. At very high frequencies, because of the shortness of the wavelengths and the acoustic "liveness" of the room, the instrument may be picked up more in a random-incidence fashion. If this be the case, then because the random-incidence response is down 9½ dB at 70 kHz, I should raise the measured spectrum by 9½ dB at that frequency, and by appropriate amounts at other frequencies.
      Thus, at very high frequencies the true spectra may be 10 or even 20 dB higher than the curves shown here. As I do not have facilities to decide in which fashion the microphone is picking up the instruments, I use the on-axis curves to be conservative.
      Using the "Math" facility of the H-P 3567A analyzer, all the spectra have been corrected as described here, to within ±0.5 dB.

 


 

Table I.  Ultrasonic Extension and Energy of Some Musical Instruments

A summary of this paper's findings. Column one refers to the figure showing the spectrum in question. Column two identifies the instrument. Column three gives the sound pressure level measured at the microphone. Column four gives the measured frequency extension: For instruments with harmonics, this is the highest frequency where harmonics are still present; for those without harmonics, the highest frequency where the sound is still at least 10 dB above the background. (See text.) The last column tells what percentage of the total energy is contained in the range between 20 kHz and the limit given in the previous column.

 
Instruments With Harmonics

Fig.   Instrument         SPL     Harmonics       Percentage
                          (dB)    Visible To       of Power
                                  What Freq.?    Above 20 kHz

1. Trumpet (Harmon mute) 96. >50 kHz 0.5 2. Trumpet (Harmon mute) 76. >80 " 2. 3. Trumpet (straight mute) 83. >85 " 0.7 4. French horn (bell up) 113. >90 " 0.03 5. French horn (mute) 99. >65 " 0.05 6. French horn 105. >55 " 0.1 7. Violin (double-stop) 87. >50 " 0.04 8. Violin (sul ponticello) 77. >35 " 0.02 9. Oboe 84. >40 " 0.01

 
Instruments Without Harmonics

Fig.   Instrument           SPL   10 dB Above    Percentage
                            (dB)   Bkgnd. to      of Power
                                  What Freq.?   Above 20 kHz

10. Speech Sibilant 72. >40 kHz 1.7 11. Claves 104. >102 " 3.8 12. Rimshot 73. >90 " 6. 13. Crash Cymbal 108. >102 " 40. 14. Triangle 96. >90 " 1. 15. Keys jangling 71. >60 " 68. 16. Piano 111. >70 " 0.02

 


 
IX. Results
Table I summarizes the results. Instruments with harmonics (Figures 1 to 9) are claimed to have energy to the highest frequencies where harmonics are still visible. Those without harmonics (Figures 10 to 16) are claimed to have energy to the frequency where they are still 10 dB above the background. These frequencies are listed in the fourth column of the table, while the last column tells what percentage of the total energy of each sample lies below these frequencies but above 20 kHz. That is, a figure of 0.5 in the last column means that half of one percent of the energy is above 20 kHz. As described above, every step has been taken to make these figures conservative, and the real figures may well be substantially higher.
      For the samples which include room reflections (Figures 1 to 10), I do not claim that our spectra are the "absolute" spectra that would be found in anechoic measurement, because the spectra may have been altered by room resonances. As my point is simply the existence of the ultrasonic energy, however, this does not matter.
      Since Figures 11 to 16 exclude room reflections, their spectra should indeed be quantitatively accurate to within the few-decibel total error of the analysis chain.

 
X. Significance of the results
Given the existence of musical-instrument energy above 20 kilohertz, it is natural to ask whether the energy matters to human perception or music recording. The common view is that energy above 20 kHz does not matter, but AES preprint 3207 by Oohashi et al. claims that reproduced sound above 26 kHz "induces activation of alpha-EEG (electroencephalogram) rhythms that persist in the absence of high frequency stimulation, and can affect perception of sound quality.[4]
      Oohashi and his colleagues recorded gamelan to a bandwidth of 60 kHz, and played back the recording to listeners through a speaker system with an extra tweeter for the range above 26 kHz. This tweeter was driven by its own amplifier, and the 26 kHz electronic crossover before the amplifier used steep filters. The experimenters found that the listeners' EEGs and their subjective ratings of the sound quality were affected by whether this "ultra-tweeter" was on or off, even though the listeners explicitly denied that the reproduced sound was affected by the ultra-tweeter, and also denied, when presented with the ultrasonics alone, that any sound at all was being played.
      From the fact that changes in subjects' EEGs "persist in the absence of high frequency stimulation," Oohashi and his colleagues infer that in audio comparisons, a substantial silent period is required between successive samples to avoid the second evaluation's being corrupted by "hangover" of reaction to the first.
      The preprint gives photos of EEG results for only three of sixteen subjects. I hope that more will be published.

In a paper published in Science, Lenhardt et al. report that "bone-conducted ultrasonic hearing has been found capable of supporting frequency discrimination and speech detection in normal, older hearing-impaired, and profoundly deaf human subjects." [5] They speculate that the saccule may be involved, this being "an otolithic organ that responds to acceleration and gravity and may be responsible for transduction of sound after destruction of the cochlea," and they further point out that the saccule has neural cross-connections with the cochlea. [6]

Even if we assume that air-conducted ultrasound does not affect direct perception of live sound, it might still affect us indirectly through interfering with the recording process. Every recording engineer knows that speech sibilants (Figure 10), jangling key rings (Figure 15), and muted trumpets (Figures 1 to 3) can expose problems in recording equipment. If the problems come from energy below 20 kHz, then the recording engineer simply needs better equipment. But if the problems prove to come from the energy beyond 20 kHz, then what's needed is either filtering, which is difficult to carry out without sonically harmful side effects; or wider bandwidth in the entire recording chain, including the storage medium; or a combination of the two.
      On the other hand, if the assumption of the previous paragraph be wrong — if it is determined that sound components beyond 20 kHz do matter to human musical perception and pleasure — then for highest fidelity, the option of filtering would have to be rejected, and recording chains and storage media of wider bandwidth would be needed.

 
XI. What Next?
A natural next step would be to measure the ultrasonic content of orchestral sound as heard from normal listening or recording distances. This will automatically allow for the absorption of ultrasonics by the air. The project will be expensive, because musicians' union rules require players to be paid at recording rates, which are several times ordinary "scale," whenever a live microphone is present; and I anticipate difficulty in having these rules waived for our research. We solicit funding for this project!

 
Acknowledgements
This project went from a long-held idea of mine to reality because of the enthusiasm of Scott Kelly, Sandee Perez and Hovel Babikian, students in my Caltech course "Projects in Music & Science," EE/Mu 107. I am grateful for their substantial help in getting started.
      Without the help of my friend Prof. Gerald Jay Sussman of MIT, the work would not have been finished. As a visiting faculty member at Caltech during 1991-92, and continuing since then, he has contributed his deep knowledge, lucid teaching, and experienced counsel.
      Sincere gratitude also to:
      Caltech's Robert McEliece, Barry Megdal and the late Ed Posner for interest and support;
      Hewlett Packard Corporate Gifts; Rick Walker (former student in EE/Mu 107) for his good offices; Mac MacDonald for patient and knowledgeable field support; and Fred Cruger, Paul Gallagher, Norm Olsen and Bob Youden;
      Aco/Pacific Co. and Noland Lewis for equipment and information;
      Bruel & Kjaer Instruments, and Erling Frederiksen and Joe Chou for information;
      "Anonymous," whose $10,000 gift allowed the purchase of crucial equipment;
      Doug Sax, Ernst Schoenthal;
      Denise Bovet, for calculations of inharmonicity of piano strings;
      the late Bart Locanthi, who was generous with his interest and knowledge;
      Julie Sussman, for careful reading and useful comments;
      and Daniel W. Martin, editor-in-chief of the Journal of the Acoustical Society of America; Patricia M. Macdonald, executive editor of the Audio Engineering Society Journal; and the anonymous AES reviewers; for their generous assistance and good suggestions. (This does not imply endorsement of this paper by these individuals or by their organizations or publications.)
      This paper is being published long after the work was completed because of the difficulty of creating publication-quality graphs from the data. I spent a lot of time and money discovering half a dozen programs that would not make acceptable graphs. Finally, Caltech undergrad Peter Oakley learned to use Matlab to do the job, and carried out the work reliably and creatively.
      The Web (HTML) programming of this paper was done by David Boyk, of MegaHard Design.

 

 

Btw,James Boyk is a piano player and I own at least 2 of his recordings.

 

I think that these information is an answer to your post.

 

Btw, I never post anyrhing where I'm not sure I understand or experienced.I never post something thatcomes onlyfrommy imagination.

 

The main issue is the same: we all have different IGNORANCE levels and what we post come from those ignorance levels.

 

R.

 

 

Post removed 

Raul,

Thank you for the background information that you rely on to support your opinions.  I am familiar with Oohashi's work and his research into the possibility that sound waves above the audible spectrum might cause physical reactions in subject audiences.  You should know that his results are controversial because the statistical sampling data do not support his theories.  Also there is countervailing data published years ago by JBL demonstrating that frequencies above ~15 kHz do not contribute meaningful musical information.  This was done by playing recordings of various instruments with wide frequency range, then randomly cutting off high frequencies.  This would be done back and forth several times to give listeners a chance to detect the change.  Very few listeners could detect cut offs even as low as 10 kHz, and none above 15 kHz.  It seems clear that you take the significance of harmonics at high frequency above the audible range very seriously.  I argue that they are not significant.   I apologize to you for the offense given by harsh language and withdraw the word hubris.  At the same time, I take umbrage at your assumption of ignorance on my part.