Audiophilia

Humans are hard-wired to find patterns.

Even when there are none.

Explaining those patterns (at least, the ones which are real) is what science is all about. But, even there, lie pitfalls. Have you really controlled for all of the variable which might have led to the result?

A certain audiophile and journalist posted a pair of files, containing a ~43sec clip of music, and challenged his readers to see if they could hear the difference between them. Sure enough, “File A” sounds a touch brighter than “File B.” A lively discussion ensued, before he revealed the “reason” for the difference:

File B was recorded, from his turntable, via a straight-through cable.
File A was recorded, from his turntable, via a cable that passed through a switchbox.

Somehow or other, the switchbox (or the associated cabling) was responsible for the added brightness. An even more lively discussion ensued.

Now, if you open up the two files in Audacity, you discover something interesting: File A gets from the beginning of the musical clip to the same point at the end of the clip 43 milliseconds faster than File B does. That’s a 0.1% difference in speed (and hence pitch) of the recording. Such a difference, while too small to be directly discernible as a change in pitch, ought to be clearly perceptible as “brighter.”

On the other hand, I would contend that putting a switchbox in the signal path cannot possibly cause the information, traveling down the wire, to have shorter duration. The far more likely explanation was that there was a 0.1% variation in the speed of Mr. Fremer’s turntable between the two recordings.

I used Audacity’s “Change speed …” effect to slow down File A (by -0.099 percent), clipped the result (so that the two files have exactly the same duration), and posted them below.

One file.
The other file.

See if you can tell which one is Mr. Fremer’s File A, and which is his File B, and, more importantly, whether you can detect a difference in the brightness (or edginess or whatever other audiophiliac descriptions you’d like to attach to them), now that the speed difference has been corrected.

Update: (2/9/2014)

As I said in the comments, and as anyone who opens up the files in Audacity immediately discovers, it is easy to tell which file is which, by examining their waveforms. Here are Files “B” (upper) and “D” (lower)

(click on any image for a larger view). It’s not obvious, from this picture that they’re the same (bear with me, about that). But, if we think they are, it’s easy enough to check.

First, zoom in as far as you can.
Then use the time-shift tool, to align the two waveforms precisely, at some point in the clip. It’s best to focus on a segment with some high frequency (rapidly varying) content. Fortunately, the pops and clicks on Mr. Fremer’s vinyl record give us plenty to choose from.
Now select one of the wave-forms, and choose “Effect“→”Invert”.
Finally, select both and choose “Tracks“→”Mix and Render”.

If we’re correct, the two traces should precisely cancel each other out.

And they do (except for the little bits at the beginning and end that I trimmed in creating File “D”). Note that I magnified the vertical scale by a factor of 5, so that you can really see the perfect cancellation.

You can repeat the same procedure for Files “B” and “C” but, no matter how carefully you perform step 2, you can never get anything close to perfect cancellation. So, without even listening to the files, Mr. Fremer (after he got over his initial misapprehension that “C” and “D” were the same) was able to confidently determine which was which.

In a way, that’s rather disappointing, because it doesn’t really tell us anything about how different “C” and “D” are, and whether fixing the speed of the former diminished, in any way, the perceptible differences between them. As Mr. Fremer says,

As to how the two files sound, I didn’t have time last night to listen but will do so today. Of course I know which is which so I’m not sure what my result might prove.

But, since we have Audacity fired up, let’s see what the story is.

Though I said that, for the clip as a whole, there’s no way to line the tracks, so as to achieve cancellation, on a short-enough timescale, you can get very good (though, of course, not perfect) cancellation. The cancellation doesn’t persist – the tracks wander in and out of phase with each other^☭, due (at least in part, if not in toto) to the wow-and-flutter of Mr. Fremer’s turntable.

Here’s an example (chosen because Mr. Fremer seemed to think that there’s a flagrant disparity in the waveforms, right in the middle of this excerpt).

File C (upper) and D (lower), from 32.750s to 33.000s.

Following the procedure outlined above, we align the two clips at the center, and attempt to cancel the waveforms:

They cancel very well at the center, but progressively poorly as you move to either end, where the two tracks wander out-of-phase. Notice the sharp spikes. If you zoom in, you’ll notice that these are actually S-shaped: they’re the result of superposing two musical peaks (one of which we inverted, of course) that have gotten slightly out-of-phase with each other. They cancel at the center, but not at either end, where they have ceased to overlap. Of course, not just the peaks but everything else has also gone out-of-phase, so these S-shaped spikes sit on top of an incomprehensible hash.

You can repeat the process for other short excerpts, with similar-looking results.

Now, “Andy”, below, says he heard a systematic difference between the files, similar to what others reported for Fremer’s Files “A” and “B.” That bears further investigation. But I’ve said to Mr. Fremer that, if he really wants to get to the bottom of what differences, if any, the cables contributed to these recordings, it would be best to eliminate the wow-and-flutter that is clearly responsible for most (if not all) of the visible differences displayed here.

He should start with a digital source (like, say, one of the 24/96 FLAC files we’ve been discussing), played back through the two different cables he wants to test. That source, at least, won’t vary (in a random and uncontrollable fashion) from one playback to the next.

Repeatability is another one of those things that we strive for in Science.

^☭ In fact, it is that wandering in-and-out of phase that is the most glaring difference between the files and it (rather than the more sophisticated procedure that I outlined above) is what makes it trivial, for even a casual observer, to pick out which file is which.

Posted by distler at February 7, 2014 12:27 PM

TrackBack URL for this Entry: https://golem.ph.utexas.edu/cgi-bin/MT-3.0/dxy-tb.fcgi/2694

Re: Audiophilia

So I have listened to these two samples. I have invited co-workers to listen to them and I have invited guests of our establishment to listen to them. All of us have used the very same computer station, volume, settings etc. and none of the listeners other than myself were introduced to the context of this listening. All of us (admittedly, only 5 people) have said we hear them as being different. All of us have also stated individually, again without prompting or context added, that File C is a brighter/harsher/more revealing version and that File D is warmer/fuller/stuffier sounding. I don’t know what result you are hoping/expecting/predicting but that is what 5 people thought here. I hope you update this at some point with your own ‘tell’ of what you are testing. Do you hear a difference?

Posted by: Andy on February 8, 2014 10:46 AM | Permalink | Reply to this

I hope you update this at some point with your own ‘tell’ of what you are testing.

I will … eventually. But I want to wait a bit, until Mr. Fremer (and some more of the folks who participated in his original listening test) weigh in.

Yesterday, Mr. Fremer was saying that he thought Files C and D “appear to be identical.” but that he was going to give them a closer listen, today. They are different, but the differences are more subtle than in his original test (where “A” was too fast).

The questions are:

Are the differences audible-enough to be picked-out, consistently, by the listeners?
Can the listeners correlate the differences, that they hear, with those of Fremer’s original Files A and B? I.e., having eliminated the speed difference, are there audible differences that (unlike the speed difference) might be reasonably ascribed to the cables used in his test?

Thanks for your (and your colleagues’) vote(s)!

Posted by: Jacques Distler on February 8, 2014 11:45 AM | Permalink | PGP Sig | Reply to this

When I wrote the two files “appear to be identical” I was being literal. The “C and D” waveforms looked identical yet when I loaded my “A and B” waveforms to Audacity they were visually distinctive. Mr. Distler asked me to slow down the .1% fast file and look again. Again, I could see differences I could not see on his files “C and D”. Having been falsely accused of ‘doing something’ to my files I was not asking these questions because I thought Mr. Distler had done something to his. He sent a time-spread version of his that did shows slight differences but still I wonder why the more coarse renderings of my files, with one speed-correct still look different while his looked identical.

I certainly don’t subscribe to the “digital is perfect” school so I don’t know what speed shifting on Audacity might contribute to any sonic change and/or masking, or what other factors might be involved but once this test is completed there will be others involving but one variable: the first of which will be the same file with a .1% pitch difference being the only variable.

Posted by: Michael Fremer on February 8, 2014 1:43 PM | Permalink | Reply to this

Screen dithering

… still I wonder why the more coarse renderings of my files, with one speed-correct still look different while his looked identical.

Screen dithering.

At low resolution, even the same waveform will look different, depending on precisely where it is on-screen, because the locations of the points on the graph are rounded to the nearest pixel.

If you hit the “magnify” button a few times, the changes due to screen dithering go away, and you can concentrate on the actual differences in the wave form.

Of course I know which is which so I’m not sure what my result might prove.

That’s too bad. It would have been nice to know what the two files sounded like to someone who hadn’t first peeked at them in Audacity.

Posted by: Jacques Distler on February 8, 2014 1:55 PM | Permalink | PGP Sig | Reply to this

To be honest, I can not hear a brightness difference at all. Perhaps if my headphones were handy…. but on a couple of mediocre speakers attached to my computer, they are pretty similar.

But more importantly… what the heck happens to Samson next in the tale?

-cvj

Posted by: cvj on February 9, 2014 7:52 PM | Permalink | Reply to this

I tried the files on my headphones but I still can’t find the difference. I guess I should buy more expensive ones. The differences might be there but they’re very hard to detect with amateur gear.

Posted by: John on February 20, 2014 6:52 AM | Permalink | Reply to this

It is interesting to do the following experiment.

With AUDACITY play at the same time file A+B. The result is an out of phase, very clear to hear.

Now play C+D. The result is again an out of phase but very different to the A+B file.

Any clue?

Posted by: Antonio Garcia on February 25, 2017 10:16 AM | Permalink | Reply to this

Musings

Skip to the Main Content

February 7, 2014