----------------------------------------------------------------------------- Barbershop Spectrogram 1/10/99 ----------------------------------------------------------------------------- The other day, Daniel Wolf recommended a spectrogram program and I downloaded it. It works really well. It seems we have reason to be happy that we live in a time when people can trade such valuable stuff without having to bother with value abstraction... but I digress. To test the program, I thought I'd try some Barbershop. For the test, I picked a brief excerpt of song by the Happiness Emporium. This group did most of their recordings (including this one, from their album "That's Entertainment!") back in the 70's, when Barbershop technique wasn't a shadow of what it is today. However, they have a nice sound, and I thought I'd give them a try. Using Cool Edit, I extracted the CD audio to 16-bit 44.1 WAV format, and mixed it down from stereo to mono. I then ran it thru Spectrogram, fiddling with the settings until I got a clear picture. The values used to produce the image full.bmp are: Attenuation=0 Palette=CB Freq Scale=Log FFT Size (points)=8192 Freq Resolution (hertz)=5.4 Band (hertz)=10-22050 Time Scale (ms)=12 Spectrum Average (ms)=1 Toggle Grid=off This example contains a nice C7 chord, and I cropped it with Cool Edit and spectrogrized it with the same options as above, except with a time scale of 4 milliseconds. Note: I downsampled full.wav to 22K with Cool Edit to make it smaller for you all. Up.wav remains at 44K. Note: When I save bmps with the grid on, the grid moves relative to the rest of the graph. This is why the bmp files on my web site don't have grids. Does this happen to you? Then, I opened Notepad with the idea that I'd record the frequencies of the first 9 peaks (starting at the bottom of the graph) in this chord that I could get a signal strength of greater than -40 dbs out of. I picked _9_ arbitrarily. I started at the bottom because I knew that's where most of the parts would be. I didn't take the readings from the same time on the graph. Rather, I took each reading at the strongest point in each peak (it happened that all of these fell very near each other, in about the third quarter of the total time). The crosshairs have a resolution of 1cps, and I did some careful work with the mouse. If there was a range of frequencies that shared the same strength, I took the one closest to the middle of the range, taking the higher frequency if the range was odd. After I had done all that and closed Spectrogram, I turned my text file into the chart up.txt. I am at a loss to explain the results. With the transform limited to a frequency resolution worse than 5 hertz, I should not have gotten, nor did I expect to get, anywhere near the accuracy I did. It may be noticed that I do not have the lead part labeled in the up.txt chart. At first I thought that maybe the baritone was at peak3 and the lead at peak4, with peak2 being 5-3. However, this would be an unusual voicing, and listening reveals that the Baritone is singing peak2, and the lead is singing a 5/2 above the bass. This peak is in the spectrogram (up.bmp) but not the chart because I couldn't get more than -40 db's out of it.