Audio encoding for BBB

Background

Basic PCM (Pulse Code Modulated) is the digital representation of a sampled analog signal. Specifically for audio purposes, the typical format that the dat used to store this raw data is typically WAV file is the raw file which stores the data without any compressions techniques. This is in the form as shown below:

Pulse-Code Modulation Waveform for 4-bit data.

These raw signals are very easily managed since no additional processing needs to be done to output this signal to an DAC. However, the raw data files are extremely space-inefficient. This website discusses in detail what a normal sized audio file would typical cost in terms of size for a particular sample/bit rate. It shows how storing the raw audio of a song would be ~ 30Mbytes, while an equivalent MP3 would be about 2.82Mbytes.

For audio signals, the typical sample rate is typically 8kHz (2x speech bandwidth). Which is less than the sample rate for songs due to the full human hearing range going up to 20kHz, the sample rate is 44kHz or more. Consider storing a 3 minute speech track as a raw WAV file without compression will result in the following size.

Size
= (#bits/sample) * (sample freq) * (length in seconds)
= 16*8000*3*60
= 23040000bits
= 2.88 Mbytes

Although this seems small, by encoding this equivalent file as an MP3, we will see alot more size saving.

Mu-Law Encoding.

There are multiple methods to encode audio, one such method which is very simple to implement is the mu-law. This type of encoding takes advantage of the non-linear hearing capabilities of the human ear or a poor dynamic range. An example is how one who is listening to a rock concert won't be able to hear the whisper of someone next to them.

Utilizing this non-linearity, the mu law algorithm disregards the lower significant bits. This article discusses mu law in a more detailed fashion. Using the mu-law, a file which would typically require 9 to 16 bits per sample (2 bytes per sample) would now require 8 bits (1 byte per sample). This effectively reduces the size of the file by 50%.

Implementation Alogorithm

The following are the steps to perform mu-law encoding, Implementation details will be added at a later time.

Save Sign Bit
Clip value to ensure no overflow when bias is added
Add a bias to ensure there is a 1 within the Exponent Region (most significant 8 bits to the right of the sign bit)
Determine the Mantissa Region (The next most significant 4 bits to the right of the most significant "1" within the exponent region)
Using a 3 bit encoding for the position of the most significant "1" within the exponent region. (Most significant = position 7, least significant = position 0)
8bit encoding SPPPMMMM (P => binary encoding for the most significant 1, M = Mantissa

Example 11931 (0010 1110 1001 1011)

Save Sign Bit: (S) = 0
Clip Amplitude: Clip to 2^(N-1) - Bias = 2^15-132 = 32636 (No need to clip in this case)
Add Bias: 11931 + 132 = 12063 (0010 1111 0001 1111 )
Determine Exp Region: 0010 1111 0001 1111 --> Most significant 1 is in position 6 (110) of Exp Region
Determine Mantissa Region: 0101 1110 Mantissa = 0111
Mu-Law Encoding Value: SBBBMMMM = 01100111

Testing Limits of BeagleBone Black

Overview

This is to document the limitations of the Beaglebone black. One caveat is that Adafruit Python BBIO Library was used. Any overhead from this may limit the behavior of the BBB. This will be consistently updated with new material.

Table of Contents

PWM

PWM

PWM module available on the BBB is from the sitara core (AM 3359). It is a ehrPWM which according to documentation uses at max a 200Mhz reference clock. However, this only tells me the absolute maximum the PWM can run at. To investigate the actual performance limits, I ran an experiment where the frequency of the PWM module was varied with a 50% Duty Cycle.

Theoretically, we would want the square pulse where it would settle to the final value, but this may not matter we it was used to drive a switch.

5MHz PWM

Square waveform preserved, usable as a PWM.

8MHz PWM

9MHz PWM

Ringing Effect Still here

Results

Safe frequencies to use --> <9MHz: For audio purposes, this is much more than needed.

Next Steps:

Determine if there are duty cycle limitations to the PWM at higher frequencies.
Determine maximum frequency where square wave is majorly affected by the ringing and wont settle to a final value.

Foray into the world of Beaglebone Black...

There was one choice I had to make during my final year as an engineering student. Choose a thesis that would allow me to transition into grad school, or choose something that would allow me to try my hand at something that was out of my comfort zone. I took the latter choice, and embarked on the journey of designing a system which would allow for me to make a whole audio system to interface with the Beaglebone Black.

The Goal:

Build an open-source Hardware shield for the beaglebone black that would allow high quality Video/Audio recording.
Build an open-source Software Library to support the use of the Audio Video Shield

The task is daunting considering my lack of experience in analog design.

Hobey Ho. Let the fun begin