In [22]:

%pylab inline
from __future__ import print_function
from __future__ import division

Populating the interactive namespace from numpy and matplotlib

Dirac Comb¶

http://en.wikipedia.org/wiki/Dirac_delta_function

http://en.wikipedia.org/wiki/Dirac_comb

An impulse train (Dirac comb) can be expressed as an infinite sum of harmonics of the same amplitude, whose fundamental is the frequency of the impulse train.

In [23]:

linspace(0, 1, 10)

Out[23]:

array([ 0.        ,  0.11111111,  0.22222222,  0.33333333,  0.44444444,
        0.55555556,  0.66666667,  0.77777778,  0.88888889,  1.        ])

$$ x(t) = cos(\omega t)$$

In [24]:

wt = linspace(0, 6 * pi, 20000)

In [25]:

oscillation = cos(wt)
plot(oscillation);

How do I get the next harmonic i.e. double the frequency of this sinusoid?

$$ x(t) = cos(2 \pi f t)$$

In [26]:

f = 3
w = 2 * pi * f

In [27]:

phase_t = linspace(0, w, 50000)
impulse = cos(phase_t) + cos(2*phase_t)
impulse /= 2
plot(impulse);

In [28]:

impulse = cos(phase_t) + cos(2*phase_t) + cos(3*phase_t) + cos(4*phase_t)
impulse /= 4
plot(impulse);

In [29]:

N = 100
phase = linspace(0, 6*pi, 50000)
harmonics = arange(N) + 1
impulse = zeros_like(phase)
for harmonic in harmonics:
    impulse += cos(harmonic * phase)
    
impulse /= N
plot(impulse);

In [30]:

plot(impulse)
xlim((0, 5000))

Out[30]:

(0, 5000)

Why do we use cosine? Isn't sine the same thing?

In [31]:

N = 100
phase = linspace(0, 6*pi, 50000)
harmonics = arange(N) + 1
impulse = zeros_like(phase)
for harmonic in harmonics:
    impulse += sin(harmonic * phase)
    
impulse /= N
plot(impulse);

Sampling¶

Sampling can be described as a multiplication between the Dirac comb and the signal:

In [32]:

comb = [1,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0] * 25
comb [-1] = 1 # unholy trick to make some things simpler later...
# plot(comb)
stem(comb)
ylim(0, 1.1)
xticks(())
title("Dirac Comb")

Out[32]:

<matplotlib.text.Text at 0x7fa4960a1510>

To be strict, this that I just created is the Kronecker delta because it's discrete, but let's assume it's continuous

https://en.wikipedia.org/wiki/Kronecker_delta

Now let's get a function that we can sample:

In [33]:

from scipy.special import jn
jn(0, 0), jn(0, 1)

Out[33]:

(1.0, 0.76519768655796661)

Bessel functions: http://en.wikipedia.org/wiki/Bessel_function

In [34]:

jn(1,0), jn(7,0.3), jn(1, 1)

Out[34]:

(0.0, 3.3805443102187532e-10, 0.44005058574493355)

Our "Continuous" function:

In [35]:

x = linspace(0,10, 500)
plot(x, jn(1,x))
title('Bessel function of the first type, order 1');

In [36]:

x = linspace(0,10, 500)
plot(x, jn(1,x))
plot(x, jn(1,x) * comb);

In [37]:

plot(x, jn(1,x) * comb);

Now just keep the values when the Dirac comb == 1

In [38]:

sampled = jn(1,x) * comb
samples = []
for i in range(len(comb)):
    if comb[i] == 1:
        samples.append(sampled[i])

plot(samples, 'o');

In [39]:

len(samples)

Out[39]:

In [40]:

x = linspace(0,10, 500)
x_sampled = linspace(0,10, 26)
plot(x_sampled, samples)
plot(x, jn(1,x));

In [41]:

plot(x_sampled, samples, 'x-')
plot(x, jn(1,x));
xlim((0.5, 3))
ylim((0.3, 0.65))

Out[41]:

(0.3, 0.65)

In [42]:

from scipy.interpolate import interp1d
interpf = interp1d(linspace(0,10, len(samples)), samples)
sqe = (jn(1,x) - interpf(x))**2
plot(interpf(x))
plot(jn(1,x))

twinx()
plot(sqe, 'r')
axis(ymax= sqe.max())
ylabel('squared error', color='r',fontsize=18);

In [43]:

sqe = (jn(1,x) - interpf(x))**2
plot(interpf(x))
plot(jn(1,x))
ylim((0.3, 0.65))

twinx()
axis(ymax= sqe.max())
gca().set_ylabel('squared error', color='r',fontsize=18)
plot(sqe, 'r')
xlim((50,150));

In [44]:

MSE = sqe.mean()
print(MSE)

1.79419961959e-05

Mean squared error is a common way to quantify the difference between two signals:

$MSE = \frac{1}{n}\sum_{i=1}^n(X_1 - X_2)^2$

*Note:* This does not mean that the digitized signal has that error!

It just means that if we reconstruct the signal by drawing straight lines we get this error.

Sampling theorem¶

If a function $x(t)$ contains no frequencies higher than B hertz, it is completely determined by giving its ordinates at a series of points spaced 1/(2B) seconds apart.

In [45]:

x = linspace(0, 2*pi, 10)
x0 = linspace(0, 2*pi, 100)
plot(x, sin(x))
plot(x0, sin(x0))

Out[45]:

[<matplotlib.lines.Line2D at 0x7fa495573ad0>]

But.... isn't there loss here? Even though the samples are spaced closer than 1/B???

.

Although there is a difference between the different sampled signals, no information from the original signal has been lost!

i.e. when we do the ADC there will be no difference between either. (in theory... it's another thing in practice!)

Foldover/Aliasing¶

But there will be a difference if there are less than two points per sine oscillation, i.e. when the frequency we are sampling is higher that sr/2 (Nyquist frequency).

In [46]:

phs = linspace(0, 10 * 2 * pi, 300)
plot(sin(phs))

Out[46]:

[<matplotlib.lines.Line2D at 0x7fa4955fd3d0>]

Anything less than 20 points will cause problems:

In [47]:

phs = linspace(0, 10 * 2 * pi, 300)
plot(phs, sin(phs))
phs = linspace(0, 10 * 2 * pi, 12)
plot(phs, sin(phs), 'o--')

Out[47]:

[<matplotlib.lines.Line2D at 0x7fa495fc5190>]

In [48]:

phs = linspace(0, 10 * 2 * pi, 300)
plot(phs, sin(phs))
phs = linspace(0, 10 * 2 * pi, 21)
plot(phs, sin(phs), 'o--')

Out[48]:

[<matplotlib.lines.Line2D at 0x7fa4956e0f90>]

In discrete sampling twice the Nyquist frequency is the same as DC (i.e. frequency 0)...

In [49]:

phs = linspace(0, 10 * 2 * pi, 300)
plot(phs, cos(phs))
phs = linspace(0, 10 * 2 * pi, 11)
plot(phs, cos(phs), 'o--')

Out[49]:

[<matplotlib.lines.Line2D at 0x7fa494341b50>]

The frequency of the foldover component is:

$$ f_{ALIAS} = \frac{f_s}{2} - (f_0 - \frac{f_s}{2})$$

i.e. fold/mirror the frequency around the Nyquist frequency.

$$ f_{ALIAS} = f_s - f_0$$

More strictly:

$$ f_{ALIAS} = f_s - (f_0\pmod {f_s})$$

The frequency wraps around the sampling frequency.

Quantization¶

Once the signal has been sampled, a value needs to be assigned to it.

In [50]:

x = linspace(0, 2*pi, 300)
f = sin(x)
plot(f);

In [51]:

f = sin(3 * x)
f2 = (f*2).astype(int)
plot(f) 
plot(f2); 

These 3 different values can be encoded in 2 bits.

In [52]:

f = sin(x)
f2 = (f*2).astype(int)
f4 = (f*4).astype(int)
plot(f)
plot(f2)
plot(f4/3.0)
legend(['sine', '2-bit', '3-bit']);

In [53]:

2**2, 2**3

Out[53]:

(4, 8)

In [54]:

2**16

Out[54]:

In [55]:

#integer representations
x = linspace(0, 2*pi, 100000)
f = sin(x)
N = 16 # number of bits
max_value = 2**(N-1) - 1
f16 = (f*(max_value)).astype(int16)
plot(f16);

In [56]:

plot(f16, 'x-' )
xlim((0, 50))
ylim((0, 80))

Out[56]:

(0, 80)

In [57]:

N = 5 # number of bits
max_value = 2**(N-1) - 1
f8 = (f*(max_value)).astype(int8)
plot(f8)

Out[57]:

[<matplotlib.lines.Line2D at 0x7fa4940ef3d0>]

In [58]:

plot(f8, 'o')
xlim((0,20))
ylim((0, 10))
grid();

In [59]:

2**24

Out[59]:

16777216

Dynamic range¶

In [60]:

N = 16
20 * log10((2 ** (N - 1))/1)

def dynrange(N):
    return 20 * log10((2 ** (N - 1))/1)

In [61]:

print(dynrange(8), dynrange(16), dynrange(24))

42.144199393 90.3089986992 138.473798005

In [62]:

20*log10(0.5)

Out[62]:

-6.0205999132796242

In [63]:

20*log10(60/30)

Out[63]:

6.0205999132796242

Half the linear amplitude scale is only 6 dB!

Amplitude encoding¶

In [64]:

22050 * 16

Out[64]:

In [65]:

352800 * 60

Out[65]:

21168000

In [66]:

21168000/(8 * 1024)

Out[66]:

2583.984375

When the values for amplitude are stored directly from the linear measurements of energy, this form of "encoding" is known as LPCM (Linear Pulse Code Modulation)

http://en.wikipedia.org/wiki/Pulse-code_modulation

Differential Pulse Code modulation stores the difference between samples

In [67]:

from scipy.io import wavfile

sr,audio = wavfile.read('passport.wav')
plot(audio)
print(audio.max(), audio.min(), sr)

22542 -21853 44100

In [68]:

2**16

Out[68]:

In [69]:

audio.dtype

Out[69]:

dtype('int16')

In [70]:

x = linspace(0, 2*pi, 50)
plot(sin(x), 'o')
plot(diff(sin(x)))

Out[70]:

[<matplotlib.lines.Line2D at 0x7fa493fcbed0>]

In [71]:

dpcm = diff(audio)
plot(dpcm)
dpcm.max(), dpcm.min()

Out[71]:

(15460, -15785)

In [72]:

log(max(dpcm.max(), abs(dpcm.min())))/log(2)

Out[72]:

13.946267154895326

In [73]:

log(max(audio.max(), abs(audio.min())))/log(2)

Out[73]:

14.460327425907881

ADPCM (Adaptive DPCM) uses different resolutions depending on what it needs.

Delta modulation encodes using only 1 bit to describe the change, and so requires a higher sampling rate. e.g. DSD

http://en.wikipedia.org/wiki/Direct_Stream_Digital

A-law and $\mu\ \ $-law¶

The amplitude scale can be encoded "warped", i.e. non-linearly

$$F(x) = sgn(x) \frac{\ln(1+ \mu |x|)}{\ln(1+\mu)}~~~~-1 \leq x \leq 1$$

http://en.wikipedia.org/wiki/Mu-law_algorithm

The higher amplitudes are compressed to make the quantization non-linear.

In [74]:

x = linspace(-1,1, 100)
ylabel('out')
xlabel('in')
plot(x, x)
grid()

In [75]:

x = linspace(-1,1, 100)
bits = 8
mu = (2**bits) - 1

mu_shaping = sign(x)*log(1 + mu *abs(x))/log(1+mu)
plot(x, x, label= 'linear')
plot(x, mu_shaping, label= 'mu-law out')

legend(loc='best')
grid()

In [76]:

x = linspace(0, 2*pi, 100)
y = sin(x)

def mu_law(insig, nbits = 8):
    mu = (2**nbits) - 1
    return sign(insig)*log(1 + mu *abs(insig))/log(1+mu)

plot(x, y)
plot(x, mu_law(y))

Out[76]:

[<matplotlib.lines.Line2D at 0x7fa4960e5790>]

A-law:

$$F(x) = sgn(x) \begin{cases} {A |x| \over 1 + \ln(A)}, & |x| < {1 \over A} \frac{1+ \ln(A |x|)}{1 + \ln(A)}, & {1 \over A} \leq |x| \leq 1, \end{cases}$$

http://en.wikipedia.org/w/index.php?title=A-law_algorithm&action=edit

In [77]:

x = linspace(-1,1, 100)
A = 87.5
x = linspace(-1,1, 100)
a_shaping = sign(x) * where(abs(x) < 1/A, A*abs(x)/(1+log(A)), (1 + log(A*abs(x)))/(1+log(A)))
plot(x, x,label = 'in')
plot(x, mu_shaping, label= 'mu-law out')
plot(x, a_shaping, label = 'A-law out')


legend(loc='best')

Out[77]:

<matplotlib.legend.Legend at 0x7fa495766290>

In [78]:

plot(x, label = 'in')
plot(x, mu_shaping, label= 'mu-law out')
plot(x, a_shaping, label = 'A-law out')

legend(loc='best')

xlim((0,0.5))
ylim((0,1))
     

Out[78]:

(0, 1)

Floating point formats can be thought of as "adaptive"

Floating point formats are represented using mantissa and exponent. 32-bit floats: 24-bit mantissa and 8 bit exponent.

Digital to Analog conversion¶

Sample and hold¶

In [79]:

samplehold = interp1d(linspace(0,10, len(samples)), samples, kind='zero')
new_x = linspace(0, 10, 500)
plot(new_x, samplehold(new_x))

Out[79]:

[<matplotlib.lines.Line2D at 0x7fa493efdfd0>]

Low-pass filter¶

In [80]:

from scipy.signal import butter, lfilter

b, a = butter(4, 0.2, 'low')
lopassed = lfilter(b, a, samplehold(new_x))
plot(lopassed)

Out[80]:

[<matplotlib.lines.Line2D at 0x7fa495552510>]

In [81]:

b, a = butter(4, 0.1, 'low')
lopassed = lfilter(b, a, samplehold(new_x))
plot(lopassed)

b, a = butter(4, 0.05, 'low')
lopassed = lfilter(b, a, samplehold(new_x))
plot(lopassed, lw='4')

Out[81]:

[<matplotlib.lines.Line2D at 0x7fa494238ad0>]

In [82]:

from scipy.signal import bessel

In [84]:

b, a = bessel(10, 0.05, 'low')
lopassed = lfilter(b, a, samplehold(new_x))
plot(lopassed)

Out[84]:

[<matplotlib.lines.Line2D at 0x7fa48ff58ad0>]

In [85]:

plot(new_x*50, jn(1,new_x))
plot(lopassed)

Out[85]:

[<matplotlib.lines.Line2D at 0x7fa48ff9d150>]

In [86]:

plot(new_x*50, jn(1,new_x))
plot(lopassed[58:])

Out[86]:

[<matplotlib.lines.Line2D at 0x7fa48fe34250>]

By: Andrés Cabrera mantaraya36@gmail.com For MAT course MAT 201A at UCSB

This ipython notebook is licensed under the CC-BY-NC-SA license: http://creativecommons.org/licenses/by-nc-sa/4.0/