Linear Predictive
Coding (LPC) is a speech analysis technique which is one of the
most useful methods for encoding good quality speech at a low
bit rate. It supplies an extremely accurate and relatively efficient
calculation to estimate speech parameters. [52]
There are several
procedures in this technique.
Prediction:
This is a process of determining the relationships that enable
the value of the next output sample to be calculated as a combination
of previous output samples in a sampled data system. That is,
to be predicted according to the trends of the past values. That
is why it is called Linear.
Compression:
Using Linear Prediction, a sound can be reduced to a spectral
envelope, a small bit of other data, and an excitation source.
Pitch/Speed
Shifting: To alter the pitch or speed of a sound independently
from one to another. LPC corrects problems like munchkinization
by maintaining the correct amplitudes over their associated frequencies,
which holds the formants in their correct positions. Then sounds
can be sped up, slowed down or have their pitches raised or lowered
with the sound remaining completely natural.
Cross-Synthesis:
To recreate an actual sound from a LPC analysed signal, some
other information like pitch and excitation must be measured.
If a voice is re-synthesised using its LPC analysis without the
pitch information, the voice will sound very robotic. By replacing
the excitation sound with another sound you can create a completely
new sound. One classic example is cross-synthesising an orchestra
with a person's voice. The result is a talking orchestra!
[53]
The detailed procedures
are listed in the following diagram (Figure
A-1).

Figure
A-1 LPC Diagram[53]
Pulse Wave:
In the LPC diagram, this represents a pulse-type waveform which
is used as an exciter for recreating the pitched portions of the
original sound.
Noise: In
the LPC diagram, this represents a source of random noise which
is used as an exciter for recreating the non-pitched portions
of the original sound.
Balance: This
unit of the LPC system balances the amount of each type of exciter,
to create the perfect amount of non-pitched, and pitched sound.
The values for balancing are measured and stored as "Threshold"
with the LPC coefficients and pitch information.
Pitch: This
path represents the original pitch information needed to recreate
the original signal from the LPC analysis.
Threshold: This
is the information for the Balance unit which determines the balance
of the pitched and non-pitched exciters.
Amplitude: This
information is used to scale the exciter signal, and in turn the
overall output, to match the original signal's amplitude shifts.
All-Pole Filter:
The All-Pole Filter is a filter which will shape the spectrum
of the incoming signal according to the LPC analysis of the original
sound's spectrum.
Coefficients
(B's): These are the results of the original LPC analysis.
These numbers must be used by the All-Pass Filter to shape the
incoming exciters so their spectrum will match accordingly.
[53]