Escolar Documentos
Profissional Documentos
Cultura Documentos
‘HF Radio Systems and Techniques’, 7-10 July 1997, Conference Publication No. 41 1, 0 IEE, 1997
c
L ’82
next segment. The old signal segment is weighted with the
For a uniform change in the time scale, the time to falling portion of the windowing function while: the new
corresponding to the original articulation rate is mapped to segment is weighted with the rising portion of the
the transformed time t’, through the mapping t’, = p to.The windowing function.
case p > 1 corresponds to slowing down the articulation
rate by means of time-scale expansion, while the case p < Time-scale expansion is achieved by repeating the speech
1 corresponds to speeding up the articulation rate by means segments excised from the previous ones within the
of time-scale compression. Speech events which take place tolerance interval in the same manner explained above.
at a time t‘, according to new time scale will have occurred The time alignment between the successive windows with
at p ’ t 6 in the original time scale.The WSOLA algorithm respect to the signal similarity removes the phase
is a time domain process. It seeks to find a segment of the discontinuities. Therefore, time-scale modified waveform
input signal that will be overlapped with and added to the by the WSOLA algorithm can maintain maximal similarity
previous segment which lies within a prescribed tolerance to the original waveform across its segment joints.
interval around the synthesis instant. The position of the
best segment is determined by finding the value A=Am
lying within a tolerance region [-&m...A,ax] around the
analysis instant and which maximizes the cross-correlation SYSTEM IMPLEMENTATION
coefficients between the previous segment and the segment
under consideration. The basic synthesis equation used by
the WSOLA procedure is: An ATDD system was implemented using a SO MHz
TMS320C3 1 floating-point digital signal processor with
512 kilobytes each of RAM and ROM. The computer
system and external inputloutput audio were interfaced
using a 14-bit linear AID and DIA with a 8 kHz audio
sampling frequency. The processor board has digital ports
where v(n) is the square of a windowing function, w(n), Ak for connection to the radio Wrx control lines and external
are the shift factors within the range [-A,,...&,,], x(n) and timing reference signals. The digitized audio was stored in
y(n) are the input and output signal samples respectively, a 200 ms long buffer and compressed into bursts of 90 ms
L, represents the consecutive window positions, i.e., the in real-time. The remaining 10 ms reserved for in band
synthesis instants, and ?(LA represents analysis instants. timing reference signals. The radio channel was
w(n) is a Hann window’ with 50% overlap. The operation multiplexed into 100 ms transmission and reception slots.
of the WSOLA technique is illustrated in Figure 1 and DSP software was written in “C” language for the
explained below. implementation of the WSOLA algorithm. Figure 2 shows
the ATDD system and HF radio equipment connections
In the implementation of this algorithm, regularly spaced for one end of a radio link. The other end of the ATDD
synthesis instants L, = kL are chosen. Proceeding in a left- link has the same configuration of radio and ATDD
to-right fashion (in Figure 1 ) and assuming the segment processor. Audio signals from the ATDD system were
(A) was the last segment excised from the input and added interfaced to the line-in and out ports of the radio
to the output at time instant L,-/=(k-I)L, i.e., segment (a) = equipment and the control port of the ATDD was
segment (A). WSOLA seeks to find a segment (b) that will connected to the PTT line of the radio system. Figure 3
overlap-add with (a) in a synchronized way and can be shows the compressed signals to and from the audio ports
excised from the input around time instant z’(LJ. As (A’) of the HF transmitter and receiver. The top trace shows the
will overlap-add with (A) = (a) in a natural way to form a compressed 100 ms bursts of audio fed to the transmitter
portion of the original speech, WSOLA can select (b) such while the bottom trace shows the audio output from the
that it resembles (A‘) as closely as possible and is located receiver as observed at points A and B of Figure 2
within the prescribed tolerance interval around z’(LJ in respectively.
the input wave. The position of this best segment (B) is
found by maximizing a normalized cross-correlation
similarity measure between the sample sequence Experimental Setup
underlying (A’) and the input speech. After overlap-adding
@) with (a), WSOLA proceeds to the next output segment,
where (B’) now plays the same role as (A’) in the previous The experiments employed a transmit site near Ottawa and
step. In the overlap and adding process each overlapping a receive site 210 km southwest of this location near Lake
signal segment is weighted with one half of the windowing Ontario, for a period of four days during October 1996.
function for smooth transition from one segment to the Test signals were transmitted at a frequency of 4 MHz
using a horizontal multi-band fan dipole antenna at a
power level of 100 Watts and were received using a
‘The term Hann window is conventionally horizontal dipole antenna. Since the ATDD systlem relies
known as Hanning window, after Julius Von Hann. on the accurate synchronization of transmission and
283
I , , A- Tx input
I R- Rx O u t D u t