Escolar Documentos
Profissional Documentos
Cultura Documentos
Diversity
NiNA+ Voice Test Result Description
Manual
Manual
The firmware of the instrument makes use of several valuable open source software packages. For information, see the "Open
Source Acknowledgement" on the user documentation CD-ROM (included in delivery).
Rohde & Schwarz would like to thank the open source community for their valuable contribution to embedded computing.
SwissQual AG
Allmendweg 8, 4528 Zuchwil, Switzerland
Phone: +41 32 686 65 65
Fax:+41 32 686 65 66
E-mail: info@swissqual.com
Internet: http://www.swissqual.com/
Printed in Germany Subject to change Data without tolerance limits is not binding.
R&S is a registered trademark of Rohde & Schwarz GmbH & Co. KG.
Trade names are trademarks of the owners.
SwissQual has made every effort to ensure that eventual instructions contained in the document are adequate and free of errors and
omissions. SwissQual will, if necessary, explain issues which may not be covered by the documents. SwissQuals liability for any
errors in the documents is limited to the correction of errors and the aforementioned advisory services.
Copyright 2000 - 2013 SwissQual AG. All rights reserved.
No part of this publication may be copied, distributed, transmitted, transcribed, stored in a retrieval system, or translated into any
human or computer language without the prior written permission of SwissQual AG.
Confidential materials.
All information in this document is regarded as commercial valuable, protected and privileged intellectual property, and is provided
under the terms of existing Non-Disclosure Agreements or as commercial-in-confidence material.
When you refer to a SwissQual technology or product, you must acknowledge the respective text or logo trademark somewhere in
your text.
SwissQual, Seven.Five, SQuad, QualiPoc, NetQual, VQuad, Diversity as well as the following logos are registered trademarks of SwissQual AG.
Diversity ExplorerTM, Diversity RangerTM, Diversity UnattendedTM, NiNA+TM, NiNATM, NQAgentTM, NQCommTM, NQDITM, NQTMTM,
NQViewTM, NQWebTM, QPControlTM, QPViewTM, QualiPoc FreeriderTM, QualiPoc iQTM, QualiPoc MobileTM, QualiPoc StaticTM, QualiWatch-MTM, QualiWatch-STM, SystemInspectorTM, TestManagerTM, VMonTM, VQuad-HDTM are trademarks of SwissQual AG.
The following abbreviations are used throughout this manual: R&S___ is abbreviated as R&S ___.
SwissQual... Diversity
Contents
Contents
1 Introduction............................................................................................ 5
2 Listening Quality.................................................................................... 6
2.1
2.2
2.3
2.3.1
2.3.2
3.2
3.3
3.3.1
SwissQual... Diversity
Contents
SwissQual... Diversity
Introduction
1 Introduction
This document describes the technical background, the application scenarios as well
as the parameters that are measured with the single ended NiNA+ voice quality measurement. The application used was the SwissQual QoS Measurement System.
NiNA+ provides an opportunity for assessing the signal quality of a signal transmitted
via a telecommunications network without the knowledge of the originally transmitted
signal. The speech quality is determined by only using the output signal. SwissQuals
NiNA+ solution can be applied for rating of any arbitrary connection where a selfanswering far-end side is playing back human speech, for example, weather forecast
or similar. Since, NiNA+ can be applied on the mobile unit, the radio link forms part of
the tested connection. Of course, by using NiNA+ any fixed line connection, even
Voice over IP, can be rated.
Furthermore, the NiNA+ method is not restricted to end-to-end measurements; it can
be used at any arbitrary location in the transmission chain. It can be used for quality
monitoring at any electrical measuring point within a real established voice link, for
example, in a VoIP Gateway or a at an E1/T1 interface. The calculated score reflects
the true speech quality from the perspective of the end-user as if using a conventional
shaped handset at this measuring point.
SwissQual... Diversity
Listening Quality
Definition of Listening Quality
2 Listening Quality
For network operators or equipment manufacturers, it is important to know where and
why there is speech quality degradation. Since listening quality is a major factor determining customer satisfaction, encoding techniques must be designed for optimal
speech quality. In order to assess the quality of speech encoding techniques, largescale auditory tests are commonly employed. However, it is very difficult to reproduce
results obtained in such a way. Furthermore, such results are depending on the level of
motivation of the individual test candidates. It is, therefore, a big advantage to have an
automated method capable of physically measuring speech quality parameters and
producing results, which correlates as closely as possible with subjectively acquired
results.
Listening quality is a vague term compared with bit rate, echo or loudness. Since customer satisfaction can be measured directly by the quality of the transmitted speech,
encoding techniques must be selected and optimized based on their listening quality.
Listening Quality: covers the listening situation between the two calling parties,
where one party is talking and the other party is listening (non active)
Talking Quality: perceived quality by the talker during own speech activity (mainly
influenced by echoes and side tones)
Detailed definitions of these dimensions and test scenarios for auditory tests can be
found in ITU-T P.800 series.
SwissQual... Diversity
Listening Quality
Subjective and Objective Quality Assessment
Evaluation and rating is conducted on the received signal alone. (singleended method, might be a test call to a answering machine or even live monitoring)
SwissQual... Diversity
Listening Quality
Subjective and Objective Quality Assessment
Send signal
Receive signal
Experience
Expectation
Semantic
Human listener
Quality rating
Quality rating
SwissQual... Diversity
Listening Quality
Assessment of Intrusive-/Non-Intrusive Calls
Intrusive and double-ended: Both ends of the connection are under control and a
defined audio signal will be transmitted in this test connection.
Intrusive and single-ended: A test connection will be established to any answering station which is playing back a voice signal, for example, a weather forecast.
Here the same model is applied as the Non-intrusive In-service Monitoring.
SwissQual... Diversity
Listening Quality
Assessment of Intrusive-/Non-Intrusive Calls
The single ended models often look for pre-defined distortions by applying conventional signal analysis methods. This means, they are looking for background noises,
interruptions, frame repeats and so on. More advanced solutions try to reconstruct a
reference speech signal from the distorted one and apply similar psycho-acoustic
based methods for comparisons like the intrusive and double-ended methods.
Of course, the accuracy of a single ended approach is lower than that of an intrusive
and double ended approach. However, due to the advanced integrated speech extraction and the psycho-acoustic based calculations, the single-ended approach is now
accurate enough to be applied in real environments.
A non-intrusive, single-ended algorithm has two base applications, namely:
Gain a more realistic overview about the speech quality as perceived by the
subscribers
This is because the impact to speech quality coming from the sending side, for
example, Background noise, is included in the measurement and end result.
10
SwissQual... Diversity
11
SwissQual... Diversity
Finally, a set of quality describing characteristics are calculated and mapped into the
MOS-LQO.
Based on these characteristics also the cause analysis and the signal classification is
done.
For comparison, it runs nearly 100 times fast than ITU-T P.563 and even more than 20
times faster than SwissQuals speed optimized solution for P.563.
SwissQuals NiNA+ solution runs also on Windows 32bit platform. This low complexity makes NiNA+ to an ideal component at low performing platforms such as mobile
phone operating systems and digital signal processors.
Furthermore, the NiNA+ method has the following useful requirements on the speech
signal to be assessed to avoid false predictions or malfunctions.
12
SwissQual... Diversity
Speech Level: NiNA+ accepts range of active speech level from -16 dBov down to
-45dBov.
Higher levels will lead to annoying clippings of the higher amplitudes. However, if
the high speech level is caused by the network under test, it should be considered
in the quality but if the clipping is caused by measurement interface, it will lead to
artificial quality impacts.
Likewise, measurements with low speech level will have a decreasing signal-noiseratio caused by the limited digital resolution of the used A/D converter in the measurement environment. This will also lead to additional quality impacts. SwissQuals QoS system will ensure the proper level adjustment for all supported cellular
phones and ISDN/PSTN cards. Only in the transparent mode by using arbitrary terminals the customer it self has to control the correct level adjustment. For that reason speech levels, which are out of the recommended range, will be highlighted in
red color by analyzing the results in SwissQuals NQDI data interface. Files with a
speech level of below -65dBov will be not analyzed and a warning message will be
displayed.
Accuracy of predicted Listening Quality: The accuracy of the NiNA+ model was
by using large speech databases covering the complete scope of todays public
switched telephone networks.
The performance against well-known databases from the ITU-T set is shown
below. Due to the target applications from SwissQuals QoS system, a strong focus
was set for an outstanding performance in real live network connections, such as
the mentioned test real GSM with handset variations. The numbers are describing
the correlation coefficient between the MOS values obtained in the auditory tests
and the predicted scores by NiNA+. Therefore a third-order mapping was applied
before calculation of the correlation. The results below are comparing the NiNA+
performance with the current ITU-T standard P.563.
Table 3-1: Correlation coefficients between MOS values obtained in auditory tests and scores of
NiNA+
Speech Database
ITU-T P.563
NiNA+
0.902
0.905
0.842
0.918
0.916
0.857
0.929
0.903
0.895
0.925
0.935
Real VoIP
0.950
13
SwissQual... Diversity
Fig. 3-1: NiNA+ Listening Quality values for noise-free speech transmissions
This database shown in figure 3-1 is taken from the G.729 characterization phase of
ITU-T and consists of a wide range of existing codecs and combinations thereof. The
results given are on a so-called per-condition basis, which means the results of four
samples transmitted through the same application scenario were averaged.
14
SwissQual... Diversity
Fig. 3-2: NiNA+ Listening Quality values in GSM connections using real handsets
This database shown in figure 3-2 is taken from a subjective test performed or ITU-T
within the P.563 competition phase. It was organized by SwissQual in the Deutsche
Telekom Laboratories in Berlin. Compared to the common ITU-T databases, where
simulated speech files are used this test contains speech recordings in real GSM circuits. The speech signals were inserted in the handset microphone using an artificial
mouth in different acoustical environments.
15
SwissQual... Diversity
Sequences without or to less speech activity will be also analyzed but they will be signalized separately within SwissQuals QoS systems and instead of the results the information Silence or Speech Activity too low will be presented.
Typically, of most interest to the users is the Listening Quality value gained by figure 2-1 applying NiNA+. In line with ITU-T Recommendation P.800.1 it is called MOSLQO where the LQO stands for Listening Quality Objective. The MOS-LQO is defined
in range 1 to 5 where 1 is standing for bad and 5 for excellent speech quality. In real
measurements, the value will scarcely exceed 4.5.
In addition to the MOS-LQO, further analysis can be done by analyzing the average
section in NQDI.
The following values are presented in an average section:
16
SwissQual... Diversity
Static SNR in dB
Amplitude Clipping in %
Speech Activity in %
DC Offset in %
Pitch frequency in Hz
Signal Class
The MOS-LQO is truly the main result of the analysis and gives an overview about the
quality in a single number result. To give a bit more feeling about the results, which can
be expected, the following table lists results obtained by analyzing coded speech with
typical speech codecs.
Table 3-2: Typical MOS values of auditory tests and NiNA+
Codec
G.711
4.3
4.4
G.729
3.8
3.8
G.728
3.7
3.7
G.726 (32kbit/s)
3.9
3.8
GSM-FR
3.5
3.2
GSM-EFR
3.9
3.8
"Speech Level" shows the R.M.S. level of all frames containing active speech.
Because silent intervals and speech pauses will be not considered, it is a good
measure for the actual speech level control in the channel. The Speech Level is
presented in dB rel. to the Overload Point (32768 for 16Bit quantization) and is
close to Active Speech Level according ITU-T P.56.
"Static SNR" gives brief information about the signal-to-noise ratio of the signal.
17
SwissQual... Diversity
Here the ratio between the active speech and the estimated background noise floor
is calculated.
"Pitch Frequency" is a value which represents a pitch frequency of the input signal
in Hz.
The Pitch Frequency in case of speech signal is the fundamental oscillation of the
talkers vocal tract. Typical pitch frequencies for female speakers are in the range
100 to 200 Hz and for male speaker in a range 50 to 150 Hz. Even the Pitch Frequency is out of the telephone pass-band, it can be recovered by analyzing harmonic oscillations in upper frequencies.
"DC Offset": This number shows a constant value of the input signal in percentage.
Human ear can not perceive a DC Offset. The DC-Offset will not influence the quality score because it will neither be transmitted by the transducer in the terminal nor
perceived by a human ear. But a certain amount of DC Offset (>0.5%) signalizes
problems in the terminal interface or in the transmission channel itself.
"Amplitude Clipping": The latest versions of NiNA+ present the Amplitude Clipping
as a separate value.
In this case the corresponding label is enabled. This value describes roughly an
estimated amount of amplitude clipping. Since, no reference signal is available and
the hard saturation in the time signal might be affected by filtering, hence, this figure will only react on severe detectable clippings.
Noisy speech
No speech
It is possible to see more then one cause (code) in the average section. There are
eight different problem codes:
" Background noise" is signalized if the Noise Level is higher than -50 dB or the
static SNR is below 20 dB
"Modulated Noise" occurs when the segmental SNR is under-run a defined multidimensional threshold. It signalizes mainly signal-form speech codecs.
"Interruptions flag" is set to true if one or more signal interruptions are detected in a
speech signal
"Level problem" occurs if the signal level exceeds the nominal level for more then
10 dB. Likewise, this problem will be also signalized if the signal level will fall 12dB
below nominal level. Nominal speech level is -26 dBov (dB to digital overload
point).
"DC Offset" problem is shown when the DC offset of speech signal has exceeded
the predefined thresholds of +/- 0.2 %.
"Amplitude clipping" is shown if the saturation of the signal will lead to significant
distortions.
18
SwissQual... Diversity
The next step in the analysis is done by looking at the signal envelope as well as by
listening to the live recordings.
The envelope below presents the signal in the common time domain format figure 3-5.
Also here the experienced user can obtain some information as peaks and amplitude
clippings.
Furthermore, the NQDI presentation sheet gives the possibility to play back the
received sample by using the default or a specified audio player as well as several
options to export the results into external tables or text documents.
19