Escolar Documentos
Profissional Documentos
Cultura Documentos
7603/s40601-014-0015-7
GSTF Journal on Computing (JoC) Vol.4 No.3, October 2015
I. INTRODUCTION
DOI: 10.5176/2251-3043_4.3.328
The Author(s) 2015. This article is published with open access by the GSTF
23
The Author(s) 2015. This article is published with open access by the GSTF
24
The Author(s) 2015. This article is published with open access by the GSTF
25
b) Physical features
( )2 < 4002
=1
The Author(s) 2015. This article is published with open access by the GSTF
26
c) Perceptual features
The Author(s) 2015. This article is published with open access by the GSTF
27
The Author(s) 2015. This article is published with open access by the GSTF
28
B. Audio Identification
1) Overview
Audio identification is very challenging task compared
to audio classification since we have to specifically match
unknown audio object with thousands of pre-installed audio
objects whereas in audio classification we classify any audio
object into small number of pre-defined classes. As we
discussed earlier most of the researches have joined these two
together in order to get better result. First we classify
unknown audio object and identify its class then we can
match this unknown audio object among other pre install
object in the same class. By doing this we can speed up the
process by omitting the unrelated class of audio objects as
well as obtain better results.
In this section we focus only on the identification part.
According to the past literature, we can provide high level
overview of overall process done by most of the researches.
Look at the Figure 7. In here feature extraction part is exactly
same as the feature extraction of audio classifications which
we have already discussed. The key thing is to discuss here is
that how to create audio archives and searching
mechanisms. Thesetwo things we will discuss later in
detail.Apart from that almost all researches have framed
audio object into sets of overlapping frames. The reason for
doing it is a very important thing. Usually we have to identify
audio object like a song when a small part of it is presented.
This small part can be come from any place from the original
track. In this case we dont know the offset of that part. To
address this problem we can use framing thing. Look at the
Figure 8.
The Author(s) 2015. This article is published with open access by the GSTF
29
The Author(s) 2015. This article is published with open access by the GSTF
30
The Author(s) 2015. This article is published with open access by the GSTF
31
n,m(
,).
The Author(s) 2015. This article is published with open access by the GSTF
32
[2]
[3]
[4]
[5]
[6]
[7]
The Author(s) 2015. This article is published with open access by the GSTF
33
[8]
[9]
AUTHORS PROFIEL
The Author(s) 2015. This article is published with open access by the GSTF
34