Você está na página 1de 4

Bird Species Recognition based on SVM Classifier

and Decision Tree


Baowen Qiao1,2 Zuofeng Zhou, Hongtao Yang, Jianzhong Cao
1
Shaanxi normal university Xi'an Institute of Optics and Precision Mechanics of CAS
School of physics and information technology Xi'an, P.R.China
2
Xi'an Institute of Optics and Precision Mechanics of CAS zfzhou@opt.ac.cn
Xi'an, P.R.China cjz@opt.ac.cn
514091777@qq.com

Abstract—Bird species recognition is a challenging problem due classification is low, the correct rate is only 17% when they
to the variant illumination and different view point of camera. In distinguish 17 species of birds.
this paper, a new feature which is the ratio between the distance
of the eye to the root of beak and the distance of the width of the In this paper, we propose a novel bird recognition method
beak is used to distinguish the different bird species. Integrated based on SVM decision tree. The SVM classifier and decision
the new feature into the multi-scale decision tree and the SVM tree are combined to identify birds, and a new beak feature is
framework, a new bird species recognition algorithm is proposed presented to describe the bird. Both of them have great impact
to get the final recognition result. The Experiment results show on classification. In this paper, color feature and shape feature
that the proposed new feature can improve the correct are chosen as the feature of bird. The color features of this
classification rate about nine percent. paper include the color moment, the special color and the color
of the head. There are a lot of color space to describe the color
Keywords-component; Pattern Recognition; SVM; Decision feature such as RGB, YUV and HSV. In RGB color space,
Tree; bird species recognition each pixel is composed of three components, which are red,
green and blue, any other color is made up of these three
I. INTRODUCTION components. RGB color space is typically used to display
Science and technology have improved our quality of life, system, but RGB color space is not in conformity with the
but the rapid development of some industries has given up human visual characteristics. YUV color space is mainly used
people’s future living environment and causes an adverse for color video signal transmission because it just takes up
impact on the survival of some wild animals [1]. Taking minimal bandwidth. However, the color feature in HSV color
measures to protect environment and endangered animal is an space is able to represent the difference of color better compare
urgent task. Because birds are numerous and sensitive to with the color feature in RGB color space and YUV color
environmental changes; also, and are easier to monitor than space. And the shape features of this paper include the feature
other species, observing the behavior of birds allows us to of beak and moment invariants. According to extract the new
better evaluate our living environment. Therefore, it is beak feature which is the ratio between the distance of the eye
particularly important to identify the birds. With the to the root of beak and the distance of the width of the beak to
development of image identification technology, using modern identify the birds, the correct classification rate has increased
technology to identify birds has become an effective research about nine percent.
method. This paper proposed a new algorithm for bird
identification based on color image. II. PROPOSED APPROACH
Although pattern recognition has been more than half a In this paper, the color moment, the special color, the color
century of history, identification for bird is still rare. In last of the head, the feature of beak and moment invariants are
years, several approaches have been proposed to recognize the integrates as a feature to describe the bird. A method which
bird. Using birds’ voice for bird recognition has achieved combines the SVM classifier and decision tree and a new bird
interesting correct classification rates [2]-[5], but bird feature are proposed.
recognition based on image signals is facing the challenge
because of illumination, different point of camera, background A. SVM decision tree
and so on. In 2010, Wang [6] creatively uses the 32 Decision tree induction offers a highly practical method for
dimensional HMMD color structure descriptors and the width generalizing from examples whose class membership is known.
and height information about the beak and head of the bird, but The decision tree method is simple and easy to understand, and
the feature they propose lack of criterion, for other people, it is the model of decision tree is not only efficient but also has a
hard to extract the feature they want. In 2013, Marini [7] high accuracy of classification. Similar birds are divided into a
extracted the three components of the RGB color space set such as the bird of the same subject, it avoids the
histogram as a feature, his method can not only distinguish the interference of low similarity of birds. Furthermore, the support
birds but also split the picture. But the correct rate of vector machine (SVM) is a novel learning machine with many
merits such as fast solving and strongly generalizing ability so
that it is diffusely used in recent years. The SVM classifies the as a feature to describe the beak, but it lost the information of
data by mapping the vector from low-dimensional space to the length because of the change of the height of the beak, how
high-dimensional space using kernel function and has the to determine what kind of beak belong to is a problem.
advantage of small sample data for support vector to complete Therefore, a new beak feature has been proposed. Compare
the linear or nonlinear programming problem. According to with the width of beak, range of change of the distance
their character, the SVM classifier and binary decision tree are between the eye and the root of beak is relatively small. The
combined in paper. Multi class classifier is shown in Figure 1. beak is described by using the ratio between the distance of the
The color feature, beak feature and moment invariants are eye to the root of beak and the distance of the width of beak
blended for a feature which is used to train the SVM classifier (R-ERWB) is more effective. As shown in Figure 2, the three
and classify. points were the beak tip, the root of beak and the bird eye.
Figure 3 shows that beak features for classification have a good
effect.

Figure 2. An example of extracting three feature points of R-ERWB

Figure 1. It is 15 kinds of birds used to classify. All SVM classifiers are


trained by corresponding birds’ feature. And A-O represent fifteen different
kinds of birds

B. Feature extraction
Birds have many kinds of gestures such as side of the
camera and back to the camera, but the side of the bird’s head Figure 3. Classification results of beak feature. X-axis is R-HBWB and y-
is usually caught. In the process of identifying bird of human, axis is R-ERWB
beak is a particularly important factor, beak feature of many
birds are so various that people can identify easily. Finding Geometric moment was presented by Hu [8] in 1962, since
characteristics of the beak is an available method for it has invariance such as rotation, translation and scaling, it is
recognizing the birds. also called the moment invariants. The detail of the image can’t
be described well due to Hu moment invariants only use the
The height of the beak and the width of the beak are typical low moments. In image processing, the moment invariants are
characteristics in the characteristics of the beak in our eyes. usually represented as an important character of the object,
Due to the size of different pictures is not the same, according to this feature the target can be classified, and so on.
characteristics should take a relative value. Wang [6] proposed Hu moment invariants are commonly used to identify the large
the ratio of the height of beak and the width of beak (R-HBWB) objects in the image, moment invariants are good at the
description of the shape of the object and poor in the image Finally, different kinds of birds’ head may have different
which texture feature is complex. So moment invariants are color, the common head colors (black, white, red and blue) are
added to depict the shape feature. selected as head feature. The pixels around the bird’s eyes are
searched to get the head color, if the proportion of any color is
Aspect ratio and density are also extracted as the shape more than 10%, this color will be marked as a feature to
feature. At first, due to the merit of Canny operator such as the recognize the bird. The range of black is V<0.3, and the range
actual edge of original image can be identified as much as of white is S<0.2 and V>0.9.
possible, Canny operator is used to detect the edge information.
Then, the minimum enclosing rectangle (MER) will be found Overall, the flowchart of the proposed algorithm can be
by edge information. Aspect ratio is defined below: summarized into four steps:
r = WMER/LMER (1) (1) Input all images and divide images into two parts for
testing and training.
Wherein, WMER and LMER respectively are the width of MER
and the length of MER. (2) Convert images from RGB color space to HSV color
space and calculate the color moments, special color, head
The density is defined below: color, moment invariants and beak feature.
C = P2 / A (2) (3) Train the SVM classifier by inputting feature of the
Wherein, P is the circumference of the object boundary, and A training images.
is the area of the object. (4) Identify the bird by the corresponding SVM classifier
For special color feature, red and blue are chosen as a after the classifier of special color.
reference and each component of the bird which is segmented
from background is calculated in HSV color space. By viewing III. EXPERIMENTAL RESULT
the color distribution list [9], the range of red is H>0.83,
S>0.45 or H<0.06, S>0.45, and the range of blue is The proposed method has been evaluated on the Caltech
0.44<H<0.67, S<0.9, V>0.45. The bird is red or blue when the birds dataset (CUB-200-2011) [10] which contains 11788
red or blue pixel point is accounted for more than 8% of the images from 200 bird species in North America. There are
proportion of the bird which is segmented from background. approximately 60 images for each species. And each image is
Otherwise the bird does not have blue or red, and it will be annotated with a bounding box around the object of the bird.
divided into no special color. This experiment selected 15 bird species from the CUB-200-
2011 dataset, twenty percent of the number of images is used
Color statistical feature often reflect the whole color for test and other images are used for training. The 15 bird
characteristics of the image in image processing. It is species are shown in Figure 5.
insensitive for different posture of bird compared with shape
feature. The color characteristic of the bird is represented by
the calculated color moment in paper. The first moment, the
second moment and the third moment of three components of
the HSV color space are extracted as color statistical feature
because most of the color information is only related to the
lower order moment. The first moment reflects the mean value
of three components of the image, for example, the black bird
is very different from the red bird in the first moment of color
moment. However, the second moment and the third moment
reflect the change of color, birds with decorative pattern will
have great difference with birds without decorative pattern.
Figure 5. 15 different kinds of bird species in experiment.

To test how effective is the new beak feature in


classification, the correct classification rate with and without
decision tree for three kinds of feature that changing the beak
feature has been obtained, and the results are shown in Table I.
The correct rate can reach 83.87% when all features (color
moment, special color, head color, beak feature and moment
invariants) are used under the decision tree. Regarding decision
tree, it is clear that it has a positive impact on the classification
rate, providing an increasing of 3.23% in the classification.
However, the feature without R-ERWB is on the decline for at
Figure 4. Classification results of color moment. X-axis represents the first least 8% of the correct classification rate and the correct
order moment of the h component in the HSV space, and y-axis represents the classification rate just dropped by 2% when using the feature
second order moment of the h component in the HSV space.
without R-HBWB.
TABLE I. CORRECT CLASSIFICATION RATE WITH AND WITHOUT the confidence level is low, other methods should be taken into
DECISION TREE FOR DIFFERENT FEATURE
accounted.
Correct classification rate (%)
Feature
With decision tree Without decision tree
ACKNOWLEDGMENT
All 83.87 80.64 The presented research was supported by Light of West
Without R-ERWB 75.27 70.76 China program of the Chinese academy of sciences under
Without R-HBWB 81.72 78.49 Grant No.Y429611213, the young star science project in
Table II shows the result among three color space. Using Shaanxi province under Grant No.2016KJXX-01 and youth
the feature in HSV color space for classification has a better innovation promotion association under Grant No.2012291.
result than using the feature in RGB color space. It proves that
the color feature in HSV color space can better represent the
REFERENCES
characteristic compare with the color feature in RGB color
space and YUV color space. [1] U. D. Nadimpalli, R. R. Price, S. G. Hall, and P. Bomma, “A
comparision of image processing techniques for bird recognition,”
Biotechnology Process, 2006, vol. 22, no. 1, pp.9-13.
TABLE II. CORRECT CLASSIFICATION RATE IN DIFFERENT COLOR [2] Kwan, C., et al. "Bird classification algorithms: theory and experimental
SPACE results, ” 2004, vol. 5, pp.289-292.
Color space Correct rate (%) [3] Fagerlund, S. “Bird Species Recognition Using Support Vector
Machines,” Eurasip Journal on Advances in Signal Processing, 2007,
RGB 75.26
7.1pp.1-8.
YUV 73.12
HSV 83.87 [4] Marini, A.A. Marini, A. J. Turatti, A. S. Britto Jr., and A. L. Koerich,
“Visual and acoustic identification of bird species,” IEEE International
Conference on Acoustics, Speech and Signal Processing, 2015, pp.
IV. CONCLUSION 2309-2313.
This paper presents a method for classifying the bird based [5] Wielgat, R., et al. “On Using Prefiltration in HMM-Based Bird Species
on SVM decision tree. The proposed approach is available, Recognition,” International Conference on Signals and Electronic
correct classification rate can achieve about 84%. Systems, ICSES'12, 18-21 September 2012, Wroclaw, Poland 2012
pp.1-5.
Classification accuracy has some variation by changing the
[6] Wang, Hsien Chang, Y. S. Chen, and M. Y. Wu. “A user-augmented
beak feature. The influence of R-ERWB is bigger than R- object query system using color and shape features for Taiwan wild
HBWB which can make the correct rate drop by 10% at most. birds photos,” International Conference on Machine Learning and
The result shows that R-ERWB is a very effective feature for Cybernetics, ICMLC 2010, Qingdao, China, July 11-14, 2010,
bird classification. Method of using decision tree also has a Proceedings 2010, pp.2516-2520.
positive impact on classification, it could raise about 3% to 5% [7] Marini, Andréia, J. Facon, and A. L. Koerich. “Bird Species
correct classification rate. Classification Based on Color Features,” IEEE International Conference
on Systems, Man, and Cybernetics, 2013, pp.4336-4341.
There is still a lot to be improved to make the method better. [8] Hu, Mingkuei. “Visual pattern recognition by moment invariants,”
For example, there is a problem of error accumulation in Information Theory Ire Transactions on, 1962, pp.179-187.
decision tree, how to solve this problem is the future work. [9] List, Rrgbv. “Colour Spaces,” Paulbourke Net.
And confidence level could be proposed in next work. When [10] Wah, Catherine, et al. “The Caltech-UCSD Birds200-2011 Dataset,”
California Institute of Technology, 2011.

Você também pode gostar