Você está na página 1de 8

The Imaging Science Journal

ISSN: 1368-2199 (Print) 1743-131X (Online) Journal homepage: https://www.tandfonline.com/loi/yims20

Effect of image resolution on intensity based


scene illumination classification using neural
network

M. H. Hesamian, S. Mashohor, M. I. Saripan & W. A. Wan Adnan

To cite this article: M. H. Hesamian, S. Mashohor, M. I. Saripan & W. A. Wan Adnan (2015) Effect
of image resolution on intensity based scene illumination classification using neural network, The
Imaging Science Journal, 63:8, 433-439, DOI: 10.1179/1743131X15Y.0000000027

To link to this article: https://doi.org/10.1179/1743131X15Y.0000000027

© W.S. Maney & Son Ltd / Society 2015

Published online: 30 Jul 2015.

Submit your article to this journal

Article views: 1060

View Crossmark data

Citing articles: 1 View citing articles

Full Terms & Conditions of access and use can be found at


https://www.tandfonline.com/action/journalInformation?journalCode=yims20
RESEARCH PAPER
Effect of image resolution on intensity based
scene illumination classification using neural
network
M. H. Hesamian*, S. Mashohor, M. I. Saripan and W. A. Wan Adnan
In this paper, a framework for testing scene illumination classification with different image
resolutions is proposed. The testing aims to provide the researchers with valuable information
about the effect of image resolution on scene illumination classification using a neural network.
The experiment is done by extracting three types of features from the images. These three types
consist of statistical features, physic based features and histogram based features. It has been
demonstrated that scene illumination classification can be affected by changing the image
resolution. Despite the popular belief that high resolution images lead to better results, scene
illumination classification by the proposed method performed best using low resolution images.
At the second part of discussion, the reason behind this phenomenon is mathematically analysed
and explained.
Keywords: Scene illumination classification, Image resolution, Neural network

Introduction behind this study is to answer the above question in


scene illumination classification field and to propose the
Recently, the amount of studies on image classification optimum resolution. Finding this optimised resolution
has increased significantly due to its wide usage in many has some major achievements. First, delivering better
different applications, especially scene illumination performance and accuracy. Second, reduce the proces-
classification, which focuses on classifying the properties sing load by avoiding processing the unnecessary high
of light in the scene to prepare them for further resolution images and subsequently decrease the
applications like correction. response time of the system.
Scene illumination classification can be helpful in The relationship of image resolution and performance
object recognition and detection, white balance, color of face recognition has been the subject of many
constancy, color reproduction, illumination correction, studies.3–5 In summary, the findings of such research
intelligent vehicle and robot navigation.1,2 demonstrate that the performance of the face recog-
The huge progress in imaging systems enables users to nition system will remain constant by reducing the image
capture images with very high resolution, such as 15 to resolution; however, this trend will only continue up to a
20 megapixels, even for compact cameras for home certain threshold by decreasing the resolution, after
entertainment. Although super high resolution cameras which the performance will deteriorate.6 In Ref. 3, the
that are able to take photos with resolutions up to a authors investigated the effect of image resolution on the
couple of gigapixels are on the way to the market, performance of a face recognition system. This exper-
these high resolution images bring various side effects iment is done in three parts: face detection, face regis-
for applications that want to use them. For instance, tration and normalisation and face recognition. The
transferring, storing, retrieving, querying and processing experiment is repeated for the same images but various
will be more costly. In addition, many applications are resolution starting from 128|128 to 8|8 and each
not able to process high resolution images due to their time tested on separate face components. Their results
limitations. The hypothesis that ‘‘a higher resolution will proved that the face recognition system is not very
to better performance’’ is widespread. sensitive to image resolution, and moreover, good
The question of ‘‘Do high resolution images deliver accuracy can still be obtained by reducing the image
better performance?’’ is a general question that can be resolution to 32|32 pixels. In a separate study,7 the
modified to any image processing system or method and authors try to know ‘‘Are high resolution images are
has been previously explored by various studies to more effective in face recognition results?’’ The results
answer the aforementioned concern. The motivation achieved by this study also demonstrate that relatively
low resolution images perform better than high resol-
ution representation. In this study, a neural network is
Department of Computer and Communication Systems, Faculty of applied to accomplish the recognition with the basic
Engineering, Universiti Putra Malaysia, Serdang, Malaysia assumption of ‘‘high resolution images the amounts of
*Corresponding author, email mh.hesamian@gmail.com noise contaminated pixels are higher.’’

Ñ W.S. Maney & Son Ltd / Society 2015


MORE OpenChoice articles are open access and distributed under the
terms of the Creative Commons Attribution Non-Commercial License 3.0
Received 21 November 2014; accepted 12 July 2015
DOI 10.1179/1743131X15Y.0000000027 The Imaging Science Journal 2015 VOL 63 NO 8 433
Hesamian et al. Effect of image resolution on intensity based scene illumination classification using neural network

Nevertheless, low resolution images have been


acknowledged to be suitable for certain applications.
However, decreasing the image resolution increases the
effect of noise, which means these tiny images are not
appropriate for some general purposes.8 The experiment
done in Ref. 8 shows that full resolution images deliv-
ered better accuracy compared to resized images in
terms of non-parametric object recognition. This can be
related to the types of feature applied in the experiment.
In the multimedia field, this issue has been addressed
by various studies that focused on object or movement
recognition. For example, in order to boost the accuracy
of detecting a full 3D body scan, the approach of using
high resolution images was proposed.9 However, this
approach was later implemented by combining several
low resolution images from multiple views.10 Such a
method not only benefits from low resolution images but
also obtained more accurate representation compared to
its predecessors.
In medical imaging, one study focused on surveying
the effect of the image resolution on the accuracy of its
system. The final presented results showed that a higher
image resolution leads to better accuracy, albeit there is
some limitation in increasing the image resolution due to
the increased amount of noise.11

Methodology
The methodology that is used for this study is a neural
network based classification consisting of seven main
steps. The steps are shown in Fig. 1.
A database of 338 scene images was applied in this
study with resolution of 4368|2096. Then, the images
are presented with six different image resolutions:
2048|2048, 1024|1024, 512|512, 256|256,
128|128 and 64|64. For each image set with specific
resolution, the corresponding feature set has been
extracted. The process of resizing the images is done by
using the following two steps: first dividing the image to 1 Step by step flow chart of implemented system
equal size windows and then merging the pixel values of
each window into one single pixel. The intensity of such illumination value that could be helpful in finding light
pixel is the average intensity of all exciting pixels in the spots. The median of image illumination represents the
window7 (Fig. 1, image resizing). varying range of illumination in the image.
All original images are accompanied by the The third type of input feature for this classification
surrounding illumination value (SIV).12 The SIV as a system was the histogram quartering feature. This fea-
physics based feature presents the amount of light at the ture helps to increase the performance of classification
moment of capturing the photo (Fig. 1, feature extrac- by better histogram comparison. The illumination
tion). Using SIV can help bridge the gap between the histogram of the image was divided into four quarters
image presented data and the real data from the scene and then compared to the corresponding quarter of the
(Fig. 2). most ideal image illumination histogram (Fig. 3). The
The second class of input feature was a subset of differences are presented in four values for the four
statistical scene illumination classification approa- histogram quarters.
ches.13,14 The statistical feature set consists of mean, The proposed classifier for this study is a feed forward
mode and median of the image illumination, which were pattern recognition network with three hidden layers
all extracted from the Y component of the image and (Fig. 1, neural network classification). The number of
transferred from the RGB color model to CIE XYZ. neurons in the layers are 5, 10 and 5 for the first, second
The range of varying illumination (0 to 117000 lux) and third layers respectively. All the aforementioned
in the real scene was mapped from 0 to 255 in the extracted features are the inputs of this network. Using a
image illumination component (Y). Therefore, a direct target matrix that indicates the belonging group of the
relationship between the real scene illumination and the training samples enables the network to sort out the test
image illumination could be deduced. The mean of samples into predefined clusters (Fig. 4). Levenberg–
illumination depicts the average illumination of the Marquardt is used as a training algorithm for this
image pixels, which is equivalent to SIV and represents network, which is highly recommended as a first choice
the illumination of the real scene. The mode of illumi- supervised algorithm for those problems that have up to
nation can help in determining the most frequent a few hundreds of weight.15

434 The Imaging Science Journal 2015 VOL 63 NO 8


Hesamian et al. Effect of image resolution on intensity based scene illumination classification using neural network

2 Sample of input images with their corresponding SIV value

3 Sample of image histograms (2566256) with their corresponding quarter’s value. Vertical lines separating quarters from
each other and horizontal line refer to ideal histogram level. Horizontal axis represents CIE XYZ y component value; vertical
axis represents number of pixels in each Y component value

Starting from the highest image resolution set point for using it.18 Since any estimation may follow a
(2048|2048), the corresponding feature set was the prediction, providing an accurate measure for system
input of the neural network. In addition, a target matrix prediction, efficiency is necessary. Using the coefficient
carrying the belonging group of each image was also fed of determination (R square) as the measure of system
into the neural network for the training phase. Then, accuracy in order to compare the estimated value and
the process of classification was repeated 30 times to the proposed model is a common solution for evaluating
guarantee the accuracy of the system. the system efficiency in this aspect.19
The training algorithm performance was monitored Finally, for each image resolution set, the classifi-
by mean square error (MSE) (Fig. 1, result evaluation); cation process was done, and the results recorded in
this parameter was used as one of the evaluating and terms of accuracy rate, MSE and R square value (Fig. 1,
comparing parameters of the system.16,17 In order to iteration and result finalising). All the recorded values
have a very clear understandable estimate of the system are the average of 30 times repeat of the test. The 30
performance, the error percentage was proposed. The times repeat will minimise the inherent randomness
classification error or simply the misclassification rate is behavior in the training procedure of the neural
the ultimate measure of the performance of a classifier. network.18 Repeating the test for 30 times will ensure the
The capability of defining a target value based on results are following a normal distribution pattern and
reducing the misclassification rate is another motivating not a random one.

The Imaging Science Journal 2015 VOL 63 NO 8 435


Hesamian et al. Effect of image resolution on intensity based scene illumination classification using neural network

4 Designed neural network structure: eight inputs correspond to eight features (mean, mode, median, SIV and Q1 to Q4), and
three outputs correspond to three output clusters (low, ideal and high illumination)

Results and discussion MSE chart, the differences are clearer, for instance, from
256 to 128, a reduction of 1.14% occurs, whereas in the
The results presented in Fig. 5 depict that the misclassi- MSE, a reduction of 19.34% occurs. However, although
fication rate of the system varies by changing the image the 1.14% is not very remarkable for some applications,
resolution. A reduction of 4.62% occurs between the the 19.34% reduction in MSE for same cases demon-
lowest and highest misclassification rate, travelling strates the significant improvement delivered by
from the lowest image resolution to the highest image reducing the image resolution from 256 to 128.
resolution. This simply demonstrates the effect of image R square as a measure for goodness of fit of the mod-
resolution on misclassification rate. The system mis- eling can represent the performance of the system for
classification rate has a downward trend by decreasing predicting based on current modeling.19,20 As can be seen
the image resolution from 2048|2048 to 128|128. This in Fig. 7, the predicting power of a scene illumination
trend dramatically increases from 512|512 to classification system is also affected by changing the
128|128. Then, after that, by reducing the image image resolution. A slight continuous improvement is
resolution, the upward trend starts. As an example on achieved by discounting the image resolution from 2048
how to use the table in Fig. 5, let us assume that there is an to128. However, similar to the other two parameters,
image with resolution of 1024|1024; then, by referring the trend starts to decline by reducing the image size to
to the table, we can see that if the resolution decreased to 64. A growth of 10.31% occurred in the R square value
512|512 the misclassification rate will improve by from the worst case up to the optimal point (128|128).
0.57%, if reduced to the misclassification rate will The previous discussions clearly prove that decreasing
improve by 29.47% and finally reducing the image the resolution up to a certain threshold will increase the
resolution to 128| 128 will improve the rate by 31.99%. performance, whereas the performance will deteriorate
The same scenario happens in the MSE chart. when the threshold is crossed. The tables in Figs. 5, 6
Figure 6 clearly depicts the effect of image resolution on and 7 help to show how much improvement will be
MSE. The general trend of the MSE chart is the same as achieved in percentage by reducing the higher image
the misclassification rate, and an improvement of resolution. To study and determine what is the reason
37.23% is achieved from the worst to the best case. In the for such behavior, we need to go back to feature sets and

5 Misclassification rate analysis. On left, chart shows misclassification rate changes by changing image resolution. On right,
separated improvement of each image resolution is compared to others by percentages. Negative values in table show that
instead of reduction increase occurred

436 The Imaging Science Journal 2015 VOL 63 NO 8


Hesamian et al. Effect of image resolution on intensity based scene illumination classification using neural network

6 Mean square error analysis. On left, chart shows MSE changes by changing image resolution. On right, separated
improvement of each image resolution is compared to others by percentages in terms of MSE

7 R square analysis. On left, chart shows R square changes by changing image resolution. On right, separated improvement
of each image resolution is compared to others by percentages in terms of R square

their corresponding values for each image resolution the improvement of system performance is negligible, the
test. Table 1 shows the feature values for all image values for mode feature are highly affected by changing
resolutions. For a more detailed analysis, the table the image resolution, especially in the ideal illumination
presents the values for three samples, each belonging to samples. For the other two groups based on the
a different illumination class. exaggerated property of light intensity, the changes in
As can be seen in Table 1, the average illumination mode value are not as much as for the ideal class.
value remains constant by changing the image In order to analyse the effect of the image resolution
resolution. Since the resizing process is done by dividing changes on histogram quartering features, we need to
pixels in windows and averaging them, there would be apply the coefficient of variation (CV), which shows a
no difference in the average values in various image sizes. normalised measure for the distribution of pixel values
Hence, the average illumination value does not involve over four quarters. This statistic enables the changes in
performance improvement. In the case of the median the quarter value of the illumination histogram to be
illumination value, the changes occurred in the vicinity clearly monitored by image resolution. Figure 8 depicts
of a certain value with very little variation. Therefore, the effect of the image resolution changes on the CV of
even though the small contribution of median value in the histogram quarter features.

Table 1 Feature values for three different images from various illumination classes presented by various image resolutions

Sample Resolution Avg Mode Median Q1 Q2 Q3 Q4 CV

Low 64 54 22 53 2727 1208 132 29 1.0608


128 54 23 52 10986 4637 601 160 1.0605
256 54 21 51 43,858 18,334 2,503 831 1.054
512 54 23 50 174,465 74,096 9,506 4,077 1.047
1024 54 23 50 694,001 299,930 36,403 18,242 1.041
2048 54 15 49 2,768,017 1,206,272 141,736 78,279 1.038
Ideal 64 109 34 104 1021 1460 1381 234 0.473
128 109 47 103 5633 5314 1197 2660 0.427
256 109 157 101 17,489 22,080 20,464 5503 0.397
512 109 160 100 71,570 86,389 80,112 23,893 0.375
1024 109 86 100 292,092 341,148 315,081 100,255 0.362
2048 109 163 99 1,178,477 1,356,504 124,932 409891 0.356
High 64 158 248 167 640 466 1561 1429 0.1587
128 159 248 170 2707 1691 5844 57600 0.1472
256 159 238 171 11156 6308 24022 24050 0.1447
512 159 255 173 46043 24512 93283 98306 0.1410
1024 159 255 175 188512 100383 353208 406473 0.1393
2048 159 255 175 188512 100383 353208 406473 0.1399

The Imaging Science Journal 2015 VOL 63 NO 8 437


Hesamian et al. Effect of image resolution on intensity based scene illumination classification using neural network

8 Coefficient of variation of histogram quarter features separated by image illumination class: (A) low illumination class,
(B) ideal illumination class and (C) high illumination class

The CV for the quarters declines by increasing the image that the performance was improved by reducing the
resolution. This means that the differences in quarter image resolution, but this improvement only continued
values are decreasing by increasing the image resolution, up to a threshold. The best performance was delivered
and the values are going to be same. This phenomenon by the image set with a resolution of 128|128 by
results in reducing the discriminating power of the histo- improvement of 4.62% in misclassification rate, 31.17%
gram quartering features by the neural network. Therefore, in MSE and 8.69% in R square. As the performance
having quarter values with a higher CV (much more started to deteriorate by reducing the image resolution
differences in distribution) is the main reason for having to values of less than 128|128, this resolution setting
better performance for the lower resolutions. could be set as the threshold for image illumination
By looking at Fig. 8 again, it can be seen that the CV for classification and estimation.
the high and low illumination samples is more affected by The changes in CV for the quarter values of the illu-
the image resolution than the ideal illumination. This is mination histogram are acknowledged to be the main
related to the natural histogram shape of the illumination reason for performance changes. Increasing the image
value in each cluster. The ideal illumination histogram has resolution will lead to a decrease in the CV of the
closer values in each quarter, and by changing the image histogram quarter values. Once the CV in histogram
resolution, it has been less affected. However, the high and quarter values decrease, these values get more similar,
low illumination histograms have huge differences in the and it will lead to reduction in discrimination power of
quarters, and by changing image size, they have been NN. Finally, it will result in more misclassification, more
more influenced. Hence, the CV value in the high and low error and lower R square value. This phenomenon
illumination samples decreased the misclassification and affects the high and low illumination cluster more than
the error dramatically and much more than for the ideal the ideal illumination class due to the larger changes in
illumination. In general, it can be concluded that changing CV in the corresponding group.
the image resolution has much more effect on the classifi-
cation of high and low illumination samples than the
samples with ideal illumination. References
1. Zhou, L., Zhou, Z. and Hu, D. Scene classification using a multi-
Conclusion resolution bag-of-features model. Pattern Recognition, 2013, 46,
424–433.
The effect of image resolution on the illumination 2. Tominaga, S., Ebisui, S. and Wandell A, B. Scene illuminant
classification was investigated. The experiment shows classification: brighter is better. JOSA A, 2001, 18, 55–64.

438 The Imaging Science Journal 2015 VOL 63 NO 8


Hesamian et al. Effect of image resolution on intensity based scene illumination classification using neural network

3. Boom, B., Beumer, G., Spreeuwers, L. J. and Veldhuis, R. N. the accuracy of standard uptake values: a simulation study.
The effect of image resolution on the performance of a face J. Nucl. Med., 2004, 45, 1519–1527.
recognition system. 2006. 12. Hesamian, M., Mashohor, S., Saripan, M. I. and Wan Adnan, W.
4. Wang, J., Zhang, C. and Shum, H. -Y. Face image resolution Scene illumination classification using illumination histogram
versus face recognition performance based on two global analysis and neural network., Proc. IEEE Int. Conf. on ’Control
methods., in Proc. Asian Conf. on ’Computer vision, Jeju Island, system, computing and engineering (ICCSCE) 2013.
Korea 2004 13. Barnard, K., Cardei, V. and Funt, B. A comparison of computational
5. Lemieux, A. and Parizeau, M. Experiments on eigenfaces color constancy algorithms. I: Methodology and experiments with
robustness. Proc. 16th Int. Conf. on ’Pattern recognition, 2002, synthesized data. IEEE Trans. Image Process, 2002, 11, 972–984.
pp. 421–424. 14. Ebner, M. Color constancy, 2007 (John Wiley & Sons, New York),
6. Fookes, C., Lin, F., Chandran, V. and Sridharan, S. Evaluation of Vol. 6.
image resolution and super-resolution on face recognition 15. Hagan, M. T. and Menhaj, M. B. Training feedforward networks
performance. J. Visual Commun. Image Represent, 2012, 23, 75–93. with the Marquardt algorithm. IEEE Trans. Neural Networks,
7. Chen, L., Tokuda, N., Nagai, A. and Chen, X. Is high resolution 1994, 5, 989–993.
representation more effective for content based image 16. Pradeep, J., Srinivasan, E. and Himavathi, S. Diagonal based
classification?, Proc. Int. Joint Conf. on ’Neural networks, feature extraction for handwritten alphabets recognition system
IJCNN’06 2006, pp. 4045–4050. using neural network, arXiv preprint arXiv. 2011, 11030365.
8. Deng, J., Dong, W., Socher, R., Li, L. -J., Li, K. and Fei-Fei, L. 17. Weintraub, M., Beaufays, F., Rivlin, Z. e., Konig, Y. and Stolcke,
Imagenet: a large-scale hierarchical image database., Proc. IEEE A. Neural-network based measures of confidence for word
Conf. on ’Computer vision and pattern recognition, CVPR 2009, recognition., Proc. IEEE Int. Conf. on ’Acoustics, speech, and
pp. 248–255. signal processing 1997, 887–887.
9. Anguelov, D., Srinivasan, P., Koller, D., Thrun, S., Rodgers, J. and 18. Devijver, P. A. and Kittler, J. Pattern recognition: a statistical
Davis, J. Scape: shape completion and animation of people. ACM approach., 1982 (Prentice-Hall, London), Vol. 761.
Trans. Graphics, 2005, 408–416. 19. Shahsavari, S., Bagheri, G., Mahjub, R., Bagheri, R.,
10. Tong, J., Zhou, J., Liu, L., Pan, Z. and Yan, H. Scanning 3d full Radmehr, M., Rafiee-Tehrani, M., et al. Application of artificial
human bodies using kinects. IEEE Trans. Visual. Comput. neural networks for optimization of preparation of insulin
Graphics, 2012, 18, 643–650. nanoparticles composed of quaternized aromatic derivatives of
11. Boellaard, R., Krak, N. C., Hoekstra, O. S. and Lammertsma, chitosan. Drug Res., 2014, 64, 151–158.
A. A. Effects of noise, image resolution, and ROI definition on 20. Armstrong, J. S. Illusions in regression analysis., 2011.

The Imaging Science Journal 2015 VOL 63 NO 8 439

Você também pode gostar