Escolar Documentos
Profissional Documentos
Cultura Documentos
ABSTRACT: Electricity is a basic and essential input in all economy sectors. Until 2018, diesel was
the most used form of energy in the agricultural sector in the state of São Paulo. However, due to the
increasing use of electricity, this became the most consumed form of energy in 2019 and 2020. This
study aims to generate a mathematical model to forecast energy consumption in the referred to sector
by using regression analysis concepts associated with the Microsoft Excel Regression tool. The result
was a multiple linear regression model with low interference of the variables ‘number of consumers’
and ‘generated electricity’, but with a significant inverse influence of diesel consumption, confirming
the substitution of such energy source for electricity. There was a growth trend in the consumption of
electricity, whose development requires improvements and expansions of the entire infrastructure
(generation, transmission, and distribution) that will consequently reflect on people’s social welfare
and economic development.
Keywords: Consumption forecast. Mathematical model. Energy demand. Rural energy consumption.
Demand forecast.
1
Master of Agribusiness and Development, monclar.christovao@ifsp.edu.br
2
Professor of Applied Statistics, sandra.oliveira@unesp.br
3
Professor of Digital Circuits, mario.mollo@unesp.br
IV SIMPÓSIO INTERNACIONAL DE AGRONÉGOCIO E DESENVOLVIMENTO – SIAD
“Desafios e soluções para a redução de perdas e desperdícios de alimentos”
Tupã – SP: 25 a 27 de outubro de 2022
1. INTRODUCTION
Electricity is an essential input for the progress of a nation, and its supply continuity concerns
government authorities of many countries (MARANGONI; MAGATÃO; DE ARRUDA, 2020).
According to Fan, Xiao, and Wang (2014), the global increase in energy consumption has
caused a worldwide concern due to uncertainties in electricity supply and environmental degradation,
in addition to geopolitical issues.
The issue abovementioned is worsened by the inevitable need for investments – a limiting
factor in poor or developing countries (TAGHIZADEH-YAZDI; MOHAMMADI-BALANI, 2020).
The energy consumption forecast is a crucial tool in electricity management, maintenance
planning and investment decisions in future markets (VU; MUTTAQI; AGALGAONKAR, 2015).
It is possible to categorize consumption forecasts by short, mid and long term analysis period
and, in this regard, mathematical models used in energy consumption forecasting are essential for the
operation and planning of companies in the electricity sector (ABDULKAREEM et al., 2019).
The total electricity consumption, per sector, in the state of São Paulo in 2020, also
encompassing the energy self-producers, was 145,451 Gigawatt hours (GWh), a 3,78% decrease
against the previous year, which was 151,120 GWh. The commercial (-12.34%%) and industrial (-
2.35%) sectors showed a decrease in such period. Conversely, the agricultural (1.62%) and residential
(3.08%) sectors had a consumption growth (SÃO PAULO, 2021).
In view of the foregoing, it is relevant to conduct studies related to energy consumption in the
agribusiness segment.
2. OBJECTIVES
This study aims to generate a mathematical model to forecast electricity consumption in the
agricultural sector in the state of São Paulo. Specifically, it intends to identify the variables or the
combination of them that contribute to the energy consumption estimate in this sector.
3. METHODOLOGICAL PROCEDURES
Y = β0 + β1 . X1 + β2 . X2 + … + βn . Xn + Ɛ (1)
where: β0: linear coefficient (line intercept); β1, β2, ..., βn: angular coefficient or regression coefficient
(line slope); and Ɛ: Y experimental error (points outside the line).
Based on sampling data, Ŷ is the result of Y forecast for X observations, according to the
adjusted model of Equation (2). Thus, from paired samples of three or more variables, we obtain the
values of b0, b1, b2, ..., and bn.
Ŷ = b0 + b1 . X1 + b2 . X2 + … + bn . Xn (2)
where: b0: β0 estimator; and b1, b2, ..., bn: β1, β2, ..., βn respective estimators.
3.3 Testing the coefficient significance, the existence of regression and analysis of results
The significance of independent variables is tested individually. The test checks whether each
regression parameter βi is equal to zero (hypothesis H0). Thus, if p-value is less than or equal to the
significance level, we reject hypothesis H0 and conclude that βi ≠ 0 for a risk α.
Then, we check the existence of linear regression considering the F-test (Analysis of Variance
- ANOVA). If p-value (significance F) is less than or equal to the test significance level, we reject
hypothesis H0 that all regression parameters βi are equal to zero, and conclude there is regression,
that is, the adjusted model can explain and forecast Y.
The Determination or Explanation coefficient (R2) is a measurement of the proportion of
variation in Y (response) that is explained by X (explanatory) by the linear model adjustment, thus
being a descriptive measurement of the adjustment quality. This coefficient value is between 0 < R2
< 1, and the closer to the unit value, the better the quality of the adjustment of this linear model to the
data.
It is recommended to check some residual distribution attributes, such as:
a) Check whether the residual average is approximately equal to zero; and
b) Check the normality of variable Ɛ probability distribution by using the Shapiro-Wilk Test. If p-
value ≤ α, we reject hypothesis H0 of data normality existence for a risk (significance level) α.
It is also recommended to investigate the correlation between independent variables, two by
two, by using the Pearson Correlation Test to check the multicollinearity effect, that is, if there are
model redundant variables. The test checks if the linear correlation between two variables is null
(hypothesis H0). Thus, if p-value is less than or equal to α (test significance level), we reject
hypothesis H0 and conclude there is a significant linear correlation between the two variables.
In this study, we used the Excel Data Analysis supplement to adjust the regression model to
the data and a 10% significance level (α) for the applied hypothesis tests.
With respect to the regression model significance, as p-value is less than 10% (Significance F
and respective p-value in the ANOVA in Table 1), there is evidence that the model can explain and
forecast the response variable Y. Similarly, as p-values of X1 and X2 are less than 10% (please see t-
statistic and respective p-values in Table 1), the parameters of such variables were significant to the
model.
Additionally, the model adjustment level was assessed based on the multiple determination
coefficient (R Square and Adjusted R Square in Table 1), whose values 81.70% and 76.47%,
respectively, establish acceptable percentages of Y variation explained by the significant independent
variables (X1 and X2). Also, the Shapiro-Wilk test for residuals confirmed their normality, since p-
value is equal to 0.639 and hypothesis H0 is not rejected. Then, there is an indication that the
distribution of residuals is normal.
Both variable X1 (number of consumers) and variable X2 (diesel consumption in 103 m3)
showed negative coefficients, indicating these move in the opposite direction to the response variable
Y (annual electricity consumption in the agricultural sector in the state of São Paulo), when observing
one of them and keeping the other variable constant.
Actually, the number of consumers was growing until 2018, when it suddenly decreased the
following year, and even with the increase seen in 2020, its amount was higher compared to the years
prior to 2016. In regard to diesel, it was the most consumed form of energy in the agricultural sector
in the state of São Paulo until 2018, with its decrease from 2007 and electricity becoming the most
consumed energy in 2019 and 2020 in the sector (SÃO PAULO, 2021).
REFERENCES
ABDULKAREEM, A.; OKOROAFOR, E.J.; AWELEWA, A.; ADEKITAN, A. Pseudo-inverse matrix model for
estimating long-term annual peak electricity demand: The covenant university's experience. International Journal of
Energy Economics and Policy, 4. ed., Nigeria, v. 9, p. 103-109, May 2019. DOI: 10.32479/ijeep.7566. Available on:
https://www.scopus.com/inward/record.uri?eid=2-s2.0-
85067405770&doi=10.32479%2fijeep.7566&partnerID=40&md5=cf35b54993edabfab6f3fad136fc64d1.Access on:
Oct. 08, 2021.
FAN, C.; XIAO, F.; WANG, S. Development of prediction models for next-day building energy consumption and peak
power demand using data mining techniques. Applied Energy, Hong Kong, v. 127, p. 1-10, 2014. DOI:
10.1016/j.apenergy.2014.04.016. Available on: https://www.scopus.com/inward/record.uri?eid=2-s2.0-
84899701114&doi=10.1016%2fj.apenergy.2014.04.016&partnerID=40&md5=9384074adcfefd1d6622e290ed54d625.A
ccess on: Oct. 08, 2021.
MARANGONI, F.; MAGATÃO, L.; DE ARRUDA, L.V.R. Demand response optimization model to energy and power
expenses analysis and contract revision. Energies, 11. ed., Brazil, v. 13, p. 1-23, Jun. 2020. DOI: 10.3390/en13112803.
Available on: https://www.scopus.com/inward/record.uri?eid=2-s2.0-
85086370091&doi=10.3390%2fen13112803&partnerID=40&md5=45f9bc9a41402f2b673c99d8123bf344. Access on:
Oct. 08, 2021.
MARTINS, G. de A.; DOMINGUES, O. Estatística Geral e Aplicada. 5.ed. rev. e ampl. São Paulo: Atlas, 2014. 416p.
SÃO PAULO (State). Secretaria de Infraestrutura e Meio Ambiente. Balanço Energético do Estado de São Paulo
(BEESP) 2021: Ano Base 2020. São Paulo, 2021. 195p. Available on:
http://dadosenergeticos.energia.sp.gov.br/portalcev2/intranet/BiblioVirtual/diversos/BalancoEnergetico.pdf. Access on:
Mar. 15, 2022.
TAGHIZADEH-YAZDI, M.; MOHAMMADI-BALANI, A. A mathematical model for multi-region, multi-source,
multi-period generation expansion planning in renewable energy for country-wide generation-transmission planning.
Journal of Information Technology Management, 4. ed., Iran, v. 12, p. 215-231, 2020. DOI:
10.22059/jitm.2020.298258.2476. Available on: https://www.scopus.com/inward/record.uri?eid=2-s2.0-
85099813647&doi=10.22059%2fjitm.2020.298258.2476&partnerID=40&md5=6491aae48693ec897d81f15b2ab1c4e0.
Access on: Oct. 08, 2021.
VU, D.H.; MUTTAQI, K.M.; AGALGAONKAR, A.P. A variance inflation factor and backward elimination based
robust regression model for forecasting monthly electricity demand using climatic variables. Applied Energy,
Australia, v. 140, p. 385-394, 2015. DOI: 10.1016/j.apenergy.2014.12.011. Available on:
https://www.scopus.com/inward/record.uri?eid=2-s2.0-