1. Introduction

he process of achieving this research is mainly divided into two main parts: assessing the forecasting model and scheduling of storage device [1]. The modelling and simulation is performed in MATLAB.

2. a) Data pre-processing & Data-analysis

The function of the data pre-processing is acquiring representative data, removing unusual consumption hike and other inconsistencies, defining proper format for time stamping (day, month, hour, minute) and splitting up the data in identification and validation sets [2]. The function of the data analysis is analysing the data to find underlining mechanisms, trend and variations in the data, use of clustering to get more insight in intraday correlation [3].

3. b) Forecasting of Consumption & Storage Device Scheduling

To schedule the storage devices, it is very crucial to have prior knowledge about electricity consumption on day-ahead. Therefore, short-term forecasting is an important step to ensure better scheduling of the storage devices [4]. Pre-processed data is used to evaluate the performance of the forecasting models. Initially two forecasting models, ANN is chosen to forecast the consumption profile. This model is evaluated with respect to some evaluation criterion. The use of load forecasting is widely accepted as operational aid for the control the electric power system as well as to enhance consumer participation in local energy market through providing financial benefits.

The forecasted consumption profile of a base station will be used to achieve optimum scheduling of storage devices [5]. The main idea is to utilize the surplus of PV generated energy after mitigating selfconsumption. If a particular consumer can have an idea about the level of stored energy, it is possible to utilize the energy in various way like, load shifting, include some flexible loads to consume the extra energy, valley filling etc. However, for this research our target is to feed the extra stored energy in local market, so that the consumer can have some financial benefits from the trading.

II.

4. Methodology

The research approaches are selection and description of the electricity consumption data and the different variables, processing and detection of missing values, methods are discussed to discover the cohesion and pattern in the selected data and then procedure of evaluation is appointed [6].

5. a) Data

Real data of electricity consumption is needed for estimation and validation purposes of the forecasted result. Moreover, to predict the amount of stored energy in the storage devices on day-ahead, the PV generated energy of the corresponding base stations are also needed. As the ultimate goal is to predict the base After careful analysis, some data are found in the data set which follows a particular pattern but the average electricity consumption is less than the standard one. Figure 2.3 shows a consumption profile of a day which goes close to zero at mid-day [8].

Moreover, this particular base station has installed PV as local generation source. So, it is very much understandable this consumption profile is the net amount of energy consumed from grid after mitigating some load with PV. Thus, the electricity energy consumption data of this type of base stations cannot be used to identify the proper forecasting model.

6. . Electricity Use Data

Real energy consumption data of different BTS on a daily basis at 15 minutes of sampling is used. This data will be used to train the network and build the forecasted model. Then that forecasted model will be used to design the PV panel system and schedule the storage device [7]. All the data are taken in Wh and thus a. Normal Energy Consumption Data On average the yearly consumption of an identical base station in the Netherlands is 3500 kWh. So, daily electricity consumption on average is 9.5 kWh [37]. Consumption profile shown in the figure 2.1 is the electricity consumption pattern of a particular base station for the first week of March, 2013 with average consumption around 9 kWh per day. will be forecasted in the same unit.

Moreover, there are some consumption profile data sets with very low average daily consumption. A consumption profile of total 3.62 kWh is shown in Figure 4. At Figure4 Figure 4: Electricity consumption of some particular equipment Consumption is very low but at night it goes high. So most probably it is the consumption profile of cooling system. Though this consumption profile cannot give the proper idea about the consumer load behaviour.

ii. Qualitative Variables Beside input variables (electrical energy consumption), also the qualitative variables are important for forecasting. Qualitative variables are better known as dummy variables; do not have a natural ordering. These variables contain descriptive values, like the day of the week. Moreover, all the data are 15 minute sampled so hour information is also a qualitative input variable. Depending on the time electrical consumption varies like at working days from 9.00AM to 6.00AM consumption should be low and at night when everybody is at home consumption goes high at base station. However, this hour based consumption pattern also depends on season. Thus, seasonal effect can be an input variable for forecasting. But this research is focused on STLF and to capture the consumer behaviour, electricity consumption data of two or three months is used as training data. So most of the case seasonal identification remains same for all training data. Finally, day identification along with our identification is considered as qualitative input variables for the forecasting model.

7. b) Data Pre-processing

Real life data contains huge amount of noise and often has quality issues. Such volatility must be removed before simulation can be performed. If the input values to a forecasting model are poor, it will be hard to produce a good forecast, irrelevant of the quality of the forecast model. All the steps has to be taken into consideration before simulation as pre-processing is given below

? Duplicate data check ? Missing data check ? Filtering

8. unusual and noise from PV generation data set

The electricity consumption data of different base station of a year is given as Wh/15 min. Smart meters are used as measuring device, thus it has high possibility of missing data and duplicate data. Initially the full data set is passed through some checking algorithm to identify duplicate data and missing data as Pre-processing step. However, data with same time stamp is treated as duplicate data. For missing data on weekdays, average value of the immediate 7 weekday's consumption data on the same time sample is taken. However missing data for weekend days, average consumption of the previous 4 same days on the same time sample is calculated.

As an example, to find a missing data on weekdays at y_t it should take the average of previous seven weekdays on the same time t shows in Figure 5. The PV generated data is also measured in 15minute interval but it is very important to identify the noise or unusual production. Normally electronics based measurement devices are used to capture the data from controller [38]. So, to have unusual production peak or noise (like production level 1 or 2Wh) is very common.

Moreover, synchronize PV production and consumption data for the same consumer is also important for scheduling of stage device.

9. c) Data Analysis Day identification Number as Input of ANN

It is very important to make some difference among the different days of the week so that the models can identify the target data set according to the train data set. Initially these identification variables are coded into integer value. The days of the week is represented by 1,2,3,4,5,6,7 respectively for Monday, Tuesday, Wednesday, Thursday, Friday, Saturday and Sunday.

Correlation or patterns within the electricity use can aid the forecast if well defined. To discover patterns in the electricity use, self-organizing map (SOM) is used [9]. The motivation behind SOM analysis is to find the pattern of consumption of each day of a week. SOM clustering of seven days of a week is shown in Figure 2.7.

10. Monday

Tuesday Wednesday Thursday Friday Saturday Sunday

???? = 1 ? ? | ? ? ? ? | ? ?=1 × 100 % (b)

Its important features are:

? This measure represents the percentage of average absolute error occurred.

? It is independent of the scale of measurement, but affected by data transformation.

? It does not show the direction of error.

? MAPE does not panelize extreme deviations.

? In this measure, opposite signed errors do not offset each other.

III.

11. Model Building a) ANN based Forecasting Model

Artificial neural networks (ANNs) constitute a class of flexible nonlinear models designed to mimic biological neural systems of brain. Typically, a biological neural system consists of several layers, each with a large number of neural units (neurons) that can process the information in a parallel manner as illustrated in Figure 3.1. Like biological neuron, ANN has also multilayer structure such that the middle layer is built upon many simple nonlinear functions and able to receive multiple input signals from other neurons. The procedure of selecting optimal network architectures and their learning approaches are described for forecasting the wind speed in two different time horizon [11] [13].

12. b) Working Principle of Forecasting Model

ANNs are networks typically composed of several layers with interconnected elements called neurons. The first or lowest layer is the input layer which gathers external information. The middle or hidden layers process this information in a fairly elementary way to produce signals for the connected neurons at output layer. The neurons or nodes at the adjacent layers are usually fully connected by acyclic arcs from a lower layer (input) to a higher layer (output). Weights are the key factors of network performance. These weights are continuously updated during the training period to carry out complex nonlinear mapping [12]. In this research the developed forecasting model is trained in supervised way. Once the network is trained with appropriate training data set, it is ready to perform desire task. Complicacy of ANN mainly arises for the hidden layers which are used to build the connection between inputs and outputs. Thus, it creates an indirect relationship between inputs and outputs. Information received from the input layer is first processed in the hidden layer, and then transmitted to the output layer. So, the learning capability due to nonlinearity of the input data is mostly depend on number of hidden layers. A multilayer forecasting model with one hidden layer can perform an arbitrary convex approximation to any continuous non-linear mapping. According to the universal approximation theorem for neural networks [49], the standard multilayer feed forward network with a single hidden layer and finite number of hidden neurons is sufficient for any complex simulate nonlinear function with any desired accuracy. It concludes with the view that number of hidden layers has an influence according to the complicacy of the problem otherwise system will be more complicated, even single hidden layer requires large number of nodes. So, it is always a tradeoff of number of hidden layers depending on complicacy of the problem.

13. d) Selection of the Network

Depending on the structure of the network ANN can be classified in several models. Figure 3.4 shows different types of neural networks used for forecasting applications. Feed-forward neural network (FNN) is fast and simplest ANN because in the case of Multilayer layer network it transmits information form input layer to output layer using some simple structured hidden layer [13]. Normally it maps the static relationship between input and outputs The feed forward ANN model in Equation 1 in fact performs a non-linear functional mapping from the past observations of the time series to the future value, i.ey_t=f (y_(t-1),y_(t-2) ?,y?_(t-3),??y_(t-p) ,W)+ ?_t where Wa vector of all parameters is and f is a function determined by the network structure and connection weights [15] To estimate the connection weights, non-linear least square procedures are used, which are based on the minimization of the error function

?(?) = ? ? ? ? = ?(? ? ? ? ? ? ) ? ? ? IV.

14. Simulation Results and Analysis

The optimal structure of ANN is used to forecast the BTS electricity consumption on day-ahead. In this case 31st of October, 2012 is considered for the demonstration of the model. The imbalance power is calculated by the power deviation between day-ahead forecaster and 15 minutes ahead forecaster. The flexible entities are activated in way to reduce this imbalance power. A simple MATLAB-based GUI has been implemented to demonstrate how this tool can be userfriendly for a customer.

15. a) Assumptions

The historical data are in Wh and the time interval is 15 minutes. As input, total 92 days of previous data is taken for pre-processing [14] [15].The time range is March 1st, 2013 to May 31st 2013. The forecasted data represents the consumption for the next day, June 1st 2013.

16. b) Electricity Consumption Forecasting

Model of Feed-Forward ANN Designing a threelayer FNN model for STLF involves several major steps: 1) Determine the number of outputs: An FNN model may have only one output, which can be corresponding to the electrical load of a consumer, or several outputs, which can represent a 24-hour load profile of several BTSs. Some drawbacks of the multiple outputs FNN were discussed in [26]. 2) Determine the number of inputs: previous consumption data of a particular consumer along with day identification number is used as target and input of FNN. 3) Determine the number of hidden neurons: It is followed a trial and error method to select an optimal hidden layer structure for used data set at FNN. The number of hidden neuron layer mostly depends on: MAPE, Elapsed Time & Epoch. The lowest MAPE, lowest time and highest Epoch is found for the chosen number of layer. Observing these parameters, the best output is obtained from 25 hidden layer amongst different options.

17. Consumption forecasting on mid-summer (June 01, 2013)

Practical consumption data from 30 BTS will be used to evaluate the forecasting performance of the models. In this context, at first it is tried to forecast the consumption profile of each consumer on the same day with training data set of previous 3 months (92 days). ii.

Here ? is the space of all connection weights. The optimization techniques used for minimizing the error function Equation 2 are referred as Learning Rules. The best-known learning rule in literature is the back propagation or Generalized Delta Rule.

18. a. Forecasting with FNN

Form Table 1, it can be mentioned that FNN performed well. Some out of range data is found in MAPE which happened because those consumers had no usage of electricity in their consumer profile at some hours of the day. Considering the case of MAPE, site 5 has the best output as 15.9883%. Consumer 5 has a moderate level of error (MSE = 572.452) and a very noticeable regression mismatch as it is far away from 1 (R=0.83974) Lest MSE belongs to consumer 8 (MSE=343.6435). But the percentage error is as high as MAPE=25.3231%. In Figure 6-3 it is clearly visible that the output profile almost followed the trend. And most of the points are well predicted, though many of them are lower than the actual data Finally, another consideration is done for other consumer for optimized evaluation. In this case the overall performance has been considered. The selected one is being consumer 2. For the selected consumer, obtained parameters are at moderate level as R=0.8978; MSE=1260; MAPE=39.39. Though the R is slightly mismatched from the ideal value, MAPE has been obtained in an acceptable range. But the MSE is so large that can affect the overall performance of the forecasting model. From the Figure 4.5, the forecasted data is matched in almost every point with the actual consumption profile. The major mismatch is found at the higher peaks. Though the forecasted model has followed and detected the positions of the peaks well, but the values estimated are lower than the actual consumption. This mainly happens for the change in the The regression function plotting is lineated based on the best points to be found. The most mismatching is found around 100 to 300Whr. Noticeable percentage errors are found at the peaks from the scattered plotting. Based on the forecasted model, scheduling of the BTS storage device is designed and operated. For this research two ideal profiles are chosen. For the betterment of the demonstration, the available energy is compared with both actual consumption profile and the forecasted consumption profile.

For the demonstration consumer 2 is chosen. Figure 4.7 shows the scheduling of a storage device with the actual consumption profile. From the figure, available energy and stored energy is utilized in a balanced way. During the mid-hour of the day, amount of the available and stored energy is adequate. And the individual BTS has used the stored energy to meet its requirements.

The PV rating, in this case, is 7.5KW. So, the figure below clears out that, the consumer is using around 400W. That means, he can schedule the storing of the device a day ahead to save the rest 7.1KW of energy [16]. Forecasted consumption profile demonstrates the typical scheduling for the device. This is to determine the charging time of the storage device at the day ahead and to be aware of the amount of the energy to be used. From the figure 4.8, the needed energy for the next day can be estimated. It will be used to determine the time to store the energy and the dissipating time of the power. And based on the forecasted model, stored energy can be used to make the BTS grid free user. Thus a BTS can be grid independent and produce, store and consume its own energy.

19. Global

20. Conclusion & Future Work

The main focus of this research work was to identify and justify a optimized forecasting model to forecast the electricity consumption at BTS and comparing the results on the basis of the major error measurement parameters and establishing a reliable and most accurate FNN forecasting model. The forecasted data can be used for designing the scheduling and designing the independent power source for the BTS. This will help the consumer to use its own power to meet its requirement. Thus, it can become grid free consumer.

However, the following recommendations are suggested for improving the forecaster.

Instead of using Levenberg-Marquardt algorithm; which is computationally heavy; another learning method called Scaled Conjugate Gradient (SCG) for bigger ANN approximation and prediction. Larger iteration number of hidden neurons is preferable for finding optimum solution. In STLF, ARIMA is proved as a poor forecaster because of having a significant level of non-linearity in historical data. However, a hybrid model combined with ANN and ARIMA is needed to be investigated. The input variables for the forecasting model should be selected based on partial mutual information (PMI) algorithm instead of the last known research experience only. This method will detect nonlinear dependencies between the input and output as well as prevent the selection of inputs with redundant information. It will reduce the dimension of the input layer. Aside of using Neural Network Toolbox? of MATLAB, better flexibility would be expected. Thus, the user will have more control on every step of the program.

Figure 1: Weekly consumption pattern However, the daily consumption patter in different depending on week days and weekend, even similar week days does not have similar consumption patterns shown in Figure 2. It is very important to identify the non-linearity of the consumption pattern, because forecasting model selection is depend in the degree of non-linearity of the input data. — Figure 1. Figure 1 :

Figure 2: Daily load consumption — Figure 2. Figure 2 :

Figure 3: PV Compromised Electricity Consumption — Figure 3. Figure 3 :F

Figure 5: Average value for missing data on weekdays Moreover, to find a missing data on weekend (as an example Saturday) it has to take the average of previous 4 Saturday of the same time sample shown in Figure 2.6. — Figure 4. Figure 5 :

Figure 6: Average value for missing data on weekend Filtering unusual and noise from PV generation data set:The PV generated data is also measured in 15minute interval but it is very important to identify the noise or unusual production. Normally electronics based measurement devices are used to capture the data from controller[38]. So, to have unusual production peak or noise (like production level 1 or 2Wh) is very common. — Figure 5. Figure 6 :

Figure 7: SOM clustering of electricity use per day Figure 2.7 illustrates different electricity consumption pattern of each day. So, as an input of ANN day identification number will be different for each day. d) Evaluation Method An evaluation method is necessary for three aspects; ? Model identification ? Performance comparison between models ? Insight in model performance for practical use In order to evaluate a forecast made with a specific model, forecast-error metrics or so called performance indicators are defined. Three types of performance indicators are discussed. The first used performance indicators are the scale dependent metrics. These indicators are on the same scale as the data. One of the most used scaled performance indicator is the mean are extremely large when the actual values reaches zero [10]. When considering data which contain zeros, the metric is undefined. An example of percentage based metric is the MAPE, which is one of the most used squire error (MSE). The other category is percentage based performance indicators, which can be used to compare different data sets as they are scale independent. To overcome division by zero, a scale free error is proposed by Hyndman and Koehler in 2006 [40]. The abovementioned performance indicators are given below. In each of the forthcoming definitions ? ? is actual value, ð??" ? is the forecasted value, ? ? = ? ? ? ð??" ? is the forecast error and n is the size of the test set. Also, y ? = 1 n ? y t n t=1 is the test mean and ? 2 = 1 n?1 ? (y t ? y ?) 2 n t=1 — Figure 6. Figure 7 :?

Figure 8: Flow of information in biological neuron. — Figure 7. Figure 8 :

Figure 9: A general ANN structure with one hidden layer Input values are processed by this network to form output values as depicted in a simple non-linear — Figure 8. Figure 9 :

Figure 10: Illustration of network weights in supervised learning process c) Network ArchitectureComplicacy of ANN mainly arises for the hidden layers which are used to build the connection between inputs and outputs. Thus, it creates an indirect relationship between inputs and outputs. Information received from the input layer is first processed in the hidden layer, and then transmitted to the output layer. So, the learning capability due to nonlinearity of the input data is mostly depend on number of hidden layers. A multilayer forecasting model with one hidden layer can perform an arbitrary convex approximation to any continuous non-linear mapping. According to the universal approximation theorem for neural networks [49], the standard multilayer feed forward network with a single hidden layer and finite number of hidden neurons is sufficient for any complex simulate nonlinear function with any desired accuracy. It concludes with the view that number of hidden layers has an influence according to the complicacy of the problem otherwise system will be more complicated, even single hidden layer requires large number of nodes. So, it is always a tradeoff of number of hidden layers depending on complicacy of the problem. — Figure 9. Figure 10 :

Figure 11: The taxonomy of ANN architecture — Figure 10. Figure 11 :

Figure 12: Structure of FNN through a unidirectional information flow (only form input towards output). The inputs are fed directly to the outputs via a series of weights. Each connection may have different weights. Due to lack of any feedback, FNN must use a large number of input neurons for learning the historical data and consequently achieving good results. Despite of being simple, feed-forward networks are more efficient in complex nonlinear forecasting problem with bigger number of input variables [15].e) Mathematical ModelIn a mathematical model, has been developed referring to the Figure3.5 — Figure 11. Figure 12 :F

Figure 13: Forecasting performance of FNN for site 5 — Figure 12. Figure 13 :Figure 14 :

Figure 15: Forecasting performance of FNN for site 8 — Figure 13. Figure 15 :

Figure 16: MAPE and R of forecasted output of site 8 by FNN Figure 6-4, obtained regression function is comparatively good. The mismatched area lies in between 50 to 100Wh and the MAPE shows a scattered pattern of error. Moreover it is clear that the errors largely occurred at the peaks.Finally, another consideration is done for other consumer for optimized evaluation. In this case the overall performance has been considered. The selected — Figure 14. Figure 16 :

Figure 17: Forecasting performance of FNN for site 2 — Figure 15. Figure 17 :

Figure 16. Figure 18 :

Figure 19: PV production and available energy for site 2 — Figure 17. Figure 19 :

Figure 18. FFigure 20 :

Figure 21: PV production and available energy for site 17 — Figure 19. Figure 21 :Figure 22 :

Figure 20. Table 1 :

		i.
				( ) Volume XVII Issue II Version I
Consumer	ANN			of Researches in Engineering
1 2 3 4	R 0.94285 3.63E+03 33.9695 MSE MAPE (%) 0.89787 1.26E+04 39.39 0.78771 8.70E+03 28.5366 0.91719 1.49E+03 52.8844			Global Journal
5	0.83974 572.452		15.9883
6	0.41161 440.1884		103.0053
7	0.97297 1.27E+03 47.334
8	0.91181 343.6435		25.3231
9	0.9641	1.23E+03 26.5852
10	0.9468	537.9639	27.9004

⁴

⁵

⁶

Application of Short-Term Load Forecasting for Optimizing the Storage Devices of a Base Station

Table of contents