13 Trend and Seasonal Methods of Forecasting
Vikas Singla
13.1 OBJECTIVES
To make students understand:
- Trend method of variation in data and its quantitative forecasting method.
- Seasonal method of data variation.
- Multiplicative method of data prediction if it is varying in seasonal manner
13.2 INTRODUCTION
Previous module discussed time series method of forecasting when data variation occurred over a short term period. Also the fluctuation behavior of data was random and unknown. This module would focus on methods of forecasting when variation data is occurring from short to mid-term period. Quarterly sales, sales of products depending on weather conditions, traffic congestion on daily basis, movie ticket sales during weekends are some of the examples that fall in the category of such time period. Moreover, such data show a particular behavior in its variation. Sale of smart phones shows an increasing trend whereas sales of desktops show a decreasing trend over a monthly or quarterly time period. Whereas, sale of milk products, demand of winter clothing, demand of electricity, rush hours show cyclical trend occurring over regular time period. In all such cases data does follow a particular pattern which was missing in random variations. Thus, this module focuses on two forecasting techniques namely trend and seasonal used when data variation shows either of the following pattern over a specified time interval:
- either increasing or decreasing (discussed under category of trend method)
- regularly repeating upward or downward movements at regular intervals (discussed under seasonal method)
13.3 TREND METHOD OF FORECASTING
The trend component in the data may be either increasing or decreasing. This increase or decrease variation may be linear or not. But this chapter would limit itself only to linear variation in the data showing any particular trend. Analysis of trend involves developing a mathematical equation that would appropriately describe a relationship between two variables. The relationship can be positive or negative. Positive relationship would imply that with increase in one variable other variable would also show an increasing trend. Whereas, negative relationship would imply that with increase in one variable other variable would tend to show a decreasing trend. For example, cases of lung cancer would show an increasing trend with increase in smoking habits. However, with increase in awareness about harmful effects of smoking sales of tobacco products might decrease.
Thus, in trend forecasting there are two variables. One of the variable which is under control like time, money spend on advertisement, media time devoted to advertisements related to tobacco etc. is called as independent variable. The effect of these controlled variables on some other variable such as sales, awareness level etc. is called as dependent variable. When independent variable is time and dependent variable changes with change in time periods then it is time series analysis. If independent variable is other than time such as money spend on advertisement and effect of this is studied on amount of sales then it is studied under causal relationship. For both cases, one aspect is common i.e. there should be relationship between dependent and independent variable.
For forecasting purposes, this linear relationship between two variables linear regression analysis is used.
Regression analysis is the process of building a model involving two variables that can be used to predict one variable by another variable. The most elementary regression model is called simple regression in which the variable to be predicted is called the dependent variable and variable which predicts is called independent variable. The dependent variable is denoted by ‘y’ and independent variable is denoted by ‘x’. The model is denoted by following equation.
Dependent or predicted variable (y) = intercept (a) + slope (b) * independent or predictor variable (x)
Thus, a basic regression model where there is only one independent variable and the regression function is linear can be expressed as y = a + bx …..(1)
where y = dependent variable
x = independent variable
a and b are two parameters where a = intercept and b = slope intercept
The above stated model in (1) is said to be simple, because there is only one independent variable linear in parameters, because no parameter (a and b) appears as an exponent or is multiplied or divided by another parameter, and linear in independent variable, as it appears only in first power.
For instance in the above example if sales is considered as dependent and advertisement expenditure as independent variable then the model would become as:
Sales = a + b * (advertisement expenditure)
The parameters ‘a’ and ‘b’ in equation 1 are called as regression coefficients. ‘b’ is the slope of regression equation which indicates the change in dependent variable ‘Y’ with unit increase in independent variable ‘X’. The parameter ‘a’ is the Y intercept of the regression line. If the scope of regression model includes value of X as zero then the mean of Y is given by parameter ‘a’ otherwise it does not have any particular meaning. For example, figure 1 indicates following regression equation where Y is the rate in percentage and X is the time in years.
Y = -814.99 + 0.426*(X)
Where Y intercept ‘a’ = -814.99 indicates value of regression function if X=0. But as regression line does not start from zero so it does not carry any meaning. The slope parameter ‘b’ = 0.426 indicates that unit increase in X would lead to 0.426 increase in Y.
Thus, in this process entire aim would be calculate values of ‘a’ and ‘b’ which can be done by using following formulas.
Slope intercept ‘b’ can be calculated by using the following equation:
b = (∑XiYi – (∑Xi)( ∑Yi)/n ) / (∑X 2 – (∑Xi)2/n ) ………(2)
y intercept of the regression line can be calculated as:
a = (∑Yi – b∑Xi) / n ……….(3)
By using above discussed formulas we would explain following two examples. In the first example time would be taken as independent variable. This example would be considered as an application of trend method of forecasting. Second example would take time as dependent variable.
Example 13.2.1: A firm’s sales for a product line during 13 quarters of past three years are shown in Table 13.2.1.
Forecast sales for each quarter of fourth year.
Table 13.2.1 | ||||||||||||
Quarter
(X) |
1. | 2. | 3. | 4. | 5. | 6. | 7. | 8. | 9. | 10. | 11. | 13. |
Sales (Y) | 600 | 1550 | 1500 | 1500 | 2400 | 3100 | 2600 | 2900 | 3800 | 4500 | 4000 | 4900 |
Solution:
Table 13.2.2 | |||
Quarter (X) | Sales (Y) | XY | X square |
1 | 600 | 600 | 1 |
2 | 1550 | 3100 | 4 |
3 | 1500 | 4500 | 9 |
4 | 1500 | 6000 | 16 |
5 | 2400 | 13000 | 25 |
6 | 3300 | 19800 | 36 |
7 | 2600 | 18200 | 49 |
8 | 2900 | 23200 | 64 |
9 | 3800 | 34200 | 81 |
10 | 4500 | 45000 | 100 |
11 | 4000 | 44000 | 131 |
13 | 4900 | 58800 | 144 |
Total = 78 | 33550 | 269400 | 650 |
By using the formula for slope intercept ‘b’ as given in equation (2) b
= (∑XiYi – (∑Xi)( ∑Yi)/n ) / (∑Xi2 – (∑Xi)2/n )
= (269400 – 78*33550/13) / (650 – 6084/13)
= 358.91
Similarly ‘a’ is calculated by using formula given in equation (3) a
= (∑Yi – b∑Xi) / n
= (33550 – 358.91*78) / 13
= 462.87
Thus equation becomes:
Y = 462.87 + 358.91*(X)
For all quarters of fourth year i.e. for period 13, 14, 15 and 16 sales was forecasted as:
Forecast (for period 13) | = | 462.87 + 358.91*(X) |
= | 462.87 + 358.91*(13) | |
= | 5128.7 | |
Forecast (for period 14) | = | 462.87 + 358.91*(X) |
= | 462.87 + 358.91*(14) | |
= | 5487.61 | |
Forecast (for period 15) | = | 462.87 + 358.91*(X) |
= | 462.87 + 358.91*(15) | |
= | 5846.52 | |
Forecast (for period 16) | = | 462.87 + 358.91*(X) |
= | 462.87 + 358.91*(16) | |
= | 6205.43 |
Example 13.2.2: Example: A manufacturing company produces a spare part in batches which vary in size as demand fluctuates. Table 13.2.3 contains data on batch size and number of man hours required to produce a particular batch. It was suggested that larger batch size would require more effort in terms of time taken to produce. Also estimate mean number of hours required to produce a batch of 55 units
Table 13.2.3 | ||||||||||
Batch size (X) | 30 | 20 | 60 | 80 | 40 | 50 | 60 | 30 | 70 | 60 |
Man Hours (Y) | 73 | 50 | 138 | 170 | 87 | 108 | 135 | 69 | 148 | 132 |
Solution: As amount of time taken to produce a particular batch depends on size of that batch so, batch size is considered as independent variable and man hours as dependent variable.
By using equation (2) and using the values of Table 13.2.3 we can calculate values of ‘a’ and ‘b’ as shown in Table 13.2.4:
Table 13.2.4 | |||
Batch Size (X) | Man hours (Y) | XY | X2 |
30 | 73 | 2190 | 900 |
20 | 50 | 1000 | 400 |
60 | 138 | 7680 | 3600 |
80 | 170 | 13600 | 6400 |
40 | 87 | 3480 | 1600 |
50 | 108 | 5400 | 2500 |
60 | 135 | 8100 | 3600 |
30 | 69 | 2070 | 900 |
70 | 148 | 10360 | 4900 |
60 | 132 | 7920 | 3600 |
Total = 1100 | 500 | 61800 | 28400 |
By using calculations in Table 13.2.4 regression parameters are:
b = (61800 – (500)(1100)/10 ) / (28400 – (500)2/10 )
= 2.0
a = (1100 – 2(500)) / 10
= 10.0
Thus equation becomes:
Y = 10 + 2*X
And estimated man hours for batch size of 55 would be
Y = 10 + 2*(55)
= 130
13.4 SEASONAL METHOD OF FORECASTING
Seasonal variations in time series data represent upward or downward movement occurring repeatedly at regular intervals. For instance, sales of winter clothes increases during winter months and decreases during summer months. Same cycle gets repeated after same time period. Two essential characteristics of seasonal variation is that:
- Every cycle of upward and downward movement repeats itself after similar time period.
- Secondly, regular cycles can be used to infer the time for which there would be an increase and time for which there would be decrease in demand.
For example, rush hours happen twice a day. For illustration purposes suppose there is increase in traffic during two hours in morning and for two hours in evening every day. So, in this cycle there would be increase during morning and then traffic would start decreasing which would again increase during evening and then again would drop. This cycle repeats itself every day regularly making it a seasonal variation where each season is of one day. So, if naïve forecast is used then demand for Tuesday can be estimated as demand for Monday during peak hours. Similarly, in case of barber shop there is increase in demand of hair cuts during Sundays and then drops. This cycle gets repeated every week. In this case each season is of one week. Demand of hair cuts for one Saturday can be estimated as demand for previous Saturday.
Models of seasonality:
To estimate seasonal forecast there are two different models: additive and multiplicative.
In the additive model seasonality is expressed as a quantity which is added or subtracted to average of the data. In the multiplicative model seasonality is expressed as percentage of the average amount which is then used to multiply the value of a series to incorporate seasonality. These seasonal percentages are termed as seasonal indexes. For instance, seasonal index of 1.20 regarding sales of toys for a particular time period suggests that sales of toys for that quarter are 20% percent more than the average sales. Similarly, seasonal index of 0.90 would imply that sales for that period are 10% below the average.
This chapter discusses only multiplicative model as this model is more widely used and it tends to be more representative of actual forecast.
Example 13.2.3: The following data clearly depicts seasonal pattern of a particular business. It was tentatively estimated by looking at increasing trend of demand per year that (as shown in Total row) that total demand in fifth year would be 2600. Predict quarterly demand for fifth year.
Table 13.2.5 | ||||
Quarter | Year 1 | Year 2 | Year 3 | Year 4 |
1. | 45 | 70 | 100 | 100 |
2. | 335 | 370 | 585 | 725 |
3. | 520 | 590 | 830 | 1160 |
4. | 100 | 170 | 285 | 215 |
Total | 1000 | 1300 | 1800 | 2200 |
Solution: Some important points that should be considered by looking at the data
- The data shows that there is increase in demand as we move from first quarter which peaks in third quarter and then shows a decline in fourth quarter. This cycle gets repeated for every year in four year data shown.
- Here, data of each year is divided into four seasons pertaining to quarterly data. If we use naïve forecast then demand for next season i.e. for first quarter of fifth year would be more than 100.
- So, quarter wise demand comparison follows a trend pattern, whereas year wise comparison indicates seasonal pattern with increase and decrease components.
Step 1: Firstly, calculate average demand per season:
Year 1: 1000 / 4 = 250,
Year 2: 1300 / 4 = 300
Year 3: 1800 / 4 = 450,
Year 4: 2200 / 4 = 550
Step 2: Calculation of seasonal indices: Divide the actual demand for a season by the average demand of that season.
Quarter | Year 1 | Year 2 | Year 3 | Year 4 |
1. | 45/250 | 70/300 | 100/450 | 100/550 |
2. | 335/250 | 370/300 | 585/450 | 725/550 |
3. | 520/250 | 590/300 | 830/450 | 1160/550 |
4. | 100/250 | 170/300 | 285/450 | 215/550 |
Step 3: Calculate the average of seasonal indices of each season.
Quarter | Year 1 | Year 2 | Year 3 | Year 4 | Average of seasonal indices |
1. | 0.18 | 0.23 | 0.22 | 0.18 | (0.18 + 0.23 + 0.22 + 0.18)/4 = 0.20 |
2. | 1.34 | 1.23 | 1.30 | 1.32 | 1.30 |
3. | 2.08 | 1.97 | 1.84 | 2.11 | 2.00 |
4. | 0.40 | 0.57 | 0.63 | 0.39 | 0.50 |
Step 4: As it has been estimated that fifth year demand would be 2600, so it can be suggested that average demand per quarter in fifth year would be 2600/4 = 650 units.
Thus, quarterly forecast can be made by multiplying the seasonal index of each quarter with average demand of that quarter.
Forecast demand of first quarter of fifth year | = 650*0.20 | = 130 |
Forecast demand of second quarter of fifth year | = 650*1.30 | = 845 |
Forecast demand of third quarter of fifth year | = 650*2.00 | = 1300 |
Forecast demand of fourth quarter of fifth year | = 650*0.50 | = 325 |
13.5 SUMMARY
This chapter discusses forecasting techniques when data shows a certain pattern over short to medium term. Specifically, trend and seasonal methods have been discussed with some illustrations. Trend method pertains to either increasing or decreasing fluctuation of data over selected time period. Such time period can be daily, weekly or monthly but preferably should be less than one year. Trend method involves time series data and uses method of linear regression for prediction purposes. Causal method also uses same methodology but does not involve time series data. Seasonal method of forecasting is used where data shows both increasing and decreasing pattern over a certain time period. In this method pattern is repeated over regular interval of time period. Multiplicative method of prediction has been illustrated as it is more representative of forecast.
13.6 GLOSSARY
- Trend Forecasting: A forecasting technique in which time series data shows either increasing or decreasing pattern.
- Causal Forecasting: A situation in which one variable causes another.
- Linear Regression: A least squares method which assumes that past data and future projections fall around a straight line.
- Regression parameters: Intercept ‘a’ and slope intercept ‘b’ are used to construct regression line with minimum error.
- Seasonal method: is forecasting technique where past data shows increasing and decreasing pattern regularly repeatedly.
13.7 REFERENCES/ SUGGESTED READINGS
- Chase, B.R., Shankar, R., Jacobs, F.R. and Aquilano, N.J., Operations & Supply Chain Management, 13th Edition, McGraw Hill.
- Stevenson, W.J., Operations Management, 9th Edition, Tata McGraw Hill.
- Lee J. Krajewski, Operations Management, Prentice-Hall of India, New Delhi, 8th Edition.
13.8 SHORT ANSWER QUESTIONS
1. Using some fictitious data we wish to predict the musical ability for a person who scores 8 on a test for mathematical ability. We know the relationship is positive. We know that the slope is 1.63 and the intercept is 8.41. What is their predicted score on musical ability?
(a) 80.32 (b) 21.45 (c) 68.91 (d) -4.63
Answer: b
2. If b = 0 the line of best fit will conventionally be drawn
.
(a) through the middle of the datapoints
(b) as horizontal
(c) as provides the best fit to the scores.
Answer: b
3. If you read that the slope of a scatter diagram is 2.00, what does this mean?
(a) For every increase of 1 on the x-axis there is an increase of 2.00 on the y-axis.
(b) For every increase of 2.00 on the x-axis there is an increase of twice as much on the y-axis.
(c) For every increase of 2.00 on the x-axis there is an increase of 2.00 on the y-axis.
(d) Very little.
Answer: a
4. The slope of the line is called:
(a) a correlation coefficient and indicates the variability of the points around the regression line in the scatter diagram.
(b) b and gives us a measure of how much y changes as x changes.
(c) a and is the point where the regression line cuts the vertical axis.
(d) none of the above.
Answer: b
5. In seasonal variation time period for increase in demand during one cycle can be used to predict the increase in next cycle.
(a) True (b) False
Answer: a
6. The time period for increase and decrease is similar in each cycle of seasonal variation.
(a) True (b) False
Answer: a
13.9 MODEL QUESTIONS
1. The complete demand data for years 1, 2, and 3 are given below. Compute the forecast for each of quarters in year 4, given that the forecast for the total demand in year 4 is 2980 gallons.
Quarterly demand (in gallons) | ||||
Year | Quarter 1 | Quarter 2 | Quarter 3 | Quarter 4 |
1 | 350 | 710 | 950 | 420 |
2 | 370 | 680 | 1060 | 500 |
3 | 450 | 750 | 1020 | 570 |
2. Show how data is fluctuating graphically and use appropriate method to predict for next period.
X | 1. | 2. | 3. | 4. | 5. | 6. | 7. | 8. | 9. | 10. | 11. | 13. |
y | 600 | 1550 | 1500 | 1500 | 2400 | 3100 | 2600 | 2900 | 3800 | 4500 | 4000 | 4900 |