Forecasting of Gold Prices Volatility with Symmetric and Asymmetric Volatility Models

With this paper the author forecasts the out-of-sample volatility of gold price changes in Turkey. Looking at the both the symmetric and the asymmetric evaluation criteria, GJR-GARCH model is the best fitted model for forecasting gold price volatility in Turkey. The GJR-GARCH model findings reveal a negative shock asymmetry for gold prices. Thus, it shows that positive news in the market affects the volatility of gold prices in the next period more than negative news.


INTRODUCTION
The volatility estimation is used by researchers with the fluctuation of international financial markets and for hedging and speculative income. It also has a significant place in the application of asset pricing models, including foreign exchange rate risk, policymaking, and regulation, hedging, financial risk management, option pricing, international portfolio diversification. Although there is sufficient evidence to assess the volatility estimation performance on international stock exchanges and foreign exchange markets, there is little evidence for the volatility estimation on commodity prices (Kroner et al., 1995).
The statistical characteristics of financial time series play a key role in the development of volatility forecasts. The studies of Mandelbrot (1963) and Fama (1965) indicate that financial returns do not act together over time, but are not independent of each other. At the same time, they point out that large amounts of changes in prices of financial assets traded in financial markets are followed by large amounts and small amounts of changes, and that volatility clusters are formed with another statement.
It is also known that the financial returns series do not show normal distribution characteristics, but show features such as excessive kurtosis around the mean, volatility clustering, asymmetric response, and leverage.
According to the simple volatility model, the basic assumptions that the return series are independent of each other and have the same distribution, their means are zero, and their variance is constant are not valid for financial return series. With the Autoregressive Conditional Heteroscedasticity (ARCH) model published by Robert M. Engle in 1982, he revealed the existence of heteroscedasticity in financial time series and argued that heteroscedasticity should be modeled. This model was later developed by Bollerslev (1986) and named as Generalized ARCH (GARCH) model. After these studies, ARCH models have been used frequently in volatility modeling in finance literature.
ARCH models have revealed successful and more complex ARCH derivatives as there are financial return series with different statistical properties in volatility modeling. For this reason, comparing the performance of various volatility forecasting models by looking at the in/out of sample performance of the model when choosing the volatility forecasting model has given more accurate results in practice.
The focus of this study is on forecasting volatility in gold prices in Turkey. In this context, it is aimed to find the best performing model among a lot of volatility models(random walk, simple moving average models, exponential smoothing model, ARCH, GARCH, GJR-GARCH and E-GARCH) for gold prices.
Thus, it will be discuss the findings of the best performing model. The remainder of this paper is organized as follows: The second section presents the existing literature on gold price volatility forceasting. The Third section describes symmetric and asymmetric volatility forecasting methodology, data, and discusses the forecast evaluation methods. The empirical results are presented in the fourth section. Finally, in the fifth section the paper is concluded. Kutan and Aksoy (2004) directly used the GARCH (1,1) model to examine the effect of the consumer price index on gold market returns and volatility. However, there is no investigation of the most suitable model. As a result, it is concluded that gold does not react significantly to consumer price index news and is not good protection against inflation. Capie et al. (2005) examine how gold behaves as a hedging instrument for exchange rate risk. GARCH, threshold GARCH, exponential GARCH methods are used in the study. Among these, the GARCH (1,2) model is found as best model for volatility structure. Erer (2011) used weekly data for the sale price of gold (TL / gr) between the 2001-2011 periods in his study, which examined the volatility in the gold market. During this analysis, symmetric and asymmetric conditional volatility modeling of the volatility of the gold bullion sales price logarithmic return series is performed. The most successful result was obtained in the TARCH (2,2) model. Cihangir and Ugurlu (2018) examined the volatility in gold prices in Turkey by using daily data for the period 2004-2012. In the study, GARCH, GJR-GARCH, and EGARCH models were used and the GJR-GARCH model was selected as the best fitted model for the data according to the model determination criteria. As a result of the GJR-GARCH model, there is no leverage effect Istanbul Gold Market. Aksoy (2013), using the Istanbul Gold Exchange gold and silver prices for the period 2008-2011, investigated the day-of-week effect on returns and volatility. In the study using GARCH models, a day-of-week effect is found in yield and volatility for gold. It is also concluded that gold prices are more volatile than silver prices.

METHODOLOGY AND DATA
The observed volatility of gold prices is considered monthly for use in forecasts and estimations. In this context, the data on gold prices evaluate between the periods 1985: 01-2018: 01. The observed volatility for use in forecasting and estimation define as the standard deviation of logarithmic return data, similar to Balaban (2004). Logarithmic return series calculate as follows; where Pt, and Rt are price and return in month t. Monthly volatility is defined as within-month standard deviation of all periods returns: impossible to employ all models in a single study. In study will be used a wide range of time series forecasting techniques from a naive benchmark of the random walk to the more sophisticated conditional heteroscedasticity models like in  and Balaban, Bayar, and Faff (2006).
Besides, it will be excluded the models that regime-switching specifications. While a regime-switching model is a good one for in-sample modeling, it is not readily amenable to an out-of-sample volatility forecasting exercise (Balaban, Bayar and Faff, 2006).
This study's models include a random walk, simple moving average models, an exponential smoothing model, a regression model, and symmetric and asymmetric conditional volatility models.

Random walk (RW) model:
The RW model foresees that the best forecast of this month's volatility ( , Moving average (MA-a) models:The MA-α model tells that the best forecast is an equally weighted average of realized values in the last α months: where α = 3, 12, 30.

Exponential smoothing (ES) model:
Forecast under the ES model is a function of the immediate past forecast and immediate past observed volatility: The smoothing parameter ( θ ) is restricted to lie between zero and one. The optimal θ is estimated through minimizing the mean squared error, with an annual update.

Regression model:
In the regression model, I use parameter estimates of c and β from the monthly rolling to forecast next month's volatility.
It should be noted that as this study performing an investigation of out-of-sample forecasts, all parameter estimates for all competing models employ data from estimation windows only.

Symmetric Conditional Volatility Models:
The use of conditional heteroscedastic models has been a common tool for modelling and forecasting volatility of financial asset returns following the introduction of the ARCH model and its generalized version, the GARCH model.
Note that the previous models use monthly volatility series. However, with the conditional volatility models, mothly price changes are first modelled as a p-order autoregression: The autoregressive terms account for the economically minor but statistically significant autocorrelation in price changes. The monthly prediction errors ( t u ) are assumed to be conditionally normally distributed with a zero mean and variance 2 t σ based on the information set Ψ available at time t-1.  var u u σ − = , the conditional variance can be modeled as AR (p) process by using the squares of the estimated residual lag; where vt is white noise process. If φ1= φ2=…φq=0, variance will homoscedastic.

ARCH(1) model
Autoregressive conditional heteroskedasticity ARCH (1) process can write as: As can be seen, the conditional variance of ut depends on the actual value of 2 1 t u − . The higher the actual value of 2 1 t u − , the higher the conditional variance in the t period.

GARCH(1,1) model:
Bollerslev (1986) developed Engle's ARCH model to allow the conditional variance to be modeled as an ARMA (p, q) process. In this model, the conditional variance defines as a function consisting of the terms autoregression and moving average, and conditional variance is transformed into an ARMA process. The superiority of this model to the ARCH model is that it can model the volatility resistance without the need for a large number of variables. The most commonly used GARCH model in finance literature is the GARCH (1,1) model. For instance, in a GARCH (1,1) model, the conditional current period volatility depends on the previous period's conditional volatility and the previous period's squared prediction error:

Asymmetric Conditional Volatility Models:
When the ARCH and GARCH models are examined, the signs of the shocks disappear because the errors are squared. Only their magnitude can be interpreted. In other words, in the model, the effects of positive shocks of the same magnitude and negative shocks on volatility are calculated the same. This, however, does not fully reflect a reality that exists in the financial asset series. This fact is that a negative shock of the same magnitude (bad news) has a greater impact on volatility than a positive shock (good news). Such asymmetries in stock returns are called the leverage effect. The decrease in the firm's stocks will cause an increase in the debt equity ratio. According to Dijk and Franses (2000), the behavior of conditional variance of time series for financial assets is generally asymmetrical compared to the previous return. Also, during the recession periods, the volatility of financial assets is high. In short, asymmetric volatility is the characteristic feature of financial time series (Li and Li, 1996). The most used asymmetric GARCH models Threshold ARCH models (TARCH -Threshold ARCH) or the GJR-GARCH model, which is very similar to the TARCH model, were identified by Zakoian (1994) and Glosten, Jaganathan, and Runkle (1993), respectively, and the E-GARCH (Exponential GARCH) model is developed by Nelson (1991).

E-GARCH(1,1) model:
The leptokurtic structure and volatility cluster, which exist in financial time series, can be effectively determined with the GARCH model. However, GARCH models fail to capture the asymmetry that serves to distinguish between negative and positive shocks in the variance structure. The exponential GARCH (EGARCH) model is developed by Nelson (1991) to eliminate the weaknesses of the GARCH model that takes into account the asymmetry in the volatility structure. .In the EGARCH model, the possibility that the up and down movements in the financial markets may not have the same effect on the predictability of the future volatility of financial assets is taken into account. Downward movements are more effective than upward movements in predicting volatility. This effect, called the "Leverage Effect", was first put forward by Black (1976). This situation, in which it is claimed that negative news coming to the market has more impact on the volatility of financial assets than positive news is modeled as follows: As seen in Equation 9, the conditional variance of a time series in the E-GARCH model is a nonlinear function of the magnitude and sign of its historical values and lagged residuals. The that this model works is that the y parameter is statistically significant.
Accordingly, the statistically significant negative γ parameter indicates that positive return shocks generate less volatility than negative return shocks. For example, the volatility of gold prices tends to increase after negative returns and to decrease after positive returns. As a result, the presence of asymmetric volatility in the EGARCH model depends on the statistical significance of the γ parameter.

GJR-GARCH(1,1) model:
Glosten Jagannathan and Runkle (1993) developed a GARCH model that takes into account the different effects of good and bad news on volatility. That's why the threshold GARCH model is also called GJR-GARCH. The GJR-GARCH model or threshold GARCH model is actually the asymmetric ARCH process used in modeling volatility. In this model, 1 0 t u − = acts as a threshold. The effects of shocks above and below this threshold on volatility are different. The threshold GARCH model can be written as:

Forecast Evaluation
In this study employed the four commonly used symmetric error statistics: the mean error (ME), the mean absolute error (MAE), the mean squared error (MSE), and the mean absolute percentage error (MAPE). Monthly forecast error is forecast volatility ( , However, under prediction of volatility is primarily important for traders with long and short positions as well as option buyers and sellers. Although Poon and Granger (2003) suggest that using the asymmetric evaluation criteria is advisable, there are only a few papers with this feature in the literature Balaban, 2004;and Balaban, Bayar and Faff, 2006).
Besides, in this study also employed asymmetric error statistics: the mean the logarithmic error (LE) metric (Pagan and Schwert, 1990), for discrimination between under/over-predictions.
The LE statistic reads as follows:  Table-1 and the graphs of the series are given in Figure-1.    Accordingly, at the 10% significance level, there is an ARCH effect in the 1st, 3rd and 9th delay of the Gold return series, but no ARCH effect is found in the 6th and 12th lag. Table 3 presents the comparative results of the symmetric evaluation criteria and the summary statistics.  Table 3 shows the comparative results of symmetrical evaluation criteria and summary statistics. The ME statistic shows as a mean whether a model is under/over-predicted. All models overpredict volatility except regression and unsymmetrical volatility models (ARCH, GARCH, GJR-GARCH, and E-GARCH). According to ME statistics, the MA30 model has the highest over-predict figure, while the GJR-GARCH model has the lowest under-predict figure. However, it should not be given too much weight to ME, as negative and positive forecast errors can cross each other. When i ignore the ME results, the mean and median adjusted standard deviations of the error statistics show that the MSE statistic produces the most variable performance results among the models.

EMPIRICAL RESULTS
Looking at other symmetrical criteria, the GJR-GARCH model has the best performance according to MAE and MAPE criteria. It is followed by GARCH and ARCH models, respectively. According to the MSE criteria, the E-GARCH model has the best performance, followed by the GJR-GARCH, ARCH, and GARCH models, respectively. When all symmetrical criteria consider, the model with the worst performance consistently is the random walk model. This model follows by MA3, MA12, and MA30, respectively.
It should be noted that irrespective of the error statistics, the performance of the MA-α models is almost undistinguishable from each other for any α. Thus, the weighting approach does not seem much valueadded. Table 4 shows the results of the asymmetric evaluation criteria where positive and negative forecast errors are differently treated. Our second asymmetric criterion, the LE statistic, favours the GJR-GARCH model among the other competitors, and particularly over the GARCH model, another asymmetric conditional volatility specification. ARCH, E-GARCH, and regression models follow them, respectively.
According to Tables 3 and 4, it is seen that the optimal model for forecasting gold price volatility is the GJR-GARCH model. This finding also correspondence with Cihangir and Ugurlu (2018). Erer (2011) also stated that the best performing model for gold price prediction is TARCH. If I ignore the model denomination, our results correspond. However, I think it is important to interpret the GJR-GARCH model forecast results since it contains leverage (Asymmetry) information for gold prices. Thus, the estimation results of all period GJR-GARCH model gives in table-5. Notes: ***, ** and * indicate statistical significance at the 1%, 5%, and 10% levels, respectively. Estimation Method is ML ARCH -Normal distribution (BFGS / Marquardt steps) and convergence achieved after 25iterations.
According to Table 5, the γ parameter estimate as -0.315230, and this value is statistically significant.
Therefore, I can say that the model works. The 1 α parameter 0.158442 , which expresses the effect of positive news on conditional variance, has been estimated and is statistically significant. In asymmetric models, good news will collect on the 1 α parameter, and bad news will collect on the 1 α γ + parameters. There is a negative shock asymmetry with a larger effect on volatility in models with a leverage effect (i.e., 0 γ > ) and whose parameter is statistically significant. In other words, bad (negative) news means that the next period will affect the volatility of gold prices more than positive news. However, the asymmetry coefficient of -0.315230 was estimated in the model in our study. So, 0 γ < . In models with a statistically significant asymmetry coefficient 0 γ < and this parameter, there is a positive shock asymmetry with a greater effect on volatility. In other words, it means that good (positive) news will affect the volatility of gold prices more than bad (negative) news in the next period (Brooks, 2008: 408).

CONCLUSIONS
In this paper, the author analyses a wide range of volatility forecasting techniques using both symmetric and asymmetric evaluation criteria, for gold prices in Turkey. To our best knowledge, there has been no evidence for the out-of-sample predictive accuracy of a broad range of time series models of volatility using gold price(gr/tl) data. The following points are worth emphasizing.
The overall rankings of the symmetric error statistics clearly assert that the GJR-GARCH model is significantly superior over the other competitors while both the symmetric and the asymmetric conditional volatility models better perform. The GJR-GARCH model findings reveal a negative shock asymmetry for gold prices. Thus, it shows that positive news in the market affects the volatility of gold prices in the next period more than negative news. This results are of importance for gold price forecasting, spot and derivatives pricing and risk management.