A similarity of multivariate time series in stocks network analysis

Correlation-based network as a model for financial markets, especially stock market, is a complex system has received much attention. There have been a lot of studies which deals with stocks network analysis, where each stock is represented by a univariate time series of its closing price, and then...

Full description

Saved in:
Bibliographic Details
Main Author: Gan, Siew Lee
Format: Thesis
Language:English
Published: 2016
Subjects:
Online Access:http://eprints.utm.my/id/eprint/78035/1/GanSiewLeePFS2016.pdf
http://eprints.utm.my/id/eprint/78035/
http://dms.library.utm.my:8080/vital/access/manager/Repository/vital:98220
Tags: Add Tag
No Tags, Be the first to tag this record!
id my.utm.78035
record_format eprints
spelling my.utm.780352018-07-23T06:05:28Z http://eprints.utm.my/id/eprint/78035/ A similarity of multivariate time series in stocks network analysis Gan, Siew Lee QA Mathematics Correlation-based network as a model for financial markets, especially stock market, is a complex system has received much attention. There have been a lot of studies which deals with stocks network analysis, where each stock is represented by a univariate time series of its closing price, and then the similarity between two stocks are quantified by using Pearson correlation coefficient (PCC) on the logarithmic returns. However, in daily stock market activity, stock is represented by a multivariate time series during its opening, highest, lowest, and closing prices. The solely used of the information from closing price may cause the loss of information from other prices. In this thesis, all four prices are considered. The notion of multivariate time series similarity among stocks are developed. The use of Escoufier vector correlation (EVC), a multivariate generalization of PCC, is proposed to measure the similarity between stocks. Then the EVC coefficients are used to construct the stocks network in multivariate setting based on minimal spanning tree (MST). In the case study on BURSA MALAYSIA, the topological properties of stocks in EVC-based MST and in PCC-based MST are different. The total path lengths among stocks in the economic sector according to EVC-based MST is generally smaller than according to PCC-based MST. It means that with the approach of EVC-based MST, the stocks are strongly connected with other stocks in the same sector. Moreover, EVC is proposed to define the similarity between economic sectors, where each sector is represented by a multivariate time series of p components and each component is a univariate time series of stock’s closing price. To the best of our knowledge, there is no previous studies which deals with the similarity between economic sectors using this approach. The methodology for economic sectors network analysis is formulated in this thesis. The current practice of using Kruskal’s or Prim’s algorithm is to obtain MST, and then sub-dominant ultrametric (SDU) from the MST. It will consume a lot of time when the number of stocks is large. Therefore to solve this problem, an efficient algorithm is developed based on fuzzy relation approach. A comparison study based on the empirical and simulated data shows that the proposed algorithm is faster. The proposed algorithm provides not only MST and SDU, but also the forest of all MSTs. 2016-03 Thesis NonPeerReviewed application/pdf en http://eprints.utm.my/id/eprint/78035/1/GanSiewLeePFS2016.pdf Gan, Siew Lee (2016) A similarity of multivariate time series in stocks network analysis. PhD thesis, Universiti Teknologi Malaysia, Faculty of Science. http://dms.library.utm.my:8080/vital/access/manager/Repository/vital:98220
institution Universiti Teknologi Malaysia
building UTM Library
collection Institutional Repository
continent Asia
country Malaysia
content_provider Universiti Teknologi Malaysia
content_source UTM Institutional Repository
url_provider http://eprints.utm.my/
language English
topic QA Mathematics
spellingShingle QA Mathematics
Gan, Siew Lee
A similarity of multivariate time series in stocks network analysis
description Correlation-based network as a model for financial markets, especially stock market, is a complex system has received much attention. There have been a lot of studies which deals with stocks network analysis, where each stock is represented by a univariate time series of its closing price, and then the similarity between two stocks are quantified by using Pearson correlation coefficient (PCC) on the logarithmic returns. However, in daily stock market activity, stock is represented by a multivariate time series during its opening, highest, lowest, and closing prices. The solely used of the information from closing price may cause the loss of information from other prices. In this thesis, all four prices are considered. The notion of multivariate time series similarity among stocks are developed. The use of Escoufier vector correlation (EVC), a multivariate generalization of PCC, is proposed to measure the similarity between stocks. Then the EVC coefficients are used to construct the stocks network in multivariate setting based on minimal spanning tree (MST). In the case study on BURSA MALAYSIA, the topological properties of stocks in EVC-based MST and in PCC-based MST are different. The total path lengths among stocks in the economic sector according to EVC-based MST is generally smaller than according to PCC-based MST. It means that with the approach of EVC-based MST, the stocks are strongly connected with other stocks in the same sector. Moreover, EVC is proposed to define the similarity between economic sectors, where each sector is represented by a multivariate time series of p components and each component is a univariate time series of stock’s closing price. To the best of our knowledge, there is no previous studies which deals with the similarity between economic sectors using this approach. The methodology for economic sectors network analysis is formulated in this thesis. The current practice of using Kruskal’s or Prim’s algorithm is to obtain MST, and then sub-dominant ultrametric (SDU) from the MST. It will consume a lot of time when the number of stocks is large. Therefore to solve this problem, an efficient algorithm is developed based on fuzzy relation approach. A comparison study based on the empirical and simulated data shows that the proposed algorithm is faster. The proposed algorithm provides not only MST and SDU, but also the forest of all MSTs.
format Thesis
author Gan, Siew Lee
author_facet Gan, Siew Lee
author_sort Gan, Siew Lee
title A similarity of multivariate time series in stocks network analysis
title_short A similarity of multivariate time series in stocks network analysis
title_full A similarity of multivariate time series in stocks network analysis
title_fullStr A similarity of multivariate time series in stocks network analysis
title_full_unstemmed A similarity of multivariate time series in stocks network analysis
title_sort similarity of multivariate time series in stocks network analysis
publishDate 2016
url http://eprints.utm.my/id/eprint/78035/1/GanSiewLeePFS2016.pdf
http://eprints.utm.my/id/eprint/78035/
http://dms.library.utm.my:8080/vital/access/manager/Repository/vital:98220
_version_ 1643657708504612864
score 13.18916