Staffordshire University logo
STORE - Staffordshire Online Repository

Stock Market Random Forest-Text Mining (SMRF-TM) Approach to Analyse Critical Indicators of Stock Market Movements

ELAGAMY, MAZEN NABIL (2017) Stock Market Random Forest-Text Mining (SMRF-TM) Approach to Analyse Critical Indicators of Stock Market Movements. Doctoral thesis, Staffordshire University.

[img]
Preview
Text
ElagamyMN_PhD thesis.pdf
Available under License All Rights Reserved.

Download (2MB) | Preview
[img] Text
ElagamyMN4285_Ethos Agreement.pdf
Restricted to Repository staff only
Available under License All Rights Reserved.

Download (70kB)

Abstract or description

The Stock Market is a significant sector of a country’s economy and has a crucial
role in the growth of commerce and industry. Hence, discovering efficient ways to analyse and visualise stock market data is considered a significant issue in modern finance. The use of data mining techniques to predict stock market movements has been extensively studied using historical market prices but such approaches are constrained to make assessments within the scope of existing information, and thus they are not able to model any random behaviour of the stock market or identify the causes behind events. One area of limited success in stock market prediction comes from textual data, which is a rich source of information. Analysing textual data related to the Stock Market may provide better understanding of random behaviours of the market.
Text Mining combined with the Random Forest algorithm offers a novel approach to the study of critical indicators, which contribute to the prediction of stock market abnormal movements. In this thesis, a Stock Market Random Forest-Text Mining system (SMRF-TM) is developed and is used to mine the critical indicators related to the 2009 Dubai stock market debt standstill. Random forest and expectation maximisation are applied to classify the extracted features into a set of meaningful and semantic classes, thus extending current approaches from three to eight classes: critical down, down, neutral, up, critical up, economic, social and political.
The study demonstrates that Random Forest has outperformed other classifiers and has achieved the best accuracy in classifying the bigram features extracted from the corpus.

Item Type: Thesis (Doctoral)
Faculty: School of Computing and Digital Technologies > Computing
Depositing User: Jeffrey HENSON
Date Deposited: 03 Apr 2018 13:18
Last Modified: 07 Jun 2018 11:51
URI: https://eprints.staffs.ac.uk/id/eprint/4285

Actions (login required)

View Item View Item

DisabledGo Staffordshire University is a recognised   Investor in People. Sustain Staffs
Legal | Freedom of Information | Site Map | Job Vacancies
Staffordshire University, College Road, Stoke-on-Trent, Staffordshire ST4 2DE t: +44 (0)1782 294000