Abstract
Stock market investments are highly rewarding but also high in risk. Modern investors use variety of tools to take informed investment decisions. In the current era of digital world, financial service industry has generated huge volume and immense verities of data with extreme speed. Due to the rapid growth in data collection and the heterogeneous nature and complexity of the data, there is a need for Big Data analytical solution that would be able to deal with the stock market data. Large volumes of unstructured, heterogeneous raw data can be stored in a massively scalable manner using data lakes, which are the ideal solution to the big data storage conundrum. The ability of a data lake to preserve data in its original format while processing it at runtime using a schema on-read technique is its key feature. The challenge faced in the data lake is performing analytics which is a significant tool to calculate and analyze the stock market. The proposed architecture of Azure Databricks DeltaLake (ADDL) with Azure DataLake Storage Generation 2 (ADLSG2) is used for analytical processes like Fibonacci retracement for better stock analysis, which aid in forecasting the market price for better investment. As a result, the research focus is to produce a storage having read as well as write capabilities by taking into consideration the Extract-Load-Transform (ELT) operation on the datasource. In this experimental databricks implementation, runtime is performed using open source of Apache Spark API and a highly improved execution engine, which results in a significant performance improvement when comparing to the standard source of Apache Spark available on the ADLS platform. Additionally, the Fibonacci retracement level calculation is achieved with the analytics and forecasting of test close price with various ML and DL techniques such as KNN, LSTM are compared with original price of the test data for better prediction of forecast close price.
Original language | English |
---|---|
Title of host publication | 2023 International Conference on Self Sustainable Artificial Intelligence Systems (ICSSAS) |
Publisher | IEEE Computer Society |
Pages | 1065-1071 |
Number of pages | 7 |
ISBN (Electronic) | 979-8-3503-0085-7 |
ISBN (Print) | 979-8-3503-0086-4 |
DOIs | |
Publication status | Published - 6 Dec 2023 |
Event | International Conference on Self Sustainable Artificial Intelligence Systems - Erode, India Duration: 18 Oct 2023 → 20 Oct 2023 |
Conference
Conference | International Conference on Self Sustainable Artificial Intelligence Systems |
---|---|
Abbreviated title | ICSSAS 2023 |
Country/Territory | India |
City | Erode |
Period | 18/10/23 → 20/10/23 |
Keywords
- Cloud computing
- Runtime
- Costs
- Cluster Computing
- Lakes
- Writing
- Big Data Applications