This repository contains a Jupyter Notebook for predicting stock prices using Linear Regression with PySpark.
The goal of this project is to analyze historical stock data and predict future stock prices using Linear Regression. The implementation leverages PySpark, which allows efficient handling of large datasets.
- BDA_Project.ipynb β The main Jupyter Notebook containing the full implementation.
Ensure you have the following dependencies installed:
pip install pyspark pandas matplotlib scikit-learn- Clone the repository:
git clone https://github.com/vishnus1793/Big-Data-Analysis.git cd Big-Data-Analysis - Start Jupyter Notebook:
jupyter notebook
- Open
BDA_Project.ipynband execute the cells.