- Project Overview
- Data Sources
- Features
- Modeling Approach
- Installation & Setup
- Usage Guide
- Real-World Applications
- Educational Applications
- App Breakdown
The Retail Analytics Suite is a comprehensive data analysis and visualization platform that combines sales data from multiple retail sources to provide actionable insights and predictive analytics. The application analyzes Walmart sales data, Amazon sales data, and Amazon customer sentiment data to deliver a holistic view of retail performance and customer satisfaction.
- Source: Walmart store sales records
- Period: 2020-2021
- Format: CSV
- Update Frequency: Weekly
- Key Metrics: Weekly sales, store information, and economic indicators
- Link: Walmart Dataset on Kaggle
- Source: Amazon sales records
- Period: 2020-2021 fiscal year
- Format: CSV
- Update Frequency: Daily
- Key Metrics: Order details, product information, and pricing data
- Link: Amazon Sales Dataset on Kaggle
- Source: Amazon customer feedback
- Period: 2020-2021
- Format: CSV
- Update Frequency: Daily
- Key Metrics: Review content, ratings, and customer sentiment
- Link: Amazon Reviews Dataset on Kaggle
-
Sales Analysis
- Time series visualization
- Seasonal trend analysis
- Store performance comparison
- Revenue forecasting
-
Customer Sentiment Analysis
- Review sentiment scoring
- Word cloud visualization
- Category-wise sentiment analysis
- Trend analysis of customer satisfaction
-
Price Analytics
- Price distribution analysis
- Discount impact assessment
- Category-wise pricing strategies
- Competitive pricing analysis
-
Interactive Dashboards
- Dynamic filtering
- Real-time calculations
- Custom date range selection
- Multiple visualization options
- Purpose: Long-term sales forecasting with seasonal decomposition
- Implementation:
- Multiplicative seasonality model
- Yearly, weekly, and daily seasonality components
- Holiday effects incorporation
- Automatic changepoint detection
Advantages:
- Handles missing data effectively
- Captures multiple seasonality patterns
- Robust to outliers
- Automatic trend changepoint detection
- Purpose: Short to medium-term sales forecasting
- Implementation:
- Order selection: (1,1,1) for trend
- Seasonal order: (1,1,1,7) for weekly patterns
- Rolling validation approach
Advantages:
- Captures both trend and seasonality
- Provides confidence intervals
- Strong statistical foundation
- Good for short-term predictions
- Purpose: Feature-based sales prediction
- Implementation:
- 200 trees with max depth of 15
- Minimum samples split: 5
- Minimum samples leaf: 2
Advantages:
- Handles non-linear relationships
- Feature importance ranking
- Robust to outliers
- Good for complex pattern recognition
- Purpose: Customer review sentiment scoring
- Implementation:
- VADER sentiment scoring
- Custom text preprocessing
- Weighted sentence analysis
Advantages:
- Specifically tuned for social media text
- Handles emoji and punctuation
- Multiple sentiment dimensions
# Clone the repository
git clone 'https://github.com/Diparna/Sales_Forecasting/'
# Install required packages
pip install -r requirements.txt
# Run the Streamlit app
streamlit run app.py- Format Requirements: Specify CSV format
- Supported File Types: CSV files only
- Step-by-step guide for selecting data and running models
- Input parameters explanation
- Understanding the visualizations
- Reading the forecasts
- Predict demand for products to optimize inventory and avoid stockouts or overstocking
- Plan promotional campaigns based on forecasted sales trends
- Anticipate demand spikes to adjust production schedules and logistics planning
- Improve supplier relations by sharing accurate demand forecasts
- Help businesses allocate budgets effectively by forecasting revenue streams
- Aid in financial planning for startups or small businesses
- Predict seasonal trends in bookings, helping with staff planning and resource allocation
- Forecast customer preferences for targeted marketing
- Assist restaurants and cafes in purchasing raw materials by forecasting sales volumes
- Predict trends in customer dining habits for menu optimization
- Plan production schedules based on sales forecasting to reduce waste
- Align resources with expected demand for better cost management
- Demonstrate time-series analysis, ARIMA models, and machine learning applications in forecasting
- Visualize the impact of data preprocessing and model selection on predictive accuracy
- Allow students to upload their datasets to practice forecasting in real-world scenarios
- Provide hands-on experience with sales datasets for project-based learning
- Illustrate the power of predictive analytics in a simple, user-friendly manner
- Engage visitors by allowing them to tweak parameters and see how forecasts change
- Use the app as a sandbox environment for students to explore retail analytics
- Simulate scenarios to teach resource allocation based on demand predictions
- Serve as a tool for teaching Streamlit, Python, and data visualization techniques
- Show how to integrate data science with app development to build impactful tools
- Default landing page with an overview of the three main analyses
- Store Timeline
- Sales Forecasting
- Original Dataset Overview
- Engineered Features Overview
- Feature Distribution
- Correlation Analysis
- Seasonal Sales Analysis
- ARIMA Forecasting
- Summary
- Sales Prediction
- IDA (Initial Data Assessment)
- Data Transformation
- Product Overview
- Pricing Analysis
- Rating Analysis
- Product Recommendations
- Sentiment Distribution Tab:
- Distribution analysis
- Summary statistics
- Detailed Analysis Tab:
- Sentiment score distribution
- Rating Analysis Tab:
- Sentiment vs rating visualization
- Correlation metrics
- Word Clouds Tab:
- Word clouds for positive, neutral, and negative reviews