Skip to content
Permalink

Comparing changes

Choose two branches to see what’s changed or to start a new pull request. If you need to, you can also or learn more about diff comparisons.

Open a pull request

Create a new pull request by comparing changes across two branches. If you need to, you can also . Learn more about diff comparisons here.
base repository: VisionDepth/VisionDepth3D
Failed to load repositories. Confirm that selected base ref is valid, then try again.
Loading
base: f3c90a0
Choose a base ref
...
head repository: VisionDepth/VisionDepth3D
Failed to load repositories. Confirm that selected head ref is valid, then try again.
Loading
compare: 149e3af
Choose a head ref
  • 10 commits
  • 12 files changed
  • 1 contributor

Commits on May 31, 2025

  1. Update requirements.txt

    VisionDepth authored May 31, 2025
    Configuration menu
    Copy the full SHA
    b7994f2 View commit details
    Browse the repository at this point in the history

Commits on Jul 23, 2025

  1. V3.2.5 (#48)

    * Update fr.json
    
    updated french json to include missing keys
    
    * Update de.json
    
    updated json to add missing keys
    
    * Update en.json
    
    updated json to add missing keys
    
    * Update es.json
    
    update json to add missing keys
    
    * Update render_depth.py
    
    # 🛠️ VisionDepth3D v3.2.X – Performance & Stability Update
    
    ## Changelog:
    
    ### Fixed UI Freezing During Folder Processing
    - Resolved a bug where the GUI would freeze when processing a folder of videos for depth generation.
    - Properly offloaded batch size parsing and folder selection to the main thread, with core logic in a background thread.
    
    ### Removed Redundant Batch Logic in Folder Processing
    - Cleaned up double handling of `batch_size` logic in `process_video_folder()` and `process_videos_in_folder()`.
    - Now batch size is handled only once, before launching the background thread, improving clarity and reliability.
    
    ### Refactored ONNX Model Loader
    - Rebuilt the `load_onnx_model()` function to support both rank-4 and rank-5 ONNX models.
    - Adds automatic padding for 32-frame models (e.g. Marigold) and returns predictions as `torch.tensor` objects.
    - Includes robust shape detection, better error handling, and future support for diverse ONNX pipelines.
    
    * Update VisionDepth3D.py
    
    🎬 Integrated PySceneDetect into FrameTools GUI for Scene-Based Splitting
    
    - Added support for automatic scene detection using PySceneDetect.
    - Clips can now be segmented into manageable scenes directly from the FrameTools tab.
    - Enables smarter depth processing by allowing fine-tuned 3D rendering per scene.
    - Added user-adjustable threshold slider (NOTE: fixed logic to reflect correct cut behavior).
    
    * Update merged_pipeline.py
    
    FrameTools Tab: Improved Frame Extraction Workflow
    
    - Added directory selector to choose custom output location for extracted frames.
    - Switched to FFmpeg for frame extraction, significantly improving speed and reliability.
    VisionDepth authored Jul 23, 2025
    Configuration menu
    Copy the full SHA
    72059a0 View commit details
    Browse the repository at this point in the history
  2. Update VisionDepth3D.py

    commented out inno setup snippet in GUI script
    VisionDepth authored Jul 23, 2025
    Configuration menu
    Copy the full SHA
    b090f84 View commit details
    Browse the repository at this point in the history
  3. V3.2.5 (#49)

    * Update fr.json
    
    updated french json to include missing keys
    
    * Update de.json
    
    updated json to add missing keys
    
    * Update en.json
    
    updated json to add missing keys
    
    * Update es.json
    
    update json to add missing keys
    
    * Update render_depth.py
    
    # 🛠️ VisionDepth3D v3.2.X – Performance & Stability Update
    
    ## Changelog:
    
    ### Fixed UI Freezing During Folder Processing
    - Resolved a bug where the GUI would freeze when processing a folder of videos for depth generation.
    - Properly offloaded batch size parsing and folder selection to the main thread, with core logic in a background thread.
    
    ### Removed Redundant Batch Logic in Folder Processing
    - Cleaned up double handling of `batch_size` logic in `process_video_folder()` and `process_videos_in_folder()`.
    - Now batch size is handled only once, before launching the background thread, improving clarity and reliability.
    
    ### Refactored ONNX Model Loader
    - Rebuilt the `load_onnx_model()` function to support both rank-4 and rank-5 ONNX models.
    - Adds automatic padding for 32-frame models (e.g. Marigold) and returns predictions as `torch.tensor` objects.
    - Includes robust shape detection, better error handling, and future support for diverse ONNX pipelines.
    
    * Update VisionDepth3D.py
    
    🎬 Integrated PySceneDetect into FrameTools GUI for Scene-Based Splitting
    
    - Added support for automatic scene detection using PySceneDetect.
    - Clips can now be segmented into manageable scenes directly from the FrameTools tab.
    - Enables smarter depth processing by allowing fine-tuned 3D rendering per scene.
    - Added user-adjustable threshold slider (NOTE: fixed logic to reflect correct cut behavior).
    
    * Update merged_pipeline.py
    
    FrameTools Tab: Improved Frame Extraction Workflow
    
    - Added directory selector to choose custom output location for extracted frames.
    - Switched to FFmpeg for frame extraction, significantly improving speed and reliability.
    
    * Update VisionDepth3D.py
    
    commented out inno setup caller
    
    * Update VisionDepth3D.py
    
    changed title tag to 3.2.5
    VisionDepth authored Jul 23, 2025
    Configuration menu
    Copy the full SHA
    3ad8453 View commit details
    Browse the repository at this point in the history

Commits on Aug 10, 2025

  1. V3.2.6 (#52)

    * v3.2.6 – Add support for batch and single video 3D rendering
    
    - Implemented dual input modes: render from a single video or from a list of video/depth map pairs.
    - Added mode selector dropdown (Single / Batch) with dynamic UI updates to show relevant input fields.
    - Updated label system to reflect selected inputs per mode.
    - Ensured compatibility with preview and 3D render workflows for both modes.
    - Refactored processing logic to call existing 3D render function per item in batch.
    - GUI layout improved for cleaner alignment of input controls.
    
    * v3.2.6
    
    * v3.2.6 – Add IPD factor multiplier for parallax scaling
    
    - Introduced `ipd_factor` parameter to control stereo separation intensity.
    - Applies uniform scaling to foreground, midground, and background shift values.
    - Useful for testing different IPD settings or achieving exaggerated parallax effects.
    - Included default fallback (1.0) for backward compatibility.
    - Note: This is a synthetic IPD adjustment (non-metric depth), primarily for aesthetic tuning.
    
    *will introduce into UI at a later date, set to 1.0 atm*
    
    * v3.2.6 – Add audio offset support to Audio Attacher tool
    
    - Added slider to GUI for setting custom audio offset (positive or negative)
    - Updated FFmpeg integration to shift audio stream relative to video duration
    - Useful for syncing misaligned audio during reattachment process
    
    * v3.2.6 – Add Japanese Language Support
    
    * v3.2.6 - Add missing code and update program version title
    
    Fully implemented Japanese language to work, new batch process in 3D render tab had missing elements for the language support
    VisionDepth authored Aug 10, 2025
    Configuration menu
    Copy the full SHA
    b8e7aea View commit details
    Browse the repository at this point in the history
  2. Update Changelog.md

    v3.2.6 Changelog
    VisionDepth authored Aug 10, 2025
    Configuration menu
    Copy the full SHA
    78eed9e View commit details
    Browse the repository at this point in the history
  3. V3.3 (#53)

    * Update Changelog.md
    
    * v3.3
    
    ### 3D Pipeline Improvements – v3.3
    
    - **Black Bar Handling**
      - Detects and crops letterbox bars before depth estimation.
      - Re-adds bars post-processing with neutral depth to prevent stereo artifacts.
    
    - **Depth Stability & Quality**
      - Added temporal depth filtering (EMA) for smoother frame-to-frame depth.
      - Percentile-based normalization for consistent depth range across scenes.
      - Midtone shaping to enhance perceived depth and separation.
      - Optional curvature enhancement for more natural object roundness.
    
    - **Stereo & Parallax Control**
      - Dynamic IPD scaling based on scene depth variance.
      - Shift smoothing for foreground, midground, and background layers.
      - Edge-aware masking and feathering to reduce tearing/ghosting.
      - Subject-tracked zero-parallax plane with eased floating window adjustments.
      - Optional dynamic convergence bias tied to subject position.
    
    - **Image Quality**
      - GPU-accelerated depth-of-field with multi-level Gaussian blending.
      - Brightness-preserving sharpening with anti-clipping.
    
    - **Framing & Output**
      - Per-frame optional black bar cropping in the stereo stage.
      - Aspect-ratio-safe resizing and padding for per-eye frame alignment.
      - Multiple stereo formats supported (Full-SBS, Half-SBS, VR, anaglyph, interlaced).
    
    - **Encoding & Stability**
      - FFmpeg pipeline over stdin with correct CRF/CQ handling.
      - Graceful cleanup and resource release on completion or cancel.
      - Detailed logging for crop, skip, and encoder decisions.
    
    * v3.3
    
    ### Depth Pipeline Improvements – v3.3
    
    - **Letterbox Detection & Cropping**
      - Added `ignore_letterbox_bars` option to automatically detect top/bottom black bars.
      - Crops bars before sending frames to the depth model for cleaner predictions.
      - Stores bar metadata (top/bottom size, original resolution) in a `.letterbox.json` sidecar file.
    
    - **Preserve Original Resolution**
      - After depth prediction, resizes the depth map back to the cropped region size.
      - Re-adds original bars with a neutral depth value to avoid 3D distortion.
      - Ensures depth output matches original width/height exactly.
    
    - **Crash Prevention & Safety Checks**
      - Resets bar values to zero if detections are invalid (too large or empty frames).
      - Handles fade-ins and all-black first frames without breaking processing.
    
    - **Grayscale Conversion Refactor**
      - Unified `convert_depth_to_grayscale()` into a single reusable function.
      - Works with `PIL.Image`, `torch.Tensor`, and `numpy.ndarray`.
      - Includes NaN checks, shape handling, and safe fallbacks for bad frames.
    
    - **Output Consistency**
      - Standardized grayscale depth conversion across all depth output paths.
      - Maintains uniform depth range handling for downstream 3D processing.
    
    * v3.3
    
    Version Tag Change
    VisionDepth authored Aug 10, 2025
    Configuration menu
    Copy the full SHA
    3df9af9 View commit details
    Browse the repository at this point in the history

Commits on Aug 11, 2025

  1. Update requirements.txt

    adding missing requirement
    VisionDepth authored Aug 11, 2025
    Configuration menu
    Copy the full SHA
    e90b8ba View commit details
    Browse the repository at this point in the history

Commits on Aug 12, 2025

  1. Update README.md

    Took out TensorRT installation instructions since its no longer used in the code do to compiling into .exe issues
    VisionDepth authored Aug 12, 2025
    Configuration menu
    Copy the full SHA
    9074068 View commit details
    Browse the repository at this point in the history
  2. Update README.md

    Updated install guide
    VisionDepth authored Aug 12, 2025
    Configuration menu
    Copy the full SHA
    149e3af View commit details
    Browse the repository at this point in the history
Loading