-
Notifications
You must be signed in to change notification settings - Fork 17
Permalink
Choose a base ref
{{ refName }}
default
Choose a head ref
{{ refName }}
default
Comparing changes
Choose two branches to see what’s changed or to start a new pull request.
If you need to, you can also or
learn more about diff comparisons.
Open a pull request
Create a new pull request by comparing changes across two branches. If you need to, you can also .
Learn more about diff comparisons here.
base repository: VisionDepth/VisionDepth3D
Failed to load repositories. Confirm that selected base ref is valid, then try again.
Loading
base: f3c90a0
Could not load branches
Nothing to show
Loading
Could not load tags
Nothing to show
{{ refName }}
default
Loading
...
head repository: VisionDepth/VisionDepth3D
Failed to load repositories. Confirm that selected head ref is valid, then try again.
Loading
compare: 149e3af
Could not load branches
Nothing to show
Loading
Could not load tags
Nothing to show
{{ refName }}
default
Loading
- 10 commits
- 12 files changed
- 1 contributor
Commits on May 31, 2025
-
Configuration menu - View commit details
-
Copy full SHA for b7994f2 - Browse repository at this point
Copy the full SHA b7994f2View commit details
Commits on Jul 23, 2025
-
* Update fr.json updated french json to include missing keys * Update de.json updated json to add missing keys * Update en.json updated json to add missing keys * Update es.json update json to add missing keys * Update render_depth.py # 🛠️ VisionDepth3D v3.2.X – Performance & Stability Update ## Changelog: ### Fixed UI Freezing During Folder Processing - Resolved a bug where the GUI would freeze when processing a folder of videos for depth generation. - Properly offloaded batch size parsing and folder selection to the main thread, with core logic in a background thread. ### Removed Redundant Batch Logic in Folder Processing - Cleaned up double handling of `batch_size` logic in `process_video_folder()` and `process_videos_in_folder()`. - Now batch size is handled only once, before launching the background thread, improving clarity and reliability. ### Refactored ONNX Model Loader - Rebuilt the `load_onnx_model()` function to support both rank-4 and rank-5 ONNX models. - Adds automatic padding for 32-frame models (e.g. Marigold) and returns predictions as `torch.tensor` objects. - Includes robust shape detection, better error handling, and future support for diverse ONNX pipelines. * Update VisionDepth3D.py 🎬 Integrated PySceneDetect into FrameTools GUI for Scene-Based Splitting - Added support for automatic scene detection using PySceneDetect. - Clips can now be segmented into manageable scenes directly from the FrameTools tab. - Enables smarter depth processing by allowing fine-tuned 3D rendering per scene. - Added user-adjustable threshold slider (NOTE: fixed logic to reflect correct cut behavior). * Update merged_pipeline.py FrameTools Tab: Improved Frame Extraction Workflow - Added directory selector to choose custom output location for extracted frames. - Switched to FFmpeg for frame extraction, significantly improving speed and reliability.
Configuration menu - View commit details
-
Copy full SHA for 72059a0 - Browse repository at this point
Copy the full SHA 72059a0View commit details -
commented out inno setup snippet in GUI script
Configuration menu - View commit details
-
Copy full SHA for b090f84 - Browse repository at this point
Copy the full SHA b090f84View commit details -
* Update fr.json updated french json to include missing keys * Update de.json updated json to add missing keys * Update en.json updated json to add missing keys * Update es.json update json to add missing keys * Update render_depth.py # 🛠️ VisionDepth3D v3.2.X – Performance & Stability Update ## Changelog: ### Fixed UI Freezing During Folder Processing - Resolved a bug where the GUI would freeze when processing a folder of videos for depth generation. - Properly offloaded batch size parsing and folder selection to the main thread, with core logic in a background thread. ### Removed Redundant Batch Logic in Folder Processing - Cleaned up double handling of `batch_size` logic in `process_video_folder()` and `process_videos_in_folder()`. - Now batch size is handled only once, before launching the background thread, improving clarity and reliability. ### Refactored ONNX Model Loader - Rebuilt the `load_onnx_model()` function to support both rank-4 and rank-5 ONNX models. - Adds automatic padding for 32-frame models (e.g. Marigold) and returns predictions as `torch.tensor` objects. - Includes robust shape detection, better error handling, and future support for diverse ONNX pipelines. * Update VisionDepth3D.py 🎬 Integrated PySceneDetect into FrameTools GUI for Scene-Based Splitting - Added support for automatic scene detection using PySceneDetect. - Clips can now be segmented into manageable scenes directly from the FrameTools tab. - Enables smarter depth processing by allowing fine-tuned 3D rendering per scene. - Added user-adjustable threshold slider (NOTE: fixed logic to reflect correct cut behavior). * Update merged_pipeline.py FrameTools Tab: Improved Frame Extraction Workflow - Added directory selector to choose custom output location for extracted frames. - Switched to FFmpeg for frame extraction, significantly improving speed and reliability. * Update VisionDepth3D.py commented out inno setup caller * Update VisionDepth3D.py changed title tag to 3.2.5
Configuration menu - View commit details
-
Copy full SHA for 3ad8453 - Browse repository at this point
Copy the full SHA 3ad8453View commit details
Commits on Aug 10, 2025
-
* v3.2.6 – Add support for batch and single video 3D rendering - Implemented dual input modes: render from a single video or from a list of video/depth map pairs. - Added mode selector dropdown (Single / Batch) with dynamic UI updates to show relevant input fields. - Updated label system to reflect selected inputs per mode. - Ensured compatibility with preview and 3D render workflows for both modes. - Refactored processing logic to call existing 3D render function per item in batch. - GUI layout improved for cleaner alignment of input controls. * v3.2.6 * v3.2.6 – Add IPD factor multiplier for parallax scaling - Introduced `ipd_factor` parameter to control stereo separation intensity. - Applies uniform scaling to foreground, midground, and background shift values. - Useful for testing different IPD settings or achieving exaggerated parallax effects. - Included default fallback (1.0) for backward compatibility. - Note: This is a synthetic IPD adjustment (non-metric depth), primarily for aesthetic tuning. *will introduce into UI at a later date, set to 1.0 atm* * v3.2.6 – Add audio offset support to Audio Attacher tool - Added slider to GUI for setting custom audio offset (positive or negative) - Updated FFmpeg integration to shift audio stream relative to video duration - Useful for syncing misaligned audio during reattachment process * v3.2.6 – Add Japanese Language Support * v3.2.6 - Add missing code and update program version title Fully implemented Japanese language to work, new batch process in 3D render tab had missing elements for the language support
Configuration menu - View commit details
-
Copy full SHA for b8e7aea - Browse repository at this point
Copy the full SHA b8e7aeaView commit details -
Configuration menu - View commit details
-
Copy full SHA for 78eed9e - Browse repository at this point
Copy the full SHA 78eed9eView commit details -
* Update Changelog.md * v3.3 ### 3D Pipeline Improvements – v3.3 - **Black Bar Handling** - Detects and crops letterbox bars before depth estimation. - Re-adds bars post-processing with neutral depth to prevent stereo artifacts. - **Depth Stability & Quality** - Added temporal depth filtering (EMA) for smoother frame-to-frame depth. - Percentile-based normalization for consistent depth range across scenes. - Midtone shaping to enhance perceived depth and separation. - Optional curvature enhancement for more natural object roundness. - **Stereo & Parallax Control** - Dynamic IPD scaling based on scene depth variance. - Shift smoothing for foreground, midground, and background layers. - Edge-aware masking and feathering to reduce tearing/ghosting. - Subject-tracked zero-parallax plane with eased floating window adjustments. - Optional dynamic convergence bias tied to subject position. - **Image Quality** - GPU-accelerated depth-of-field with multi-level Gaussian blending. - Brightness-preserving sharpening with anti-clipping. - **Framing & Output** - Per-frame optional black bar cropping in the stereo stage. - Aspect-ratio-safe resizing and padding for per-eye frame alignment. - Multiple stereo formats supported (Full-SBS, Half-SBS, VR, anaglyph, interlaced). - **Encoding & Stability** - FFmpeg pipeline over stdin with correct CRF/CQ handling. - Graceful cleanup and resource release on completion or cancel. - Detailed logging for crop, skip, and encoder decisions. * v3.3 ### Depth Pipeline Improvements – v3.3 - **Letterbox Detection & Cropping** - Added `ignore_letterbox_bars` option to automatically detect top/bottom black bars. - Crops bars before sending frames to the depth model for cleaner predictions. - Stores bar metadata (top/bottom size, original resolution) in a `.letterbox.json` sidecar file. - **Preserve Original Resolution** - After depth prediction, resizes the depth map back to the cropped region size. - Re-adds original bars with a neutral depth value to avoid 3D distortion. - Ensures depth output matches original width/height exactly. - **Crash Prevention & Safety Checks** - Resets bar values to zero if detections are invalid (too large or empty frames). - Handles fade-ins and all-black first frames without breaking processing. - **Grayscale Conversion Refactor** - Unified `convert_depth_to_grayscale()` into a single reusable function. - Works with `PIL.Image`, `torch.Tensor`, and `numpy.ndarray`. - Includes NaN checks, shape handling, and safe fallbacks for bad frames. - **Output Consistency** - Standardized grayscale depth conversion across all depth output paths. - Maintains uniform depth range handling for downstream 3D processing. * v3.3 Version Tag Change
Configuration menu - View commit details
-
Copy full SHA for 3df9af9 - Browse repository at this point
Copy the full SHA 3df9af9View commit details
Commits on Aug 11, 2025
-
Configuration menu - View commit details
-
Copy full SHA for e90b8ba - Browse repository at this point
Copy the full SHA e90b8baView commit details
Commits on Aug 12, 2025
-
Took out TensorRT installation instructions since its no longer used in the code do to compiling into .exe issues
Configuration menu - View commit details
-
Copy full SHA for 9074068 - Browse repository at this point
Copy the full SHA 9074068View commit details -
Configuration menu - View commit details
-
Copy full SHA for 149e3af - Browse repository at this point
Copy the full SHA 149e3afView commit details
Loading
This comparison is taking too long to generate.
Unfortunately it looks like we can’t render this comparison for you right now. It might be too big, or there might be something weird with your repository.
You can try running this command locally to see the comparison on your machine:
git diff f3c90a0...149e3af