Comparing changes

* Update fr.json updated french json to include missing keys * Update de.json updated json to add missing keys * Update en.json updated json to add missing keys * Update es.json update json to add missing keys * Update render_depth.py # 🛠️ VisionDepth3D v3.2.X – Performance & Stability Update ## Changelog: ### Fixed UI Freezing During Folder Processing - Resolved a bug where the GUI would freeze when processing a folder of videos for depth generation. - Properly offloaded batch size parsing and folder selection to the main thread, with core logic in a background thread. ### Removed Redundant Batch Logic in Folder Processing - Cleaned up double handling of `batch_size` logic in `process_video_folder()` and `process_videos_in_folder()`. - Now batch size is handled only once, before launching the background thread, improving clarity and reliability. ### Refactored ONNX Model Loader - Rebuilt the `load_onnx_model()` function to support both rank-4 and rank-5 ONNX models. - Adds automatic padding for 32-frame models (e.g. Marigold) and returns predictions as `torch.tensor` objects. - Includes robust shape detection, better error handling, and future support for diverse ONNX pipelines. * Update VisionDepth3D.py 🎬 Integrated PySceneDetect into FrameTools GUI for Scene-Based Splitting - Added support for automatic scene detection using PySceneDetect. - Clips can now be segmented into manageable scenes directly from the FrameTools tab. - Enables smarter depth processing by allowing fine-tuned 3D rendering per scene. - Added user-adjustable threshold slider (NOTE: fixed logic to reflect correct cut behavior). * Update merged_pipeline.py FrameTools Tab: Improved Frame Extraction Workflow - Added directory selector to choose custom output location for extracted frames. - Switched to FFmpeg for frame extraction, significantly improving speed and reliability.

commented out inno setup snippet in GUI script

* Update fr.json updated french json to include missing keys * Update de.json updated json to add missing keys * Update en.json updated json to add missing keys * Update es.json update json to add missing keys * Update render_depth.py # 🛠️ VisionDepth3D v3.2.X – Performance & Stability Update ## Changelog: ### Fixed UI Freezing During Folder Processing - Resolved a bug where the GUI would freeze when processing a folder of videos for depth generation. - Properly offloaded batch size parsing and folder selection to the main thread, with core logic in a background thread. ### Removed Redundant Batch Logic in Folder Processing - Cleaned up double handling of `batch_size` logic in `process_video_folder()` and `process_videos_in_folder()`. - Now batch size is handled only once, before launching the background thread, improving clarity and reliability. ### Refactored ONNX Model Loader - Rebuilt the `load_onnx_model()` function to support both rank-4 and rank-5 ONNX models. - Adds automatic padding for 32-frame models (e.g. Marigold) and returns predictions as `torch.tensor` objects. - Includes robust shape detection, better error handling, and future support for diverse ONNX pipelines. * Update VisionDepth3D.py 🎬 Integrated PySceneDetect into FrameTools GUI for Scene-Based Splitting - Added support for automatic scene detection using PySceneDetect. - Clips can now be segmented into manageable scenes directly from the FrameTools tab. - Enables smarter depth processing by allowing fine-tuned 3D rendering per scene. - Added user-adjustable threshold slider (NOTE: fixed logic to reflect correct cut behavior). * Update merged_pipeline.py FrameTools Tab: Improved Frame Extraction Workflow - Added directory selector to choose custom output location for extracted frames. - Switched to FFmpeg for frame extraction, significantly improving speed and reliability. * Update VisionDepth3D.py commented out inno setup caller * Update VisionDepth3D.py changed title tag to 3.2.5

* v3.2.6 – Add support for batch and single video 3D rendering - Implemented dual input modes: render from a single video or from a list of video/depth map pairs. - Added mode selector dropdown (Single / Batch) with dynamic UI updates to show relevant input fields. - Updated label system to reflect selected inputs per mode. - Ensured compatibility with preview and 3D render workflows for both modes. - Refactored processing logic to call existing 3D render function per item in batch. - GUI layout improved for cleaner alignment of input controls. * v3.2.6 * v3.2.6 – Add IPD factor multiplier for parallax scaling - Introduced `ipd_factor` parameter to control stereo separation intensity. - Applies uniform scaling to foreground, midground, and background shift values. - Useful for testing different IPD settings or achieving exaggerated parallax effects. - Included default fallback (1.0) for backward compatibility. - Note: This is a synthetic IPD adjustment (non-metric depth), primarily for aesthetic tuning. *will introduce into UI at a later date, set to 1.0 atm* * v3.2.6 – Add audio offset support to Audio Attacher tool - Added slider to GUI for setting custom audio offset (positive or negative) - Updated FFmpeg integration to shift audio stream relative to video duration - Useful for syncing misaligned audio during reattachment process * v3.2.6 – Add Japanese Language Support * v3.2.6 - Add missing code and update program version title Fully implemented Japanese language to work, new batch process in 3D render tab had missing elements for the language support

v3.2.6 Changelog

* Update Changelog.md * v3.3 ### 3D Pipeline Improvements – v3.3 - **Black Bar Handling** - Detects and crops letterbox bars before depth estimation. - Re-adds bars post-processing with neutral depth to prevent stereo artifacts. - **Depth Stability & Quality** - Added temporal depth filtering (EMA) for smoother frame-to-frame depth. - Percentile-based normalization for consistent depth range across scenes. - Midtone shaping to enhance perceived depth and separation. - Optional curvature enhancement for more natural object roundness. - **Stereo & Parallax Control** - Dynamic IPD scaling based on scene depth variance. - Shift smoothing for foreground, midground, and background layers. - Edge-aware masking and feathering to reduce tearing/ghosting. - Subject-tracked zero-parallax plane with eased floating window adjustments. - Optional dynamic convergence bias tied to subject position. - **Image Quality** - GPU-accelerated depth-of-field with multi-level Gaussian blending. - Brightness-preserving sharpening with anti-clipping. - **Framing & Output** - Per-frame optional black bar cropping in the stereo stage. - Aspect-ratio-safe resizing and padding for per-eye frame alignment. - Multiple stereo formats supported (Full-SBS, Half-SBS, VR, anaglyph, interlaced). - **Encoding & Stability** - FFmpeg pipeline over stdin with correct CRF/CQ handling. - Graceful cleanup and resource release on completion or cancel. - Detailed logging for crop, skip, and encoder decisions. * v3.3 ### Depth Pipeline Improvements – v3.3 - **Letterbox Detection & Cropping** - Added `ignore_letterbox_bars` option to automatically detect top/bottom black bars. - Crops bars before sending frames to the depth model for cleaner predictions. - Stores bar metadata (top/bottom size, original resolution) in a `.letterbox.json` sidecar file. - **Preserve Original Resolution** - After depth prediction, resizes the depth map back to the cropped region size. - Re-adds original bars with a neutral depth value to avoid 3D distortion. - Ensures depth output matches original width/height exactly. - **Crash Prevention & Safety Checks** - Resets bar values to zero if detections are invalid (too large or empty frames). - Handles fade-ins and all-black first frames without breaking processing. - **Grayscale Conversion Refactor** - Unified `convert_depth_to_grayscale()` into a single reusable function. - Works with `PIL.Image`, `torch.Tensor`, and `numpy.ndarray`. - Includes NaN checks, shape handling, and safe fallbacks for bad frames. - **Output Consistency** - Standardized grayscale depth conversion across all depth output paths. - Maintains uniform depth range handling for downstream 3D processing. * v3.3 Version Tag Change

adding missing requirement

Took out TensorRT installation instructions since its no longer used in the code do to compiling into .exe issues

Updated install guide

Commits on May 31, 2025

Update requirements.txt

VisionDepth authored May 31, 2025

Configuration menu

View commit details

Copy full SHA for b7994f2

Browse repository at this point

Copy the full SHA

b7994f2 View commit details

Browse the repository at this point in the history

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Comparing changes

Open a pull request

Commits on May 31, 2025

Commits on Jul 23, 2025

Commits on Aug 10, 2025

Commits on Aug 11, 2025

Commits on Aug 12, 2025

This comparison is taking too long to generate.

Uh oh!