Skip to content
View zixinzhang02's full-sized avatar
  • HKUST, SJTU
  • Shanghai, China
  • 14:50 (UTC +08:00)

Highlights

  • Pro

Block or report zixinzhang02

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Pinned Loading

  1. EnVision-Research/PhysToolBench EnVision-Research/PhysToolBench Public

    PhysToolBench: Benchmarking Physical Tool Understanding for MLLMs

    Python 27 3

  2. EnVision-Research/A4-Agent EnVision-Research/A4-Agent Public

    A4-Agent: An Agentic Framework for Zero-Shot Affordance Reasoning

    Python 35

  3. zhengxuJosh/Awesome-Multimodal-Spatial-Reasoning zhengxuJosh/Awesome-Multimodal-Spatial-Reasoning Public

    This repository collects and organises state‑of‑the‑art papers on spatial reasoning for Multimodal Vision–Language Models (MVLMs).

    291 15

  4. EnVision-Research/PAP EnVision-Research/PAP Public

    Panoramic Affordance Prediction (PAP)

    21

  5. EnVision-Research/DVD EnVision-Research/DVD Public

    DVD: Deterministic Video Depth Estimation with Generative Priors

    Python 110 9