


default search action
Jing Bi 0002
Person information
- affiliation: Univeristy of Rochester, Rochester, NY, USA
Other persons with the same name
- Jing Bi — disambiguation page
- Jing Bi 0001
— Beijing University of Technology, School of Software Engineering, Beijing, China (and 3 more) - Jing Bi 0003 — Shenyang Normal University, Software College, Shenyang, China
Other persons with a similar name
SPARQL queries 
Refine list

refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2026
[j1]Yunlong Tang
, Jing Bi
, Siting Xu, Luchuan Song
, Susan Liang
, Teng Wang
, Daoan Zhang
, Jie An
, Jingyang Lin
, Rongyi Zhu, Ali Vosoughi
, Chao Huang
, Zeliang Zhang
, Pinxin Liu, Mingqian Feng, Feng Zheng
, Jianguo Zhang
, Ping Luo
, Jiebo Luo
, Chenliang Xu
:
Video Understanding With Large Language Models: A Survey. IEEE Trans. Circuits Syst. Video Technol. 36(2): 1355-1376 (2026)
[c9]Yolo Yunlong Tang, Jing Bi, Chao Huang, Susan Liang, Daiki Shimada, Hang Hua, Yunzhong Xiao, Yizhi Song, Pinxin Liu, Mingqian Feng, Junjia Guo, Zhuo Liu, Luchuan Song, Ali Vosoughi, Jinxi He, Liu He, Zeliang Zhang, Jiebo Luo, Chenliang Xu:
Caption Anything in Video: Fine-grained Object-centric Captioning via Spatiotemporal Multimodal Prompting. AAAI 2026: 41697-41699
[i28]Susan Liang, Chao Huang, Filippos Bellos, Yolo Yunlong Tang, Qianxiang Shen, Jing Bi, Luchuan Song, Zeliang Zhang, Jason J. Corso, Chenliang Xu:
Omni-Judge: Can Omni-LLMs Serve as Human-Aligned Judges for Text-Conditioned Audio-Video Generation? CoRR abs/2602.01623 (2026)
[i27]Luchuan Song, Pinxin Liu, Haiyang Liu, Zhenchao Jin, Yolo Yunlong Tang, Zichong Xu, Susan Liang, Jing Bi, Jason J. Corso, Chenliang Xu:
TDMM-LM: Bridging Facial Understanding and Animation via Language Models. CoRR abs/2603.16936 (2026)- 2025
[c8]Yunlong Tang
, Daiki Shimada
, Jing Bi, Mingqian Feng, Hang Hua, Chenliang Xu:
Empowering LLMs with Pseudo-Untrimmed Videos for Audio-Visual Temporal Understanding. AAAI 2025: 7293-7301
[c7]Jing Bi, Junjia Guo, Yunlong Tang
, Lianggong Bruce Wen, Zhang Liu, Bingjie Wang, Chenliang Xu:
Unveiling Visual Perception in Language Models: An Attention Head Analysis Approach. CVPR 2025: 4135-4144
[c6]Yunlong Tang
, Junjia Guo, Hang Hua, Susan Liang, Mingqian Feng, Xinyang Li, Rui Mao, Chao Huang, Jing Bi, Zeliang Zhang, Pooyan Fazli, Chenliang Xu:
VidComposition: Can MLLMs Analyze Compositions in Compiled Videos? CVPR 2025: 8490-8500
[c5]Yunlong Tang, Junjia Guo, Pinxin Liu, Zhiyuan Wang, Hang Hua, Jia-Xing Zhong, Yunzhong Xiao, Chao Huang, Luchuan Song, Susan Liang, Yizhi Song, Liu He, Jing Bi, Mingqian Feng, Xinyang Li, Zeliang Zhang, Chenliang Xu:
Generative AI for Cel-Animation: A Survey. ICCVW 2025: 3837-3850
[i26]Yunlong Tang, Junjia Guo, Pinxin Liu, Zhiyuan Wang, Hang Hua, Jia-Xing Zhong, Yunzhong Xiao, Chao Huang, Luchuan Song, Susan Liang, Yizhi Song, Liu He, Jing Bi, Mingqian Feng, Xinyang Li, Zeliang Zhang, Chenliang Xu:
Generative AI for Cel-Animation: A Survey. CoRR abs/2501.06250 (2025)
[i25]Jing Bi, Junjia Guo, Susan Liang, Guangyu Sun, Luchuan Song, Yunlong Tang, Jinxi He, Jiarui Wu, Ali Vosoughi, Chen Chen, Chenliang Xu:
VERIFY: A Benchmark of Visual Explanation and Reasoning for Investigating Multimodal Reasoning Fidelity. CoRR abs/2503.11557 (2025)
[i24]Jing Bi, Susan Liang, Xiaofei Zhou, Pinxin Liu, Junjia Guo, Yunlong Tang, Luchuan Song, Chao Huang, Guangyu Sun, Jinxi He, Jiarui Wu, Shu Yang, Daoan Zhang, Chen Chen, Lianggong Bruce Wen, Zhang Liu, Jiebo Luo
, Chenliang Xu:
Why Reasoning Matters? A Survey of Advancements in Multimodal Reasoning (v1). CoRR abs/2504.03151 (2025)
[i23]Yunlong Tang, Jing Bi, Chao Huang, Susan Liang, Daiki Shimada, Hang Hua, Yunzhong Xiao, Yizhi Song, Pinxin Liu, Mingqian Feng, Junjia Guo, Zhuo Liu, Luchuan Song, Ali Vosoughi, Jinxi He, Liu He, Zeliang Zhang, Jiebo Luo
, Chenliang Xu:
Caption Anything in Video: Fine-grained Object-centric Captioning via Spatiotemporal Multimodal Prompting. CoRR abs/2504.05541 (2025)
[i22]Jing Bi, Pinxin Liu, Ali Vosoughi, Jiarui Wu, Jinxi He, Chenliang Xu:
I2G: Generating Instructional Illustrations via Text-Conditioned Diffusion. CoRR abs/2505.16425 (2025)
[i21]Yunlong Tang, Pinxin Liu, Mingqian Feng, Zhangyun Tan, Rui Mao, Chao Huang, Jing Bi, Yunzhong Xiao, Susan Liang, Hang Hua, Ali Vosoughi, Luchuan Song, Zeliang Zhang, Chenliang Xu:
MMPerspective: Do MLLMs Understand Perspective? A Comprehensive Benchmark for Perspective Perception, Reasoning, and Robustness. CoRR abs/2505.20426 (2025)
[i20]Chao Huang, Yuesheng Ma, Junxuan Huang, Susan Liang, Yunlong Tang, Jing Bi, Wenqiang Liu, Nima Mesgarani, Chenliang Xu:
ZeroSep: Separate Anything in Audio with Zero Training. CoRR abs/2505.23625 (2025)
[i19]Ali Vosoughi, Jing Bi, Pinxin Liu, Yunlong Tang, Chenliang Xu:
Can Sound Replace Vision in LLaVA With Token Substitution? CoRR abs/2506.10416 (2025)
[i18]Jing Bi, Lianggong Bruce Wen, Zhang Liu, Chenliang Xu:
ACTLLM: Action Consistency Tuned Large Language Model. CoRR abs/2506.21250 (2025)
[i17]Jing Bi, Chenliang Xu:
What to Do Next? Memorizing skills from Egocentric Instructional Video. CoRR abs/2507.02997 (2025)
[i16]Yolo Yunlong Tang, Jing Bi, Pinxin Liu, Zhenyu Pan, Zhangyun Tan, Qianxiang Shen, Jiani Liu, Hang Hua, Junjia Guo, Yunzhong Xiao, Chao Huang, Zhiyuan Wang, Susan Liang, Xinyi Liu, Yizhi Song, Junhua Huang, Jia-Xing Zhong, Bozheng Li, Daiqing Qi, Ziyun Zeng, Ali Vosoughi, Luchuan Song, Zeliang Zhang, Daiki Shimada
, Han Liu, Jiebo Luo
, Chenliang Xu:
Video-LMM Post-Training: A Deep Dive into Video Reasoning with Large Multimodal Models. CoRR abs/2510.05034 (2025)
[i15]Jing Bi, Guangyu Sun, Ali Vosoughi, Chen Chen, Chenliang Xu:
Diagnosing Visual Reasoning: Challenges, Insights, and a Path Forward. CoRR abs/2510.20696 (2025)
[i14]Jing Bi, Filippos Bellos, Junjia Guo, Yayuan Li, Chao Huang, Yolo Yunlong Tang, Luchuan Song, Susan Liang, Zhongfei Mark Zhang, Jason J. Corso, Chenliang Xu:
When to Think and When to Look: Uncertainty-Guided Lookback. CoRR abs/2511.15613 (2025)
[i13]Yolo Yunlong Tang, Daiki Shimada, Hang Hua, Chao Huang, Jing Bi, Rogerio Feris, Chenliang Xu:
Video-R4: Reinforcing Text-Rich Video Reasoning with Visual Rumination. CoRR abs/2511.17490 (2025)
[i12]Daoan Zhang, Pai Liu, Xiaofei Zhou, Yuan Ge, Guangchen Lan, Jing Bi, Christopher G. Brinton, Ehsan Hoque, Jiebo Luo
:
VisualActBench: Can VLMs See and Act like a Human? CoRR abs/2512.09907 (2025)- 2024
[c4]Jing Bi
, Yunlong Tang
, Luchuan Song
, Ali Vosoughi
, Nguyen Nguyen
, Chenliang Xu
:
EAGLE: Egocentric AGgregated Language-video Engine. ACM Multimedia 2024: 1682-1691
[c3]Nguyen Nguyen, Jing Bi, Ali Vosoughi, Yapeng Tian, Pooyan Fazli, Chenliang Xu:
OSCaR: Object State Captioning and State Change Representation. NAACL-HLT (Findings) 2024: 3565-3576
[i11]Nguyen Manh Nguyen, Jing Bi, Ali Vosoughi, Yapeng Tian, Pooyan Fazli, Chenliang Xu:
OSCaR: Object State Captioning and State Change Representation. CoRR abs/2402.17128 (2024)
[i10]Yunlong Tang, Daiki Shimada, Jing Bi, Chenliang Xu:
AVicuna: Audio-Visual LLM with Interleaver and Context-Boundary Alignment for Temporal Referential Dialogue. CoRR abs/2403.16276 (2024)
[i9]Jing Bi, Yunlong Tang, Luchuan Song, Ali Vosoughi, Nguyen Nguyen, Chenliang Xu:
EAGLE: Egocentric AGgregated Language-video Engine. CoRR abs/2409.17523 (2024)
[i8]Yunlong Tang, Junjia Guo, Hang Hua, Susan Liang, Mingqian Feng, Xinyang Li, Rui Mao, Chao Huang, Jing Bi, Zeliang Zhang, Pooyan Fazli, Chenliang Xu:
VidComposition: Can MLLMs Analyze Compositions in Compiled Videos? CoRR abs/2411.10979 (2024)
[i7]Jing Bi, Junjia Guo, Yunlong Tang, Lianggong Bruce Wen, Zhang Liu, Chenliang Xu:
Unveiling Visual Perception in Language Models: An Attention Head Analysis Approach. CoRR abs/2412.18108 (2024)- 2023
[i6]Jing Bi, Nguyen Manh Nguyen, Ali Vosoughi, Chenliang Xu:
MISAR: A Multimodal Instructional System with Augmented Reality. CoRR abs/2310.11699 (2023)
[i5]Yunlong Tang, Jing Bi, Siting Xu, Luchuan Song, Susan Liang, Teng Wang, Daoan Zhang, Jie An, Jingyang Lin, Rongyi Zhu, Ali Vosoughi, Chao Huang, Zeliang Zhang, Feng Zheng, Jianguo Zhang, Ping Luo, Jiebo Luo
, Chenliang Xu:
Video Understanding with Large Language Models: A Survey. CoRR abs/2312.17432 (2023)- 2021
[c2]Jing Bi, Jiebo Luo
, Chenliang Xu:
Procedure Planning in Instructional Videos via Contextual Modeling and Model-based Policy Learning. ICCV 2021: 15591-15600
[i4]Jing Bi, Jiebo Luo, Chenliang Xu:
Procedure Planning in Instructional Videos via Contextual Modeling and Model-based Policy Learning. CoRR abs/2110.01770 (2021)- 2020
[c1]Jing Bi, Vikas Dhiman
, Tianyou Xiao, Chenliang Xu:
Learning from Interventions Using Hierarchical Policies for Safe Learning. AAAI 2020: 10352-10360
[i3]Jing Shi, Jing Bi, Yingru Liu
, Chenliang Xu:
Cubic Spline Smoothing Compensation for Irregularly Sampled Sequences. CoRR abs/2010.01381 (2020)
2010 – 2019
- 2019
[i2]Jing Bi, Vikas Dhiman, Tianyou Xiao, Chenliang Xu:
Learning from Interventions using Hierarchical Policies for Safe Learning. CoRR abs/1912.02241 (2019)- 2018
[i1]Jing Bi, Tianyou Xiao, Qiuyue Sun, Chenliang Xu:
Navigation by Imitation in a Pedestrian-Rich Environment. CoRR abs/1811.00506 (2018)
Coauthor Index

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from
to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the
of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from
,
, and
to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from
and
to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from
.
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2026-04-15 00:56 CEST by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint


Google
Google Scholar
Semantic Scholar
Internet Archive Scholar
CiteSeerX
ORCID







