Hi, I'm Zhanghan Wang

I am Zhanghan Wang, a PhD student at New York University now. And I am widely interested in building high-performance and reliable systems.
Recently, I am working on correctness of distributed system advised by Prof. Aurojit Panda.

Research & Projects

My research experience and coursework projects. (* means equal contribution)

Research
Project

It Takes Two to Entangle

ASPLOS'26 2024-2025

Zhanghan Wang*, Ding Ding*, Hang Zhu, Habin Lin, Aurojit Panda.

Preprint (Old version) Code

Understanding Stragglers in Large Model Training Using What-if Analysis

OSDI'25 2025

Jinkun Lin, Ziheng Jiang, Zuquan Song, Sida Zhao, Menghan Yu, Zhanghan Wang, Chenyuan Wang, Zuocheng Shi, Xiang Shi, Wei Jia, Zherui Liu, Shuguang Wang, Haibin Lin, Xin Liu, Aurojit Panda, and Jinyang Li.

Paper Code

Runtime Protocol Refinement Checking for Distributed Protocol Implementations

NSDI'25 2022-2024

Ding Ding, Zhanghan Wang, Jinyang Li, Aurojit Panda.

Paper

Incremental Specialization of Network Programs

HotNets'24 2024

Fabian Ruffy, Zhanghan Wang, Gianni Antichi, Aurojit Panda, Anirudh Sivaraman

Paper Code

Improve Load Balance for DLRM with Programmable Switch

Jan 2022 - Sept 2022

Advisor: Jialin Li (NUS) and Liang Luo (Meta, US)

Deep Learning Recommendation Model(DLRM) is a widely used recommendation model developed by Meta. DLRM uses embedding tables that encode sparse features like movie genres. However, there are a lot of such tables, and the number of embeddings and the embedding sizes vary a lot. Thus, they are usually partitioned into different machines. Nevertheless, the accessing pattern is skewed due to the popularity of data. Thus, we tried to use the programmable switch to improve the load balance by caching several embedding entries and routing the lookup requests based on caching status. The work hasn't been completely done and the problem might already be obsolete.

Database Deadlock Diagnosis for Large-scale ORM-based Web Application

July 2021 - Jan 2022

Advisor: Jinyang Li (NYU) and Zhaoguo Wang (SJTU)

Database-backed web application usually relies database to handle deadlocks. However, the common-used detect-and-recover strategy could be costly. Although developers can sometimes reorganize their application to removedeadlocks, the large number of LOC and third-party ORM frameworks they use make this much more difficult. In thisproject, we use symbolic execution to extract the APIs' statement templates with symbolic inputs and path conditions forthe issued statements. Based on the information, we then analyze and report the potential deadlocks. I was only listed in acknowledgement in the paper since I left the team later.

Paper

RocksDB with Disaggregated Block Cache.

March 2022 - June 2022

Advisor: Cheng Li (USTC)

This is my undergraduate thesis. In this work, I explored using RDMA to disaggregate the block cache of RocksDB to improve the overal throughput of LSM-Tree-based KV store. This project help alleviate the memory burden of block cache in RocksDB and disaggregate more than 75% memory with only 15% performance drop.

Code

SQL Query Plan Optimization

Oct 2020 - Dec 2020

Advisor: Cheng Li (USTC)

In this project, we develop a greedy algorithm to reorder left-deep-join tree in SQL query plan, and achieved better performance.

News

Graph DataBase File System

March 2019 - July 2019

Zhanghan Wang, Chuqing Gao, Zhiyuan Huang, Xingmei Wang, Jiacheng Wan (in no particular order)

We utilize Neo4j (one of the best graph database) to build a FUSE-based file system with a fantastic web UI. The files are connected in the graph based on their contents. We used some machine learning techniques to extract keywords and description. Due to the limitation of early AI techniques, GDBFS can only process some simple files.

A Note in 2024: we noticed that, with the emerging LLM, there are some new works similar to our GDBFS design philosophy. This again proves that new techniques always enable some old ideas...

Report Code

Experience

Education and Work Experiences

Education

2022 - Present

New York Univeristy

PhD Student

I am now a PhD student at NYU, researching on correctness of distributed systems, advised by Prof. Aurojit Panda.

2018 - 2022

Univeristy of Science and Technology of China

Bachelor of Computer Science

USTC is where I started my Computer Science Study since 2018. And I met Prof. Cheng Li here, who offered me lots of help in digging into system works.

Work

Roblox

Research Intern

2025 Summer

Developed a prototype for script-based LOD with Just-in-time compilation and instances cache..

Bytedance

Research Intern

2024 Summer

Developed a tool to verify correctness of distributed model training/inferencing implementation.

Projects for Fun

IssueClear

Oct 2025

This is a tool to scrape either Github or JIRA issues and allow user to use LLM to filter out the issues they are interested in. I expect it can help researchers in program testing, debug and verification to find more interesting bugs and facilitate their works.
This is also my first project experiencing (nearly) pure vibe coding.
The tool is still under development.

(NOTE: logo drawed by Google Nano Banana Pro, let me know if it infringes your rights.)

Code Dataset

P4 language extension for VSCode

May 2021

VSCode Extension for p4 language. This extension support simple but not complete syntax and semantic highlights. This project is mainly for learning p4lang specification and review visitor pattern. It is not maintained anymore.

Code Try it out