Research faculty at Virginia Tech

Socially responsible AI for health, safety, and society.

I build and evaluate generative AI, LLM, vision-language, computer vision, and machine learning systems for real-world problems in public health, intelligent transportation, digital content understanding, and AI for social good.

2,000+ citations Across AI, health, NLP, vision, and transportation
LLM evaluation Benchmarks, reasoning, commonsense, multilingual understanding, and safety
Hands-on AI LLM/VLM systems, cloud pipelines, model evaluation, and deployment
Portrait of Surendrabikram Thapa

About me

Research across AI, health, transportation, and online safety.

U.S. permanent resident Received permanent residency under the EB-1A extraordinary ability category.

I am a research faculty member at Virginia Tech in Blacksburg, USA, building and evaluating generative AI, large language model, vision-language, and multimodal machine learning systems for high-impact real-world settings. My recent work focuses on LLM evaluation, agentic AI, RAG, multilingual and low-resource reasoning, benchmark construction, model bias, and responsible AI deployment.

I apply this work across health AI, online safety, and intelligent transportation, with projects in non-invasive health monitoring, misinformation and harmful-content detection, driver behavior analysis, ADS readiness, and interpretable multimodal perception.

Research areas

LLMs and machine learning with real-world stakes.

My research centers on large language models, multimodal learning, and deep learning systems that can be evaluated, explained, and deployed responsibly in health care, transportation, and online safety settings.

Large language models and agentic AI

I work on LLM evaluation, multilingual and low-resource reasoning, RAG and retrieval-augmented distillation, agentic AI frameworks, prompt-based learning, model bias, and responsible deployment of generative AI in high-impact domains.

LLM evaluation Agentic AI RAG Multilingual LLMs AI safety

Medical AI and health monitoring

Non-invasive cardiovascular signal recovery, biomedical vision-language models, LLM-assisted health applications, mental health surveillance, medical dialogue generation, and AI methods for neurodegenerative disease modeling.

Health AI BVP recovery Alzheimer's detection Biomedical LLMs

Computational social science

LLM- and VLM-powered systems for online harms, misinformation, hate speech, polarization, mental health, code-mixed content, low-resource languages, and interpretable multimodal content moderation.

NLP Vision-language models Misinformation Content moderation

Intelligent transportation

Large vision-language models for traffic-scene understanding, driver behavior analysis, naturalistic driving data, face de-identification, ADS readiness, 3D perception, and wearable sensing for stress and fatigue.

Transportation safety VLMs Driver monitoring ADS readiness

Tools and technologies

Hands-on AI systems from research to deployment.

I work across the full applied-AI stack: building models, designing evaluations, engineering data pipelines, containerizing workflows, running cloud experiments, and translating research ideas into usable systems.

LLMs and agentic AI

Hands-on work with pretraining, fine-tuning, RAG, LangChain, RLHF, DPO, evaluation, bias analysis, prompt-based learning, and agentic workflows.

RAG LangChain RLHF DPO Evaluation

ML, vision, and multimodal AI

Implementation-heavy work across deep learning, computer vision, vision-language models, signal processing, detection, segmentation, tracking, and explainability.

PyTorch TensorFlow OpenCV CUDA VLMs

Data, cloud, and engineering

Python-first research engineering with scalable pipelines for data processing, experimentation, visualization, cloud execution, and deployment.

Python SQL Docker AWS GCP Airflow Pandas

Selected publications

Selected publications across core research themes.

A selected view of my work across language, health, transportation, and multimodal AI. The full publication list is available on Google Scholar.

Language, LLMs, and social AI

Online safety, polarization, low-resource language understanding, and multimodal social content.

Selected papers, newest first
ACL 2026

Self-Explaining Hate Speech Detection with Moral Rationales

Vargas F., Trager J., Alves D., Guida M., Thapa S., Atil B., Dementieva D., Smart A.J., and Agrawal A.

ACL 2026

POLAR: A Benchmark for Multilingual, Multicultural, and Multi-Event Online Polarization

Naseem U., Thapa S., and collaborators.

ARR May 2026 submission

GAATHO: A Low-Resource Benchmark and Dataset for Culturally Grounded Riddle Reasoning

Shiwakoti S., Thapa S., Rauniyar K., Maharjan I., and Naseem U.

ICWSM 2025

A Multimodal Prompt-based Framework for Analyzing Code-Mixed and Low-Resource Memes

Thapa S., Veeramani H., Hu L., Zhang Q., Wang W., and Naseem U.

NAACL 2025

GameTox: A Comprehensive Dataset and Analysis for Enhanced Toxicity Detection in Online Gaming Communities

Naseem U., Shiwakoti S., Shah S., Thapa S., and Zhang Q.

EMNLP 2025

Probing the Limits of Multilingual Language Understanding: Low-Resource Language Proverbs as LLM Benchmark for AI Wisdom

Thapa S., Rauniyar K., Veeramani H., Adhikari S., Razzak I., and Naseem U.

IEEE BigData 2025

Agentic AI Framework for Low-Resource Essay Evaluation via Scoring, Explanation, and Debate

Thapa S., Rauniyar K., Shiwakoti S., Adhikari S., Rashid J., Kim J., and Naseem U.

ACL SustaiNLP 2023

ADEPT: Adapter-based Efficient Prompt Tuning Approach for Language Models

Shah S., Thapa S., Jain A., and Huang L.

Health AI and medical sensing

Biomedical NLP, non-contact physiological sensing, and AI for mental and neurological health.

Selected papers, newest first
WISE 2024

Did You Tell a Deadly Lie? Evaluating Large Language Models for Health Misinformation Identification

Thapa S., Rauniyar K., Veeramani H., Shah A., Razzak I., and Naseem U.

JMIR Medical Informatics 2024

Advancing Accuracy in Multimodal Medical Tasks Through Bootstrapped Language-Image Pretraining (BioMedBLIP)

Naseem U., Thapa S., and Masood A.

SIGIR 2023

MDKG: Graph-Based Medical Knowledge-Guided Dialogue Generation

Naseem U., Thapa S., Zhang Q., Hu L., and Nasim M.

CVPR 2023

Camera-based Recovery of Cardiovascular Signals from Unconstrained Face Videos using an Attention Network

Deshpande Y., Thapa S., Sarkar A., and Abbott A.L.

CVPR 2023

A Temporal Encoder-Decoder Approach to Extracting Blood Volume Pulse Signal Morphology from Face Videos

Li F., Thapa S., Bhat S., Sarkar A., and Abbott A.L.

IEEE Big Data 2024

SAFENet: Towards a Robust Suicide Assessment in Social Media Using Selective Prediction Framework

Thapa S., Salman M., Shah S., Zhang Q., Rashid J., Hu L., Razzak I., and Naseem U.

IEEE BigData 2024

THYMES: A Framework for Detecting Suicidal Ideation from Social Media Posts Using Hyperbolic Learning

Thapa S., Salman M., Shah S., Shiwakoti S., Zhang Q., Hu L., Razzak I., and Naseem U.

International Journal of Human-Computer Studies 2022

Exploiting Linguistic Information from Nepali Transcripts for Early Detection of Alzheimer's Disease

Adhikari S., Thapa S., Naseem U., Singh P., Huo H., Bharathy G., and Prasad M.

IEEE R10-HTC 2020Best Paper Award

Feature Selection Based Twin-Support Vector Machine for the Diagnosis of Parkinson's Disease

Thapa S., Adhikari S., Ghimire A., and Aditya A.

Transportation and vision

Traffic-scene understanding, driver privacy, naturalistic driving, 3D perception, and driver health.

Selected papers, newest first
IEEE ITSC 2026

Lane-Specific, Multi-Phase Surrogate Modeling of Lane-Change Risk for ADAS Safety Evaluation

Guduri B., Costa R., and Thapa S.

IEEE ITSC 2026

Concept-Grounded Vision-Language Models for Interpretable Driving Scene Condition Perception

Thapa S. and Sarkar A.

IEEE IV 2026

Characterizing Stress and Fatigue in Long-Haul Truck Drivers through Wearable Sensing and Physiological Measures

Thapa S. and Sarkar A.

IEEE IV 2024

Semantic Understanding of Traffic Scenes with Large Vision Language Models

Jain S., Thapa S., Chen K.T., Abbott A.L., and Sarkar A.

ECCV Workshops 2024

3D Object Detection and Tracking Refinement with Ensemble Methods and Spatiotemporal Filtering

Jain S., Thapa S., Bhardwaj S., Abbott A.L., Sarkar A., and Xuan J.

The Visual Computer 2025

A Deep Dive into Enhancing Sharing of Naturalistic Driving Data through Face Deidentification

Thapa S. and Sarkar A.

IEEE IV 2023

GAN-based Deidentification of Drivers' Face Videos: An Assessment of Human Factors Implications in NDS Data

Thapa S. and Sarkar A.

HFES Annual Meeting 2022

Deidentification of Drivers' Face Videos: Scope and Challenges in Human Factors Research

Thapa S., Cook J., and Sarkar A.

Experience

Applied research experience and education.

Experience

Aug 2023 - Present

Virginia Tech

Research Faculty

  • Collaborating with faculty across Virginia Tech departments and centers.
  • Serves as principal investigator (PI), Co-PI, and co-investigator on projects funded by federal and state agencies, research institutes, and industry partners.
  • Builds LLM/VLM evaluation systems, multimodal AI models, cloud/containerized workflows, and scalable data/model pipelines for real-world deployment.
LLM/VLM evaluation Multimodal perception Driver behavior modeling Transportation safety Health monitoring Online safety
Aug 2021 - Aug 2023

Virginia Tech Transportation Institute

Graduate Research Assistant

Worked on transportation AI, naturalistic driving data, DOT/National Academies research synthesis, and driver face de-identification.

Aug 2021 - Aug 2023

Bradley Department of Electrical and Computer Engineering, Virginia Tech

Graduate Research Assistant

Developed non-contact cardiovascular sensing methods for blood-volume-pulse recovery from face video.

May 2019 - May 2020

University of Technology Sydney

Visiting Scholar

Built AI models for early Alzheimer's disease detection using neuropsychological, brain-volume, and speech-transcript signals.

Jun 2020 - Oct 2020

European Organization for Nuclear Research (CERN)

Openlab Summer Intern

Worked on conditional progressive GANs for spectrally valid satellite image generation and validation with UNOSAT water-stream detection models.

Education

Aug 2021 - Aug 2023

Virginia Tech

M.S., Computer Science (Thesis); Graduate Certificate in Data Analytics

Blacksburg, Virginia.

Aug 2017 - May 2021

Delhi Technological University

B.Tech., Software Engineering

New Delhi, India.

Service

Academic service and research community leadership.

I serve the research community through area chair roles, program committees, reviewing, workshop organization, and shared-task leadership across AI, NLP, social computing, multimedia, vision, and intelligent systems venues.

Program committee and reviewing

I serve as a program committee member and reviewer for major AI, NLP, and computer vision venues, including NeurIPS, ICLR, CVPR, ECCV, ICCV, ACL, EMNLP, ARR, AAAI, SIGIR, WebConf, ICWSM, LREC, COLING, ECAI, IJCAI, ECML/PKDD, ACM CSCW, and multiple IEEE/ACM transactions and conferences.

Workshop organization

I have chaired and organized a broad set of workshops that bring together researchers working on low-resource NLP, multimodal social-good AI, biomedical vision-language modeling, online safety, and socio-political event extraction.

Selected roles, newest first

  • Workshop Chair, EEUCA at ACL 2026
  • Workshop Chair, WiNLP at EMNLP 2025 and 2026
  • Workshop Chair, MM4SG at WebConf 2024, 2025, and 2026, and ICDM 2024
  • Workshop Chair, CHiPSAL at COLING 2025 and LREC 2026
  • Workshop Chair, LFMBio at WACV 2026
  • Workshop Chair, VLM4Bio at ACM Multimedia 2024
  • Workshop Chair, CASE at EACL 2024 and RANLP 2025
  • Organizing Committee, CASE at RANLP 2023

Shared tasks and competitions

I help lead shared tasks that define benchmark datasets, evaluation protocols, and community research agendas for multilingual NLP, multimodal content understanding, online safety, polarization, misinformation, and AI for healthier digital spaces.

Selected tasks, newest first

  • SemEval 2026 Task 9 at ACL 2026: multilingual, multicultural, and multi-event online polarization
  • EEUCA at ACL 2026: VaxMeme shared task on multimodal vaccine-critical meme detection
  • EEUCA at ACL 2026: understanding toxic behavior in gaming communities using AI to promote healthier digital spaces
  • CHiPSAL at LREC 2026: multimodal hate and sentiment understanding in low-resource text-embedded images
  • CASE at RANLP 2025: multimodal hate, humor, and stance event detection in marginalized socio-political movements
  • CHiPSAL at COLING 2025: Devanagari language identification, hate speech, and target detection
  • CASE at EACL 2024: extended multimodal hate speech event detection during the Russia-Ukraine crisis
  • CASE at EACL 2024: stance and hate event detection in tweets related to climate activism
  • CASE at RANLP 2023: Multimodal Hate Speech Event Detection
  • CASE at RANLP 2023: Event Causality Identification

Contact

Let's connect.

I am happy to talk about research collaboration, AI for social good, transportation safety, health AI, and selected publication updates. Or just say hello.

Top ↑