Shubhashis Roy Dipta

PhD Researcher · UMBC

sroydip1@umbc.edu

Resume Google Scholar

Amazon Science (Alexa)

Seattle, WA

Applied Scientist Intern (Incoming)

Summer 2026

Amazon Science (Alexa)

Seattle, WA

Applied Scientist Intern

Summer 2025

Paper: PA3: Policy-Aware Agent Alignment

Scale AI

San Francisco, CA

Machine Learning Research Intern

Summer 2024

University of Maryland, Baltimore County

Ph.D. in Computer Science

Fall 2023 - Present

Grade: 4.00/4.00

Publications: See Here (From 2022)

University of Maryland, Baltimore County

M.Sc. in Computer Science

Spring 2021 - Spring 2023

Grade: 4.00/4.00

Morgan State University

Research Assistant

2017 - 2019

Publications: 4 Journal

UniShopr.com

Bangladesh

Founder

2017 - 2021

Upcoming Travel

ACL 2026 in San Diego, CA (Jul 3-7)

✅ NeurIPS 2025 in San Diego, CA (Dec 2-7)
❌ AACL 2025 in Mumbai, India (Dec 20-24) (canceled)

👋 I'm open to meet! Email me to schedule a chat!

Peer Review

Reviewed 28+ papers across top venues (2023–2025).

Conferences

ACLNeurIPSNAACLCOLING*SEM

Workshops

SemEvalTrustNLPSRWW-NUTELVM

Journals

Scientific ReportsBMC BioinformaticsPlant MethodsComputational and Structural Biotechnology

I am a final-year Ph.D. researcher in Computer Science at the University of Maryland, Baltimore County (UMBC), advised by Dr. Frank Ferraro. I’ve also interned at Amazon Science (Alexa AI; Summer 2025 + returning Summer 2026) and Scale AI (Summer 2024). My research focuses on three areas where modern LLMs fail predictably - (1) complex reasoning, (2) tool-use, and (3) modality conflict - to make them more reliable, efficient, and aligned.

Reasoning & Decomposition
- Atomic, presupposition-free decomposition for robust claim verification [De-Presuppose]
- Token-efficient math reasoning via distractor-aware computational graphs [DAGGER]
- Curriculum-driven GRPO for math reasoning in under-resourced languages [GanitLLM]
- Hierarchical event abstraction for compositional sequence modeling [SHEM]
Agentic LLMs & Reinforcement Learning
- Tool-calling alignment via policy-grounded deliberation [PA3]
- Multi-agent benchmarks for diagnosing collaboration failures [AgentCollabBench]
- Mechanistic analysis of token saliency in on-policy distillation [Rock Tokens]
- Metacognitive control in LLMs under resource constraints [TRIAGE]
Multimodal Learning & Evaluation
- Reference-free factuality metric for video captions [VC-Inspector]
- Calibrated abstention under modality conflict in omni-modal models [OMD]
- Zero-shot multilingual text-to-video retrieval via temporal event decomposition [Q2E]

Graduating Spring 2027 - actively seeking Research Scientist roles in NLP / Multimodal AI. Please reach out if you have an opening.

Recent News (See All)

May 14, 2026	🥳 OMD-Bench got accepted at 3 CVPR 2026 workshops (Any2Any MLLM, CVinWild, KnowledgeMR).
May 7, 2026	🥳 1 of my ACL papers (VC Inspector) also got accepted to MAGMaR 2026.
Apr 7, 2026	Successfully defended my PhD Proposal. Now I am officially ABD (All But Dissertation). Slides of the proposal can be found here.
Apr 6, 2026	🥳 3 of my papers got accepted to ACL 2026 - first-author VC-Inspector (Main) and GanitLLM (Findings), plus co-authored Survey on Multimodal Unlearning (Findings).
Jan 28, 2026	🥳 2 of my mentored papers got accepted at LoResLM Workshop at EACL 2026 [ 1, 2 ]
Dec 3, 2025	✈️ Heading to San Diego for NeurIPS 2025. Email me to schedule a chat!
Nov 21, 2025	🥳 3 of my mentored papers got accepted at BLP Workshop at AACL 2025 [ 1, 2, 3 ]

Beyond Research

I founded UniShopr (2017–2021), a cross-border e-commerce platform serving consumers in Bangladesh.

I’ve also competed internationally in robotics and algorithms - placing 9th at the University Rover Challenge 2015 (Utah, USA) and 22nd at the European Rover Challenge 2016 (Poland), ranking 8th out of 300+ teams at the 2018 ACM ICPC Asia Dhaka Regional with multiple regional and national placements, and reaching the top 70 on Kaggle 🥉 in the Birdcall Identification competition. Full list of awards →

Featured Publications

Check out Google Scholar for a full list of my publications.

Submitted

Cornerstones or Stumbling Blocks? Deciphering the Rock Tokens in On-Policy Distillation

Yuxuan Jiang*, Runchao Li*, Shubhashis Roy Dipta*, and 2 more authors

Preprint 2026

* Equal contribution

arXiv
Submitted

AgentCollabBench: Diagnosing When Good Agents Make Bad Collaborators

Aritra Mazumder, Shubhashis Roy Dipta, Nusrat Jahan Lia, and 10 more authors

Preprint 2026

arXiv Code Website
Preprint

TRIAGE: Evaluating Prospective Metacognitive Control in LLMs under Resource Constraints

Zabir Al Nazi, and Shubhashis Roy Dipta

Preprint 2026

arXiv
Submitted

PA3: Policy-Aware Agent Alignment through Chain-of-Thought

Shubhashis Roy Dipta, Daniel Bis, Kun Zhou, and 4 more authors

Preprint 2026

Work done during internship at Amazon Alexa AI

arXiv Video
Submitted

†DAGGER: Distractor-Aware Graph Generation for Executable Reasoning in Math Problems

Zabir Al Nazi, Shubhashis Roy Dipta, and Sudipta Kar

Preprint 2026

arXiv Code Website
Submitted

Omni-Modal Dissonance Benchmark: Systematically Breaking Modality Consensus to Probe Robustness and Calibrated Abstention

Zabir Al Nazi*, Shubhashis Roy Dipta*, and Md Rizwan Parvez

Preprint 2026

* Equal contribution

arXiv
ACL

GanitLLM: Difficulty-Aware Bengali Mathematical Reasoning through Curriculum-GRPO

Shubhashis Roy Dipta, Khairul Mahbub, and Nadia Najjar

ACL 2026

arXiv Code Website
ACL

Advancing Reference-free Evaluation of Video Captions with Factual Analysis

Shubhashis Roy Dipta, Tz-Ying Wu, and Subarna Tripathi

ACL 2026

arXiv Code Website
ACL

Multimodal Unlearning Across Vision, Language, Video, and Audio: Survey of Methods, Datasets, and Benchmarks

Nobin Sarwar, Shubhashis Roy Dipta, Zheyuan Liu, and 1 more author

ACL 2026

PDF Code Website
AACL

Q2E: Query-to-Event Decomposition for Zero-Shot Multilingual Text-to-Video Retrieval

Shubhashis Roy Dipta, and Francis Ferraro

AACL 2025

arXiv Video Code Poster Slides Website
*SEM

If We May De-Presuppose: Robustly Verifying Claims through Presupposition-Free Question Decomposition

Shubhashis Roy Dipta, and Francis Ferraro

*SEM 2025

arXiv Code Poster Website
MathAI @NeurIPS

Learning How to Use Tools, Not Just When: Pattern-Aware Tool-Integrated Reasoning

Ningning Xu, Yuxuan Jiang, and Shubhashis Roy Dipta

MathAI @NeurIPS 2025

arXiv
*SEM

Semantically-informed Hierarchical Event Modeling

Shubhashis Roy Dipta, Mehdi Rezaee, and Francis Ferraro

*SEM 2023

arXiv Code Slides