Deprecated: The each() function is deprecated. This message will be suppressed on further calls in /home/zhenxiangba/zhenxiangba.com/public_html/phproxy-improved-master/index.php on line 456
Shubhashis Roy Dipta
[go: Go Back, main page]

Shubhashis Roy Dipta

PhD Researcher Ā· UMBC

sroydip1@umbc.edu


Amazon Science (Alexa)
Seattle, WA
Applied Scientist Intern (Incoming)
Summer 2026
Manager: Dr. Lichao Wang
Amazon Science (Alexa)
Seattle, WA
Applied Scientist Intern
Summer 2025
Manager: Dr. Lichao Wang
Mentor: Dr. Daniel Bis
Paper: PA3: Policy-Aware Agent Alignment
Scale AI
San Francisco, CA
Machine Learning Research Intern
Summer 2024
Manager: Dr. Adrian Lam
Mentor: Vijay Kalmath
Blog: RLHF for Text-to-SQL
See more
University of Maryland, Baltimore County
Ph.D. in Computer Science
Fall 2023 - Present
Advisor: Dr. Frank Ferraro
Grade: 4.00/4.00
Publications: See Here (From 2022)
University of Maryland, Baltimore County
M.Sc. in Computer Science
Spring 2021 - Spring 2023
Awards: Phi Kappa Phi
Grade: 4.00/4.00
Morgan State University
Research Assistant
2017 - 2019
Advisor: Dr. Iman Dehzangi
Publications: 4 Journal
UniShopr.com
Bangladesh
Founder
2017 - 2021

Upcoming Travel

  • ACL 2026 in San Diego, CA (Jul 3-7)
Previous
  • āœ… NeurIPS 2025 in San Diego, CA (Dec 2-7)
  • āŒ AACL 2025 in Mumbai, India (Dec 20-24) (canceled)
šŸ‘‹ I'm open to meet! Email me to schedule a chat!

Peer Review

Reviewed 28+ papers across top venues (2023–2025).

Conferences
ACLNeurIPSNAACLCOLING*SEM
Workshops
SemEvalTrustNLPSRWW-NUTELVM
Journals
Scientific ReportsBMC BioinformaticsPlant MethodsComputational and Structural Biotechnology

I am a final-year Ph.D. researcher in Computer Science at the University of Maryland, Baltimore County (UMBC), advised by Dr. Frank Ferraro. I’ve also interned at Amazon Science (Alexa AI; Summer 2025 + returning Summer 2026) and Scale AI (Summer 2024). My research focuses on three areas where modern LLMs fail predictably - (1) complex reasoning, (2) tool-use, and (3) modality conflict - to make them more reliable, efficient, and aligned.

  • Reasoning & Decomposition
    • Atomic, presupposition-free decomposition for robust claim verification [De-Presuppose]
    • Token-efficient math reasoning via distractor-aware computational graphs [DAGGER]
    • Curriculum-driven GRPO for math reasoning in under-resourced languages [GanitLLM]
    • Hierarchical event abstraction for compositional sequence modeling [SHEM]
  • Agentic LLMs & Reinforcement Learning
    • Tool-calling alignment via policy-grounded deliberation [PA3]
    • Multi-agent benchmarks for diagnosing collaboration failures [AgentCollabBench]
    • Mechanistic analysis of token saliency in on-policy distillation [Rock Tokens]
    • Metacognitive control in LLMs under resource constraints [TRIAGE]
  • Multimodal Learning & Evaluation
    • Reference-free factuality metric for video captions [VC-Inspector]
    • Calibrated abstention under modality conflict in omni-modal models [OMD]
    • Zero-shot multilingual text-to-video retrieval via temporal event decomposition [Q2E]

Graduating Spring 2027 - actively seeking Research Scientist roles in NLP / Multimodal AI. Please reach out if you have an opening.

Recent News (See All)

May 14, 2026 🄳 OMD-Bench got accepted at 3 CVPR 2026 workshops (Any2Any MLLM, CVinWild, KnowledgeMR).
May 7, 2026 🄳 1 of my ACL papers (VC Inspector) also got accepted to MAGMaR 2026.
Apr 7, 2026 Successfully defended my PhD Proposal. Now I am officially ABD (All But Dissertation). Slides of the proposal can be found here.
Apr 6, 2026 🄳 3 of my papers got accepted to ACL 2026 - first-author VC-Inspector (Main) and GanitLLM (Findings), plus co-authored Survey on Multimodal Unlearning (Findings).
Jan 28, 2026 🄳 2 of my mentored papers got accepted at LoResLM Workshop at EACL 2026 [ 1, 2 ]
Dec 3, 2025 āœˆļø Heading to San Diego for NeurIPS 2025. Email me to schedule a chat!
Nov 21, 2025 🄳 3 of my mentored papers got accepted at BLP Workshop at AACL 2025 [ 1, 2, 3 ]

Beyond Research

I founded UniShopr (2017–2021), a cross-border e-commerce platform serving consumers in Bangladesh.

I’ve also competed internationally in robotics and algorithms - placing 9th at the University Rover Challenge 2015 (Utah, USA) and 22nd at the European Rover Challenge 2016 (Poland), ranking 8th out of 300+ teams at the 2018 ACM ICPC Asia Dhaka Regional with multiple regional and national placements, and reaching the top 70 on Kaggle šŸ„‰ in the Birdcall Identification competition. Full list of awards →

Featured Publications

Check out Google Scholar for a full list of my publications.

  1. Cornerstones or Stumbling Blocks? Deciphering the Rock Tokens in On-Policy Distillation
    Submitted
    Cornerstones or Stumbling Blocks? Deciphering the Rock Tokens in On-Policy Distillation
    Yuxuan Jiang*,Ā Runchao Li*,Ā Shubhashis Roy Dipta*, and 2 more authors
    Preprint 2026
    * Equal contribution
  2. AgentCollabBench: Diagnosing When Good Agents Make Bad Collaborators
    Submitted
    AgentCollabBench: Diagnosing When Good Agents Make Bad Collaborators
    Aritra Mazumder,Ā Shubhashis Roy Dipta,Ā Nusrat Jahan Lia, and 10 more authors
    Preprint 2026
  3. TRIAGE: Evaluating Prospective Metacognitive Control in LLMs under Resource Constraints
    Preprint
    TRIAGE: Evaluating Prospective Metacognitive Control in LLMs under Resource Constraints
    Zabir Al Nazi,Ā andĀ Shubhashis Roy Dipta
    Preprint 2026
  4. PA3: Policy-Aware Agent Alignment through Chain-of-Thought
    Submitted
    PA3: Policy-Aware Agent Alignment through Chain-of-Thought
    Shubhashis Roy Dipta,Ā Daniel Bis,Ā Kun Zhou, and 4 more authors
    Preprint 2026
    Work done during internship at Amazon Alexa AI
  5. †DAGGER: Distractor-Aware Graph Generation for Executable Reasoning in Math Problems
    Submitted
    †DAGGER: Distractor-Aware Graph Generation for Executable Reasoning in Math Problems
    Zabir Al Nazi,Ā Shubhashis Roy Dipta,Ā andĀ Sudipta Kar
    Preprint 2026
  6. Omni-Modal Dissonance Benchmark: Systematically Breaking Modality Consensus to Probe Robustness and Calibrated Abstention
    Submitted
    Omni-Modal Dissonance Benchmark: Systematically Breaking Modality Consensus to Probe Robustness and Calibrated Abstention
    Zabir Al Nazi*,Ā Shubhashis Roy Dipta*,Ā andĀ Md Rizwan Parvez
    Preprint 2026
    * Equal contribution
  7. GanitLLM: Difficulty-Aware Bengali Mathematical Reasoning through Curriculum-GRPO
    ACL
    GanitLLM: Difficulty-Aware Bengali Mathematical Reasoning through Curriculum-GRPO
    Shubhashis Roy Dipta,Ā Khairul Mahbub,Ā andĀ Nadia Najjar
    ACL 2026
  8. Advancing Reference-free Evaluation of Video Captions with Factual Analysis
    ACL
    Advancing Reference-free Evaluation of Video Captions with Factual Analysis
    Shubhashis Roy Dipta,Ā Tz-Ying Wu,Ā andĀ Subarna Tripathi
    ACL 2026
  9. Multimodal Unlearning Across Vision, Language, Video, and Audio: Survey of Methods, Datasets, and Benchmarks
    ACL
    Multimodal Unlearning Across Vision, Language, Video, and Audio: Survey of Methods, Datasets, and Benchmarks
    Nobin Sarwar,Ā Shubhashis Roy Dipta,Ā Zheyuan Liu, and 1 more author
    ACL 2026
  10. Q2E: Query-to-Event Decomposition for Zero-Shot Multilingual Text-to-Video Retrieval
    AACL
    Q2E: Query-to-Event Decomposition for Zero-Shot Multilingual Text-to-Video Retrieval
    Shubhashis Roy Dipta,Ā andĀ Francis Ferraro
    AACL 2025
  11. If We May De-Presuppose: Robustly Verifying Claims through Presupposition-Free Question Decomposition
    *SEM
    If We May De-Presuppose: Robustly Verifying Claims through Presupposition-Free Question Decomposition
    Shubhashis Roy Dipta,Ā andĀ Francis Ferraro
    *SEM 2025
  12. Learning How to Use Tools, Not Just When: Pattern-Aware Tool-Integrated Reasoning
    MathAI @NeurIPS
    Learning How to Use Tools, Not Just When: Pattern-Aware Tool-Integrated Reasoning
    Ningning Xu,Ā Yuxuan Jiang,Ā andĀ Shubhashis Roy Dipta
    MathAI @NeurIPS 2025
  13. Semantically-informed Hierarchical Event Modeling
    *SEM
    Semantically-informed Hierarchical Event Modeling
    Shubhashis Roy Dipta,Ā Mehdi Rezaee,Ā andĀ Francis Ferraro
    *SEM 2023