Each theme combines recent work with long-running foundations from the group, from multimodal emotion and sentiment analysis to retrieval-grounded generation, scientific discovery, efficient learning, and vision-language-action models.

Safety

We study whether AI systems behave safely inside the situations they are built for, especially when LLM agents face off-topic, adversarial, or underspecified requests.

  • Operational safety for task-specific LLM agents
  • Red-teaming, jailbreak analysis, and refusal behavior
  • Test-time alignment and model steering for safer responses
OffTopicEval operational safety evaluation visual
Operational Safety

OffTopicEval

An evaluation suite for measuring whether LLM agents accept valid in-domain requests and refuse out-of-domain ones.

Team: Jingdi Lei, Varun Gumma, Rishabh Bhardwaj, Seok Min Lim, Chuan Li, Amir Zadeh, Soujanya Poria

Safety Arithmetic test-time steering visual
Alignment

Safety Arithmetic

A test-time framework for steering language models toward safer behavior through parameters and activations.

Team: Safety and alignment group

RESTA safety realignment figure
Safety Re-Alignment

RESTA

Restoring safety in fine-tuned language models through task arithmetic while retaining downstream task capability.

Team: Rishabh Bhardwaj, Do Duc Anh, Soujanya Poria

Chain of Utterances red-teaming figure
Red-Teaming

Chain of Utterances

RED-EVAL and Chain-of-Utterances prompting for probing harmful behavior and studying safety alignment in LLMs.

Team: Safety and dialogue systems group

Gender bias probing figure for BERT
Bias Analysis

Gender Bias in BERT

A highly cited analysis of gender bias encoded in contextualized language representations.

Team: Bias and responsible AI collaborators

Trustworthiness

We develop methods for AI systems that know when to answer, when to cite, when to refuse, and how to communicate uncertainty in grounded settings.

  • Trustworthy retrieval-augmented generation
  • Grounded attribution and citation-aware generation
  • Trust calibration in multi-agent and human-facing systems
Trust-Score and Trust-Align retrieval augmented generation visual
RAG

Trust-Score and Trust-Align

Trust-Score evaluates RAG trustworthiness, while Trust-Align improves grounded attribution, refusal, and citation quality.

Team: Maojia Song, Shang Hong Sim, Rishabh Bhardwaj, Hai Leong Chieu, Navonil Majumder, Soujanya Poria

Chain-of-Knowledge framework figure
Grounded Generation

Chain-of-Knowledge

A highly cited framework for grounding LLM generation through dynamic knowledge adaptation over heterogeneous sources.

Team: Xiang Lisa Li, Ruochen Zhao, Yew Ken Chia, Bosheng Ding, Shafiq Joty, Soujanya Poria, Lidong Bing

Epistemic Context Learning visual
Multi-Agent Trust

Epistemic Context Learning

Studying trust formation and calibrated reliance in LLM-based multi-agent systems.

Team: Trustworthy and interactive AI group

Multimodality

Multimodality is a long-running foundation of DeCLaRe Lab: we build models and benchmarks that integrate language, vision, audio, video, and social context.

  • Multimodal fusion and representation learning
  • Emotion, sentiment, sarcasm, and social signal understanding
  • Vision-language and audio-language reasoning benchmarks
MELD dataset visual
Conversational AI

MELD and DialogueRNN

Highly cited resources and models for multimodal, multi-party emotion recognition in conversations.

Team: Soujanya Poria, Devamanyu Hazarika, Navonil Majumder, collaborators

Tensor Fusion Network visual
Fusion

TFN, MISA, and Multimodal Fusion

Foundational work on modality interaction, invariant and specific representations, and robust multimodal sentiment analysis.

Team: Multimodal learning group

TangoFlux training pipeline figure
Audio Generation

Tango and TangoFlux

Text-to-audio generation research spanning diffusion-based audio synthesis and fast preference-optimized flow matching.

Team: Audio, speech, and generative AI group

AI for Science

We explore how language models and agentic systems can help scientific reasoning: retrieving inspirations, forming hypotheses, ranking ideas, and synthesizing evidence.

  • Scientific hypothesis discovery and rediscovery
  • Chemistry-focused benchmarks and multi-agent discovery pipelines
  • Open-domain scientific literature reasoning
MOOSE-Chem visual
Scientific Discovery

MOOSE-Chem

An ICLR 2025 benchmark and framework for testing whether LLMs can rediscover valid chemistry hypotheses from background questions and literature.

Team: Zonglin Yang, Wanhao Liu, Ben Gao, Tong Xie, Yuqiang Li, Wanli Ouyang, Soujanya Poria, Erik Cambria, Dongzhan Zhou

Scientific hypothesis discovery visual
Hypothesis Discovery

Open-Domain Scientific Hypotheses

LLM methods for automated scientific hypothesis discovery across broad literature collections.

Team: Scientific reasoning group

Efficiency

We build techniques that make training, adaptation, and inference cheaper without losing reliability or downstream performance.

  • Dynamic data selection and data-efficient training
  • Online memory, adapters, distillation, token retention, and long-context methods
  • Compact multimodal and embodied models for practical deployment
δ-mem online memory architecture
Online Memory

δ-mem

A lightweight online memory mechanism that compresses history into a compact state and uses it to modulate frozen Transformer attention.

Team: Jingdi Lei, Di Zhang, Junxian Li, Weida Wang, Kaixuan Fan, Xiang Liu, Qihan Liu, Xiaoteng Ma, Baian Chen, Soujanya Poria

Data Agent visual
Data Selection

Data Agent

An end-to-end dynamic data selection framework that learns sample-wise policies to accelerate training while preserving performance.

Team: Suorong Yang, Fangjian Su, Hai Gan, Ziqi Ye, Jie Li, Baile Xu, Furao Shen, Soujanya Poria

DELLA-Merging method figure
Model Merging

DELLA-Merging

Magnitude-based sampling for reducing interference when merging task-specialized language models.

Team: Prateek Yadav Deep, Rishabh Bhardwaj, Soujanya Poria

EFLA linear attention figure
Linear Attention

EFLA

Error-Free Linear Attention derives an exact continuous-time solution for robust long-context computation.

Team: Jingdi Lei, Dong Zhang, Soujanya Poria

Efficient LLM visual
Efficient LLMs

PromptDistill, LLM-Adapters, UDApter

Efficient inference, parameter-efficient fine-tuning, and adapter-based transfer for language and speech models.

Team: Efficient learning group

Embodied AI

We develop embodied agents that perceive, reason, and act, with emphasis on compact VLA models, action grounding, and interactive benchmarks.

  • Vision-language-action models for robotic tasks
  • World-model and action-based preference rewards
  • Interactive reasoning benchmarks from perception to action
NORA teaser
VLA

NORA and NORA 1.5

Small generalist vision-language-action models designed for efficient action grounding and dependable embodied behavior.

Team: Chia-Yu Hung, Qi Sun, Pengfei Hong, Amir Zadeh, Chuan Li, U-Xuan Tan, Navonil Majumder, Soujanya Poria

Embodied AI benchmark visual
Interactive Evaluation

Emma-X and Perception-to-Action

Embodied foundation models and benchmarks that evaluate how agents move from visual reasoning to action.

Team: Embodied AI group