Udari Madhushani Sehwag

Recent Projects

Work in Progress

➟

I am co-organizing a Proceedings of National Academy of Science (PNAS) special issue on "Collective Artificial Intelligence"

with Karl Tuyls, Jakob Foerster, Joshua Plotkin and Arne Traulsen

➟

Agentyx: A Framework for Building LLM Based Multi-agent Systems

with Leo Ardon, Jared Vann, Sivapriya Vellaichamy, Mani Ganapathy, Sumitra Ganesh

➟

A Holistic Approach for Evaluating Agentic Systems

with Francesca Mosca, Deepeka Garg, Leo Ardon, Sumitra Ganesh

➟

Weak to Strong Generalization

with Aakriti Agrawal, Mucong Ding, Furong Huang

➟

Enhanced Embodied Intelligence Through Multimodal Foundation Model Based Agents.

with Arjun Karanam, Rika Antonova, Shuran Song, Jeannette Bohg

Generative AI: Social and resposible generative AI (2022-present)

☞

GenARM: Reward Guided Generation with Autoregressive Reward Model for Test-Time Alignment

Yuancheng Xu, Udari Madhushani Sehwag, Alec Koppel, Sicheng Zhu, Bang An, Furong Huang, Sumitra Ganesh

under review at ICLR 2025

Project-page / Pdf/ Bibtex

☞

Collab: Controlled Decoding using Mixture of Agents for LLM Alignment

Souradip Chakraborty, Sujay Batt, Udari Madhushani Sehwag, Soumya Suvra Ghosal, Jiahao Qiu, Mengdi Wang, Dinesh Manocha, Furong Huang, Alec Koppel, Sumitra Ganesh

under review at ICLR 2025

☞

AdvBDGen: Adversarially Fortified Prompt-Specific Fuzzy Backdoor Generator Against LLM Alignment

Pankayaraj Pathmanathan, Udari Madhushani Sehwag, Michael-Andrei Panaitescu-Liess, Furong Huang

Safe Generative AI workshop at NeurIPS 2024

under review at ICLR 2025

Project-page / Pdf/ Bibtex

☞

Policy Dreamer: Diverse Public Policy Generation Via Elicitation and Simulation of Human Preferences

Arjun Karanam, Jose Ramon Enriquez, Udari Madhushani Sehwag, Kanishk Gandhi, Micheal Elabd, Noah Goodman, Sanmi Kyejo

Socially Responsible Language Modelling Research workshop at NeurIPS 2024

Project-page / Pdf/ Bibtex

☞

Generative AI Agents for Knowledge Work Augmentation in Finance

Sumitra Ganesh, Leo Ardon, Daniel Borrajo, Deepeka Garg, Udari Madhushani Sehwag, Annapoorni Narayanan, Giuseppe Canonaco, Manuela Veloso

accepted at Annual Reviews 2024

☞

Can LLMs be Scammed? A Baseline Measurement Study

Udari Madhushani Sehwag*, Kelly Patel*, Francesca Mosca*, Vineeth Ravi, Jessica Staddon

Project-page / Pdf / Bibtex

☞

In-Context Learning with Topological Information for LLM-Based Knowledge Graph Completion

Udari Madhushani Sehwag*, Kassiani Papasotiriou*, Jared Vann, Sumitra Ganesh

SPIGM workshop at ICML 2024

Project-page / Pdf/ Bibtex

☞

O3D: Offline Data-Driven Discovery and Distillation for Sequential Decision Making with Large Language Models

Yuchen Xiao, Yanchao Sun, Mengda Xu, Udari Madhushani Sehwag, Jared Vann, Deepeka Garg, Sumitra Ganesh

COLM 2024

FMDM workshop at NeurIPS 2023

Project-page / Pdf / Bibtex

☞

SORRY-Bench: A Systematic Evaluation on Large Language Model Safety Refusal Behaviors

Tinghao Xie, Xiangyu Qi, Yi Zeng, Yangsibo Huang, Udari Madhushani Sehwag, Boyi Wei, Luxi He, Kaixuan Huang, Dacheng Li, Ying Sheng, Bo Li, Danqi Chen, Kai Li, Peter Henderson, Prateek Mittal

under review at NeurIPS 2024

Project-page / Pdf / Bibtex

☞

AI Risk Management Should Understand and Account for Both Safety and Security

Xiangyu Qi, ....., Udari Madhushani Sehwag, ....., Prateek Mittal

under review 2024

Project-page / Pdf / Bibtex

Multi-agent Learning: Social intelligence in multi-agent RL (2015-present)

☞

Autocratic Learning and Unilateral Incentive Alignment in Two-player Stochastic Games

Alex McAvoy, Udari Madhushani, Christian Hilbe, Wolfram Barfuss, Krishnendu Chatterjee, Qi Su, Naomi Ehrich Leonard, Joshua B. Plotkin

Accepted at PNAS 2024

Project-page / Pdf / Bibtex

☞

Collective Cooperative Intelligence

Wolfram Barfuss, Jessica Flack, Chaitanya S. Gokhale, Lewis Hammond, Christian Hilbe, Joel Leibo, Tom Lenaerts, Naomi Leonard, Simon Levin, Udari Madhushani, Alex McAvoy, Janusz M. Meylahn, Fernando P. Santos

Accepted at PNAS 2024

Project-page / Pdf / Bibtex

Zero-shot generalization: We develop methods that allow agents to successfully interact with novel partners during test time in mixed motive games.

☞

Heterogeneous Social Value Orientation Leads to Meaningful Diversity in Sequential Social Dilemmas

Udari Madhushani, Kevin McKee, John Agapiou, Joel Z Leibo, Thomas Anthony, Richard Everett Edward Hughes, Karl Tuyls, and Edgar Duéñez-Guzmán

AAMAS 2023

Project-page / Pdf / Bibtex

Effective communication: We study how agents can minimize communication cost by deciding when and what to communicate depending on the sequence of options they chose.

☞

One More Step Towards Reality: Cooperative Bandits with Imperfect Communication

Udari Madhushani, Abhimanyu Dubey, Naomi Leonard, Alex Pentland

NuerIPS 2021

Project-page / Pdf / Bibtex

☞

Distributed Bandits: Probabilistic Communication on \(d\)-regular Graphs

Udari Madhushani, Naomi Ehrich Leonard

ECC 2021

Project-page / Pdf / Bibtex

We analyze how agent-based strategies contribute to minimizing group regret under communication failures

☞

Distributed Learning: Sequential Decision Making in Resource-Constrained Environments

Udari Madhushani, Naomi Ehrich Leonard

PML4DC workshop, ICLR 2020

Project-page / Pdf / Bibtex/ Slides

We design a partial communication protocol that obtains the same order of performance as full communication for a significantly smaller communication cost.

☞

A Dynamic Observation Strategy for Multi-agent Multi-armed Bandit Problem

Udari Madhushani, Naomi Ehrich Leonard

ECC, 2020

Project-page / Pdf / Bibtex / Slides / Video

We propose a new communication protocol for multi-agent multi-armed bandit problem that improves group performance with only a logarithmic communication cost.

☞

Heterogeneous Explore-Exploit Strategies on Multi-Star Networks

Udari Madhushani, Naomi Ehrich Leonard

IEEE Control Systems Letters, 2020

Project-page / Pdf / Bibtex

For distributed bandits with a multi-star communication graph, we show how sampling rules for center agents that favor exploring over exploiting make the information that center agents broadcast to their neighbors more useful and improve group performance.

☞

Heterogeneous Stochastic Interactions for Multiple Agents in a Multi-armed Bandit Problem

Udari Madhushani, Naomi Ehrich Leonard

ECC, 2019

Project-page / Pdf / Bibtex / Slides

We consider the case where each agent observes all its neighbors independently with the same probability. We show that the performance of each agent depends on observation probabilities of its own and its neighbors.

Embodied AI: Human-robot coordination, control and planning (2015-present)

☞

Multi-robot Learning and Coverage of Unknown Spatial Fields

Maria Santos, Udari Madhushani, Alessia Benevento, Naomi Leonard

MRS, 2021

Project-page / Pdf / Bibtex / Slides / Video

We propose a novel explore-exploit based method for coverage in unknown special fields.

☞

Feedback Regularization and Geometric PID Control for Robust Stabilization of a Planar Three-link Hybrid Bipedal Walking Model

Lasitha Weerakoon, Udari Madhushani, Sanjeeva Maithripala, Jordan Berg

ACC, 2018

Project-page / Pdf / Bibtex / Slides / Video

We propose a geometric PID controller to stabilize a three-link planar bipedal hybrid dynamic walking robot.

☞

Semi-globally Exponential Trajectory Tracking for a Class of Spherical Robots

Udari Madhushani, Sanjeeva Maithripala, Janaka Wijayakulasooriya, Jordan Berg

Automatica, 2017

Project-page / Pdf / Bibtex

We propose a geometric feedback controller for spherical robots capable of tracking a desired position on an inclined plane, in the presence of parameter uncertainty and uncertainty of the inclination of the rolling surface.

☞

Feedback Regularization and Geometric PID Control for Trajectory Tracking of Mechanical Systems: Hoop Robots on an Inclined Plane

Udari Madhushani, Sanjeeva Maithripala, Jordan Berg

ACC, 2017

Project-page / Pdf / Bibtex / Slides

We propose a geometric control strategy for semi-almost global output tracking for a class of interconnected under actuated mechanical systems.

Udari Madhushani Sehwag

AI Research Scientist, Scale AI

udari.madhu.703 [at] gmail [dot] com