MEERKAT: Audio-Visual Large Language Model for Grounding in Space and Time

Sanjoy Chowdhury*, Sayan Nag*, Subhrajyoti Dasgupta*, Jun Chen, Mohamed Elhoseiny, Ruohan Gao, Dinesh Manocha

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Fingerprint

Dive into the research topics of 'MEERKAT: Audio-Visual Large Language Model for Grounding in Space and Time'. Together they form a unique fingerprint.

Computer Science

Keyphrases