S 3 Agent: Unlocking the Power of VLLM for Zero-Shot Multi-modal Sarcasm Detection
ACM Transactions on Multimedia Computing, Communications and Applications(2024)
Key words
Natural language processing,Multi-modal sarcasm detection,Vision Large language model
AI Read Science
Must-Reading Tree
Example

Generate MRT to find the research sequence of this paper
Chat Paper
Summary is being generated by the instructions you defined