MLLM-Tool: A Multimodal Large Language Model for Tool Agent Learning
IEEE/CVF Winter Conference on Applications of Computer Vision(2025)
Key words
Large Language Models,Functional Identification,Real Purpose,External Tools,Multimodal Input,Training Set,Image Quality,Computational Resources,Hallucinations,Selection Tool,Inference Time,Mapping Relationship,Performance In Areas,Test Subset,Unique Treatment,Single Instruction,API Calls,Types Of Ambiguity
AI Read Science
Must-Reading Tree
Example

Generate MRT to find the research sequence of this paper
Chat Paper
Summary is being generated by the instructions you defined