Introducing a strong VLM FIRE with a powerful feedback-refinement capability.
Can VLMs refine their responses based on user feedback spontaneously❓
Introducing 🔥FIRE: A Dataset for Feedback Integration and Refinement Evaluation of Multimodal Models.
- 👉 Project: https://mm-fire.github.io
Checkout our 🚀 Gradio Demo
Feel free to leave a message to me, if you have issues about relevant demo and code.