Introducing a strong VLM FIRE with a powerful feedback-refinement capability.

Can VLMs refine their responses based on user feedback spontaneously❓

Introducing 🔥FIRE: A Dataset for Feedback Integration and Refinement Evaluation of Multimodal Models.

  • 👉 Project: https://mm-fire.github.io

Checkout our 🚀 Gradio Demo

Feel free to leave a message to me, if you have issues about relevant demo and code.