Introducing a strong VLM FIRE with a powerful feedback-refinement capability.
Can VLMs refine their responses based on user feedback spontaneously❓
Introducing 🔥FIRE: A Dataset for Feedback Integration and Refinement Evaluation of Multimodal Models.
- 👉 Project: https://mm-fire.github.io Feel free to leave a message to me, if you have issues about the code.