Targeted Visual Prompting for Medical Visual Question Answering

Official implementation of the Paper "Targeted Visual Prompting for Medical Visual Question Answering," presented at the AMAI2024 Workshop of the MICCAI 2024 Conference. For more details, please refer to our paper.

This repo is undergoing a cleaning and organization process.

Installing Requirements

After cloning the repo, create a new environment, activate it, and then install the required packages by running:

pip install -r requirements.txt

Data

Download the original data from here and the processed annotation files from here. Alternatively, run the prepare_data.py under the folder corresponding to each dataset (ris, insegcat or dme) to prepare the data.

Depending on where you place the downloaded data, you will need to configure the paths in the subsequent steps.

Finetuning

To run the code, use the bash scripts in the folder scripts_vqa, for example, to run the baseline crop region on the RIS dataset,

bash scripts_vqa/ris/crop_region.sh

Notice the paths to the datasets have to be changed in advance.

Testing

The test scripts are located in the same folders as for fine-tuning. The same command will be effective to evaluate performance. For example, to evaluate the baseline draw region on the DME dataset, use

bash scripts_vqa/dme/draw_region_test.sh

Similar to the finetuning, the paths have to be configured in advance.

The metrics will be printed automatically at the end of the inference process.

This work was carried out at the AIMI Lab of the ARTORG Center for Biomedical Engineering Research of the University of Bern. Please cite this work as:

@article{tascon2024targeted,
title={Targeted Visual Prompting for Medical Visual Question Answering},
author={Tascon-Morales, Sergio and M{'a}rquez-Neila, Pablo and Sznitman, Raphael},
journal={arXiv preprint arXiv:2408.03043},
year={2024}
}

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
.vscode		.vscode
coco_regions		coco_regions
configs		configs
dataset		dataset
dme		dme
evalcap		evalcap
insegcat		insegcat
lightning_tools		lightning_tools
models		models
ris		ris
scripts		scripts
scripts_reports		scripts_reports
scripts_vqa		scripts_vqa
vqamed2019		vqamed2019
.gitignore		.gitignore
README.md		README.md
requirements.txt		requirements.txt
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Targeted Visual Prompting for Medical Visual Question Answering

Installing Requirements

Data

Finetuning

Testing

About

Uh oh!

Releases

Packages

Languages

sergiotasconmorales/locvqallm

Folders and files

Latest commit

History

Repository files navigation

Targeted Visual Prompting for Medical Visual Question Answering

Installing Requirements

Data

Finetuning

Testing

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages