Fig. 6From: Dual modality prompt learning for visual question-grounded answering in robotic surgeryExamples of the proposed and other models failing on the EndoVis-17 [25] dataset. Text in red denotes the wrong answer. Examples 1, 2, 3, 4 denote these four examples from left to rightBack to article page