Fig. 4From: Dual modality prompt learning for visual question-grounded answering in robotic surgeryExamples of localization and classification prediction results generated by the proposed model and other advanced models on the EndoVis-18 [25] dataset. Text in red denotes the wrong answer. Examples 1, 2, 3, 4 refer to these four examples from left to rightBack to article page