Presented “Can Vision-Language models generate scene descriptions?” @ NLG in the Lowland workshop, Tilburg</a>, 🇳🇱