Here are all the actual test exam dumps for IT exams. Most people prepare for the actual exams with our test dumps to pass their exams. So it's critical to choose and actual test pdf to succeed.

Exam NCA-GENM Topic 1 Question 277 Discussion

Actual exam question for NVIDIA's NCA-GENM exam
Question #: 277
Topic #: 1
You are building a system that takes an image of a scene and a short audio clip as input and generates a descriptive text. You want to evaluate the system's performance. Which of the following evaluation metrics are MOST suitable for assessing both the accuracy and the coherence of the generated descriptions in relation to the input image and audio?

Suggested Answer: E Vote an answer

BLEU, CIDEr, and SPICE are all suitable for evaluating image captioning and similar generative tasks. BLEU measures the n-gram overlap between the generated text and reference texts. CIDEr specifically focuses on consensus-based image description evaluation, weighting n-grams that are more common among human-generated captions. SPICE focuses on semantic propositional content and captures object, attribute, and relationship triples. ROUGE focuses on recall, but the other 3 provide the best overall picture. Perplexity and WER are more suitable for language models, and Inception Score and FID are used for evaluating the quality of generated images.

by Renee at Sep 22, 2025, 07:57 PM

Comments

Chosen Answer:
This is a voting comment (?) , you can switch to a simple comment.
Switch to a voting comment New
Nick name: Submit Cancel
A voting comment increases the vote count for the chosen answer by one.

Upvoting a comment with a selected answer will also increase the vote count towards that answer by one. So if you see a comment that you already agree with, you can upvote it instead of posting a new comment.