Here are all the actual test exam dumps for IT exams. Most people prepare for the actual exams with our test dumps to pass their exams. So it's critical to choose and actual test pdf to succeed.

Exam NCA-GENM Topic 1 Question 274 Discussion

Actual exam question for NVIDIA's NCA-GENM exam
Question #: 274
Topic #: 1
You are building a multimodal AI system that generates 3D models of furniture from text descriptions and a few 2D images of similar furniture pieces. The system uses separate encoders for text and images. You want to fuse the information from both modalities effectively. Which TWO of the following fusion techniques would be the most appropriate for this task, considering the different nature of the text and image data?

Suggested Answer: C,D Vote an answer

Cross-attention (C) allows the model to dynamically learn the relationships between the text and image, highlighting relevant features from each modality- A gating mechanism (D) provides a learned way to control the contribution of each modality allowing the model to prioritize the more informative input Simple concatenation (A), addition (B), or averaging (E) are less sophisticated and might not effectively capture the complex interactions between the modalities.

by Bridget at Aug 26, 2025, 02:54 AM

Comments

Chosen Answer:
This is a voting comment (?) , you can switch to a simple comment.
Switch to a voting comment New
Nick name: Submit Cancel
A voting comment increases the vote count for the chosen answer by one.

Upvoting a comment with a selected answer will also increase the vote count towards that answer by one. So if you see a comment that you already agree with, you can upvote it instead of posting a new comment.