Here are all the actual test exam dumps for IT exams. Most people prepare for the actual exams with our test dumps to pass their exams. So it's critical to choose and actual test pdf to succeed.
Actual exam question for NVIDIA's NCP-AII exam Question #: 294 Topic #: 1
You are deploying a new A1 inference service using Triton Inference Server on a multi-GPU system. After deploying the models, you observe that only one GPU is being utilized, even though the models are configured to use multiple GPUs. What could be the possible causes for this?
The 'instance_group' parameter in the model configuration dictates how Triton distributes the model across GPUs. Without proper configuration, it may default to a single GPIJ. CUDA MPS allows multiple CUDA applications (in this case, Triton inference processes) to share a single GPU, improving utilization. Insufficient CPU cores or non-optimized models could limit performance, but wouldn't necessarily restrict usage to a single GPIJ. While dissimilar GPIJs can affect performance, Triton will attempt to schedule across them if configured correctly.
A voting comment increases the vote count for the chosen answer by one.
Upvoting a comment with a selected answer will also increase the vote count towards that answer by one.
So if you see a comment that you already agree with, you can upvote it instead of posting a new comment.
Report Comment
Is the comment made by USERNAME spam or abusive?
Commenting
In order to participate in the comments you need to be logged-in.
You can sign-up / login
(it's free).
Comments
Upvoting a comment with a selected answer will also increase the vote count towards that answer by one. So if you see a comment that you already agree with, you can upvote it instead of posting a new comment.
Report Comment
Commenting
You can sign-up / login (it's free).