Free · 7 practice questions · Applied AI

NVIDIA-Certified Associate: Generative AI LLMs (NCA-GENL) practice questions.

7 scenario-based questions covering every domain on the exam blueprint. Original DecipherU writing with primary-source citations, not exam-question mimicry. Free to read. Pair with the $147 cert-prep add-on for domain reviews and exam-day strategy.

Read the NVIDIA-Certified Associate: Generative AI LLMs (NCA-GENL) exam overview See parent course

Layered on ai engineering mastery

NVIDIA-Certified Associate: Generative AI LLMs (NCA-GENL) Exam Prep Add-On

NVIDIA NCA-GENL exam-ready ramp on top of AI Engineering Mastery. Domain reviews aligned to the official NVIDIA exam outline, three 50-question mock exams, and a focused review of the NVIDIA LLM software stack including NeMo, Triton Inference Server, and TensorRT-LLM.

Buy the prep · $147 See parent course

Question 1 of 7Domain · LLM Fundamentals

A team is generating customer-service responses with an LLM and wants more consistent output for factual questions. Which sampling parameter set is most appropriate?

Pick a letter to enable reveal

Question 2 of 7Domain · Prompt Engineering and RAG

A RAG system retrieves five chunks per query and stuffs them into the prompt. Users report that the model often ignores the retrieved context and falls back to its training-data knowledge. Which mitigation is most likely to help?

Pick a letter to enable reveal

Question 3 of 7Domain · Fine-Tuning and Customization

A team needs to fine-tune a 70B-parameter base model to follow a new product's instruction style. They have a single 8x H100 node available. Which fine-tuning approach is most feasible on that hardware?

Pick a letter to enable reveal

Question 4 of 7Domain · NVIDIA Inference Stack

An ML engineer needs to serve multiple LLMs from a single inference endpoint with automatic batching across concurrent requests. Which NVIDIA software is the canonical fit?

Pick a letter to enable reveal

Question 5 of 7Domain · NVIDIA Inference Stack

TensorRT-LLM applies several inference optimizations to LLM serving on NVIDIA GPUs. Which optimization keeps the GPU continuously busy by adding new requests to a running batch as old requests complete?

Pick a letter to enable reveal

Question 6 of 7Domain · Operational Considerations and Responsible AI

A production LLM serving cluster is constrained by KV-cache memory at high concurrency. Which technique most directly addresses the KV-cache bottleneck?

Pick a letter to enable reveal

Question 7 of 7Domain · Operational Considerations and Responsible AI

An ML engineer needs to add safety guardrails to a customer-facing LLM application: content moderation, off-topic refusal, jailbreak detection. NVIDIA ships an open-source toolkit specifically for this use case. Which one is the canonical fit?

Pick a letter to enable reveal

Liked these 7? Get the full prep.

NVIDIA-Certified Associate: Generative AI LLMs (NCA-GENL) Exam Prep Add-On

Adds exam-blueprint domain reviews, exam-day strategy, the authorized study resources, and the gated practice scenarios behind purchase. $147 on top of the parent course. Verified against the official blueprint 2026-05-22.

Buy the prep · $147 Today's daily question

Other cert practice sets. Sixteen more cert-prep modules ship with practice question sets:

A cert-prep question in your inbox every morning

One scenario-based question a day across cybersecurity and Applied AI certifications, with the answer, explanation, and primary source. Free, separate from the weekly Decipher Brief, unsubscribe from either on its own.

By subscribing you agree to our privacy policy. Unsubscribe anytime.

Get cybersecurity career insights delivered weekly

Join cybersecurity professionals receiving weekly intelligence on threats, job market trends, salary data, and career growth strategies.

By subscribing you agree to our privacy policy. Unsubscribe anytime.