NVIDIA interview question

Questions around Quantization, inference optimization , LLM system design