diff --git a/docs/source/blogs/tech_blog/blog13_Inference_Time_Compute_Implementation_in_TensorRT-LLM.md b/docs/source/blogs/tech_blog/blog13_Inference_Time_Compute_Implementation_in_TensorRT-LLM.md index 869a6d5196..94070f280f 100644 --- a/docs/source/blogs/tech_blog/blog13_Inference_Time_Compute_Implementation_in_TensorRT-LLM.md +++ b/docs/source/blogs/tech_blog/blog13_Inference_Time_Compute_Implementation_in_TensorRT-LLM.md @@ -166,7 +166,7 @@ prototype_controller = NativeGenerationController(sampling_params={ llm = ScaffoldingLlm( prototype_controller, - {NativeGenerationController.WorkerTag.GENERATION: proposer_worker}, + {NativeGenerationController.WorkerTag.GENERATION: llm_worker}, ) results = llm.generate(prompts) ```