mirror of
https://github.com/NVIDIA/TensorRT-LLM.git
synced 2026-01-14 06:27:45 +08:00
* v1.5 Signed-off-by: wili-65535 <wili-65535@users.noreply.github.com> v1.5.4 Add back draft_overhead to spec dec stats Signed-off-by: Thor Johnsen <41591019+thorjohnsen@users.noreply.github.com> * v1.5.5: fix CI error Signed-off-by: wili-65535 <wili-65535@users.noreply.github.com> * v1.6: fix CI error 8196 > 8192 Signed-off-by: wili-65535 <wili-65535@users.noreply.github.com> * Address reviewer concerns Signed-off-by: Thor Johnsen <41591019+thorjohnsen@users.noreply.github.com> * Address reviewer concerns Signed-off-by: Thor Johnsen <41591019+thorjohnsen@users.noreply.github.com> * precommit run Signed-off-by: Thor Johnsen <41591019+thorjohnsen@users.noreply.github.com> * v2.0: Address reviewer concerns Signed-off-by: wili-65535 <wili-65535@users.noreply.github.com> * v2.1: add fix from wili Signed-off-by: wili-65535 <wili-65535@users.noreply.github.com> * Revert changes that require use of TypeAlias because that requires python version >= 3.10 Signed-off-by: Thor Johnsen <41591019+thorjohnsen@users.noreply.github.com> --------- Signed-off-by: Thor Johnsen <41591019+thorjohnsen@users.noreply.github.com> Signed-off-by: wili-65535 <wili-65535@users.noreply.github.com> Co-authored-by: wili-65535 <wili-65535@users.noreply.github.com> |
||
|---|---|---|
| .. | ||
| baichuan | ||
| bert | ||
| bloom | ||
| chatglm | ||
| clip | ||
| cogvlm | ||
| commandr | ||
| dbrx | ||
| deepseek_v1 | ||
| deepseek_v2 | ||
| dit | ||
| eagle | ||
| enc_dec | ||
| falcon | ||
| gemma | ||
| gpt | ||
| gptj | ||
| gptneox | ||
| grok | ||
| llama | ||
| mamba | ||
| medusa | ||
| mllama | ||
| mmdit_sd3 | ||
| mpt | ||
| multimodal_encoders | ||
| nemotron_nas | ||
| opt | ||
| phi | ||
| phi3 | ||
| qwen | ||
| recurrentgemma | ||
| redrafter | ||
| stdit | ||
| unet | ||
| __init__.py | ||
| automodel.py | ||
| convert_utils.py | ||
| generation_mixin.py | ||
| model_weights_loader.py | ||
| modeling_utils.py | ||