TensorRT-LLMs/ppl.py at 5aec7af45fc0abd876fa68a9ae8c8cae084f3af3 - TensorRT-LLMs - Gitea: Git with a cup of tea

kanshan/TensorRT-LLMs

mirror of https://github.com/NVIDIA/TensorRT-LLM.git synced 2026-01-14 06:27:45 +08:00

Kaiyu Xie 587d063e6d

Update TensorRT-LLM (#506 )

* Update TensorRT-LLM

---------

Co-authored-by: Shixiaowei02 <39303645+Shixiaowei02@users.noreply.github.com>

2023-11-30 16:46:22 +08:00

8 lines

216 B

Python

Raw Blame History

 def ppl(logits, output_ids):
     """
     Calculate per-token perplexity.
     """
     nlls = -logits.log_softmax(dim=-1)
     ppls = nlls.gather(-1, output_ids.long().unsqueeze(-1))
     return ppls.mean().exp().item()