TensorRT-LLMs/tensorrt_llm/metrics/enums.py
zhanghaotong 1026069a2b
[None][feat] Add opentelemetry tracing (#5897)
Signed-off-by: Zhang Haotong <zhanghaotong.zht@antgroup.com>
Signed-off-by: zhanghaotong <zhanghaotong.zht@antgroup.com>
Signed-off-by: Shunkang <182541032+Shunkangz@users.noreply.github.co>
Co-authored-by: Zhang Haotong <zhanghaotong.zht@alibaba-inc.com>
Co-authored-by: Shunkang <182541032+Shunkangz@users.noreply.github.co>
2025-10-27 18:51:07 +08:00

20 lines
565 B
Python

from enum import Enum
class MetricNames(Enum):
TTFT = "ttft"
TPOT = "tpot"
E2E = "e2e"
REQUEST_QUEUE_TIME = "request_queue_time"
ARRIVAL_TIMESTAMP = 'arrival_timestamp'
class RequestEventTiming(Enum):
ARRIVAL_TIME = "arrival_time"
FIRST_TOKEN_TIME = "first_token_time" # nosec: B105
FIRST_SCHEDULED_TIME = "first_scheduled_time"
LAST_TOKEN_TIME = "last_token_time" # nosec: B105
KV_CACHE_TRANSFER_START = "kv_cache_transfer_start"
KV_CACHE_TRANSFER_END = "kv_cache_transfer_end"
KV_CACHE_SIZE = "kv_cache_size"