input_tokens | Number of tokens in the prompt/input to the model. |
output_tokens | Number of tokens generated by the model as output. |
total_tokens | Total tokens used (input + output). |
audio_input_tokens | Audio tokens in the input. |
audio_output_tokens | Audio tokens in the output. |
audio_total_tokens | Total audio tokens (if using audio input/output). |
cache_read_tokens | Tokens served from cache (if caching is used). |
cache_write_tokens | Tokens written to cache. |
reasoning_tokens | Tokens used for reasoning steps (if enabled). |
time_to_first_token | Time until the first token is generated (in seconds). |
duration | Total run time (in seconds). |
provider_metrics | Provider-specific metrics (dict). |
additional_metrics | Any extra metrics provided by the model/tool (e.g., latency, cost, etc.). |