Skip to main content
Model Traces provides run-level analysis of any model deployed or integrated in your account. Use it to review request inputs, generated outputs, response times, and run metadata for troubleshooting and performance investigation. Navigation: Go to SettingsMonitoringModel Traces.

Performance Metrics Summary

The metrics summary at the top of the page shows key performance data for the selected model and period.
MetricDescription
Total RequestsTotal runs serviced by the model since deployment. Reflects processing speed and efficiency.
Response TimeP90 and P99 thresholds — the response times below which 90% and 99% of requests fall. Lower values indicate consistent speed; higher values indicate potential issues.
Failure RatePercentage of requests that failed out of total requests. For example, 5 failed out of 100 equals a 5% failure rate.
Hosting CreditsCredits consumed by the deployed model based on usage, for comparison against actual utilization.
Hosting Credits apply only to open-source and fine-tuned models. This metric is not shown for external models.

Model Traces Table

The table displays all runs for the selected model, sorted from the latest to the oldest execution date. It includes data from the first execution onward — whether from deployment (open-source and fine-tuned models) or integration (external models). Successful requests are marked with a green check icon; failed requests with a red alert icon.
ColumnDescription
StatusSuccess or failure icon for the run.
Request IDUnique identifier for the run record.
Response TimeTime taken by the model to respond.
Deployment VersionModel version deployed in your account.
Source TypeType of source that initiated the request.
SourceSpecific source name that sent the request.
Executed onRun execution timestamp. Click the Sort icon to reorder records by ascending or descending date.
InputInput text provided for the run.
OutputOutput text or response generated by the model.
To view the detailed trace for a run, click its row in the table.

Export Data

Click Export to generate a CSV file of model traces records based on the selected date range and filters.

Trace Details

Select a trace record to open the Traces detail view, which displays the input, output, and metadata for that run.
  • Input: The request text provided to the model using input tokens.
  • Output: The text output or response generated by the model using output tokens.
Plain text is the default display format. Enable JSON to view the complete request or response payload, including additional keys not visible in plain text. Content is read-only — select the Copy icon to copy to your clipboard.
In JSON mode, the model name includes the deployment version for open-source and fine-tuned models, and the connection name for external models.
Metadata
FieldDescription
Request IDUnique identifier for the trace.
Base ModelThe model used for the run.
Connection NameThe connection used to invoke the model.
Response TimeTime taken to generate the response.
Input TokensNumber of tokens in the input.
Output TokensNumber of tokens in the output.
Executed OnTimestamp of when the run occurred.
Source TypeThe type of source that triggered the run.
SourceThe specific source that triggered the run.