Prometheus metrics export
Metarank exports a set of internal metrics you can use to monitor its health. See the /metrics
endpoint description for details on how to access them.
Application metrics
All application metrics have a common metarank_
prefix:
metarank_rank_requests
: counter, number of requests received by the/rank
endpoint. This metric also counts requests by model name.
metarank_feedback_events
: counter, number of feedback events received both from API and any other connector (like kafka/pulsar/kinesis).
metarank_rank_latency_seconds
, histogram, latency distribution for/rank
requests, scoped by a model. Percentiles tracked: 50%, 80%, 90%, 95%, 98%, 99%.
JVM metrics
Metarank also exports a set of default JVM metrics, related to buffers, classloaders, GC, allocation and threadpools.
All the JVM metrics have common jvm_
prefix.
Grafana dashboard
coming soon.
Last updated