rate(). Gauge goes up/down - memory, queue depth. Histogram buckets latency/size - compute quantiles with recording rules or Grafana; beware histogram bucket cardinality.
rate() needs range ≥2 scrape intervals; too-short windows are noisy. Counter resets (pod restart) handled by Prometheus. irate is more volatile for dashboards; prefer rate for alerting stability. Alignment of scrape and eval windows matters.
trace_id for drill-down. Unified query UI (Grafana) or jump links from alert to trace. Reduces mean time to resolution.