Metrics
Returns Prometheus-compatible metrics in text format for integration into your observability stack
Authentication: Required - Uses API key authentication
Format: Returns text/plain in Prometheus exposition format
Common Use Cases:
- Export app stats to your observability provider (grafana, datadog etc)
- Track runner health and performance
- Set up alerts and monitoring
See Prometheus documentation for format details.
GET
Metrics
Documentation Index
Fetch the complete documentation index at: https://fal.ai/docs/llms.txt
Use this file to discover all available pages before exploring further.
Authorizations
API key must be prefixed with "Key ", e.g. Authorization: Key YOUR_API_KEY
Response
Prometheus-compatible metrics retrieved successfully
Prometheus-compatible metrics in text format
Example:
"# HELP fal_app_queue_size Current size of the fal app queue\n# TYPE fal_app_queue_size gauge\nfal_requests_total{application=\"my/app\"} 10"
Previous
Analytics
Time-bucketed metrics for your serverless app endpoints, including request counts,
success/error rates, and latency percentiles across all inbound traffic.
`prepare_duration` reflects queue/prepare time before execution;
`duration` is request execution time.
This endpoint shows all inbound requests to endpoints you own — not just
your own calls. This is ideal for monitoring your deployed apps, tracking
SLAs, and exporting data to tools like BigQuery or Grafana. You must own
all requested endpoints; returns 403 otherwise.
**Metric Selection:**
You must specify which metrics to include using the `expand` query
parameter. Only requested metrics will be populated in the response,
allowing you to optimize query performance and data transfer.
**Available Metrics:**
- `request_count`: Total number of requests in the time bucket
- `success_count`: Number of successful requests (2xx responses)
- `user_error_count`: Number of user errors (4xx responses)
- `error_count`: Number of server errors (5xx responses)
- `p50_prepare_duration`: 50th percentile queue/prepare time
- `p75_prepare_duration`: 75th percentile queue/prepare time
- `p90_prepare_duration`: 90th percentile queue/prepare time
- `p50_duration`: 50th percentile request execution duration
- `p75_duration`: 75th percentile request execution duration
- `p90_duration`: 90th percentile request execution duration
**Key Features:**
- See all traffic to your apps across all callers
- Selective metric inclusion via expand parameter
- Performance metrics (latency percentiles, duration stats)
- Reliability metrics (success/error rates, request counts)
- Time-bucketed data for trend analysis
- Flexible date range and timeframe options
**Common Use Cases:**
- Monitor your serverless app performance and reliability
- Export analytics to your own observability tools
- Analyze latency trends across all callers
- Track error rates and SLA compliance
Next
Metrics