How-To Verify InferenceServices and GPU usage Innovation Release

This documentation covers the current Innovation Release of EDB Postgres AI. See also:

Prerequisite: Access to the Hybrid Manager UI with AI Factory enabled. See AI Factory in Hybrid Manager.

Use this guide to confirm the correct deployment and operational status of InferenceServices and GPU resource usage.

Goal

Ensure your deployed InferenceServices are correctly utilizing GPU resources.

5–10 minutes.

kubectl get inferenceservice -n <namespace>

Look for READY to ensure the service is running.

kubectl describe nodes | grep nvidia.com/gpu
kubectl exec -n <namespace> -it <pod-name> -- nvidia-smi

kubectl get ds -n kube-system nvidia-device-plugin-daemonset
kubectl describe pods -n <namespace>

AI Factory Models

How-To Monitor deployed models with KServe