Category: I

Inference

Inference is the process of using a trained AI model to make predictions or outputs like: a chatbot generating a response, or a model labeling an image. It is the opposite of training, which is when the model is learning. Inference still needs GPUs because they can process data in parallel, making responses much faster.

Last updated: October 31, 2025

// Related Terms (I)

Instance Family

Grouping of instance types with similar characteristics. For GPUs: p-series (AWS performance), g-series (AWS graphics/ML), a2-series (GCP Accelerator-optimized).

Interruptible

A low-priority instance type where users set a bid price instead of paying a fixed rate. The instance may be …

← Back to Full Glossary