Managing Node Groups¶
What is a Node Group?¶
A Node Group is a logical collection of Kubernetes nodes that share common hardware characteristics, such as CPU and GPU manufacturer/model. Node groups enable administrators to:
- Target Workloads: Schedule specific applications (e.g., GPU-intensive AI jobs) to nodes with compatible hardware.
- Simplify Management: Apply policies, quotas, or updates to groups of nodes collectively.
- Optimize Costs: Isolate workloads to nodes with cost-effective hardware for their requirements.
Key Features¶
- Resource Allocation: Reserve nodes for critical workloads or teams.
- Scalability: Dynamically add/remove nodes as demand changes.
Node Group Operations¶
- Create a Node Group:
- Create a new node group to organize nodes with shared attributes (e.g., GPU type, memory capacity).
- Delete a Node Group:
- Remove a node group when it’s no longer needed. This does not delete the nodes themselves.
- Edit a Node Group:
- Modify group properties, such as adding or removing nodes from the node group, etc.