< img height="1" width="1" style="display:none;" alt="" src="https://px.ads.linkedin.com/collect/?pid=3131724&fmt=gif" />
Last updated:

    Cluster Administration

    In Kube AI Hub, you set a cluster's configurations and configure its features using the interactive web console or the built-in native command-line tool kubectl. As a cluster administrator, you are responsible for a series of tasks, including cordoning and adding labels to nodes, controlling cluster visibility, monitoring cluster status, setting cluster-wide alerting and notification rules, as well as configuring storage and log collection solutions.

    Note

    Multi-cluster management is not covered in this chapter. For more information about this feature, see Multi-cluster Management.

    Kube AI Hub Log Dashboard

    Learn how to enable log dashboard, a graphical interface tool similar to ElasticSearch Kibana.

    Node Management

    Monitor node status and learn how to add node labels or taints.

    GPU Card Management

    Learn how to view and manage GPU card resources in Kube AI Hub clusters.

    GPU Virtualization Mode

    Learn how to configure GPU virtualization modes in Kube AI Hub for GPU sharing and slicing.

    Cluster Overview

    Learn about the cards, quick actions, and initialization states shown on the cluster overview page.

    Application Resources Monitoring

    Monitor application resources across the cluster, such as the number of Deployments and CPU usage of different projects.

    Cluster-wide Alerting and Notification

    Alertmanager in Kube AI Hub

    Learn how to manage alerts with Alertmanager in Kube AI Hub.

    Alerting Rule Groups (Node Level)

    Learn how to set alerting rule groups for nodes.

    Alerting Messages (Node Level)

    Learn how to view alerting messages for nodes.

    Cluster Settings

    Cluster Visibility and Authorization

    Learn how to configure cluster visibility and authorization, and how to open the settings from the cluster overview page.

    Log Receivers

    Introduction

    Learn the basics of cluster log receivers, including tools, and general steps.

    Add Elasticsearch as a Receiver

    Learn how to add Elasticsearch to receive container logs, resource events, or audit logs.

    Add Kafka as a Receiver

    Learn how to add Kafka to receive container logs, resource events, or audit logs.

    Add Fluentd as a Receiver

    Learn how to add Fluentd to receive logs, events or audit logs.

    Add OpenSearch as a Log Receiver

    Learn how to add OpenSearch to receive container logs, resource events, or audit logs.

    Cluster Gateway

    Learn how to create a cluster-scope gateway on Kube AI Hub.

    Notification Management

    Customize Cluster Name in Notification Messages

    Learn how to customize cluster name in notification messages sent by Kube AI Hub.

    Configure Email Notifications

    Configure a email server and add recipients to receive email notifications.

    Configure DingTalk Notifications

    Learn how to configure a Dingtalk conversation or chatbot to receive platform notifications sent by Kube AI Hub.

    Configure WeCom Notifications

    Learn how to configure a WeCom server to receive platform notifications sent by Kube AI Hub.

    Configure Slack Notifications

    Configure Slack notifications and add channels to receive notifications from alerting policies, kube-events, and kube-auditing.

    Configure Webhook Notifications

    Configure a webhook server to receive platform notifications through the webhook.

    Cluster Shutdown and Restart

    Learn how to gracefully shut down your cluster and restart it.

    Storage Classes

    Learn basic concepts of PVs, PVCs,and storage classes, and demonstrate how to manage storage classes on Kube AI Hub.

    Volume Snapshot Classes

    Learn how to manage snapshot classes on Kube AI Hub.