Skip to content

mazzei-dataiku/plugin-k8s_monitoring

Repository files navigation

Kubernetes Monitoring Plugin

  • Author: Stephen Mazzei
  • Organization: Dataiku
  • Last Update: 2024-08-22

Description

  • This plugin contains 2 key components:
    • K8S Poll-Data: This is the macro that runs all the Cloud Provider utilities and Kubernetes API to gather all the raw metrics (Required)
    • Cleanse K8S Data: This is the python recipe that cleans the base data into a cleanse state (Not Required)

How To

Poll Data

  1. Download/Install the plugin

  2. Create a new Dataiku Project (Code is written for UIF enabled or disabled)

  3. Create a new scenario

    1. Name = "Poll K8S Data"
    2. Trigger = Time-based, every 5 minutes
    3. Steps = Execute Macro, "Poll K8S Data"
      • Cluster Name
      • Cluster Type
      • Cloud Provider Information
        • This information will vary per AWS | Azure | GCP
      • Folder Connection Name (Example S3 Connection: my-bucket-value) (Local/Cloud)
      • Folder Name
    4. Run Scenario
  4. Update the new folder in the flow for partitioning

    1. Add 2 "Dimensions" partions

    Example

Cleanse Data

  1. From Recipe dropdown in flow, select "Kubernetes Monitoring"
  2. Select Cleanse K8S Data
  3. Select the Raw folder for input, and create a new folder for output
  4. NOTE Under the "Advanced Tab" you may need to disable "Container Configuration" depending on the DSS Setup
  5. Run
  6. Create a new scenario
    1. Name = "Cleanse K8S Data"
    2. Trigger = Time-based, every hour
    3. Steps:
      1. Build "Cleanse Folder"
      2. Macro -- "Clear Scenario Run Logs" -- keep only the last 2/3 days (Project creates a lot of logs)

Review

  • From either folder, you can create datasets based off the folder paths.

Example

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages