Skip to end of metadata
Go to start of metadata

You are viewing an old version of this page. View the current version.

Compare with Current View Page History

Version 1 Next »

Lakehouse Optimizer empowers users to monitor and improve their Lakehouse infrastructure by configuring incidents of interest that identify inefficiencies in cost, performance, and operational metrics. Incidents are displayed in the Incidents section of the app, providing actionable insights to optimize resource utilization and expenditures.

image-20250116-211206.png

Steps for Incident Configuration

  1. Under the Settings menu, select Settings / Incident Policies

    Incident Policies.png
    1. Incidents can be defined for Subscriptions, Workspaces, Workflows, All Purpose Compute, Delta Live Tables, SQL Warehouses, Pools, and Job Compute.

    2. Each incident created has its own incident policy

  2. Select the area of interest to create an incident under (ex. Workflow).

  3. Select category - Cost Control or Performance

  4. Select sub-category from dropdown menu (ex. Over Provisioning)

    Category and Sub Category.png


The following incidents are configurable in LHO:

Entity

Incident

Subscriptions

Monthly Cost above Threshold

Workspaces

Monthly Cost above Threshold

Workflows

Monthly Cost above Threshold

Workflows

  • Over-Provisioning

    • Cluster CPU

    • Driver Memory

    • Driver CPU

    • Driver Memory

Workflows

  • Under-Provisioning

    • Cluster CPU

    • Driver Memory

    • Driver CPU

    • Driver Memory

Workflows

  • Imbalanced-Provisioning

    • CPU

    • Memory

Workflows

Bad Skew

Workflows

Disk Spillage

Workflows

Run Failure

Workflows

Job with All Purpose Clusters

Delta Live Tables

Over Provisioning

  • Cluster CPU

  • Cluster Memory

Delta Live Tables

Under Provisioning

  • Cluster CPU

  • Cluster Memory

Delta Live Tables

Update Failure

Delta Live Tables

Monthly Cost above Threshold

All Purpose Clusters

Monthly Cost above Threshold

All Purpose Clusters

Auto Shutdown Timeout

  • Shutdown Timeout above Threshold

  • Shutdown Timeout Missing

All Purpose Clusters

Total Idle Time above Threshold

All Purpose Clusters

  • Over-Provisioning

    • Cluster CPU

    • Driver Memory

    • Driver CPU

    • Driver Memory

All Purpose Clusters

  • Under-Provisioning

    • Cluster CPU

    • Driver Memory

    • Driver CPU

    • Driver Memory

Pools

Auto Shutdown Timeout

  • Shutdown Timeout above Threshold

Related articles

  • No labels