Incidents (AWS)

 

Incidents Dashboard

Incidents view provides critical information regarding the state of your Lakehouse infrastructure with regard to cost, performance and operational data.

Incidents are computed daily or on a different schedule basis.

Incidents main view

 

Incidents Taxonomy

Incidents are grouped into two major categories: Cost and Performance.

 

Performance Incidents

  • All Purpose Clusters

    • Auto Shutdown Timeout

 

 

Incidents per Workload

Incidents are also exposed for each job run id and also for the entire reporting period.

Incidents per Run or per Period

Workload Incidents per Task Run

 

Reported Incidents per Period for a selected Workflow

 

Incidents Policies

Policies enables the user to customize multiple rules used to generate Incidents for different assets (workloads, clusters, notebooks, pools).

Policies are at the tenant level.