Use Azure Batch to run large-scale parallel and high-performance computing (HPC) batch jobs efficiently in Azure.
Azure Batch does the following:
- Creates and manages a pool of compute nodes (virtual machines).
- Installs the applications you want to run.
- Schedules jobs to run on the nodes.
There is no cluster or job scheduler software to install, manage, or scale.
Use the Azure public cloud integration to discover and collect metrics against the Azure service.
External reference
Setup
To set up the Azure integration and discover the Azure service, go to Azure Integration Discovery Profile and select Batch Accounts.
Event support
- Supported
- Configurable in OpsRamp Azure Integration Discovery Profile.
Supported metrics
| OpsRamp Metric | Azure Metric | Metric Display Name | Unit | Aggregation Type | Description |
|---|---|---|---|---|---|
| azure_batch_accounts_CoreCount | CoreCount | Dedicated Core Count | Count | Total | Total number of dedicated cores in the batch account |
| azure_batch_accounts_CreatingNodeCount | CreatingNodeCount | Creating Node Count | Count | Total | Number of nodes being created |
| azure_batch_accounts_IdleNodeCount | IdleNodeCount | Idle Node Count | Count | Total | Number of idle nodes |
| azure_batch_accounts_JobDeleteCompleteEvent | JobDeleteCompleteEvent | Job Delete Complete Events | Count | Total | Total number of jobs that have been successfully deleted. |
| azure_batch_accounts_JobDeleteStartEvent | JobDeleteStartEvent | Job Delete Start Events | Count | Total | Total number of jobs that have been requested to be deleted. |
| azure_batch_accounts_JobDisableCompleteEvent | JobDisableCompleteEvent | Job Disable Complete Events | Count | Total | Total number of jobs that have been successfully disabled. |
| azure_batch_accounts_JobDisableStartEvent | JobDisableStartEvent | Job Disable Start Events | Count | Total | Total number of jobs that have been requested to be disabled. |
| azure_batch_accounts_JobStartEvent | JobStartEvent | Job Start Events | Count | Total | Total number of jobs that have been successfully started. |
| azure_batch_accounts_JobTerminateCompleteEvent | JobTerminateCompleteEvent | Job Terminate Complete Events | Count | Total | Total number of jobs that have been successfully terminated. |
| azure_batch_accounts_JobTerminateStartEvent | JobTerminateStartEvent | Job Terminate Start Events | Count | Total | Total number of jobs that have been requested to be terminated. |
| azure_batch_accounts_LeavingPoolNodeCount | LeavingPoolNodeCount | Leaving Pool Node Count | Count | Total | Number of nodes leaving the Pool |
| azure_batch_accounts_LowPriorityCoreCount | LowPriorityCoreCount | LowPriority Core Count | Count | Total | Total number of low-priority cores in the batch account |
| azure_batch_accounts_OfflineNodeCount | OfflineNodeCount | Offline Node Count | Count | Total | Number of offline nodes |
| azure_batch_accounts_PoolCreateEvent | PoolCreateEvent | Pool Create Events | Count | Total | Total number of pools that have been created |
| azure_batch_accounts_PoolDeleteCompleteEvent | PoolDeleteCompleteEvent | Pool Delete Complete Events | Count | Total | Total number of pool deletes that have completed |
| azure_batch_accounts_PoolDeleteStartEvent | PoolDeleteStartEvent | Pool Delete Start Events | Count | Total | Total number of pool deletes that have started |
| azure_batch_accounts_PoolResizeCompleteEvent | PoolResizeCompleteEvent | Pool Resize Complete Events | Count | Total | Total number of pool resizes that have completed |
| azure_batch_accounts_PoolResizeStartEvent | PoolResizeStartEvent | Pool Resize Start Events | Count | Total | Total number of pool resizes that have started |
| azure_batch_accounts_PreemptedNodeCount | PreemptedNodeCount | Preempted Node Count | Count | Total | Number of preempted nodes |
| azure_batch_accounts_RebootingNodeCount | RebootingNodeCount | Rebooting Node Count | Count | Total | Number of rebooting nodes |
| azure_batch_accounts_ReimagingNodeCount | ReimagingNodeCount | Reimaging Node Count | Count | Total | Number of reimaging nodes |
| azure_batch_accounts_RunningNodeCount | RunningNodeCount | Running Node Count | Count | Total | Number of running nodes |
| azure_batch_accounts_StartingNodeCount | StartingNodeCount | Starting Node Count | Count | Total | Number of nodes starting |
| azure_batch_accounts_StartTaskFailedNodeCount | StartTaskFailedNodeCount | Start Task Failed Node Count | Count | Total | Number of nodes where the Start Task has failed |
| azure_batch_accounts_TaskCompleteEvent | TaskCompleteEvent | Task Complete Events | Count | Total | Total number of tasks that have completed |
| azure_batch_accounts_TaskFailEvent | TaskFailEvent | Task Fail Events | Count | Total | Total number of tasks that have completed in a failed state |
| azure_batch_accounts_TaskStartEvent | TaskStartEvent | Task Start Events | Count | Total | Total number of tasks that have started |
| azure_batch_accounts_TotalLowPriorityNodeCount | TotalLowPriorityNodeCount | Low-Priority Node Count | Count | Total | Total number of low-priority nodes in the batch account |
| azure_batch_accounts_TotalNodeCount | TotalNodeCount | Dedicated Node Count | Count | Total | Total number of dedicated nodes in the batch account |
| azure_batch_accounts_UnusableNodeCount | UnusableNodeCount | Unusable Node Count | Count | Total | Number of unusable nodes |
| azure_batch_accounts_WaitingForStartTaskNodeCount | WaitingForStartTaskNodeCount | Waiting For Start Task Node Count | Count | Total | Number of nodes waiting for the Start Task to complete |