Jobs View Overview
The Jobs view provides a comprehensive list of all batch jobs processed by Memory Machine Batch, allowing you to monitor their status, track details, and filter records for specific inquiries. This view is essential for detailed operational oversight and troubleshooting.
Let's go through each sub-section and describe what you can see and do:
Action Bar
Located above the filters, this bar provides quick actions:
- Export to CSV: Click this button to download the currently filtered and displayed job data into a
.csv
(Comma Separated Values) file. This allows for offline analysis in spreadsheet programs. - Refresh: Click to update the Jobs table with the latest available data.
Filtering and Search Options
This section enables you to narrow down the list of jobs displayed in the table:
- Select Queue: A dropdown menu that allows you to filter jobs by their assigned queue name.
- Select Status: A dropdown menu to filter jobs based on their current status:
- Creating: The job is in the process of being created.
- Created: The job has been created.
- Running: The job is currently being run.
- Failed: The job has failed.
- Succeeded: The job has successfully run and is now complete.
- Restoring: The job is currently restoring from a checkpoint.
- Restore Succeeded: The restore for the job has succeeded, the job continues to run.
- Resture Failed: The restore for this job has failed.
- Stopped: The job has stopped running.
- Checkpointing: The job is currently having a checkpoint generated for it.
- Checkpoint Succeeded: The checkpoint for the job has succeeded.
- Checkpoint Failed: The checkpoint for the job has failed.
- Volume Unready: Managed EBS is enabled for the job and the volume is not available when the job attempted to start.
- Restore Volume Unready: Managed EBS is enabled for the job but the volume could not be found during restore.
- Select Spot Protection Failures: A dropdown filter specifically for identifying jobs that experienced failures related to spot instance protection.
- Created From / Created To: Date selectors that enable you to filter jobs based on their creation date within a specified range.
- Search Jobs: An input field where you can type keywords (e.g., a Job ID, a portion of a name) to quickly find matching jobs. Click the magnifying glass icon or press Enter to initiate the search.
Jobs Table
This is the main area where job details are displayed. Each row represents a single job, and the columns provide various pieces of information: - ID: The unique identifier for each job.
-
Queue Name: The name of the queue to which the job belongs.
-
Status: The current operational status of the job (e.g., Running, Completed, Failed).
-
Node ID: The identifier of the compute node where the job is or was running.
-
Container ID: The identifier of the container in which the job's workload is executed.
-
Spot Protections: Indicates whether spot protection was applied to the job. On a job where spot protection was engaged, click on the Spot Protections area to open another screen that shows you more detailed information:
-
Batch Job IDs: Identifiers for underlying batch jobs, if applicable.
-
Max Disk Used: The maximum amount of disk space consumed by the job during its execution.
-
Created: The date and time when the job was initially created.
-
Updated: The date and time when the job's status or details were last updated.
-
Events: Clicking on this view icon will provide key information on the events the individual job has had. When you click on the view, you will see a report showing job status and corresponding timestamps for status change, as in the image below: