AWS Batch

  • Run batch jobs as docker images (you provide)
  • For any non-ETL related work, Batch is probably better than glue
  • Dynamic provisioning of the instances (EC2 & Spot Instances)
  • Optimal quantity and type based on volume and requirements
  • No need to manage clusters, fully serverless
  • You just pay for the underlying EC2 instances-
  • Schedule Batch Jobs using CloudWatch Events
  • Orchestrate Batch Jobs using AWS Step Functions