Senior Software Engineer, Workflow Platforms
Bot Auto
Posted about 17 hours ago
Company Introduction
At Bot Auto, we are revolutionizing the transportation of goods with our cutting-edge autonomous trucks, enhancing the quality of life for communities around the globe. With the agility of a start-up and the wisdom of seasoned experts, Bot Auto boasts a team that has achieved numerous world-firsts and unparalleled innovations. United by a shared vision, we create miracles and propel the future of transportation. Join us and transform your dreams into reality.
We are seeking a highly skilled and motivated Senior Software Engineer, Workflow Platforms to architect, build, and operate the workflow orchestration platforms that power Bot Auto's engineering and autonomy workloads. From simulation and machine learning training to data pipelines and CI/CD, our teams depend on reliable, scalable workflow systems to move fast. In this role, you will own one or a hybrid of orchestration platforms (such as Argo Workflows and Airflow), operate them at scale, and develop the internal platforms and abstractions built on top of them that make running complex workloads simple, observable, and cost-efficient.
Key Responsibilities
- Architect, deploy, and operate workflow orchestration platforms (e.g., Argo Workflows, Airflow, or a hybrid) supporting simulation, machine learning and model training, data pipelines, CI/CD, and other general-purpose workloads.
- Build internal platforms, abstractions, SDKs, and self-service tooling on top of orchestration engines to make authoring, running, and monitoring workflows simple and reliable for engineers.
- Operate workflow platforms at scale on Kubernetes across cloud (AWS) and on-prem data center environments, handling scheduling, autoscaling, GPU and heterogeneous resources, and cross-cluster orchestration.
- Ensure reliability, performance, and cost efficiency of workloads through observability, queuing and prioritization, retries, and resource optimization.
- Partner with ML, simulation, data, and infrastructure teams to understand workload requirements and deliver fit-for-purpose pipelines.
- Integrate workflow platforms with storage, data streaming and event systems, artifact and model registries, and CI/CD tooling.
- Establish best practices, templates, and documentation for workflow authoring and operations; mentor engineers across the company.
- Handle user-impacting issues promptly with clear communication — mitigate in the short term and follow up with durable long-term solutions.
Required Qualifications
- Bachelor's degree in Computer Science, Engineering, or a related field, or equivalent experience
- 5+ years of hands-on experience in platform engineering, infrastructure, DevOps, or SRE roles
- Significant experience with workflow orchestration platforms such as Argo Workflows, Airflow, or comparable systems
- Strong software development skills in one or more languages: Python, Go, Java, or JavaScript/TypeScript
- Solid understanding of Kubernetes and distributed systems
Preferred Qualifications
- Expert-level experience operating Argo Workflows, Airflow, and/or other engines (e.g., Prefect, Dagster, Temporal, Kubeflow Pipelines, Flyte)
- Experience orchestrating ML training, simulation, or large-scale data and batch workloads, including GPU scheduling
- In-depth Kubernetes experience (EKS, GKE, AKS, RKE2/Rancher) and cross-cluster orchestration
- IaC tools proficiency, including Terraform, Pulumi, OpenTofu, or Ansible
- Experience with data streaming and event platforms, including NATS JetStream, Kafka, Pulsar, or RabbitMQ
Job details
Jobr Assistant extension
Get the extension →