Crusoe logo

Senior Staff Network Engineer, Automation

Crusoe

Posted about 6 hours ago

Crusoe is on a mission to accelerate the abundance of energy and intelligence. As the only vertically integrated AI infrastructure company built from the ground up, we own and operate each layer of the stack — from electrons to tokens — to power the world's most ambitious AI workloads. When you join Crusoe, you join a team that is building the future, faster.

We're in the midst of the greatest industrial revolution of our time. The demand for AI compute is boundless, and power is a bottleneck. We're solving that — with an energy-first approach that makes AI infrastructure better for the world and faster for the people innovating with AI.

We're looking for problem-solving, opportunity-finding teammates with a sense of urgency, who believe in the scale of our ambition and thrive on a path not fully paved — people who want to grow their careers alongside a team of experts across energy, manufacturing, data center construction, and cloud services.

If you want to do the most meaningful work of your career, help our customers and partners advance their AI strategies, and be part of a high-performing team that believes in each other, come build with us at Crusoe.

About this Role

Crusoe is seeking a Senior Staff Network Automation Engineer to own how our network monitors, configures, and heals itself without human intervention. You will design and ship the automation frameworks, self-healing workflows, and observability platforms that allow our global network fleet to scale sublinearly with headcount.

This is a senior technical leadership role at the intersection of Network Engineering, Software Engineering, and Infrastructure Reliability. You will partner with Network Architecture to translate intent-based designs into production automation, mentor a team of Staff and Senior engineers, and represent the automation org in cross-functional planning with Deployment, Operations, and Site Reliability. You're not writing scripts to replace manual tasks. You're building the control plane that makes the network intelligent.

What You'll Be Working On

  • Own the Network Automation Platform: Define the technical roadmap for Crusoe's automation stack, from source of truth and config generation through day-2 operations, telemetry, and closed-loop remediation across our global fleet.

  • Build the Source of Truth: Design and own the authoritative data model (NetBox, Nautobot, or equivalent) that drives all network configuration, validation, and operational state across teams.

  • Architect Intent-Based Configuration Systems: Lead the design and delivery of declarative, model-driven configuration pipelines using Python, Nornir, Ansible, Jinja2, and CI/CD, treating the network as code and making configuration drift impossible.

  • Drive Model-Driven Automation: Own Crusoe's gNMI, OpenConfig, and NETCONF/YANG strategy for telemetry collection, configuration management, and state validation across multi-vendor fabrics (Arista, Juniper, NVIDIA/Mellanox).

  • Build Self-Healing Workflows: Design and ship event-driven, auto-remediation systems that detect faults, correlate telemetry, and resolve known failure modes without human escalation.

  • Define the Observability Platform: Set the technical direction for Crusoe's telemetry, metrics, alerting, and dashboarding stack including Prometheus, Grafana, and streaming gNMI.

  • Influence Architecture for Automability: Partner with Network Architecture to ensure designs are automation-first from day one, deployable, validatable, and operable programmatically at scale.

  • Mentor and Multiply: Provide technical guidance to Staff and Senior engineers. Drive code reviews, design reviews, and platform architecture decisions that raise the engineering bar across the org.

What You'll Bring to the Team

  • 12+ years of network engineering experience with a demonstrated focus on production network automation, platform engineering, and infrastructure as code in hyperscale or internet-scale environments.

  • Demonstrated Technical Leadership: Proven track record of designing and shipping network automation platforms used by a broader engineering organization, not scripts, but systems others build on.

  • Production-Quality Software Engineering: Mastery of Python or Go at a platform level, testable, CI/CD-integrated, and production-owned. You think like a software engineer who deeply understands networking.

  • Model-Driven Automation Fluency: Deep hands-on experience with gNMI, OpenConfig, NETCONF, and YANG-modeled configuration and telemetry. You understand why model-driven automation is the only path to hyperscale.

  • Source of Truth Ownership: You have designed or owned a network source of truth platform (NetBox, Nautobot, or equivalent) end to end, including the data model, integrations, and CI/CD pipelines that consume it.

  • Network Domain Depth: Strong hands-on expertise with Arista (EOS), Juniper (Junos), and NVIDIA/Mellanox platforms in leaf-spine architectures. Solid knowledge of BGP, EVPN-VXLAN, and LLDP at DC fabric scale.

  • Event-Driven and Self-Healing Systems: Track record of building auto-remediation and closed-loop automation that detects, correlates, and resolves faults without human intervention.

  • Observability Expertise: Hands-on experience building streaming telemetry and observability platforms using gNMI collectors, Prometheus, Grafana, and equivalent tooling at fleet scale.

  • Hyperscale Operational Context: Comfort operating at scale across 10K+ network devices and multi-region fabric fleets, where the blast radius of a bug is measured in racks, not ports.

  • Education: Bachelor's degree in Computer Science, Electrical Engineering, or a related field, or equivalent practical experience in hyperscale or internet-scale environments

Benefits:

  • Competitive compensation and equity packages

  • Restricted Stock Units

  • Paid time off, paid holidays & leave of absence programs

  • Comprehensive health, dental & vision insurance

  • Employer contributions to HSA account

  • Paid parental leave

  • Paid life insurance, short-term and long-term disability

  • Professional development & tuition reimbursement

  • Mental health & wellness support

  • Commuter benefits (parking & transit)

  • Cell phone stipend

  • 401(k) Retirement plan with company match up to 4% of salary

  • Volunteer time off

  • Global travel insurance & emergency assistance

  • Daily meals allowance

  • Additional perks & programs specific to location

Compensation Range

Compensation will be paid in the range of up to $245,000 -$295,000 + Bonus. Restricted Stock Units are included in all offers.

Want to see the full job description?

Sign in to view the complete details and apply to this position.

Job details

Workplace

Office

Location

US

Experience

SE

Salary

245k - 295k USD

per year

Similar

Jobr Assistant extension

Get the extension →