As a Senior Infrastructure Engineer, you will help design and scale the core systems that power Vast.ai’s global GPU marketplace. You’ll work closely with our founders and core engineering team to extend the underlying compute infrastructure — from GPU provisioning and scheduling to billing, orchestration, and marketplace dynamics. We’re looking for someone who has previously built large-scale infrastructure platforms — systems with similarities to Vast.ai, or distributed compute orchestration frameworks. Tech stack: Python, C++, PostgreSQL, Linux, Docker, KVM, Redis, Terraform, AWS, REST/gRPC APIs.
What You'll Do
Improve the backend systems that power Vast.ai’s compute marketplace
Integrate GPU provider onboarding, usage tracking, billing, and orchestration APIs
Develop scalable infrastructure for workload scheduling and resource management
Optimize pricing and marketplace logic for efficiency and transparency
Benchmark, profile, and harden systems for performance, reliability, and fault tolerance
Collaborate with product and infrastructure teams to shape the future of decentralized compute
What We're Looking For
Distributed Systems: Experience building high-throughput backend systems or compute clouds
Compute Orchestration: Familiarity with Docker, or custom scheduling frameworks
GPU Infrastructure: Understanding of GPU provisioning, driver management, and workload scheduling
Billing & Metering: Implemented or integrated usage-based billing and account credit systems
Marketplace Dynamics: Knowledge of dynamic pricing, spot instances, or supply-demand balancing mechanisms
Security & Multi-Tenancy: Experience designing secure, multi-tenant systems in cloud environments
Programming: Strong programming skills in Python and C++; ability to write performant, maintainable, well-architected code
Database Expertise: Comfortable designing schemas and queries for large-scale data systems (PostgreSQL preferred)
Nice to Have
Experience with GPU security, virtualization, or zero-trust compute isolation
Prior startup experience or end-to-end product ownership
Benefits
Comprehensive health, dental, vision, and life insurance
401(k) with company match
Meaningful early-stage equity
Onsite meals, snacks, and close collaboration with founders and tech leads
Ambitious, fast-paced startup culture where initiative is rewarded
AI compute platform connecting developers with high-performance GPU cloud resources.
Jobr aggregates jobs directly from company career portals — no middlemen. Our team applies on your behalf with AI-tailored resumes, reviewed by a human before submission.