Platform Engineering - Managed Services & Infrastructure
- Own and operate customer-facing managed infrastructure including Refinery as a Service (RaaS) and Honeycomb Private Cloud (HnyPC) deployments across multiple AWS accounts and regions.
- Build and maintain Terraform modules, Helm charts, and deployment automation for provisioning and managing customer EKS clusters, collector pools, and Refinery instances.
- Design and implement monitoring, alerting, and observability for managed service infrastructure - using Honeycomb to monitor Honeycomb.
- Manage scaling, upgrades, and incident response for customer deployments, including capacity planning and cost optimization across AWS infrastructure.
- Building autonomous deployment and management tooling for field-operated managed services.
Technical Escalation & Unblocking
- Serve as the senior technical escalation point for our most challenging customer situations - production incidents, complex collector configurations, Refinery tuning, and architecture reviews that exceed the scope of standard technical roles.
- Diagnose and resolve deep infrastructure and observability issues spanning distributed systems, Kubernetes clusters, AWS networking (ALBs, PrivateLink, NLBs, VPCs), and polyglot service meshes.
- Partner directly with customer SRE, platform, and engineering teams to troubleshoot real-time production issues, often under time pressure and with direct revenue impact.
- Participate in an on-call rotation for managed services (Refinery as a Service, Honeycomb Private Cloud), providing Tier 2 escalation support for customer-facing infrastructure issues.
- Build and maintain SOPs, runbooks, and diagnostic frameworks that accelerate resolution for the broader field and support teams.
Open Source & Ecosystem
- Contribute to and maintain OpenTelemetry distributions, collectors, exporters, and instrumentation libraries that our customers depend on.
- Represent Honeycomb in the OpenTelemetry community - participating in SIGs, reviewing PRs, triaging issues, and driving adoption of best practices.
- Build reference architectures, sample collector configurations, and integration guides that demonstrate effective instrumentation patterns for common customer environments (Kubernetes, ECS, serverless).
- Identify gaps in the open source ecosystem that create friction for customers and either contribute fixes upstream or build bridging solutions.
- Contribute features and improvements to Honeycomb’s own open source projects (Refinery, Honeycomb Collector Distro) to support managed service capabilities.
Technical Backstop for the Field
- Be the person Solutions Architects call when a deal goes deeper than demo and design - you join calls to troubleshoot live production environments, validate architecture decisions, and provide the infrastructure credibility that closes technical evaluations.
- Tag-team with SAs on strategic accounts, owning the infrastructure and data pipeline conversations while they own the product narrative.
- Lead architecture reviews, SLO workshops, and instrumentation deep-dives for customers evaluating or expanding Honeycomb - especially in complex environments (multi-cluster Kubernetes, hybrid cloud, high-cardinality workloads).
- Step into customer-facing POCs and pilots as the hands-on technical lead, standing up collector pools, configuring Refinery pipelines, and proving out integrations in the customer’s actual environment.
- Create feedback loops between the field and product/engineering, surfacing patterns from customer environments that inform roadmap priorities.
Internal Tooling & Cross-Functional Partnership
Build internal tools and UIs that improve the operational efficiency of managed services - deployment dashboards, rule management interfaces, monitoring tooling.
Partner with Solutions Architecture, Customer Success, and Support to provide technical depth on complex accounts.
Collaborate with Product and Engineering on customer-impacting bugs, feature gaps, and integration challenges - bringing real-world production context.
Contribute to field enablement by training internal teams on advanced troubleshooting, collector configuration, Refinery internals, and emerging reliability patterns.
What you'll get when you join the Hive:
- A stake in our success - generous equity with employee-friendly stock program
- It’s not about how strong of a negotiator you are - our pay is based on transparent levels relative to experience
- Time to recharge with unlimited PTO
- A distributed-first mindset and culture (really!)
- Home office, co-working, and internet stipend
- Full benefits coverage for employees, with additional coverage available for dependents
- Up to 16 weeks of paid parental leave, regardless of path to parenthood
- Annual development allowance
- And much more...
- All communications will come from an @honeycomb.io email address
- We occasionally work with external recruiting agencies. These partners will use legitimate business email addresses—never personal accounts like Gmail or Yahoo.
- Our recruiting process will never ask you to provide financial or sensitive personal information, including but not limited to:
- Social security or tax identification numbers
- Credit card numbers
- Bank account information
Other open roles at Honeycomb.io(6)
Honeycomb is the observability platform built for AI-era software. Fast queries, unified telemetry, and LLM observability. Used by Slack, Intercom, and Dropbox.
Key team members

Yoz Grahame

Erwin van der Koogh

Wendy Amazon Smith

Kent Quirk
Jobr aggregates jobs directly from company career portals — no middlemen. Our team applies on your behalf with AI-tailored resumes, reviewed by a human before submission.