
Location
New York City or Remote
Employment Type
Full-time
Location Type
On-site, Hybrid, or Remote
Department
Product & Engineering
Compensation
• Competitive base salary
• Equity
About Evergrid
Evergrid builds the infrastructure that powers advanced artificial intelligence at scale, with the Most Advanced Neocloud. We design and operate critical GPU and power infrastructure for frontier AI workloads — environments where performance, reliability, and execution are non-negotiable. Our customers are building systems at the edge of what’s possible, and they depend on infrastructure that scales quickly and works under sustained pressure. We care deeply about outcomes, ownership, and building durable systems and long-term partnerships.
The Role
As a Senior Frontend Engineer (AI Cloud Platform), you will architect the intelligent control plane for Evergrid’s Neocloud. You will be responsible for the "Service Portal" and "Platform Admin Console," transforming complex infrastructure data into a seamless, intuitive user experience. This role sits at the intersection of high-performance computing and modern product design. You will build the interface that allows researchers and engineers to provision massive GPU clusters, manage distributed training jobs, and visualize the real-time pulse of their AI infrastructure. We are looking for a product-minded engineer who can translate kernel-level telemetry and massive scale billing data into a clean, "AI-Native" console that feels as responsive as a consumer app.
What You’ll Do Core Console Architecture
Architect and build the unified web console that serves as the "Mission Control" for our users, enabling them to manage tenancy, identity, and compute resources from a single pane of glass.
Develop complex provisioning workflows that simplify the deployment of high-density resources (e.g., bare metal MI300x clusters, H100 nodes) into single-click experiences.
Implement real-time state management to handle asynchronous infrastructure events, ensuring the UI always reflects the true state of provisioning, scaling, and recovery.
Data Visualization & Analytics
Build high-performance dashboards to visualize granular GPU usage metrics, thermal health, and cluster load without performance lag.
Create interactive financial visualizations for the "Revenue Center," allowing users to drill down into compute, storage, and network costs across different zones and regions.
Design visual topologies that show users exactly how their workloads are distributed across nodes, racks, and high-performance interconnects.
Interactive Compute Environments
Engineer the frontend experience for "Notebooks as a Service," creating a robust wrapper for launching and managing data science environments directly in the browser.
Build low-latency, web-based interfaces for "Deployment as a Service," simplifying the complexity of container image selection, replica scaling, and inference configuration.
Optimize the UI for high-frequency updates, ensuring that "AI Services" (like model training progress or inference latency) are displayed in real-time.
What We’re Looking For
5+ years of experience building complex, data-heavy single-page applications (SPAs) using React and TypeScript.
Deep proficiency with Next.js for handling hybrid rendering strategies (SSR/CSR) and performance optimization.
Strong skills in data visualization using libraries like D3.js, Recharts, or Visx to render complex timeseries data and resource topologies.
Experience building "Developer Tools" or infrastructure dashboards—you understand the difference between a marketing site and a mission-critical console.
Use of AI in your workflow to accelerate prototyping and development.
A "User-First" approach to complexity: You can take intricate concepts like "Virtual GPUs" and "Kubernetes Schedulers" and abstract them into simple, elegant UI components.
Nice to Have
Experience designing UIs for cloud platforms (AWS, GCP, Azure) or ML Ops platforms (Weights & Biases, Databricks).
Familiarity with the Jupyter ecosystem or building IDE-like experiences in the browser.
Basic understanding of Linux/Networking concepts, helping you better empathize with the workflows of our HPC and AI research user base.
Job details
Evergrid
Technology, Information and Internet
About
The most advanced AI Infra platform
Company Details
Key Team Members
Jobr Assistant extension
Get the extension →