Engram Lab logo

Founding Engineer, ML Systems & Performance

Posted about 13 hours ago

OfficeSan Francisco

About Engram

Today’s AI is a brilliant stranger: it can solve the world’s hardest math problems, but it knows next to nothing about you and your work. It rereads your files to answer even basic questions, burns an enormous amount of tokens when sifting through large corpuses, and between sessions, it retains scraps at best.

We train models to study your world and anticipate your questions in advance, forming engrams: compact memories that capture your knowledge and history. Our approach opens a new axis of scaling. The more we study your context at training time, the better we become at inference time.

We're already working with leaders in AI like Microsoft, Notion, and Harvey, and just raised $98M from General Catalyst, Kleiner Perkins, Sequoia, Factory, Modern, Amplify, Neo and others. Our investors and advisors include Assaf Rappaport, Andrej Karpathy, and Pieter Abbeel.

AI has spent years learning everything about the world. Now it should learn something about yours.

About this role

You’ll be among the first ML Systems Engineer, joining a team of machine learning researchers and performance engineers in building our personalization and continual learning API, which powers models and agents that learn from user context.

This role is focused on designing, optimizing, and scaling training and inference workloads —bridging the gap between cutting-edge AI research and production. This includes:

  • Designing and executing new frameworks, techniques, and systems to improve performance, reliability, latency, and efficiency.

  • Partner closely with researchers, turning prototypes into systems that run at scale and feeding systems constraints back into research decisions.

  • Optimize serving paths for personalization and memory retrieval, where per-user state and low latency both matter.

  • Work on distributed training — data and model parallelism, communication scheduling, and scaling efficiency across multiple GPUs and nodes.

You'll be the bridge between researchers and platform engineering, while working in deep collaboration with our customers (AI-native application-layer companies like Notion and Harvey). The architecture will be shaped by the constraints and requirements of our R&D work. There are no walls between product, research, and engineering here; delivering on our mission requires a multidisciplinary approach.

This is a founding hire in the truest sense. You’ll set the bar for engineering at Engram: code review, testing, on-call, and a security posture that gives customers confidence in entrusting us with their most sensitive data. You’ll also help build the engineering team around you and influence our engineering culture as we scale.

Your background looks like

  • Bachelor’s degree or equivalent experience in computer science, engineering, or similar.

  • 5+ years of experience with training or inference systems, optimized workloads with measurable results.

  • Strong engineering foundation, with demonstrated excellence navigating complex technical environments and shipping high-quality code in a fast-paced environment.

  • Deep understanding of ML framework (eg. PyTorch, JAX), GPUs, distributed systems, and infrastructure.

  • Operate well in ambiguous environments — you will have real ownership and be responsible for steering the ship in a novel sector of the industry.

  • You have a bias toward action and a knack for turning research concepts into concrete, executable plans.

Bonus points if you have

  • Prior early-stage experience.

  • Experience in open-source ML or systems infrastructure projects.

Engram is based in San Francisco. This role is in-person in our SF office. We offer competitive cash compensation and startup equity.

Engram is an equal opportunity employer. We’re building a team that reflects a range of backgrounds and perspectives, and we welcome applicants regardless of race, color, religion, national origin, gender, gender identity, sexual orientation, age, disability, or veteran status.

Job details
Workplace
Office
Location
San Francisco

Atelier ENGRAM développe et propose des expériences sensorielles uniques en suggérant une relation affective différente du monde numérique. La relation de l’usager avec l'activation devient partie intégrante de l’événement. On l’invite à intervenir et à générer le contenu, numérique ou physique, des installations. L’ambition de l’atelier est de marquer l’imaginaire collectif en touchant les limites technologiques et en proposant une expérience affective différente à l’usager. __ ENGRAM def. Trace organique laissée par un évènement dans le fonctionnement bioélectrique du cerveau, qui constituerait la base matérielle du souvenir. __

Industry
Design Services
Headquarters
Montréal, Québec
Founded
2014
Apply smarter with Jobr

Jobr aggregates jobs directly from company career portals — no middlemen. Our team applies on your behalf with AI-tailored resumes, reviewed by a human before submission.

Direct from company career pages
AI-personalised cover letters
Human review before every submit
Application tracking & follow-ups