This job was posted more than 40 days ago and might be expired.
Sarvam logo

Backend Intern - Inference Pipelines & Diagnostics

Posted about 1 month ago

OfficeBengaluruEN

About Sarvam

Sarvam is building the bedrock of Sovereign AI for India. The company is developing India's full-stack sovereign AI platform, building across research, models, infrastructure and applications with a singular focus on making AI genuinely work for India. Sarvam works with leading enterprises and public institutions and is backed by Lightspeed, Peak XV, and Khosla Ventures. Sarvam partners with India's leading brands, including Tata Capital, SBI Life, CRED, IDFC, and LIC.

About the Role

We're looking for a Backend Intern to join Sarvam's engineering team and own meaningful workstreams in two critical areas: our inference pipeline infrastructure and diagnostics API services. You'll work on the systems that serve AI models at scale, help build robust APIs for diagnostics and observability, and contribute to the data pipelines that keep everything running reliably. Strong performers will be fast-tracked to a full-time offer at the end of the internship. Preferred background: AI/ML or Computer Science.

What You'll Do

• Build and optimise backend services for LLM inference pipelines in Python or Node.js

• Develop and maintain diagnostics API services for model observability and health monitoring

• Integrate LLM APIs and manage request routing, latency, and error handling across inference flows

• Design and query SQL and NoSQL databases to support pipeline state management and diagnostics data

• Build and maintain data pipelines to support inference workloads and operational metrics

• Deploy and manage services on cloud infrastructure (AWS or GCP) using version-controlled codebases on Git

• Collaborate with ML engineers and platform teams to debug, profile, and improve system performance

What We're Looking For

• Proficiency in Python or Node.js; comfortable writing clean, production-quality backend code

• Solid understanding of REST API design, including diagnostics and observability endpoints

• Familiarity with SQL and at least one NoSQL database (e.g. MongoDB, Redis, or DynamoDB)

• Working knowledge of Git for version control and collaborative development

• Basic exposure to cloud platforms — AWS or GCP

• Interest in LLMs and familiarity with LLM API integration patterns

• Background in Computer Science, AI, or Machine Learning preferred

Bonus Points

• Prior exposure to inference serving frameworks (e.g. vLLM, TGI, Triton, or similar)

• Experience with monitoring and observability tooling (e.g. Prometheus, Grafana, or OpenTelemetry)

• Familiarity with containerisation and orchestration (Docker, Kubernetes)

• Contributions to open-source projects in backend or ML infrastructure

Why Sarvam?

Sarvam is a fast-moving, high talent-density team building full-stack AI for India, working on problems that push the frontiers of AI with real population-scale impact.

• Work alongside researchers, engineers, builders, and business leaders who move fast and hold each other to a very high bar

• High ownership and high impact, from day one

• Everything we do is AI-first, from the way we build and ship to the way we think about problems

• You can work on problems that could change how an entire country learns, works, and communicates

If you want to work on problems at the frontier of AI in India, Sarvam is the place to be.

Job details
Workplace
Office
Location
Bengaluru
Experience
EN

Sarvam is India's full-stack sovereign AI platform, with speech-to-text, text-to-speech, translation, and conversational agents across 22 Indian languages. Start free.

Employees
325
Industry
Software Development

Key team members

Hemant Mohapatra

Hemant Mohapatra

Urvi Shah

Urvi Shah

Dr. Ganesh Ghangale

Dr. Ganesh Ghangale

Agathe PADOVANI

Agathe PADOVANI

Apply smarter with Jobr

Jobr aggregates jobs directly from company career portals — no middlemen. Our team applies on your behalf with AI-tailored resumes, reviewed by a human before submission.

Direct from company career pages
AI-personalised cover letters
Human review before every submit
Application tracking & follow-ups