Triomics logo

MLOps & Data Engineer

Posted about 9 hours ago

OfficeIndia Office

GROWTH PATH

This is an individual contributor role with strong ownership expectations. High performers may be considered for workstream lead or functional lead responsibilities after approximately 12 months, based on demonstrated ownership, delivery, technical judgment, mentoring, cross-functional influence, and ability to reduce dependency on the Director of ML.

ABOUT THE ROLE

We are looking for an MLOps & Data Engineer to build the infrastructure that allows our ML team to process clinical documents, run experiments, deploy models, monitor systems, and support annotation/evaluation workflows.

You will work closely with Research Engineers, ML Evaluation Engineers, Clinical AI Data Specialists, Engineering DevOps, and backend teams. This role requires both ML infrastructure and practical data engineering skills.

WHAT YOU WILL DO

  • Build and maintain data pipelines for clinical document processing, OCR outputs, text extraction, metadata normalization, and dataset preparation.

  • Support deployment cycles for ML/LLM systems in collaboration with Engineering DevOps.

  • Build and maintain training, inference, and evaluation infrastructure.

  • Improve experiment tracking, model versioning, dataset versioning, CI/CD, monitoring, observability, and reproducibility.

  • Build internal tools and lightweight Streamlit apps for annotation, clinical review, evaluation, QA, data inspection, and project operations.

  • Automate recurring ML workflows and reduce manual operational burden on Research Engineers.

  • Work with Research Engineers to productionize reliable prototypes.

  • Work with ML Evaluation Engineers to support evaluation pipelines, hidden test set runs, regression automation, and production monitoring.

  • Ensure systems are secure, reproducible, maintainable, and production-friendly.

WHAT WE EXPECT

  • 3โ€“6+ years of experience in data engineering, MLOps, backend engineering for ML systems, ML platform work, or production data workflows.

  • Strong Python skills and comfort with data processing, APIs, scripts, and internal tools.

  • Experience with Docker, Git, CI/CD, APIs, cloud infrastructure, and production monitoring.

  • Experience with data pipelines, workflow orchestration, object storage, databases, and batch/stream processing.

  • Familiarity with ML workflows such as experiment tracking, model registry, inference deployment, and evaluation pipelines.

  • Ability to build practical internal tools quickly, including Streamlit or similar lightweight apps.

  • Strong engineering discipline: logging, tests, reproducibility, documentation, reliability, and security-aware data handling.

NICE TO HAVE

  • Experience with LLM serving, vLLM, Ray, Triton, Kubernetes, Terraform, Airflow, Prefect, MLflow, Weights & Biases, FastAPI, Streamlit, or similar tools.

  • Experience with OCR/document pipelines, PDFs, TIFF/JPEG processing, EHR data, or healthcare data systems.

  • Experience working with DevOps/SRE teams and understanding where ML platform ownership should sit versus engineering DevOps ownership.

  • Familiarity with PHI/PII-aware data handling and secure data workflows.

SUCCESS IN 6 MONTHS

  • Establishes reliable ML deployment and data-processing workflows.

  • Reduces RE time spent on infra, manual data preparation, and ad hoc tooling.

  • Builds useful internal tools for annotation, evaluation, review, and data inspection.

  • Improves reproducibility of experiments and releases.

  • Works effectively with Engineering DevOps without requiring the engineering team to own all ML-specific infra.

About Triomics

Triomics is building the agentic AI layer for oncology EHRs. Cancer hospitals spend billions on highly trained staff manually reading unstructured patient records - pathology reports, clinical notes, genomic panels - to power workflows like trial matching, registry curation, visit prep, and quality reporting. We replace that manual work with task-driven AI agents that sit inside the EMR and process records at scale, in real time.

Our platform is trusted by leading cancer centers including Memorial Sloan Kettering, Mount Sinai, and Yale Cancer Center. We have grown 10x in the last year and process millions of oncology medical documents monthly.

Our investors include Battery Ventures, Lightspeed, General Catalyst, Nexus Venture Partners, and Y Combinator.

Why Join Triomics

  • Impact at scale. The systems your teams build directly power AI workflows that accelerate cancer research and improve patient outcomes.

  • Cutting-edge problems. Hard, data-intensive systems at the intersection of AI, healthcare, and scale - in a highly regulated industry where reliability is non-negotiable.

  • World-class team. Work alongside top talent across AI, engineering, and product, with best-in-industry compensation.

  • Culture that ships. Fast-paced, ownership-driven, with company-sponsored workations.

Perks & Benefits

  • Lunch provided at the office - one less daily decision.

  • Flexible working hours - we care about output, not clock-ins.

  • Comprehensive health insurance for you and your family.

  • Zomato meal benefits for early starts and late nights.

Job details
Workplace
Office
Location
India Office
Employees
71
Industry
Software Development
Headquarters
New York, NY
Specialties
oncology, generative AI, LLM, health technology, digital health, precision health, clinical trials, and clinical research

Key team members

Sameer Brij Verma

Sameer Brij Verma

Apply smarter with Jobr

Jobr aggregates jobs directly from company career portals โ€” no middlemen. Our team applies on your behalf with AI-tailored resumes, reviewed by a human before submission.

Direct from company career pages
AI-personalised cover letters
Human review before every submit
Application tracking & follow-ups