Senior Data Engineer
DeepIP.com
Office
Paris, France
Full Time
About Us
At DeepIP, our vision is to build the AI operating system for IP practitioners. Intellectual Property is not just legal paperwork—it’s a company’s strategic DNA. Yet today’s patent professionals are drowning in inefficient processes, outdated tools, and mountains of prior art.
We are building AI-powered assistants to radically transform the practice of IP: from drafting patents, to prosecution, and to searching prior art. Our products empower IP professionals to focus on strategy and creativity while AI handles the complexity.
2025 is just the beginning — we’re growing faster than ever and looking for bold, passionate people to join the ride. If you thrive in a fast-paced, flexible environment where every challenge sparks growth, come build the future with us. 🌟
About The Role
We are looking for a Senior Data Engineer specialized in large-scale systems to design and operate our next-generation agentic pipelines, sometimes across 1B+ records.
You will be responsible for data ingestion, indexing, and serving at scale. Your mission is to ensure low-latency, highly relevant results, to serve as the world’s fastest and most accurate IP research assistant.
Key Responsibilities
- Ingest & Normalize Global Patent Data: Design pipelines to ingest and preprocess heterogeneous and multimodal data sources.
- Pipeline Infrastructure: Build and tune databases (ElasticSearch, OpenSearch, Milvus, Weaviate, etc.) to support both keyword and vector-based retrieval.
- Performance & Relevance: Optimize latency, recall, and precision across billions of records. Work with AI engineers on data chunking and embedding strategies.
- Monitoring & Reliability: Ensure observability, resilience, cost-efficiency and performance of large-scale clusters in production.
About You
- 5+ years of experience in Data Engineering with a focus on large-scale retrieval systems.
- Strong expertise with ElasticSearch / OpenSearch and vector DBs (Milvus, Weaviate, Pinecone, FAISS, etc.).
- Proven experience in scaling ingestion pipelines and processing queries at scale (100M+ records).
- Experience with Python (preferred) or another major language used in large-scale data systems (e.g. Java, Scala, Go) and query languages (SQL, ES|QL); experience with streaming frameworks (Kafka, Spark, Flink) is a plus.
- Comfortable with observability, monitoring, and performance tuning of distributed systems.
- Experience with Kubernetes and infrastructure as code.
- Bonus if you have experience in product / frontend development (Typescript).
- Interest in Intellectual Property and willingness to learn about the domain.
- Interest or experience in Agentic Pipelines.
What We Offer
- Join an AI-first team tackling one of the world’s biggest data challenges: patent knowledge.
- Impact from day one: your work will power thousands of IP professionals worldwide.
- A culture of collaboration, ownership, and rapid iteration.
- Flexible working hours and work location, competitive compensation.
Recruitment Process
- HR call : get to know you!
- Presentation interview with our VP Engineer
- Technical interview: system design & live coding (1h30)
- Co-founder interview
If you thrive in a fast-paced, ambitious, and super flexible environment, where collaboration, ownership, and a hunger for growth are celebrated, join us on this exciting journey.
DeepIP is an Equal Opportunity Employer. All qualified applicants will receive consideration for employment without regard to race, sex, gender, sexual orientation, age, colour, religion, national origin, protected veteran status or on the basis of disability.
Applying at DeepIP is also the opportunity to access a broader network. Should we not proceed with a job offer, we would be pleased to refer you to the Talent Club. The talent club was created by Serena and aims at offering talents great opportunities in innovative companies (Dataiku, Malt, Libeo…)
Senior Data Engineer
Office
Paris, France
Full Time
September 19, 2025