Data Scientist / AI Engineer
Fulcrum Digital.com
Office
Pune City, India
Full Time
Who Are We
Fulcrum Digital is an agile and next-generation digital accelerating company providing digital transformation and technology services right from ideation to implementation. These services have applicability across a variety of industries, including banking & financial services, insurance, retail, higher education, food, healthcare, and manufacturing.
About The Role
We are looking for a skilled
and hands-on Data Scientist with 4–5 years of experience in developing
and deploying machine learning models—ranging from traditional ML algorithms to
advanced deep learning and Generative AI systems. The ideal candidate brings a
strong foundation in classification, anomaly detection, and time-series
modeling, along with hands-on experience in deploying and optimizing Transformer-based
models. Familiarity with quantization, fine-tuning, and RAG
(Retrieval-Augmented Generation) is highly desirable.
Responsibilities
- Design, train, and evaluate ML models for tasks such as classification, anomaly detection, forecasting, and natural language understanding.
- Build and fine-tune deep learning models, including RNNs, GRUs, LSTMs, and Transformer architectures (e.g., BERT, T5, GPT).
- Develop and deploy Generative AI solutions, including RAG pipelines for use cases such as document search, Q&A, and summarization.
- Perform model optimization techniques such as quantization for improving latency and reducing memory/compute overhead in production.
- Optionally fine-tune LLMs using Supervised Fine-Tuning (SFT) and Parameter-Efficient Fine-Tuning (PEFT) methods like LoRA or QLoRA.
- Define and track relevant evaluation metrics; continuously monitor model drift and retrain models as needed.
- Collaborate with cross-functional teams (data engineering, backend, DevOps) to productionize models using CI/CD pipelines.
- Write clean, reproducible code and maintain proper versioning and documentation of experiments.
Requirements
Required Skills
- 4–5 years of hands-on experience in machine learning or data science roles.
- Proficient in Python and ML/DL libraries: scikit-learn, pandas, PyTorch, TensorFlow.
- Strong knowledge of traditional ML and deep learning, especially for sequence and NLP tasks.
- Experience with Transformer models and open-source LLMs (e.g., Hugging Face Transformers).
- Familiarity with Generative AI tools and RAG frameworks (e.g., LangChain, LlamaIndex).
- Experience in model quantization (e.g., dynamic/static quantization, INT8) and deployment on constrained environments.
- Knowledge of vector stores (e.g., FAISS, Pinecone, Azure AI Search), embeddings, and retrieval techniques.
- Proficiency in evaluating models using statistical and business metrics.
- Experience with model deployment, monitoring, and performance tuning in production environments.
- Familiarity with Docker, MLflow, and CI/CD practices.
Preferred Qualifications
- Experience fine-tuning LLMs (SFT, LoRA, QLoRA) on domain-specific datasets.
- Exposure to MLOps platforms (e.g., SageMaker, Vertex AI, Kubeflow).
- Familiarity with distributed data processing (e.g., Spark) and orchestration tools (e.g., Airflow).
- Contributions to research papers, blog posts, or open-source projects in ML/NLP/GenAI.