Data Engineer - Hadoop
Virtusa.com
Office
TN
Full Time
Data Engineer - Hadoop - (CREQ229546)
Description
- 7+ years of experience designing and building data pipelines using Apache Spark, Databricks or equivalent bigdata frameworks.
- Handson expertise with streaming and messaging systems such as Apache Kafka (publish subscribe architecture), Confluent Cloud, RabbitMQ or Azure Event Hub. Experience creating producers, consumers and topics and integrating them into downstream processing.
- Deep understanding of relational databases and CDC. Proficiency in SQL Server, Oracle or other RDBMSs; experience capturing change events using Debezium or native CDC tools and transforming them for downstream consumption.
- Proficiency in programming languages such as Python, Scala or Java and solid knowledge of SQL for data manipulation and transformation.
- Cloud platform expertise. Experience with Azure or AWS services for data storage, compute and orchestration (e.g., ADLS, S3, Azure Data Factory, AWS Glue, Airflow, DBX, DLT).
- Data modelling and warehousing. Knowledge of data Lakehouse architectures, Delta Lake, partitioning strategies and performance
- Version control and DevOps. Familiarity with Git and CI/CD pipelines; ability to automate deployment and manage infrastructure as code.
- Strong problem solving and communication skills. Ability to work with cross functional teams and articulate complex technical concepts to nontechnical stakeholders..
Primary Location
: IN-TN-ChennaiSchedule
: Full TimeEmployee Status
: Individual ContributorJob Type
: ExperiencedTravel
: NoJob Posting
: 17/09/2025, 10:54:05 AMData Engineer - Hadoop
Office
TN
Full Time
September 17, 2025