Be part of a dynamic team where your distinctive skills will contribute to a winning culture and team.

As a Data Engineer III - Python / Spark / Data Lake at JPMorgan Chase within the Consumer and Community Bank - Connected Commerce Technology, you will be a seasoned member of an agile team, tasked with designing and delivering reliable data collection, storage, access, and analytics solutions that are secure, stable, and scalable. Your responsibilities will include developing, testing, and maintaining essential data pipelines and architectures across diverse technical areas, supporting various business functions to achieve the firm's business objectives.

Job responsibilities

Supports review of controls to ensure sufficient protection of enterprise data
Advises and makes custom configuration changes in one to two tools to generate a product at the business or customer request
Updates logical or physical data models based on new use cases
Frequently uses SQL and understands NoSQL databases and their niche in the marketplace
Adds to team culture of diversity, opportunity, inclusion, and respect
Develop enterprise data models, Design/ develop/ maintain large-scale data processing pipelines (and infrastructure), Lead code reviews and provide mentoring thru the process, Drive data quality, Ensure data accessibility (to analysts and data scientists), Ensure compliance with data governance requirements, and Ensure business alignment (ensure data engineering practices align with business goals)

Required qualifications, capabilities, and skills

Formal training or certification on data engineering concepts and 3+ years applied experience
Experience across the data lifecycle, advanced experience with SQL (e.g., joins and aggregations), and working understanding of NoSQL databases
Experience with statistical data analysis and ability to determine appropriate tools and data patterns to perform analysis
Advanced proficiency in at least one programming language including Python, Java or Scala
Advanced proficiency in at least one cluster computing frameworks including Spark, Flink or Storm
Advanced proficiency in leveraging Gen AI models from Anthropic (or OpenAI, or Google) using APIs/SDKs
Advanced proficiency in Gen AI SDKs such as LangChain, LangGraph, LangSmith
Advanced proficiency in at least one cloud data lakehouse platform such as AWS data lake services, Databricks or Hadoop, at least one relational data store such as Postgres, Oracle or similar, and at least one NOSQL data store such as Cassandra, Dynamo, MongoDB or similar
Advanced proficiency in at least one scheduling/orchestration tool such as Airflow, AWS Step Functions or similar
Proficiency in Unix scripting, data structures, data serialization formats such as JSON, AVRO, Protobuf, or similar, big-data storage formats such as Parquet, Iceberg, or similar, data processing methodologies such as batch, micro-batching, or stream, one or more data modelling techniques such as Dimensional, Data Vault, Kimball, Inmon, etc., Agile methodology, TDD or BDD and CI/CD tools

Preferred qualifications, capabilities, and skills

Proficiency in IaC (preferably Terraform, alternatively AWS cloud formation)
Proficiency in cloud based data pipeline technologies such as- Fivetran, DBT, Prophecy.io, etc.
Strong Python and Spark

JPMorganChase, one of the oldest financial institutions, offers innovative financial solutions to millions of consumers, small businesses and many of the world’s most prominent corporate, institutional and government clients under the J.P. Morgan and Chase brands. Our history spans over 200 years and today we are a leader in investment banking, consumer and small business banking, commercial banking, financial transaction processing and asset management.

We offer a competitive total rewards package including base salary determined based on the role, experience, skill set and location. Those in eligible roles may receive commission-based pay and/or discretionary incentive compensation, paid in the form of cash and/or forfeitable equity, awarded in recognition of individual achievements and contributions. We also offer a range of benefits and programs to meet employee needs, based on eligibility. These benefits include comprehensive health care coverage, on-site health and wellness centers, a retirement savings plan, backup childcare, tuition reimbursement, mental health support, financial coaching and more. Additional details about total compensation and benefits will be provided during the hiring process.

We recognize that our people are our strength and the diverse talents they bring to our global workforce are directly linked to our success. We are an equal opportunity employer and place a high value on diversity and inclusion at our company. We do not discriminate on the basis of any protected attribute, including race, religion, color, national origin, gender, sexual orientation, gender identity, gender expression, age, marital or veteran status, pregnancy or disability, or any other basis protected under applicable law. We also make reasonable accommodations for applicants’ and employees’ religious practices and beliefs, as well as mental health or physical disability needs. Visit our FAQs for more information about requesting an accommodation.

JPMorgan Chase & Co. is an Equal Opportunity Employer, including Disability/Veterans

Data Engineer III- Python / Spark / Data Lake

JPMorgan Chase & Co.

Data Engineer III- Python / Spark / Data Lake

JPMorgan Chase & Co.