Markets Application Production Services
Bank of America.com
Office
Plano, United States
Full Time
Job Description:
At Bank of America, we are guided by a common purpose to help make financial lives better through the power of every connection. We do this by driving Responsible Growth and delivering for our clients, teammates, communities and shareholders every day.
Being a Great Place to Work is core to how we drive Responsible Growth. This includes our commitment to being an inclusive workplace, attracting and developing exceptional talent, supporting our teammates’ physical, emotional, and financial wellness, recognizing and rewarding performance, and how we make an impact in the communities we serve.
Bank of America is committed to an in-office culture with specific requirements for office-based attendance and which allows for an appropriate level of flexibility for our teammates and businesses based on role-specific considerations.
At Bank of America, you can build a successful career with opportunities to learn, grow, and make an impact. Join us!
Job Description:
This job is responsible for providing front-line support to end users, responding to issues related to incidents and problem management governance for multiple applications, and leading triage activities on all business impacting incidents. Key responsibilities include ensuring compliance with incident management and problem management policies and procedures, serving as a focal point for the customer, client, and associate experience, restoring complex production incidents under tight Service Level Agreements, and pursuing root cause and problem resolution follow ups.
Responsibilities:
- Leads production support triage efforts, manages bridge line troubleshooting, engages in technical research, and escalates issues to leadership as needed
- Ensures all impacts are accurately recorded and documented in the system of record, oversees that documents and wikis are updated and available for use during triage, and supports the documentation of application flows, upstream/downstream impacts during outages, the customer experience, and contacts for support needs
- Identifies and/or validates business impacts through interpretation of monitors, dashboards, and logs to communicate with leadership and vendors
- Manages activities to identify incident root cause, resolution, preventative actions, and change requests, and reports on incident data quality
- Promotes and enforces production governance during triage/testing and identifies production failure scenarios, vulnerabilities, and opportunities for improvement
- Serves as a subject matter expert for applications within a portfolio, leveraging extensive knowledge of application functionalities and application flows
- Assesses and prioritizes research requests, ad hoc reports, and offline incidents at the direction of senior team members and delegates work as needed to team members and peers
Position Summary:
We are looking to hire a team member who will be part of Market Application Production Services (MAPS) Quartz team working in 2nd shift. This role is involved in the monitoring & supporting of Quartz Core environment and components including in-flight strategic projects. The team works on automation, tool development, fixing Infrastructure issues, incident management and provide innovative solutions to ensure that the application infrastructure remains optimized and stable.
Required Qualifications:
- Eight years strong experience in Linux administration, programming experience in at least one language (Python, Shell scripting, etc.)
- Excellent Operational skills, troubleshooting Production and Network issues and identifying Root Cause Prevention.
- Candidate should be able to connect the dots and conclude complex Infrastructure issues quickly with minimal supervision.
- Exposure to Core troubleshooting Tools and Automation.
- Strong experience in Infrastructure automation using either of Ansible or Python,
- Self-motivated and results oriented with excellent analytical, problem solving, interpersonal, presentation and communication skills.
- Operate in a fast-paced environment with multiple concurrent priorities.
- Exposure and working experience with large scale environments.
- Background in large Enterprise experience.
- Process adherence and efficiency in handling Incident and problem management
- Adherence to written procedures and policies for Change Management
Desired Qualifications:
- Experience in monitoring, and observability tools such as ELK, Prometheus, Splunk, Dynatrace, etc.
- Experience with supporting Cloud technologies (Azure ,etc.) will be a plus
Skills:
- Adaptability
- Analytical Thinking
- Influence
- Production Support
- Risk Management
- Automation
- Collaboration
- Innovative Thinking
- Result Orientation
- Solution Design
- Business Acumen
- DevOps Practices
- Project Management
- Solution Delivery Process
- Stakeholder Management
Shift:
1st shift (United States of America)Hours Per Week:
40Markets Application Production Services
Office
Plano, United States
Full Time
September 23, 2025