Key Responsibilities

Incident & Problem Management

Lead major incident (MI) bridges and restore service with minimum business impact.
Handle all L3 escalations, perform deep diagnostics across Java, JVM, middleware, OS, and infra.
Own technical RCAs, drive long‑term and systemic remediation.
Identify recurring failure patterns and risks.

Reliability Engineering

Apply SRE principles: SLIs/SLOs, error budgets, resilience patterns.
Tune JVM parameters, analyze thread/heap dumps, and improve performance.
Influence application architecture for fault tolerance, scalability, and recoverability.
Validate DR readiness, failover behavior, and resilience testing outcomes.

Change, Release & Risk

Automation, Monitoring & Observability

Build advanced automation using Shell/Python/PowerShell.
Develop frameworks for health validation, automated recovery, and compliance checks.
Define observability standards; optimize alerts and improve MTTR.

Leadership & Mentorship

Skills & Qualifications

Technical (Mandatory)

Strong knowledge of application architecture, distributed systems, and middleware.
Java expertise: JVM internals, GC, memory management, thread/heap dump analysis, performance tuning.
.Net -- CLR internals, garbage collection, memory management, thread/dump analysis, and application performance tuning.
Strong Unix/Linux, networking basics, and advanced scripting (Shell/Python/PowerShell/VBS).
Advanced SQL and understanding of databases; Autosys (or equivalent scheduler).
Handson with observability tools: Splunk, AppDynamics/Dynatrace, ELK, Grafana, Prometheus.

Reliability & Operations

Major incident leadership, deep RCA, change/release readiness, DR & resilience engineering.
Experience in regulated production environments.

Soft Skills

Experience & Education

7–12+ years in Application Reliability, Production Support, SRE, or platform operations.
Bachelor’s degree in Computer Science/Engineering or equivalent.
ITIL, cloud, or industry certifications (preferred).
Banking/financial domain experience (preferred).

Working Conditions

Senior Applications Support Specialist