Principal Software Engineer – Performance Engineering
Jersey, NJ - USA
Job Summary
If you are looking for a game-changing career working for one of the worlds leading financial institutions youve come to the right place.
As aPrincipal Software Engineer atJPMorganChase within the Commercial and Investment Bank Payment Performance Engineering team you will serve as the technical authority and execution lead for performance engineering across our platforms. You will be hands-on and strategicdefining non-functional requirements (NFRs) and service level objectives (SLOs) embedding performance into CI/CD scaling automated performance testing and driving resiliency capacity and user experience outcomes across distributed systems. You will set the bar for performance-first design partner closely with architecture SRE and application teams and ensure performance risks are identified and eliminated before production.
Job responsibilities
- Define and institutionalize application- and endpoint-level NFRs and SLOs including p95/p99 latency throughput ramp profiles and error budgets
- Drive proactive performance engineering through early bottleneck detection architectural guidance and capacity modeling
- Serve as the final technical authority for performance sign-offs across platform releases
- Design build and maintain automated test suites for load stress soak spike and capacity scenarios
- Virtualize partner dependencies and inject faults to validate components when upstream systems are unavailable
- Expand fully automated environment-aware performance test execution (on-commit/overnight) with health checks and actionable sanity tests
- Build dashboards and alerts correlating performance test signals with production telemetry against defined SLOs
- Provide actionable reporting on SLO variance drift and per-endpoint hotspots using RUM synthetic and server-side metrics
- Embed performance gates into CI/CD pipelines (pre-deploy smoke post-deploy validation regression detection with auto-fail/notify)
- Lead chaos and resiliency experiments (CPU memory network latency dependency failures) and validate autoscaling under extreme load
- Apply AI/LLMs to workload and scenario generation metrics interpretation and automated reporting with measurable success guardrails
Required qualifications capabilities and skills
- 15 years of overall engineering experience with 10 years in performance engineering for high-traffic distributed systems (web APIs microservices event-driven data-centric)
- Hands-on software engineering experience with Java/Spring Boot and Kubernetes (self-managed and EKS)
- Deep expertise in workload modeling queuing theory and statistical analysis of latency/throughput; fluent with percentile-based SLOs and error budgets
- Proficiency with load and protocol testing tools such as JMeter and BlazeMeter
- Scripting/orchestration skills in Java Python or TypeScript for performance automation and execution control
- Experience with service virtualization and fault injection (e.g. WireMock Mountebank Toxiproxy) including record-replay and dynamic templating
- Strong observability/APM capabilities using Dynatrace and/or OpenTelemetry plus RUM and synthetic monitoring approaches
- Experience building dashboards and analysis workflows in Kibana and/or Grafana to drive actionable decisions
- Strong CI/CD and DevOps experience (e.g. Jenkins GitLab GitHub Actions) including repeatable sign-offs artifact/version alignment and environment promotion
- Infrastructure-as-code and platform delivery experience (e.g. Terraform CloudFormation) including autoscaling strategies
- Ability to partner across architecture SRE and application teams to coach standards adoption and drive release readiness
Preferred qualifications capabilities and skills
- Experience with data-platform performance optimization (e.g. Oracle tuning JDBC pool tuning Kafka throughput/partitioning caching strategies)
- Strong systems and cloud performance background (Linux tooling JVM tuning containers AWS primitives such as compute ALB/NLB EKS networking)
- Experience with k6 and other modern cloud-native load testing frameworks
- Familiarity with service mesh technologies (Istio/Linkerd) and traffic-control patterns (rate limiting backpressure)
- Practical application of LLMs for test generation anomaly detection or automated reporting in engineering workflows
- Experience operating in financial-services scale low-latency systems and/or regulated environments
- Knowledge of advanced performance tooling (e.g. perf eBPF) and production-grade troubleshooting practices
This position is subject to Section 19 of the Federal Deposit Insurance Act. As such an employment offer for this position is contingent on JPMorgan Chases review of criminal conviction history including pretrial diversions or program entries.
Required Experience:
Staff IC
About Company
JPMorganChase, one of the oldest financial institutions, offers innovative financial solutions to millions of consumers, small businesses and many of the world’s most prominent corporate, institutional and government clients under the J.P. Morgan and Chase brands. Our history spans ov ... View more