SRE Infrastructure & Team Lead VP - Glasgow

Location:Glasgow City
Job Type:Full Time

Big Data (Everest) SRE Infrastructure & Team Lead – VP

JPMC are looking to develop a core set of set of data management capabilities to drive consistency across each line of business. This data platform will be deployed on premise and longer term in the public cloud. The initial focus is on sourcing, storing, enriching and making available information to supporting internal management reporting, external regulatory reporting, as well as machine learning and other data analysis applications.

We are seeking an experienced software engineering lead in our global Site Reliability Engineering (SRE) team supporting our Big Data platform. This individual will be expected to lead a team of software engineers who will grow into subject manage experts, work with functional application development teams, partner with infrastructure engineers and production support analysts to determine requirements for designing and developing automation, SDLC and development environment testing & integration tools. The toolsets developed must pass the rigor of JPMC’s cyber security standards.

The SRE team runs, maintains and improves the Big Data Platform against established Service Level Objectives by applying software engineering practices. It is responsible for the availability, performance, change management, monitoring, and capacity management of their services, with special emphasis being placed on the automation of the processes/workload in support of the above. The SRE team is also responsible for the operational support of the Big Data infrastructure, with emphasis being placed on the ability to submit outage/issue/incident data into a design and SDLC feedback loop to ensure maximum automation and outage avoidance.

Key responsibilities this role would include:

  • Design, develop, test and deliver the software to automate manual operational work and ensure application performance and resiliency
  • Key contributor to SRE, core infrastructure and functional development teams throughout the life cycle to help create software for reliability and scale, ensuring minimal refactoring or changes

  • Troubleshoot priority incidents, conduct blameless post-mortems and ensure permanent closure of the incidents
  • Identify application patterns and analytics in support of better service level objectives
  • Design self-healing and resiliency patterns
  • Design and conduct the performance tests, identifies the bottlenecks, opportunities for optimization and the capacity demand
  • Contribute to strategy of a best in class monitoring frameworks to accomplish end to end flow monitoring and noiseless alerting
  • Design automated software and product upgrades, change management and release management solutions
  • Partner with SRE, Operate and development teams to monitor and correct the effort split between manual operational work and engineering work
  • Engage with Technology Controls organization to ensure tooling and ecosystem meets the Firm’s rigorous cyber policies
  • Contribute to Firm level SRE community via engineering projects and/or intellectual capital
  • Be part of the 24x7 support coverage as needed
  • Manage team members

Key qualifications include:

This role requires a wide variety of strengths and capabilities, including:

  • Bachelor’s Degree in Computer Science, Engineering or Business

  • Prior experience in leading DevOps and/or application development teams

  • Excellent debugging and trouble shooting skills

  • Hands on experience using large scale software development, preferably in one of these languages: Java, Python, scripting languages

  • Hands on experience of GIT, BitBucket, Jenkins, SONAR, SPLUNK, Maven, AIM and/ or Continuous Delivery tools

  • Hands on experience in Unix: Linux and Solaris, relational (Oracle, MS SQL DB, Sybase, etc) and non DB technologies

  • Knowledge of Load balancing, IP, DNS

  • Exposure to new and emerging technologies such as cloud and virtualization

  • Exposure to messaging technologies: eg Kafka, etc
  • Exposure to Orchestration and configuration management tools for infrastructure
  • Familiarity with Agile Methodologies
  • Hands on experience building out and maintaining data management platforms/workbenches either in house or as part of a commercial offering
  • Experience with infrastructure components utilized in data warehousing or big data environments.
  • Excellent communication skills, both written and oral appropriately scaled for senior technical and senior business audience

  • Ability to work and effectively prioritize in a highly dynamic work environment that includes a global focus