Site Reliability Engineer(Vice President)-Houston(Hybrid)

September 18, 2023

Job Description


  • Site Reliability Engineer for Global Wholesale Lending and Banking Technology (WLBT) is a critical role that will help drive the end-to-end deliverables to ensure a stable, efficient, observable, and resilient technology environment. The successful candidate will deep-dive into current production incidents, understand current design and architectural issues, develop innovative and technical tooling to improving production stability and enabling faster recovery. The candidate will be required to partner closely with the technology leads of each functional streams across WLBT and the wider organization to achieve their goals to improve supportability and reduce toil.
  • The partnership will include interaction with Architects, Engineers, Developers, Production Management and Infrastructure partners (DBA’s, Network, SA’s) to understand the technology stacks, process flows, transaction processing in product processors, data flows, web/app tier performance, database performance, middleware performance and the inputs/outputs to ensure optimal engineering of stacks for application performance.
  • Contribute to define and implement best practices and processes including driving compliance. Ensure transparency and consistency across teams in the region and globally.
  • Develop strong relationship with development, testing and infrastructure teams to drive efficiency.
  • Work closely with senior stakeholders across the WLCR business to understand and drive tooling program to improve supportability.
  • Previous experience of Site Reliability Engineering concepts and demonstrated practical execution of these concepts is a requirement.

Functional Responsibilities:

  • A high performing technologist with the experience and background that will allow them to excel in a fast paced, rapidly changing, and highly technical environment. The candidate will also be responsible to provide Best in Class Production Availability, Resiliency and Predictability to the WLBT business and lending functions by standardizing and improving application logging for better proactive monitoring, designing solutions that are fault tolerant, scalable, reliable and building always-on systems for high availability. In addition, the candidate will partner closely with each area to perform diagnostic and forensic investigation for outages caused by scale issues in applications
  • Engage Product/Technology Owners to establish business service level objectives, Service level Indicators to measure performance and then identify hotspots and architectural refinements required.
  • The candidate will spend 50% of their time on development tasks such as new features, scaling, recovery, and automation of manual tasks, continuous integration and continuous delivery. The successful candidate will have the technical skills and aptitude that can bring the latest technology ideas to fruition and support an excellent client experience.
  • Single Point of contact to ensure that all the work stream deliverables under the global Program are on track as planned. Key person for planning, execution and delivery End to End, facing the senior management for risks/escalations when plans fall through the cracks. Strategist who can bring the status from Red to Green.
  • Confidently communicate and escalate issues and review/present exceptions on a monthly basis to the Production Resiliency Senior management & Business Exco sponsors and stakeholders.
  • The successful technologist will need strong influential and social skills to help negotiate and drive all the teams involved in building production resiliency.
  • Working closely with technology, infrastructure, regulatory and service operations organizations across the globe, the person will have unparalleled opportunities to work across boundaries to solve some of the most difficult problems in the financial industry today.


  • 6-10 years of relevant experience
  • Experience with financial products, knowledge of lending is a plus
  • Ability to adjust priorities quickly as circumstances dictate
  • Consistently demonstrates clear and concise written and verbal communication
  • Working knowledge of the industry and industry products and services
  • Demonstrated interpersonal and development skills
  • Strong scripting / programming experience with a focus on production and operational improvement
  • Knowledge of support tools such as Splunk / ELK / App-Dynamics
  • AWS or similar cloud technologies
  • Python scripting
  • Knowledge of Kubernetes/OpenShift/Docker


  • Bachelor’s degree/University degree or equivalent experience


Job Family Group:



Job Family:

Applications Development


Time Type:

Full time


Primary Location:

Houston Texas United States


Primary Location Salary Range:

$121,560.00 – $182,340.00


Citi is an equal opportunity and affirmative action employer.

Qualified applicants will receive consideration without regard to their race, color, religion, sex, sexual orientation, gender identity, national origin, disability, or status as a protected veteran.

Citigroup Inc. and its subsidiaries (“Citi”) invite all qualified interested applicants to apply for career opportunities. If you are a person with a disability and need a reasonable accommodation to use our search tools and/or apply for a career opportunity review Accessibility at Citi.

View the “EEO is the Law” poster. View the EEO is the Law Supplement.

View the EEO Policy Statement.

View the Pay Transparency Posting