Talent.com
This job offer is not available in your country.
Intermediate Site Reliability Engineer, Database Operations

Intermediate Site Reliability Engineer, Database Operations

GitLabWorkFromHome, Southland, New Zealand
30+ days ago
Job description

Overview

Intermediate Site Reliability Engineer, Database Operations. Join to apply for the Intermediate Site Reliability Engineer, Database Operations role at GitLab.

GitLab is an open-core software company that develops an AI-powered DevSecOps platform used by more than 100,000 organizations. The Database Operations team owns the lifecycle of the PostgreSQL database engine for GitLab.com, focusing on reliability, scalability, performance, and security of the database and its supporting services.

Responsibilities

  • Automating every operational task as a core requirement (e.g., package updates, configuration changes across environments, tooling for automatic provisioning of user-facing services).
  • Responding to platform emergencies, alerts, and escalations from Customer Support.
  • Ensure systems exist to manage software life-cycles with minimal manual effort.
  • Develop an automated, multi-environment observability stack based on the existing SaaS system and extend it to predict capacity needs based on usage patterns.
  • Plan for new service roll-outs, expansion and capacity management of existing services, and collaborate with users to optimize resource consumption.

As An SRE You Will

  • Work on database reliability and performance for GitLab.com from within the SRE team and collaborate with product teams to ship solutions.
  • Analyze solutions and implement best practices for PostgreSQL clusters and components.
  • Develop observability for relevant database metrics and achieve database objectives.
  • Roll out changes to production with peers and help mitigate database-related production incidents.
  • Provide on-call support on rotation and lend database expertise to engineering teams (e.g., migrations, queries, performance optimizations).
  • Automate database infrastructure and provide self-service tools to enable engineering success.
  • Use the GitLab product to operate GitLab.com efficiently and plan the growth of the database infrastructure.
  • Design, build, and maintain core database infrastructure components to scale for hundreds of thousands of concurrent users.
  • Support and debug production issues across services and stack layers.
  • Design monitoring and alerting to focus on symptoms and proactive indicators rather than outages.
  • Document actions to turn learnings into repeatable processes and automation.
  • Projects You Could Work On

  • Review and implement database administration solutions (e.g., backups, performance tuning).
  • Automate setup of replicas and backups using Ansible, Terraform, Chef and other tools.
  • Develop self-service tooling for engineers via GitLab ChatOps.
  • Provide technical guidance on database-related design methodologies and tuning.
  • Review database migrations and optimize queries and schemas for performance.
  • Respond to production incidents with focus on mitigating database issues.
  • Contribute to infrastructure design and scalability considerations for data storage.
  • Plan scaling steps for the database and evaluate capacity requirements.
  • Qualifications

  • Experience running PostgreSQL in high-growth, large production environments using self-managed (VM, Kubernetes with PostgreSQL Operators) or DBaaS services.
  • Hands-on experience using PostgreSQL internals data to design, build, and troubleshoot systems.
  • Infrastructure automation, orchestration, and configuration management (e.g., Chef, Ansible, Puppet, Terraform).
  • Solid understanding of SQL and PL / pgSQL.
  • Significant experience in a large SaaS distributed systems production environment.
  • Strong written and verbal English communication skills and collaboration abilities.
  • Documentation mindset with a bias for delivering quickly and iterating.
  • Proactive, go-for-it attitude and willingness to fix issues when you see something broken.
  • Solid data modeling and data structure design skills.
  • Bonus : programming skills (Ruby and / or Go) and experience with ClickHouse or similar OLAP databases.
  • Additional Information

  • GitLab is an equal opportunity employer; policies are merit-based and apply without regard to race, color, religion, sex, national origin, age, disability, or other protected status. If accommodation is needed during the recruiting process, please notify us.
  • Country Hiring Guidelines : GitLab hires globally. All roles are remote, though some may have location-based eligibility requirements.

    Privacy Policy : Review our Recruitment Privacy Policy. This job posting does not include extraneous site notices or tracking links.

    #J-18808-Ljbffr

    Create a job alert for this search

    Reliability Engineer • WorkFromHome, Southland, New Zealand

    Related jobs
    • Promoted
    Engineering Senior Software Engineer New Zealand (Remote) FullTime

    Engineering Senior Software Engineer New Zealand (Remote) FullTime

    Leonardo Interactive PtyWorkFromHome, Southland, New Zealand
    Join the Revolution at Leonardo.Ai is an Australian tech startup.Our mission is to unleash the world's creativity with its groundbreaking AI-powered platform. We're seeking highly skilled Senior Sof...Show moreLast updated: 30+ days ago
    • Promoted
    Senior Software Engineer, Data Platform

    Senior Software Engineer, Data Platform

    SmarterDxWorkFromHome, Southland, New Zealand
    We are looking for a data and backend-oriented Senior Software Engineer to help us advance our clinical AI by designing and building core systems that handle, process, and analyze clinical data at ...Show moreLast updated: 14 days ago
    • Promoted
    Senior Software Engineer

    Senior Software Engineer

    FederatoWorkFromHome, Southland, New Zealand
    Federato is on a mission to defend the right to efficient, equitable insurance for all.We enable insurers to provide affordable coverage to people and organizations facing the issues of today - the...Show moreLast updated: 14 days ago
    • Promoted
    Division CFO, Trilogy (Remote) - $400,000 / year USD

    Division CFO, Trilogy (Remote) - $400,000 / year USD

    TrilogyWorkFromHome, Southland, New Zealand
    Division CFO, Trilogy (Remote) - $400,000 / year USD.Division CFO, Trilogy (Remote) - $400,000 / year USD.This range is provided by Trilogy. Your actual pay will be based on your skills and experience —...Show moreLast updated: 3 days ago
    • Promoted
    Site Reliability / Gitops Engineer

    Site Reliability / Gitops Engineer

    CanonicalWorkFromHome, Southland, New Zealand
    Site Reliability / Gitops Engineer.Be among the first 25 applicants.Site Reliability / Gitops Engineer.Canonical is a leading provider of open source software and operating systems to the global en...Show moreLast updated: 30+ days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    CanonicalWorkFromHome, Southland, New Zealand
    Canonical is a leading provider of open source software and operating systems to the global enterprise and technology markets. Our platform, Ubuntu, is widely used in breakthrough enterprise initiat...Show moreLast updated: 30+ days ago
    • Promoted
    Embedded Linux Senior Software Engineer - Optimisation

    Embedded Linux Senior Software Engineer - Optimisation

    CanonicalWorkFromHome, Southland, New Zealand
    Embedded Linux Senior Software Engineer - Optimisation.Embedded Linux Senior Software Engineer - Optimisation.Embedded Linux Senior Software Engineer - Optimisation. Be among the first 25 applicants...Show moreLast updated: 30+ days ago
    • Promoted
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    CanonicalWorkFromHome, Southland, New Zealand
    Canonical is a leading provider of open source software and operating systems to the global enterprise and technology markets. Our platform, Ubuntu, is widely used in breakthrough enterprise initiat...Show moreLast updated: 30+ days ago
    • Promoted
    Control Plane - Site Reliability Engineer (Hosted Infrastructure)

    Control Plane - Site Reliability Engineer (Hosted Infrastructure)

    ElasticWorkFromHome, Southland, New Zealand
    Control Plane - Site Reliability Engineer (Hosted Infrastructure).Elastic is the Search AI Company enabling real-time answers across data at scale. We integrate, scale, and evolve multi-cloud infras...Show moreLast updated: 20 days ago
    • Promoted
    Site Reliability Engineering Manager

    Site Reliability Engineering Manager

    CanonicalWorkFromHome, Southland, New Zealand
    Site Reliability Engineering Manager.Be among the first 25 applicants.Site Reliability Engineering Manager.Canonical is a leading provider of open-source software and operating systems for global e...Show moreLast updated: 30+ days ago
    • Promoted
    Control Plane - Site Reliability Engineer (Hosted Infrastructure)

    Control Plane - Site Reliability Engineer (Hosted Infrastructure)

    Elasticsearch B.V.WorkFromHome, Southland, New Zealand
    Elastic, the Search AI Company, enables everyone to find the answers they need in real time, using all their data, at scale — unleashing the potential of businesses and people.The Elastic Search AI...Show moreLast updated: 20 days ago
    • Promoted
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    Clover HealthWorkFromHome, Southland, New Zealand
    At Counterpart Health, we are transforming healthcare and improving patient care with our innovative primary care tool, Counterpart Assistant. By supporting Primary Care Physicians (PCPs), we are ab...Show moreLast updated: 14 days ago
    • Promoted
    Senior Software Engineer

    Senior Software Engineer

    Education Perfect LtdWorkFromHome, Southland, New Zealand
    Education Perfect is an EdTech platform designed to empower educators and amplify their impact in the classroom.We aim to enable teachers to personalise learning at scale with a range of powerful l...Show moreLast updated: 20 days ago
    • Promoted
    Staff / Principal Software Engineer - NZ Remote

    Staff / Principal Software Engineer - NZ Remote

    Auror LimitedWorkFromHome, Southland, New Zealand
    Please note : This role is only available for candidates working remotely from NZ.We are unable to consider any applications outside of NZ. Auror is empowering the retail industry to tackle theft and...Show moreLast updated: 8 days ago
    • Promoted
    Senior Site Reliability / Gitops Engineer

    Senior Site Reliability / Gitops Engineer

    CanonicalWorkFromHome, Southland, New Zealand
    Senior Site Reliability / Gitops Engineer role at Canonical.Canonical is a leading provider of open source software and operating systems to the global enterprise and technology markets.Our platfor...Show moreLast updated: 30+ days ago
    • Promoted
    Linux Engineering Manager - Optimisation for Latest Hardware

    Linux Engineering Manager - Optimisation for Latest Hardware

    CanonicalWorkFromHome, Southland, New Zealand
    Lead an engineering team that partners with the Linux engineers of a major silicon company, and works across the full Linux stack from kernel to GUI, to optimise Ubuntu, the world's most widely use...Show moreLast updated: 30+ days ago
    • Promoted
    Senior Principal Software Engineer - Platform Engineering

    Senior Principal Software Engineer - Platform Engineering

    AtlassianWorkFromHome, Southland, New Zealand
    Senior Principal Software Engineer - Platform Engineering.Senior Principal Software Engineer - Platform Engineering.Senior Principal Software Engineer - Platform Engineering.Senior Principal Softwa...Show moreLast updated: 30+ days ago
    • Promoted
    Staff / Principal Software Engineer - NZ Remote

    Staff / Principal Software Engineer - NZ Remote

    AurorWorkFromHome, Southland, New Zealand
    Please note : This role is only available for candidates working remotely from NZ.We are unable to consider any applications outside of NZ). At Auror, we're empowering the retail industry to tackle t...Show moreLast updated: 1 day ago