Overview
Intermediate Site Reliability Engineer, Database Operations. Join to apply for the Intermediate Site Reliability Engineer, Database Operations role at GitLab.
GitLab is an open-core software company that develops an AI-powered DevSecOps platform used by more than 100,000 organizations. The Database Operations team owns the lifecycle of the PostgreSQL database engine for GitLab.com, focusing on reliability, scalability, performance, and security of the database and its supporting services.
Responsibilities
- Automating every operational task as a core requirement (e.g., package updates, configuration changes across environments, tooling for automatic provisioning of user-facing services).
- Responding to platform emergencies, alerts, and escalations from Customer Support.
- Ensure systems exist to manage software life-cycles with minimal manual effort.
- Develop an automated, multi-environment observability stack based on the existing SaaS system and extend it to predict capacity needs based on usage patterns.
- Plan for new service roll-outs, expansion and capacity management of existing services, and collaborate with users to optimize resource consumption.
As An SRE You Will
Work on database reliability and performance for GitLab.com from within the SRE team and collaborate with product teams to ship solutions.Analyze solutions and implement best practices for PostgreSQL clusters and components.Develop observability for relevant database metrics and achieve database objectives.Roll out changes to production with peers and help mitigate database-related production incidents.Provide on-call support on rotation and lend database expertise to engineering teams (e.g., migrations, queries, performance optimizations).Automate database infrastructure and provide self-service tools to enable engineering success.Use the GitLab product to operate GitLab.com efficiently and plan the growth of the database infrastructure.Design, build, and maintain core database infrastructure components to scale for hundreds of thousands of concurrent users.Support and debug production issues across services and stack layers.Design monitoring and alerting to focus on symptoms and proactive indicators rather than outages.Document actions to turn learnings into repeatable processes and automation.Projects You Could Work On
Review and implement database administration solutions (e.g., backups, performance tuning).Automate setup of replicas and backups using Ansible, Terraform, Chef and other tools.Develop self-service tooling for engineers via GitLab ChatOps.Provide technical guidance on database-related design methodologies and tuning.Review database migrations and optimize queries and schemas for performance.Respond to production incidents with focus on mitigating database issues.Contribute to infrastructure design and scalability considerations for data storage.Plan scaling steps for the database and evaluate capacity requirements.Qualifications
Experience running PostgreSQL in high-growth, large production environments using self-managed (VM, Kubernetes with PostgreSQL Operators) or DBaaS services.Hands-on experience using PostgreSQL internals data to design, build, and troubleshoot systems.Infrastructure automation, orchestration, and configuration management (e.g., Chef, Ansible, Puppet, Terraform).Solid understanding of SQL and PL / pgSQL.Significant experience in a large SaaS distributed systems production environment.Strong written and verbal English communication skills and collaboration abilities.Documentation mindset with a bias for delivering quickly and iterating.Proactive, go-for-it attitude and willingness to fix issues when you see something broken.Solid data modeling and data structure design skills.Bonus : programming skills (Ruby and / or Go) and experience with ClickHouse or similar OLAP databases.Additional Information
GitLab is an equal opportunity employer; policies are merit-based and apply without regard to race, color, religion, sex, national origin, age, disability, or other protected status. If accommodation is needed during the recruiting process, please notify us.Country Hiring Guidelines : GitLab hires globally. All roles are remote, though some may have location-based eligibility requirements.
Privacy Policy : Review our Recruitment Privacy Policy. This job posting does not include extraneous site notices or tracking links.
#J-18808-Ljbffr