SRE Team Lead (GameDev)
jobgether
Spain
Tiempo completo
428 ofertas más en Spain.
Sube tu CV y descubre cuáles encajan realmente contigo.
Accountabilities:
- Lead, mentor, and support a team of DevOps and SRE engineers, fostering technical growth and engineering excellence.
- Define and evolve Site Reliability Engineering and DevOps practices, processes, and standards across the organization.
- Contribute to technical strategy, infrastructure roadmaps, and the continuous improvement of engineering culture.
- Design, maintain, and enhance highly available, scalable, and resilient cloud-native infrastructure.
- Develop and expand monitoring, observability, alerting, and incident management capabilities to ensure system reliability.
- Participate in and coordinate on-call rotations while improving incident response and root cause analysis processes.
- Automate infrastructure provisioning, operational workflows, and repetitive tasks using Infrastructure as Code and scripting.
- Collaborate closely with development teams to improve system reliability, deployment pipelines, CI/CD processes, and overall platform performance.
- Promote Kubernetes-native approaches and provide technical mentorship on cloud and platform engineering practices.
- Support architectural decision-making and contribute to the evolution of cloud infrastructure and operational excellence.
Requirements
- 5+ years of experience in DevOps, Site Reliability Engineering, or related infrastructure-focused roles.
- Previous leadership experience or a strong desire and capability to take ownership of a technical team.
- Deep expertise in Kubernetes and container orchestration platforms, with at least 4 years of hands-on experience.
- Strong experience with Terraform and Infrastructure as Code practices.
- Proven experience working with Oracle Cloud Infrastructure and managing cloud-based environments.
- Solid background building and maintaining highly available and fault-tolerant systems.
- Experience managing both SQL databases (particularly PostgreSQL) and NoSQL technologies.
- Strong knowledge of observability tools, including Prometheus, Grafana, exporters, monitoring, and alerting systems.
- Proficiency in automation and scripting using Python and Bash.
- Strong understanding of CI/CD pipelines, GitOps methodologies, and platform engineering concepts.
- Excellent troubleshooting skills with a structured approach to root cause analysis and reliability improvements.
- Proactive mindset focused on automation, operational efficiency, and continuous improvement.
- Strong communication and collaboration skills with experience working across engineering teams.
- Nice to have: Certified Kubernetes Administrator (CKA) certification.
- Nice to have: experience with AWS and Google Cloud Platform.
- Nice to have: ability to read and understand JavaScript, TypeScript, or Ruby codebases.
Benefits
- Fully remote work environment with flexibility to work from the location that suits you best.
- Competitive compensation package.
- Opportunity to lead and shape infrastructure strategy within a rapidly growing technology organization.
- Clear career development framework with performance reviews, mentoring programs, and advancement opportunities.
- Dedicated learning budget for professional courses, certifications, workshops, and training programs.
- Corporate English lessons and access to educational resources and online libraries.
- Private medical insurance and mental health support programs.
- Generous paid vacation, public holidays, and sick leave.
- Monthly flexible benefits budget that can be used for hobbies, sports, wellness, and personal interests.
- Regular team-building activities, workshops, and company events.
- Collaborative, low-bureaucracy culture that encourages autonomy, innovation, and ownership.
- Opportunity to work with modern technologies and contribute to large-scale cloud-native platforms.
Este anuncio proviene de ats_lever. Ver anuncio original ↗