Site Reliability Engineer
ID: SRE\-1
Description
About the Role
We are looking for a highly skilled Site Reliability Engineer to join our team and play a key role in driving innovation across the semiconductor industry. In this position, you will own the day\-to\-day reliability and operation of our systems and software infrastructure, working in close partnership with our development and engineering teams to ensure seamless deployment and consistent performance of our products. This is an opportunity to make a meaningful impact within a talented, experienced team \- and we offer a competitive salary and benefits package and a flexible work environment to support you in doing your best work.
Responsibilities* Design, manage, and continuously evolve our infrastructure to support reliability and scale
- Drive infrastructure and configuration management automation to reduce toil and improve consistency
- Manage and optimize SLURM workload scheduling to maximize cluster efficiency and resource utilization
- Maintain and extend internal tooling, including a SLURM\-based autoscaler, SLURM plugins, and CMDB tools
- Own and improve software delivery pipelines, automating deployment processes using GitLab CI
- Direct involvement in Series A, venture debt, and strategic model design from day one.
- Accelerated development path. Real growth potential as the Finance team scales.
- 4 days per week in the Barcelona office (city center), 1 day WFH
- Competitive package, including base salary and participation in the Virtual Share Plan.
- A collaborative, technical, and growth\-oriented environment that values direct ownership and clear thinking.
Requirements
Requirements:
Strong experience in:* Linux system administration (RHEL)
- Git
- Ansible (Configuration as Code)
- GitLab CI
- Docker ecosystem
- Bash, Python
- Networking knowledge
- Monitoring solution setup (Prometheus/Grafana)
- Strong problem solving and analytical skills
Experience in:* SLURM \- strong advantage
- Rust \- strong advantage
- Distributed storage \- CEPH \- strong advantage
- Podman
- Backup solutions
- ZFS
- VPN setup (tailscale/headscale)
- Mikrotik
- Other script/programming languages
- EDA Tools
- Windows system administration
- Identity management, authn/authz, OIDC, SAML
- Datacenter ecosystem knowledge near Barcelona
MID
SENIOR
Este anuncio proviene de indeed. Ver anuncio original ↗