Distributed Systems Engineer (m/w/d)
Daisytuner is building the software layer for the next generation of computing. We make complex software run efficiently on any processor—from CPUs and GPUs to novel accelerators—using a self\-learning compiler and cloud\-scale optimization infrastructure.
Our team brings together researchers and engineers from RWTH Aachen, TU Munich, TU Darmstadt, and ETH Zurich to tackle some of the hardest problems in systems software, compilers, and infrastructure.
We are building a continuous benchmarking and optimization platform for heterogeneous compute. Similar to how CI/CD systems continuously build and test software, our infrastructure continuously benchmarks, tunes, and learns from workloads running across CPUs, GPUs, and accelerators.
As a Distributed Systems Engineer, you will design and build the infrastructure that powers this platform. You will work on job orchestration, distributed execution, scheduling, data processing, and backend systems that allow our compiler infrastructure to operate at scale.
What you will be doing
- Build and evolve our distributed runner infrastructure across bare\-metal and cloud environments
- Design systems that schedule, execute, monitor, and recover benchmarking and optimization workloads
- Improve reliability, scalability, observability, and resource utilization across the platform
- Develop backend services and data pipelines that connect users, compiler services, and distributed workers
- Integrate performance\-critical native components into backend systems
- Help shape the architecture of our continuous benchmarking and optimization platform as we scale
- Work closely with compiler engineers to turn research ideas into production systems
- Background in Computer Science, Software Engineering, or a related field (degree or equivalent experience)
- Strong software engineering fundamentals
- Experience building production backend systems
- Experience with distributed systems, scheduling systems, orchestration platforms, CI/CD infrastructure, workflow engines, or similar backend infrastructure
- Experience reasoning about reliability, failure handling, concurrency, and scalability
- Experience working with databases and high\-volume data processing
- Strong programming skills in at least one backend or systems language (TypeScript, Go, C\+\+, Rust, Java, Python, or similar)
What we offer
- A small, highly technical team with direct impact on core technology
- Competitive compensation and potential equity participation
- The opportunity to work at the intersection of compilers, distributed systems, and heterogeneous computing
- Real ownership over infrastructure and architectural decisions
- The chance to build foundational technology for the future of compute
We are an equal opportunity employer and welcome applications from people of all backgrounds. We value diversity and believe that different perspectives make us stronger. We do not discriminate based on gender, nationality, ethnic origin, religion, disability, age, sexual orientation, or identity.
Job Types: Full\-time, Permanent
Work Location: Hybrid remote in Darmstadt
Diese Anzeige stammt von indeed. Originalanzeige ansehen ↗