via ats_lever · 15. Juni 2026 ·vor 1 Tag

Senior ML Engineer (Token Factory)

jobgether
Germany Vollzeit
62 weitere Jobs in Germany.
Lad deinen CV hoch und sieh, welche wirklich zu dir passen.
CV hochladen

Accountabilities:

  • Drive inference optimization efforts by identifying bottlenecks and implementing performance improvements across diverse LLM architectures, improving throughput and reducing latency and cost per token.

  • Contribute to the design and evolution of inference engines, including techniques such as speculative decoding, KV-cache optimization, and support for dense and MoE models.

  • Develop and productionize low-precision training and inference pipelines (e.g., FP8, MXFP4) to maximize efficiency on large GPU clusters.

  • Profile and analyze GPU workloads using modern tooling to identify performance constraints and guide architectural improvements.

  • Collaborate on scalable distributed training and inference systems, including sharding strategies, custom kernels, and hardware-aware optimizations.

  • Contribute to engineering best practices including testing, CI/CD, and maintainable production-grade ML systems.

Requirements


  • Strong understanding of machine learning fundamentals, particularly transformer architectures and large language models.

  • Hands-on experience profiling and optimizing GPU workloads using tools such as Nsight or PyTorch Profiler.

  • Deep knowledge of GPU architecture, including memory hierarchy and compute vs. memory trade-offs.

  • Familiarity with key LLM concepts such as attention mechanisms, RoPE, KV-cache, Flash Attention, and quantization techniques.

  • Experience with large-scale deep learning training, including distributed systems, sharding strategies, and custom kernel development.

  • Strong software engineering skills, with advanced proficiency in Python and modern ML frameworks.

  • Solid understanding of software engineering practices such as version control, CI/CD pipelines, and unit testing.

  • Strong communication skills with the ability to collaborate effectively in highly technical, cross-functional teams.
Benefits:
  • Competitive compensation package

  • Strong career development and continuous learning opportunities

  • Flexible work environment with high autonomy and ownership

  • Collaborative, innovation-driven engineering culture

  • Opportunity to work on frontier AI systems at massive scale

  • International, highly skilled, and diverse team environment

Der Markt für diese Art von Stelle

Ähnliche Angebote
62
Ingenieurwesen in Germany
Vollzeit
81%
der Ingenieurwesen-Angebote in Deutschland
Remote möglich
15%
der Ingenieurwesen-Angebote
jobgether

200 offene Stellen · Austria, Belgium, Denmark, France, Germany +11

📊 Ingenieurwesen · Deutschland
2.998
aktive Stellen
16.3%
Remote
Ø 4d
Ø online
Gefragte Skills
ExcelERPISOPythonAWSCI/CDSQLAzureAgileLean

Häufige Fragen

Wie viele Ingenieurwesen-Jobs gibt es in Germany?
Aktuell 62 Stellen im Bereich Ingenieurwesen in Germany auf AlmostHired, bei 20 verschiedenen Unternehmen. Unsere Daten werden täglich aktualisiert.
Bieten Ingenieurwesen-Stellen Home Office an?
15% der Ingenieurwesen-Angebote in Deutschland erlauben Remote-Arbeit, teilweise oder vollständig. Um gezielt nach Remote-Stellen zu filtern, nutze AlmostHired.
Wie erfahre ich, ob ich für diese Stelle passe?
Lad deinen CV hoch — unsere KI vergleicht dein Profil mit den Stellenanforderungen und zeigt dir einen präzisen Match-Score, inklusive passender und fehlender Skills.