- Startseite
- Remote Jobs
- Principal AI/ML Engineer
Principal AI/ML Engineer
Eckdaten
Arbeitsmodell
Company Overview
team.blue is an ecosystem of successful brands working together across regions to provide customers with everything they need to succeed online. 60 successful brands make up the group; with a team of more than 3000 experts serving its 3.5 million customers across Europe and beyond.
team.blue's brands are a mix of traditional hosting businesses, offering services from domain names, email, shared hosting, e-commerce and server hosting solutions and specialist SaaS providers offering adjacent products such as compliance, marketing tools and team collaboration products. This broad product offering makes it a one-stop partner for online businesses and entrepreneurs across Europe.
Position Overview
team.blue is building the AI layer that runs across one of Europe's largest digital-services ecosystems, powering hosting, domains, email, and SaaS for millions of SMBs. As Principal AI/ML Engineer you will be the senior technical authority on AI systems end-to-end: from model research and fine-tuning through agentic orchestration, real-time inference, and production reliability. This is not a research-only role and not an MLOps-only role. You will do both, setting technical direction, shipping production AI, and raising the bar across a team that is moving fast.
Key Responsibilities:
Agentic AI Systems
- Architect and evolve our multi-agent orchestration platform (currently built on Hermes / Multica), including plugin systems, tool-use pipelines, observability hooks, and channel adapters (voice, telephony, messaging)
- Design and implement voice AI pipelines --- STT (VibeVoice-ASR, Whisper), real-time TTS with streaming (VibeVoice-Realtime), VAD (Silero), SIP/RTP telephony integration --- with sub-300 ms end-to-end latency targets
- Build and maintain RAG pipelines with retrieval quality measurement, re-ranking, and hybrid search over vector keyword indexes
- Define MCP server architecture and tool-use contracts across internal and third-party integrations
Model Development & Fine-Tuning
- Fine-tune and evaluate LLMs (LoRA, QLoRA, DPO) for domain-specific tasks including customer support, classification, and structured extraction
- Evaluate and benchmark model quality using automated evals, human preference data, and domain-specific metrics (WER, DER, cpWER for speech; RAGAS / LLM-as-judge for RAG)
- Manage model lifecycle: experiment tracking, versioning, reproducibility, and promotion to production
Observability & Reliability
- Own the AI observability stack: Langfuse tracing, span-level LLM call instrumentation, cost tracking, and quality regression alerting
- Define and enforce guardrails: hallucination detection, PII redaction, output safety scanning, and rate-limiting across multi-tenant deployments
Platform & Pipelines
- Build data ingestion, preprocessing, and feature pipelines supporting model training and continual learning
- Drive CI/CD for ML: automated eval gating, shadow deployments, canary releases, and rollback triggers
Technical Leadership
- Set architectural standards for AI systems across the group; conduct design reviews and own ADRs for major decisions
- Mentor ML engineers and applied scientists; grow the team's capabilities in production AI, not just prototype AI
- Collaborate with Product and Commercial teams to translate business problems into ML problem formulations with clear success metrics
- Engage with external research partners and track emerging work (arXiv, conference proceedings, open-source releases) to identify signals worth productionizing
Experience & Skills:
- 8 years in ML Engineering, Applied AI, or Research Engineering with at least 2 years in a lead or staff-level role
- Deep, hands-on experience with LLMs in production: fine-tuning, RLHF/DPO, prompt engineering, RAG, and tool use
- Fluent in Python and the core ML stack: PyTorch, Transformers (HuggingFace), PEFT/LoRA
- Real experience with LLM inference serving --- vLLM, TensorRT-LLM, or TGI --- in a latency-sensitive production environment
- Practical knowledge of agentic frameworks: multi-agent coordination, tool-call orchestration, context/memory management, and observability (Langfuse, Opik, or equivalent)
- Experience with speech AI (ASR/TTS pipelines) or real-time audio systems is a strong plus
- Solid understanding of MLOps: experiment tracking (MLflow/W&B), model registries, containerization (Docker/Kubernetes), and CI/CD for ML
- Awareness of LLM-specific risk: hallucination, prompt injection, data leakage, fairness, and privacy --- and how to mitigate them in production
- Strong communication skills: you can write a crisp design doc, run a productive architecture review, and explain tradeoffs to a non-technical stakeholder
Nice to have
- Experience with voice pipelines end-to-end: VAD → ASR → LLM → TTS → SIP/RTP telephony
- Multi-hop RAG with self-consistency, chain-of-thought reranking, or RAPTOR-style hierarchical retrieval
- Familiarity with MCP (Model Context Protocol) server design and tool-use contracts
- Contributions to open-source ML projects or published work (arXiv, NeurIPS, ACL, Interspeech, etc.)
- Experience with multimodal models (vision-language, audio-language)
- Knowledge of quantization techniques (GPTQ, AWQ, GGUF) and their quality/latency tradeoffs
Right to Work
At any stage, please be prepared to provide proof of eligibility to work in the country you're applying for. Unfortunately, we are unable to support relocation packages or sponsorship visas.
" Come as you are " Everyone is welcome here. Diversity & Inclusion are at our core. Far above any technical competence, we value respect, openness, and trusted collaboration. We do not tolerate intolerance.
ESG
"At team.blue, our commitment to caring for the environment and each other is at the heart of everything we do. Our latest impact report showcases our ongoing ESG efforts and ambitious sustainability goals. Interested in learning more about our dedication to making a positive impact? Check it out here."
The most trusted digital enabler team.blue is a leading digital enabler for companies and entrepreneurs. It serves over 3.3 million customers in Europe and has more than 3,000 experts to support them. Its goal is to shape technology and to empower businesses with innovative digital services.
Click here to read more about team.blue


