Question
Full-time
5-10

ML Engineering Team Lead - Sovereign AI Engineering

4/19/2026

You will lead and grow a team of ML engineers while setting the technical direction for the company's ML platform, including training pipelines and model serving. Additionally, you will contribute to critical systems, maintain operational excellence, and ensure the successful deployment of models across various environments.

Working Hours

40 hours/week

Company Size

51-200 employees

Language

English

Visa Sponsorship

No

About The Company
Dream is a pioneering AI cybersecurity company delivering revolutionary defense through artificial intelligence. Our proprietary AI platform creates a unified security system safeguarding assets against existing and emerging generative cyber threats. Dream's advanced AI automates discovery, calculates risks, performs real-time threat detection, and plans an automated response. With a core focus on the "unknowns," our AI transforms data into clear threat narratives and actionable defense strategies. Dream's AI cybersecurity platform represents a paradigm shift in cyber defense, employing a novel, multi-layered approach across all organizational networks in real-time. At the core of our solution is Dream's proprietary Cyber Language Model, a groundbreaking innovation that provides real-time, contextualized intelligence for comprehensive, actionable insights into any cyber-related query or threat scenario.
About the Role

At Dream, we redefine cyber defense vision by combining AI and human expertise to create products that protect nations and critical infrastructure. This is more than a job; it’s a Dream job. Dream is where we tackle real-world challenges, redefine AI and security, and make the digital world safer. Let’s build something extraordinary together.


Dream's AI cybersecurity platform applies a new, out-of-the-ordinary, multi-layered approach, covering endless and evolving security challenges across the entire infrastructure of the most critical and sensitive networks. Built as part of a broader sovereign AI platform, our technology is designed to operate in on-premise, private cloud, and air-gapped environments, enabling nations to maintain full control over their data, infrastructure, and AI capabilities. Central to Dream's proprietary Cyber Language Models are innovative technologies that provide contextual intelligence for the future of cybersecurity.


At Dream, our talented team, driven by passion, expertise, and innovative minds, inspires us daily. We are not just dreamers, we are dream-makers.


The Dream Job

It starts with you - a technical leader driven to build both the ML platform and the engineering team behind it. You care about reliable infrastructure, great developer experience, and growing engineers through real ownership. You'll set the technical direction for Dream's ML platform - training pipelines, model serving, feature stores, experiment tracking, and compute orchestration - shaping how models reach production across cloud and on-prem, including air-gapped deployments. A significant part of the platform supports large language models, with unique challenges across training, evaluation, and inference in mission-critical environments. You stay close enough to the codebase to debug production issues, unblock your engineers, and make sound architecture calls.

If you want to make a meaningful impact, join Dream's mission and lead the team that builds the ML platform driving Sovereign AI products - this role is for you.



The Dream-Maker Responsibilities

  • Set technical direction for the ML platform - training pipelines, model serving, feature stores, experiment tracking, and compute orchestration - through RFCs, prototypes, design reviews, and build-vs-buy decisions
  • Lead and grow a team of ML Engineers - hire, mentor, pair on hard problems, and raise the bar through code and design reviews
  • Contribute to critical systems, debug production issues, and maintain deep context on the codebase to inform technical decisions
  • Own operational excellence for model serving - set and enforce SLAs, run capacity planning, and keep compute costs predictable
  • Establish ML engineering standards - reproducible experiments, automated evals, model packaging, CI/CD for models, and observability
  • Support the full lifecycle of Dream's models - from training on domain-specific data to low-latency inference powering production systems
  • Work closely with Data Platform, AI, Data Science, and Product teams - translate business priorities into engineering work and manage cross-team dependencies
  • Measure and improve developer experience - deploy friction, onboarding time, CI turnaround - as seriously as model performance

The Dream Skill Set

  • 6+ years in software engineering, ML engineering, or platform engineering, with hands-on experience building and operating ML infrastructure at scale.
  • 2+ years leading an engineering team - hiring, mentoring, conducting design reviews, and shipping alongside your team
  • Engineering craft - Strong Python, distributed systems design, testing, secure coding, API design, CI/CD discipline, and production ownership.
  • ML platform & serving - Model serving frameworks (e.g., Triton, TorchServe, vLLM, Ray Serve); model packaging, deployment pipelines, and inference optimization
  • Training infrastructure - Distributed training pipelines (e.g., frameworks like PyTorch, JAX) experiment orchestration and reproducibility
  • ML lifecycle tooling - Feature stores, model registries, experiment tracking (e.g., MLflow, Weights & Biases); dataset versioning and lineage
  • Data pipelines - Building training and inference data pipelines; familiarity with tools like Spark, Airflow/Dagster, and streaming ingestion
  • Comfortable with AI coding tools like Cursor, Claude Code, or Copilot


Nice to Have:

  • Experience operating in constrained environments - on-premise, private cloud, or air-gapped deployments
  • Hands-on experience with simulation environments, synthetic data generation, or reinforcement learning workflows
  • Platform & infra - Kubernetes, AWS, Terraform or similar IaC, CI/CD, observability, incident response
  • Hands-on data science or applied ML experience

Never Stop Dreaming...

If you think this role doesn't fully match your skills but are eager to grow and break glass ceilings, we’d love to hear from you! 


Requirements

null
Key Skills
PythonMachine Learning EngineeringDistributed SystemsModel ServingKubernetesCI/CDAPI DesignInfrastructure as CodePyTorchJAXMLflowWeights & BiasesSparkAirflowObservabilityLeadership
Categories
TechnologySoftwareEngineeringData & AnalyticsSecurity & Safety
Apply Now

Please let Dream know you found this job on InterviewPal. This helps us grow!

Apply Now
Prepare for Your Interview

We scan and aggregate real interview questions reported by candidates across thousands of companies. This role already has a tailored question set waiting for you.

Elevate your application

Generate a resume, cover letter, or prepare with our AI mock interviewer tailored to this job's requirements.