In this role, you’ll have the opportunity to advance the development of open-source LLM, vision and/or multimodal foundation models and products.…
Your work will directly impact our customers in the form of products and services that make use of Computer Vision technology. A day in the life.…
Because of the nature of the work we do with our Defence clients, you will need to be eligible for UK Security Clearance (SC) and willing to work between 2 to 4……
Experience with deep learning frameworks (PyTorch). Strong experience in Computer Vision and image processing. Be able to obtain UK national security clearance.…
Successful candidates must be able to commit to at least 3 months long internship period. Work with the Solution Architect team internally and externally (……
Communication: Excellent communication skills, capable of conveying complex AI concepts to both technical colleagues and non-technical stakeholders in a clear……
You should be highly efficient at using these tools to write, debug, and optimise code. Anomalous Pattern Detection: Experience or interest in identifying……
Background in retail, logistics, marketplaces, or other operationally complex environments. You raise the bar for the team through code review, methodology……
Successful candidates must be able to commit to at least 3 months long internship period. Work with the Solution Architect team internally and externally (……
Your contributions will have direct impact on millions of customers worldwide, as you create highly available, resilient, and scalable cloud services that……
We combine our Tiny AI with multimodal LLMs to enable our advanced AI features for our customers. We have flexible working hours and work together from our……
Must have strong expertise in writing GPU kernels for mobile devices (i.e., smartphones) as well as a deep understanding of model serving frameworks and engines……
Work with product and engineering to improve our world-class identity-focused products. By assessing government- issued identity documents and facial biometrics……
Meta SAM 3 (Segment Anything Model 3) or SAM 2 for advanced promptable object masking and real-time tracking. Partner with backend, frontend, and product teams……
Technical experience in one or more of the following areas: document understanding, vision-language modelling, few-shot learning, distillation, quantisation and……
Deep understanding of reinforcement learning algorithms and optimization methods applied to vision and multimodal learning problems, with a focus on improving……
Senior BIM Specialist I positions focus on leading BIM production execution, maintaining model health, and driving coordination quality while supporting……
Experience with deep learning frameworks (e.g., Pytorch, Tensorflow) and Python. Research experience involving 3D Computer Vision, Deep Learning, or Robotics—……
We are looking for an experienced Computer Vision Engineer to help us make transport safer and greener, who thrives on solving complex problems and……
In this role you will support the manager of data science with the development of data science and analytics roadmap of assets across cell-level initiatives.…
Develop the skills you want when the time is right for you, with access to over 20,000 courses on our learning platform, leadership courses, and new job……
Prior industry experience translating research into deployed products. Stay current with the literature and propose research directions that advance Nucs AI’s……
These models support autonomous driving features and in-vehicle experiences by enabling environmental understanding. Proficiency in Python and C++.…
We are seeking a Machine Learning Researcher to join our team and help advance the state of the art in human-centric generative video models. Your work will focus on improving expression control, lip synchronisation, and overall realism in models such as WAN and Hunyuan. You’ll collaborate with a world-class team of researchers and engineers to build systems that can generate lifelike talking-head videos from text, audio, or motion signals—pushing the boundaries of neural rendering and avatar animation. We are hiring remotely across the EMEA region.
Key Responsibilities
Research and develop cutting-edge generative video models, with a focus on controllable facial expression, head motion, and audio-driven lip synchronisation.
Fine-tune and extend video diffusion models such as WAN and Hunyuan for better visual realism and audio-visual alignment.
Design robust training pipelines and large-scale video/audio datasets tailored for talking-head synthesis.
Explore techniques for controllable expression editing, multi-view consistency, and high-fidelity lip sync from speech or text prompts.
Work closely with product and creative teams to ensure models meet quality and production constraints.
Stay current with the latest research in video generation, speech-driven animation, and 3D-aware neural rendering.
Must Haves
Strong background in machine learning and deep learning, especially in generative models for video, vision, or speech.
Hands-on experience with video synthesis tasks such as face reenactment, lip sync, audio-to-video generation, or avatar animation.
Proficient in Python and PyTorch; familiar with libraries like MMPose, MediaPipe, DLIB, or image/video generation frameworks.
Experience training large models and working with high-resolution audio/video datasets.
Deep understanding of architectures such as transformers, diffusion models, GANs and motion representation techniques.
Proven ability to work independently and drive research from idea to implementation.
Strong problem-solving skills, ability to work autonomously in a remote-first environment.
Nice to Have
PhD in Computer Vision, Machine Learning, or a related field, with publications in top-tier conferences (CVPR, ICCV, ICLR, NeurIPS, etc.).
Familiarity with or contributions to open-source projects in lip sync, video generation, or 3D face modelling.
Experience with real-time inference, model optimisation, or deployment for production applications.
Knowledge of adjacent areas like emotion modelling, multimodal learning, or audio-driven animation.
Experience working with or adapting models like WAN, Hunyuan or similar.