Agentic & Generative Edge AI Optimization Engineer · Hybrid
Location:
Region: Munich
Country: Germany
Role:
- Industry: Engineering, Technology
- Job Type: Contract / Freelance
Working Arrangement: On-site
Rate: Negotiable
Application Details:
- Date Posted:
Start Date: ASAP
Expiry Date: 28/02/2026
Job Reference: 1812202516
Agentic & Generative Edge AI Optimization Engineer · Hybrid
Job title – Agentic & Generative Edge AI Optimization Engineer
Location – Munich or Hamburg
Site expectation – 3x onsite per week
Contract – 12 months
Rate – Open to rate expectations, happy to discuss
Required:
- 5+ years of experience with software/AI engineers + deep exposure to LLMs, VLMs or systems performance.
- LLM Quantization techniques.
- Proficiency with AI frameworks (PyTorch, TensorFlow) and agentic frameworks (LangChain, Google ADK etc.)
- Knowledge of AI toolchains (CUDA, TensorRT, TFLite etc).
- Strong embedded background and NPU accelerators, including software architecture, build systems and version control.
- Familiarity with build systems (YOCTO, OpenEmbedded)
- Solid programming experience with C, C++, Python, Bash.
- Excellent communication skills in English.
What You’ll Do
- Optimize LLMs and multimodal models for on-device deployment
- Apply advanced quantization (8-bit, 4-bit, mixed precision), pruning, and distillation techniques for optimized models on NPU targets.
- Accelerate inference performance
- Implement system optimizations such as speculative decoding and efficient algorithms tailored for edge environments.
- Engineer agentic AI capabilities for tiny agents
- Enhance small language models for edge deployment while ensuring safety principles.
- Work with inference engines and deployment frameworks
- Deploy optimized models using Ollama, llama.cpp, ONNX Runtime, and TFLite for efficient NPU inference.
- Benchmark LLMs and agentic systems
- Design benchmarking pipelines for Generative and Agentic AI systems on-device.
- Develop demonstrators and proof-of-concepts
- Build PoCs for industrial safety monitoring, in-cabin sensing, and other edge AI applications.
- Move key technologies from research into product solutions
- Translate advanced optimization techniques and agentic AI features into production-ready implementations and collaborate with product teams.
If this role is of interest to you, submit your CV below.
g2 Recruitment are committed to equality of opportunity for all applications from individuals are encouraged regardless of age, disability, sex, gender reassignment, sexual orientation, pregnancy and maternity, race, religion or belief and marriage and civil partnerships or any other characteristic protected by law.