Your responsibilities will cover the entire lifecycle of an AI model, from receiving it from the research team to ensuring its efficient operation on the end device.

1. AI Model Optimization & Deployment:

Analyze trained AI models (from TensorFlow, PyTorch) to identify performance bottlenecks and resource requirements (memory, compute).
Convert models into inference-optimized formats such as ONNX, TensorRT, and TFLite.
Apply advanced optimization techniques, including quantization (INT8/FP16) and pruning, to reduce model size and accelerate processing speed while maintaining accuracy.
Profile and benchmark model performance on target hardware (e.g., NVIDIA Jetson, ARM CPUs) to ensure latency and throughput criteria are met.

2. Application Software Development:

Build high-performance applications and libraries in C++/Python to load, manage, and execute AI models in both Linux and Windows environments.
Develop end-to-end data processing pipelines, from pre-processing input data (images, video) to post-processing model outputs.
Create and maintain unit and integration tests to ensure the stability and accuracy of AI features.

3. Research & Improvement:

Stay current with the latest technologies, algorithms, and tools in embedded AI and efficient machine learning.
Research and experiment with new AI models to evaluate their feasibility and potential for product application.
Participate in troubleshooting, debugging, and continuously improving deployed AI systems to enhance performance and reliability.

4. Collaboration & Technical Support:

Collaborate closely with AI/ML scientists to understand model architectures and deployment requirements.
Work with hardware engineering teams to leverage specialized on-chip acceleration features.
Support other teams (such as QA and Product) in integrating and testing AI solutions.

Essential Qualifications:

3+ years of relevant professional experience.
Bachelor's or Master’s degree in Computer Science, Computer Vision, or a closely related technical field.
Proficiency in programming languages such as C/C++, Python, and/or CUDA.
Hands-on experience with edge device deployment technologies (e.g., TensorRT, TFlite, ONNX).
Experience in developing and deploying applications on Linux and Windows operating systems.
Solid understanding and practical experience with prominent AI frameworks (e.g., TensorFlow, PyTorch) for model development and training.
Experience or knowledge of deploying AI models directly onto specialized hardware chips.
Good command of the English language, both written and verbal.

Preferred Qualifications:

Relevant academic coursework, significant projects, or internships in Artificial Intelligence/Machine Learning or hardware optimization.
Active participation in technology communities, open-source contributions, or successful hackathon experiences.

Compensation & bonus: 13th & 14th salary, AIP bonus, Holidays, Tet, and Long year service …
Social insurance, Health insurance, Unemployment insurance: by Social Insurance and Labor Law
The regime of annual leave, company trip, and checkup examination
Award for marriage, newborn
We have AON insurance package for employee, spouse, and children every year
You will be trained, learned & work with the best technical managers who help you improve various dev skills & career path
You’ll love working in our dynamic environment employees, young & active
We love sport activities, as marathon, football, swimming,...
Working time: From Monday to Friday | 08:30-12:00 & 13:00-17.30