We are seeking a Senior AI Engineer with strong expertise in both system engineering and AI technologies to build high-performance, scalable infrastructures that support AI services, including Computer Vision and LLM/NLP applications.
You will design, develop, and maintain the system architecture for large-scale AI inference, training, and fine-tuning systems, ensuring seamless integration with applications, data pipelines, and infrastructure components.
Requirements
Bachelor’s degree in Computer Science, Software Engineering, or a related field, plus:
- 5+ years of development experience, with at least 3 years in AI/ML-related projects.
- Expert in Python (including asyncio, concurrency, multiprocessing).
- Experienced in data preprocessing, feature engineering, and evaluation techniques.
- Adept at working with databases (including non-relational and vector databases).
- Proficiency in containerization and orchestration (Docker Compose, Kubernetes).
- Familiarity with microservices architectures and message brokers (Kafka, RabbitMQ).
- Capable of managing task queues and asynchronous processing (Celery or similar).
- Skilled in monitoring & observability stacks (Prometheus, Grafana, ELK).
- Committed to security best practices, performance optimization, and system scalability.
- Well-versed in cloud platforms, including deploying distributed and edge AI systems.
- Solid understanding of CI/CD, version control (Git), and agile workflows.
- Enthusiastic about experimentation, rapid prototyping, and learning new technologies.
Role-Specific Expertise
A. Computer Vision
Preferred qualifications:
- Solid foundation and strong fundamentals in Computer Vision.
- Hands-on experience with inference toolchains (TensorRT, ONNX, Triton).
- Proficiency with PyTorch or TensorFlow, including inference and training workflows.
- Experience developing and optimizing deep learning pipelines for image/video tasks.
- Familiarity with Stable Diffusion, GANs, Vision Transformers, and VLMs is a strong plus.
B. LLM / NLP
Preferred qualifications:
- Strong understanding of Natural Language Processing and LLMs.
- Familiar with LangChain, LlamaIndex, and vector databases (FAISS, Milvus, Qdrant).
- Experience with inference engines (vLLM, Ollama, TGI, Text Generation Inference).
- Familiar with LLM fine-tuning, serving, and optimization workflows.
- Experience in embedding management, tokenization, and prompt optimization.
- Understanding Retrieval-Augmented Generation (RAG) pipelines is a strong advantage.