Artificial Intelligence Researcher

Description:

In this role, the successful candidate will focus on optimising the performance and efficiency of large-scale AI models, including large language models (LLMs) and other generative AI systems. This is a unique opportunity to shape the future of next-generation AI technologies within a dynamic and collaborative environment.

Key Responsibilities:

Research and implement advanced optimisation algorithms to enhance the efficiency and performance of large model inference.
Identify and address bottlenecks in inference processes, developing innovative solutions for improved computational power, reduced latency, and efficient memory utilisation.
Conduct in-depth research on model quantisation techniques (e.g., INT8, INT4) to achieve optimal inference performance while maintaining model stability and accuracy.
Design and refine speculative sampling algorithms to improve generation speed and output quality for large AI models.
Translate cutting-edge research into deployable algorithmic tools and solutions for production use.
Collaborate with internal engineering teams to seamlessly integrate optimised algorithms into scalable, real-world systems.
Stay at the forefront of industry trends and advancements in AI and large models, fostering a culture of continuous innovation.

Required Qualifications:

Master’s degree or higher in Computer Science, Artificial Intelligence, Mathematics, or a related field (PhD strongly preferred).
A minimum of 2 years of R&D experience in deep learning or related fields, particularly large model optimisation.
Strong knowledge of mainstream large model architectures (e.g., Transformers, LLMs) and their inference processes.
Expertise in model quantisation techniques, including Quantisation-Aware Training (QAT) and Post-Training Quantisation (PTQ).
Deep understanding of speculative sampling methods (e.g., Top-k, Top-p, temperature sampling) and their optimisation strategies.
Proficiency in deep learning frameworks such as TensorFlow and PyTorch, with experience in inference engines like vLLM and SGlang.
Advanced Python programming skills; experience with C++ or CUDA is advantageous.
Solid foundation in algorithms and coding implementation.

Organization	asobbi
Industry	IT / Telecom / Software Jobs
Occupational Category	Artificial Intelligence Researcher
Job Location	Dubai,UAE
Shift Type	Morning
Job Type	Full Time
Gender	No Preference
Career Level	Intermediate
Experience	2 Years
Posted at	2025-01-07 3:52 pm
Expires on	2025-04-07