AI Quality Engineer

1 day ago


Muscat, Muscat, Oman Elile AI Full time

About the Role

We are seeking a highly skilled AI Quality Engineer to join our National Large Language Model (LLM) Project. This key role will focus on establishing and implementing robust data quality frameworks, evaluation methodologies, and quality gates throughout the LLM development lifecycle. The ideal candidate will ensure our Arabic LLM meets the highest standards of performance, reliability, and cultural appropriateness before being deployed to 20,000 government employees.

Key Responsibilities

  • Design and implement comprehensive data quality frameworks specific to Arabic language datasets for LLM training and evaluation
  • Establish and enforce quality gates at each project phase (data preparation, model training, evaluation, and RAG implementation)
  • Develop detailed acceptance criteria for each phase gate requiring formal sign-off from key stakeholders
  • Create and implement quality metrics for data annotation, achieving >90% inter-annotator agreement and >95% cultural/contextual accuracy
  • Design and maintain data pipeline quality assurance processes for Arabic text normalization, diacritics standardization, and dialect variation mapping
  • Implement Arabic-specific tokenization optimization with >98% vocabulary coverage and >95% morphological accuracy
  • Develop comprehensive RAG quality measurement frameworks covering both retrieval metrics
  • Establish automated monitoring systems for continuous quality assessment with real-time dashboards
  • Create and enforce testing protocols for model evaluation across various Arabic language tasks
  • Implement robust regression testing frameworks to ensure model updates maintain or improve quality metrics
  • Develop protocols for bias detection and mitigation in both training data and model outputs
  • Support the implementation of benchmarking against global standards
  • Design human evaluation frameworks to assess model outputs qualitatively
  • Collaborate with data annotation teams to ensure high-quality ground truth data
  • Participate in weekly quality committee meetings and bi-weekly RAG performance reviews
  • Create and maintain quality documentation including processes, guidelines, and acceptance criteria

Requirements

  • Bachelor's or Master's degree in Computer Science, AI, Machine Learning, or related field
  • 4+ years of experience in AI/ML quality assurance, with specific focus on natural language processing
  • Strong understanding of LLM evaluation methodologies and benchmarking techniques
  • Experience establishing quality gates and acceptance criteria for AI systems
  • Hands-on experience with data quality frameworks and validation techniques
  • Experience implementing multi-level annotation review processes with clear metrics
  • Proficiency in designing data pipeline quality assurance systems for Arabic language processing
  • Experience with RAG quality assessment covering both retrieval and generation components
  • Ability to establish and track performance metrics against benchmarks.
  • Experience implementing automated testing frameworks and continuous integration for ML systems
  • Strong knowledge of bias detection and fairness assessment in AI systems
  • Familiarity with Arabic language and NLP challenges specific to Semitic languages
  • Experience with human evaluation protocols and annotation quality assessment
  • Proficiency in Python and relevant testing/quality assurance libraries
  • Understanding of statistical analysis techniques for model evaluation
  • Experience with data annotation platforms and quality control mechanisms
  • Knowledge of responsible AI practices and ethical considerations

Preferred Qualifications

  • Experience with LLM evaluation specifically for government or enterprise applications
  • Knowledge of Arabic-specific LLM benchmarks
  • Experience with RAG system evaluation and quality assurance
  • Familiarity with platforms like Scale AI, Humanloop, or other annotation/evaluation systems
  • Experience with hallucination detection and factual consistency verification
  • Knowledge of prompt engineering and prompt quality assessment
  • Experience with MLOps and quality gates in CI/CD pipelines for ML
  • Proficiency with data lineage tracking and documentation
  • Experience implementing A/B testing frameworks for model comparison
  • Familiarity with user experience testing for AI applications
  • Experience with security and privacy testing for AI systems
  • Knowledge of ROUGE, BLEU, BERTScore, and other NLP evaluation metrics
  • Experience creating custom metrics for domain-specific tasks
  • Experience participating in quality governance committees

What We Offer

  • Opportunity to contribute to a nationally significant AI project
  • Competitive compensation package
  • Collaboration with world-class AI teams and researchers
  • Professional development opportunities in cutting-edge AI quality assurance
  • Chance to establish quality standards for Arabic language AI
  • Work with advanced language models and state-of-the-art evaluation techniques
#J-18808-Ljbffr
  • AI Quality Engineer

    6 days ago


    Muscat, Muscat, Oman Prana Tree Full time

    Head of Technical Program Management & Talent AcquisitionJob Title: AI Quality EngineerAbout the Prana Tree LLC :Prana Tree LLC is an innovative IT consulting firm specializing in building next-generation business applications. We are committed to leveraging cutting-edge technologies to develop scalable, high-performance solutions for our clients across...


  • Muscat, Muscat, Oman beBee Careers Full time

    AI Quality Assurance EngineerThis role is focused on ensuring the highest standards of performance, reliability, and cultural appropriateness in our Large Language Model.Data Quality Frameworks: Design and implement comprehensive data quality frameworks specific to Arabic language datasets for LLM training and evaluation.Quality Gates: Establish and enforce...


  • Muscat, Muscat, Oman Elile AI Full time

    About the RoleWe are seeking an experienced AI Expert and Project Manager to join our National Large Language Model (LLM) Project, replacing ChatGPT usage in the workplace. As a key technical advisor, you will provide expertise across the full LLM stack, from model training and fine-tuning to deployment and RAG implementation.Key ResponsibilitiesProvide...


  • Muscat, Muscat, Oman beBee Careers Full time

    NLP Quality EngineerWe require a highly skilled NLP Quality Engineer to support our National LLM Project, with a specific focus on Arabic language processing.The ideal candidate will design and implement Arabic-specific data quality frameworks for LLM training and evaluation, establish and enforce quality gates across all phases, and define acceptance...

  • AI Engineer

    2 weeks ago


    Muscat, Muscat, Oman PhazeRo Full time

    About UsPhazeRo stands at the forefront of AI innovation, dedicated to bridging the gap between advanced technology and business solutions. We work across various sectors, such as Energy, Finance, and BioAI, to empower our clients with intelligent automation and data-driven insights. Our team is made up of visionary thinkers and expert engineers who are...

  • AI Developer

    5 days ago


    Muscat, Muscat, Oman beBee Careers Full time

    Artificial Intelligence EngineerWe are seeking an Artificial Intelligence Engineer to support the design, development, and deployment of AI models and solutions across various business domains.This role involves working with senior engineers and data scientists to process data, build machine learning pipelines, and implement AI algorithms that drive business...

  • AI Model Architect

    2 weeks ago


    Muscat, Muscat, Oman beBee Careers Full time

    **AI Engineering Expertise Wanted**About UsWe are seeking an experienced AI Engineer to join our dynamic team. In this role, you will design, implement, and maintain AI systems and applications that enhance our product offerings.Key Responsibilities:Architect and develop AI models and solutions that align with client requirements and business...


  • Muscat, Muscat, Oman beBee Careers Full time

    Arabic AI Quality Assurance SpecialistThis role involves establishing and enforcing quality gates at each project phase, designing and implementing comprehensive data quality frameworks specific to Arabic language datasets, and ensuring our Arabic LLM meets the highest standards of performance, reliability, and cultural...


  • Muscat, Muscat, Oman beBee Careers Full time

    Ambitious Technology Leader WantedWe are seeking an experienced professional to join our global Field Engineering team as an MLOps Field Engineer. In this role, you will work closely with enterprise sales leads to deliver solutions that enable the adoption of AI/ML in various industries.The ideal candidate will have a strong technical background, excellent...


  • Muscat, Muscat, Oman beBee Careers Full time

    Cloud Native AI/ML EngineerWe are seeking a highly skilled Cloud Native AI/ML Engineer to join our team. The successful candidate will be responsible for designing, implementing, and maintaining scalable, reliable, and automated MLOps pipelines for deploying large language models and other AI/ML models in cloud environments.Responsibilities:Design,...