Senior SQA Engineer (LLM) Remote Pakistan

TechSurge Inc • karachi division, sindh • Posted June 02, 2026

About the Role

Responsibilities

  • Design and own the end-to-end QA strategy for the Conversational Banking Platform, covering functional, regression, performance, security, and AI-specific evaluation.
  • Build and maintain golden datasets, eval suites, and LLM-as-judge frameworks to validate conversational quality across intents, languages, and tenants.
  • Define the tenant onboarding QA gate, the certification checklist every new business unit must pass before going live.
  • Establish regression strategies for prompt changes, model upgrades, retrieval index updates, and guardrail policy changes.
  • Use Langfuse traces to drive evaluation: mine production failures, convert them into test cases, and close the loop with engineering.
  • Test NeMo Guardrails configurations against jailbreaks, prompt injection, off-topic drift, and false-positive over-blocking.
  • Validate governance and compliance behaviors: data residency, PII handling, regul...