Gen AI Architect
Are you interested in working with the World’s leading AI-first Quality Engineering Company? Ready to advance your career, team up with global thought leaders across industries and make a difference every day? Join us at QualityAI!
We are looking for a Gen AI Architect to join our growing team in United States!
Job Overview:
We are seeking a Generative AI Architect to lead the design, architecture, and delivery of enterprise-grade AI solutions powered by Large Language Models (LLMs), multi-modal AI, and Retrieval-Augmented Generation (RAG) pipelines. The ideal candidate will combine deep technical expertise in AI/ML systems with proven experience in enterprise architecture, ensuring solutions are scalable, secure, compliant, and aligned with business goals.
This role involves defining the technical roadmap for Generative AI initiatives, selecting and integrating AI frameworks, orchestrating model lifecycle management, and guiding cross-functional teams to deliver production-ready Gen AI solutions. You will be the go-to expert for translating high-level business needs into robust, future-proof AI architectures.
Key Responsibilities:
- Architect Gen AI Systems - Design and evolve architectures for LLM-powered applications, RAG workflows, multi-agent AI, and vector search integration.
- Technology Evaluation - Select and recommend AI frameworks, vector databases (Weaviate, Pinecone, Milvus), and orchestration tools (LangChain, LangGraph) that meet performance, scalability, and compliance needs.
- Prompt & Model Strategy - Define prompt engineering standards, fine-tuning approaches, and model governance guidelines for consistent and reliable outputs.
- Scalable API Design - Architect secure, high-performance RESTful APIs (e.g., FastAPI) for AI service integration.
- Data Architecture - Oversee the design and preparation of large, complex datasets (structured/unstructured) for training, fine-tuning, and inference.
- Cloud AI Integration - Architect and deploy AI workloads on AWS (Bedrock, SageMaker), Azure (OpenAI, ML), or GCP (Vertex AI) with multi-cloud readiness.
- Security & Compliance - Ensure solutions adhere to enterprise security policies, AI governance frameworks, and data privacy regulations (GDPR, HIPAA, SOC 2).
- Performance Optimization - Implement GPU optimization, model quantization, caching strategies, and distributed inference for real-time workloads.
- Leadership & Mentorship - Guide engineering and data science teams on best practices in Gen AI architecture, scalability, and ethical AI.
Required Skills and Qualifications:
- Bachelor’s or Master’s degree in Computer Science, Engineering, or related technical discipline.
- 10+ years in software development/architecture, with 3+ years in AI/ML and at least 2 years in Generative AI system design.
- Proven experience architecting and deploying enterprise-scale LLM-based applications.
- Expertise in RAG techniques, vector database design, and semantic search optimization.
- Strong Python proficiency and familiarity with other enterprise languages (Java, C#, Go).
- Proficiency with Generative AI libraries and frameworks (LangChain, Hugging Face, Transformers).
- In-depth knowledge of REST API design, microservices, and event-driven architecture.
- Hands-on with multi-cloud AI services (AWS Bedrock, Azure OpenAI, GCP Vertex AI).
- Experience in MLOps, CI/CD automation (Azure DevOps, GitHub Actions, Jenkins, GitLab CI).
- Strong problem-solving, analytical, and communication skills.
Preferred Qualifications:
- Prior work with regulated industry data (finance, healthcare, insurance).
- Experience integrating multi-modal AI (text, image, audio, video) into enterprise solutions.
- Familiarity with open-source LLMs (LLaMA, Mistral, Ollama).
- AI and cloud architecture certifications (AWS ML Specialty, Azure AI Engineer Associate).
Benefits:
Why QualityAI?
QualityAI is an AI-first quality engineering company helping enterprises deploy and scale complex systems with greater confidence. Operating across data, models, platforms, infrastructure, and operational environments, the company provides assurance and engineering expertise that helps organizations ensure systems perform reliably in real-world conditions.
Formerly Qualitest, QualityAI supports global enterprises across regulated and technology-driven industries, combining deep engineering heritage with AI-enabled delivery, operational assurance, and lifecycle expertise to help clients achieve certainty at go-live.
- Be a part of a company who strives to support for diversity and inclusion in the workplace - we are one, we are many at QualityAI. Celebrate culture, share knowledge with engineers from around the globe, and inspire each other through our differences.
- Local and global opportunities - we offer you internal rotation and international mobility opportunities to grow your career.
- Clear view of your career and progression with the company - QualityAI is growing massively (since Jan 2021 - added more than 2000 engineers) and giving you the opportunity to grow with us.
- Work hard and play harder with our flexible and casual culture. Take a break from work and join an employee event, or enjoy the amenities and games provided from one of our Employees Centers.
- Never stop experimenting and learning with QualityAI Tech academy: 3000+ training courses, mentorship programs, technical tribes, sponsored certifications, leadership programs and much more.
- Earn bonuses via our Client Referral and Employee Referral Program’s. Refer and earn - tap your network for net-worth.
- A Competitive pay, the salary range for the role is $180,000 - $200,000.
If you like what you have read, send us your resume and let’s start talking!
Intrigued to find more about us?
Visit our website at https://www.QualityAIgroup.com/
If you like what you have read, send us your resume and let’s start talking!
LinkedIn: https://www.linkedin.com/company/qualityaigroup
Nearest Major Market: San Jose
Nearest Secondary Market: Palo Alto