Job Description
The Opportunity
We are building the infrastructure for the intelligent future. As a Senior Generative AI Architect, you will lead the design and deployment of next-generation Large Language Models (LLMs) and multimodal systems. Join our elite team in San Francisco to shape the technological landscape of 2026 and beyond.
Why Join Us?
β’ Work with cutting-edge technology in a fast-paced, high-impact environment.
β’ Competitive equity package and top-tier benefits.
β’ Collaborative culture focused on innovation, ethical AI, and scalability.
Key Responsibilities
You will be responsible for the full lifecycle of our AI initiatives, from prototype to production deployment.
Responsibilities
- Design, train, and fine-tune state-of-the-art Generative AI models (e.g., GPT, LLaMA, Claude) for enterprise applications.
- Optimize model inference latency and reduce token costs through advanced quantization and distillation techniques.
- Lead architecture reviews and mentor junior engineers in best practices for MLOps and Deep Learning.
- Collaborate with product and research teams to define roadmap requirements for future AI capabilities.
- Ensure model robustness, fairness, and data privacy compliance (GDPR/CCPA).
- Implement and manage CI/CD pipelines for model deployment using Kubernetes and cloud-native infrastructure.
Qualifications
- Ph.D. or Masterβs degree in Computer Science, Machine Learning, or a related quantitative field.
- 7+ years of professional experience in Software Engineering, with 3+ years focused on AI/ML.
- Deep expertise in Python, PyTorch, TensorFlow, or JAX.
- Proven track record of deploying production-ready NLP models at scale.
- Strong understanding of Transformer architectures, Attention mechanisms, and Reinforcement Learning from Human Feedback (RLHF).
- Experience with vector databases (Pinecone, Milvus) and RAG (Retrieval-Augmented Generation) pipelines.