Job Description
We are looking for a visionary Senior AI/LLM Architect to lead our R&D division in San Francisco. As we prepare for the 2026 technological landscape, you will architect scalable, high-performance machine learning systems that define the next generation of human-computer interaction.
In this role, you will bridge the gap between cutting-edge research and production-grade engineering, ensuring our AI models are not only state-of-the-art but also ethical, efficient, and safe.
Responsibilities
- Architect Design: Design and implement robust, scalable ML pipelines for Large Language Models and generative AI agents.
- Research Integration: Translate theoretical research papers into production-ready code and deployable models.
- Performance Optimization: Optimize model inference latency and throughput to support real-time applications.
- Team Leadership: Mentor junior engineers and data scientists, fostering a culture of innovation and technical excellence.
- Collaboration: Partner with product managers and designers to define AI capabilities for future roadmaps.
Qualifications
- Education: Masterβs or Ph.D. in Computer Science, Machine Learning, or a related field.
- Experience: 5+ years of experience in AI/ML engineering, with a focus on NLP and Deep Learning.
- Technical Stack: Proficiency in Python, PyTorch, TensorFlow, and distributed computing frameworks (e.g., Ray, Kubernetes).
- Knowledge: Deep understanding of Transformer architectures, attention mechanisms, and fine-tuning strategies.
- Communication: Excellent ability to communicate complex technical concepts to non-technical stakeholders.