Job Description
Are you ready to architect the future of Intelligence?
Nexus Future Systems is seeking a world-class AI Infrastructure Architect to lead our mission in 2026 and beyond. In this pivotal role, you will bridge the gap between cutting-edge machine learning research and rock-solid, scalable cloud infrastructure. You won't just manage servers; you will build the digital nervous system for the next generation of autonomous agents.
Why Join Us?
We are operating at the intersection of generative AI and next-gen computing. If you are passionate about solving complex scalability challenges and want to define the infrastructure standards for the industry, this is your opportunity to shape the future.
Responsibilities
- Design and deploy high-performance, fault-tolerant GPU clusters for large-scale LLM training.
- Architect serverless inference pipelines to reduce latency and optimize cost-efficiency.
- Collaborate with Data Science teams to translate research models into production-ready services.
- Implement robust security protocols and data governance frameworks for sensitive AI models.
- Drive automation strategies for CI/CD pipelines specifically tailored for machine learning workflows.
- Evaluate and integrate emerging hardware technologies to accelerate model training times.
Qualifications
- 10+ years of experience in software engineering and infrastructure architecture.
- Deep expertise in Kubernetes, Docker, and container orchestration.
- Proficiency in Python, Rust, or Go for low-level system programming.
- Proven track record of optimizing cloud costs (FinOps) in AWS or Azure environments.
- Strong understanding of distributed systems, networking, and high availability architecture.
- Experience with MLOps tools such as Kubeflow, MLflow, or Airflow.