Chief Architect - GPU & Autonomous Cloud Kubernetes Orchestration (Remote)

Living Talent Company


Date: 4 weeks ago
City: Edmonton, AB
Contract type: Full time
Remote
Autonomous Cloud Orchestration Software

Optimize CPU & GPU

K8s Control Layer

  • Startup (revenue-generating, Series A)
  • Company size: 30
  • Future unicorn
  • REMOTE first culture
  • Smart, fun, low-ego team culture
  • Compensation: Base Salary US $250k+, Equity

Key Responsibilities

  • Architecture & Development: Kubernetes-based ML/AI optimization platforms
  • Leadership & Collaboration: with C-staff, product management, engineering, and design partners.
  • Communication: Create detailed architecture diagrams, documents, and presentations.
  • User Experience Focus: for Infrastructure Admin and MLOps staff.
  • Open Source Community: Stay actively involved with CNCF and related projects.
  • Enterprise-Class Solutions: Drive & deliver solutions for enterprise-class data, ML, AI applications.
  • FinOps & SRE Best Practices: FinOps for cloud financial management, modern SRE practices.

Qualifications

  • Entrepreneurial, Startup Experience
  • 10 years+ infrastructure level software architecture and development.

Extensive Experience

  • Linux, Virtualization platforms (hands-on)
  • AWS, GCP or Azure.

Strong Experience

  • Kubernetes-based ML/AI systems (Kubeflow, Kueue, KServe, GPU Operators, DRA, Karpenter)

Deep Knowledge

  • ML/AI use cases & customer stories of model development, training, inference, & hardware accelerator usage (CPU, GPU, TPU).
  • Modern cloud-native architectures (scalability, availability, reliability, security, observability).
  • Proven track record of delivering complex distributed systems.
  • Active involvement in open-source communities, particularly CNCF and related projects.
  • Strong leadership and team collaboration skills.
  • Excellent communication skills, both verbal and written.

Preferred Qualifications

  • Knowledge of additional ML/AI frameworks and tools.
  • Experience in DevOps practices and tools.
  • Certification in Kubernetes or related technologies.
  • Awareness of FinOps and SRE best practices
  • Bachelor’s or Master’s degree in Computer Science, Engineering, or related field.

How to apply

To apply for this job you need to authorize on our website. If you don't have an account yet, please register.

Post a resume

Similar jobs

Fairmont Gold Attendant (Part Time)

Fairmont Breakers Long Beach, Edmonton, AB
2 hours ago
Company DescriptionYour team and working environment:Edmonton's "Chateau on the River" For more than 100 years, Fairmont Hotel Macdonald has effortlessly delivered timeless luxury in the heart of downtown Edmonton. Nestled upon the North Saskatchewan River Valley, the hotel's charm and ever-evolving elegance has earned it the spotlight as one of the city's most sought after locations.  A storied past, an...

Financial Senior Manager, Scientific Research & Experimental Development (SR&ED)

MNP, Edmonton, AB
1 day ago
Inspirational, innovative and entrepreneurial - this is how we describe our empowered teams. Combine your passion with purpose and join a culture that is thriving in the face of change.Make an impact with our Scientific Research and Experimental Development (SR&ED) team as a Senior Manager (or Manager). This diverse team of professionals delivers customized tax strategies within a complex and...

Deli Manager (Chappelle Food Store - North Central Co-op)

Central Alberta Co-op Ltd., Edmonton, AB
1 day ago
We invite applications for the position of Deli Manager to join our Food Division Team at our Chappelle Food Store located in Edmonton, Alberta. We are looking for an individual that has had success in leading a retail team to pursue a career with our Association!Open Availability Is Required.Who we are:Co-op does business differently. As a co-operative, we believe in...