Sr. CloudOps Engineer
Looking for CloudOps heroes to support next generation AI sports analytics!
As a Senior Cloud Engineer, you will serve as the backbone of our AI and data processing infrastructure. You will design, build, and maintain a robust, secure, and scalable cloud environment that enables AI engineers to train models and allows Acme AI and partners to deliver cutting-edge sports analyses. This role is critical for translating high-volume video and data needs into a high-performance, automated, and reliable platform.
KEY RESPONSIBILITIES
General responsibilities include:
- Designing and implementing core cloud infrastructure components, including virtual servers, networks, and security protocols.
- Ensuring the cloud environment is secure, available, and scalable for massive data loads.
- Implementing Infrastructure as Code (IaC) principles using tools like Terraform.
- Defining and enforcing secure, private networking and security best practices across the cloud environment.
- Automating workflows for data ingestion, processing, and model training.
- Setting up tools to support the workflows of AI engineers.
- Developing and maintaining automated deployment pipelines (CI/CD).
- Managing orchestration tools such as Prefect, Dagster, or Airflow for complex data pipelines.
- Monitoring system performance, costs, and availability.
- Setting up monitoring for current pipelines and critical services.
- Implementing data quality tooling to ensure the integrity and reliability of sports analysis data.
Key output we are looking for are virtual servers, networks, security protocols, and automated deployment pipelines.
EXPERIENCE + QUALIFICATION
- Bachelor or equivalent degree in software or technical disciplines with 4-7 years of experience in CloudOps and related areas.
- Cloud Platform Expertise: Extensive experience with cloud platforms, with a strong preference for GCP (General principles must be transferable).
- Containerization & Orchestration: Proficiency in Kubernetes and Helm.
- Infrastructure as Code (IaC): Hands-on experience with Terraform and building scalable cloud infrastructures.
- Programming & Scripting: Strong expertise in Python.
- Tools & Development: Proficiency in Git, Docker, and Rest API development.
- Data & Databases: Proficiency in databases and writing complex SQL.
- Workflow Orchestration: Experience with tools like Prefect, Dagster, or Airflow.
- Security: Knowledge of best practices and secure, private networking in cloud environments.
- Code Quality: Ability to write clean, maintainable code using Python, SQL, and Docker.
NICE TO HAVES
- MLOps Experience: Experience with monitoring tools like MLFlow.
- Data Platform Design: Mastery in designing high-performance data platforms and architectures.
- Python Best Practices: Familiarity with static typing and PyTest.
WORK DETAILS
This is a full-time position, where you’ll be working in a supportive, fast-paced environment with access to learning and development opportunities.
- Type: Full-time, onsite with some remote flexibility
- Schedule: Standard 8 hours, five-days engagement under contract
- Location: Acme AI Ltd, Level 5, House 385, Road 6, Mirpur DOHS, Dhaka 1216
- Salary: BDT 80,000-140,000 (offered based on experience and technical test-result)
BENEFITS
- Market competitive salary based on experience
- Festival bonuses
- Access to company health fund (upon eligibility; since health and hospitalisation insurance sucks!)
- Quarterly profit-sharing (upon eligibility)