Cloud Operations Engineer

We are seeking a motivated and customer-focused CloudOps Engineer to ensure the reliability, scalability, and performance of cloud-based systems running on AWS. This role is ideal for a hands-on cloud operations professional who thrives in a fast-paced environment and enjoys solving complex infrastructure, monitoring, and security challenges.


You will work closely with Leads, SREs, DevOps, and Security teams to ensure 24/7 operational excellence across AWS environments. You will help implement automated guardrails, drive observability and remediation, and continuously improve the reliability, security, and cost efficiency of cloud platforms.



Key Responsibilities
  • Deliver remote CloudOps services, including incident management, SOC operations, AWS support, configuration, maintenance, and disaster recovery.

  • Monitor performance, availability, and health of AWS environments using tools such as CloudWatch, Datadog, and Opsgenie.

  • Provide proactive guidance and implementation support for monitoring and observability solutions.

  • Act as the primary point of contact for AWS Technical Support cases, managing cases end-to-end.

  • Perform detailed security investigations for SOC-related alerts and execute initial remediation for known issues.

  • Assist customers with AWS configuration questions and system integrations.

  • Collaborate with internal DevOps teams to resolve issues involving third-party tools and platforms.

  • Maintain clear, timely, and consistent communication with customers and internal stakeholders throughout incident and alert lifecycles.

  • Deliver exceptional customer service with a strong focus on responsiveness and quality.

  • Participate in on-call rotations and lead incident response for production issues

  • Perform root cause analysis (RCA) and drive post-incident improvements

  • Partner with CloudOps teams to improve operational maturity and reliability practices

  • Collaborate with DevOps and Engineering teams to embed reliability into CI/CD pipelines

  • Contribute to runbooks, reliability standards, and operational documentation

  • Support governance, compliance, and security requirements through automation




Requirements

Required Qualifications

  • Bachelor’s or Master’s degree in Computer Science, Engineering, or a related field

  • 2–5 years of hands-on experience operating and supporting AWS cloud environments.

  • Experience working as a NOC, SOC, CloudOps, or Advanced Support Engineer.

  • Strong knowledge of AWS services, including EC2, S3, CloudWatch, Lambda, ECS, VPC, and EBS.

  • Strong experience in Linux and Windows environments in networking and security within production environments.

  • At least 1+ years of experience with monitoring and observability tools such as Datadog, Prometheus, Grafana, and CloudWatch.

  • Strong troubleshooting and problem-solving skills with the ability to learn independently.

  • High attention to detail, strong follow-up skills, and the ability to prioritize competing support requests.

  • Customer-centric mindset with a passion for service excellence.

  • Willingness to participate in an on-call rotation.

  • Ability to adapt quickly to new AWS services and evolving cloud technologies.

  • Excellent written and verbal communication skills in English.

  • Certifications in AWS, Datadog, or New Relic are preferred.

  • Prior experience working in a fast-growing, high-volume support environment.


Signs You May Be a Great Fit 
  • Impact: Play a pivotal role in shaping a rapidly growing venture studio with Cloud-driven digital transformation. 
  • Culture: Thrive in a collaborative, innovative environment that values creativity, ownership, and agility.
  • Growth: Access professional development opportunities, and mentorship from experienced peers. 
  • Benefits: Competitive salary, wellness packages, and flexible work arrangements that support your lifestyle and goals.