Cloud Operations Engineer
We are seeking a motivated and customer-focused CloudOps Engineer to ensure the reliability, scalability, and performance of cloud-based systems running on AWS. This role is ideal for a hands-on cloud operations professional who thrives in a fast-paced environment and enjoys solving complex infrastructure, monitoring, and security challenges.
You will work closely with Leads, SREs, DevOps, and Security teams to ensure 24/7 operational excellence across AWS environments. You will help implement automated guardrails, drive observability and remediation, and continuously improve the reliability, security, and cost efficiency of cloud platforms.
Deliver remote CloudOps services, including incident management, SOC operations, AWS support, configuration, maintenance, and disaster recovery.
Monitor performance, availability, and health of AWS environments using tools such as CloudWatch, Datadog, and Opsgenie.
Provide proactive guidance and implementation support for monitoring and observability solutions.
Act as the primary point of contact for AWS Technical Support cases, managing cases end-to-end.
Perform detailed security investigations for SOC-related alerts and execute initial remediation for known issues.
Assist customers with AWS configuration questions and system integrations.
Collaborate with internal DevOps teams to resolve issues involving third-party tools and platforms.
Maintain clear, timely, and consistent communication with customers and internal stakeholders throughout incident and alert lifecycles.
Deliver exceptional customer service with a strong focus on responsiveness and quality.
Participate in on-call rotations and lead incident response for production issues
Perform root cause analysis (RCA) and drive post-incident improvements
Partner with CloudOps teams to improve operational maturity and reliability practices
Collaborate with DevOps and Engineering teams to embed reliability into CI/CD pipelines
Contribute to runbooks, reliability standards, and operational documentation
Support governance, compliance, and security requirements through automation
Requirements
Required Qualifications
Bachelor’s or Master’s degree in Computer Science, Engineering, or a related field
2–5 years of hands-on experience operating and supporting AWS cloud environments.
Experience working as a NOC, SOC, CloudOps, or Advanced Support Engineer.
Strong knowledge of AWS services, including EC2, S3, CloudWatch, Lambda, ECS, VPC, and EBS.
Strong experience in Linux and Windows environments in networking and security within production environments.
At least 1+ years of experience with monitoring and observability tools such as Datadog, Prometheus, Grafana, and CloudWatch.
Strong troubleshooting and problem-solving skills with the ability to learn independently.
High attention to detail, strong follow-up skills, and the ability to prioritize competing support requests.
Customer-centric mindset with a passion for service excellence.
Willingness to participate in an on-call rotation.
Ability to adapt quickly to new AWS services and evolving cloud technologies.
Excellent written and verbal communication skills in English.
Certifications in AWS, Datadog, or New Relic are preferred.
Prior experience working in a fast-growing, high-volume support environment.
- Impact: Play a pivotal role in shaping a rapidly growing venture studio with Cloud-driven digital transformation.
- Culture: Thrive in a collaborative, innovative environment that values creativity, ownership, and agility.
- Growth: Access professional development opportunities, and mentorship from experienced peers.
- Benefits: Competitive salary, wellness packages, and flexible work arrangements that support your lifestyle and goals.