Jobs for People with MS: National MS Society

Mobile National MS Society Logo

Job Information

MongoDB Manager, Cloud Operations Engineering in Bengaluru, India

MongoDB’s mission is to empower innovators to create, transform, and disrupt industries by unleashing the power of software and data. We enable organizations of all sizes to easily build, scale, and run modern applications by helping them modernize legacy workloads, embrace innovation, and unleash AI. Our industry-leading developer data platform, MongoDB Atlas, is the only globally distributed, multi-cloud database and is available in more than 115 regions across AWS, Google Cloud, and Microsoft Azure. Atlas allows customers to build and run applications anywhere—on premises, or across cloud providers. With offices worldwide and over 175,000 new developers signing up to use MongoDB every month, it’s no wonder that leading organizations, like Samsung and Toyota, trust MongoDB to build next-generation, AI-powered applications.

Cloud Operations Engineers are responsible for building internal tools and process automation. Day-to-day duties are creating and monitoring systems alert dashboards, reviewing critical event and system logs, accessing customer instances that underpin their production databases, and performing server administration duties including performance troubleshooting. Applicants must be critical thinkers who are quick to detect, resolve, or escalate issues that are sometimes broad in scope and difficult to trace.

We are looking to speak to candidates who are based in Bengaluru for our hybrid working model.

Responsibilities

  • Help scale the Cloud Operations Engineering team with the strategic implementation and refinement of processes and tools

  • Provide career development feedback and advice to direct reports

  • Identify and measure team health indicators and performance metrics

  • Ensure proper team focus on priorities, objectives, and related deliverables

  • Collaborate with technical and non-technical teams across the company

  • Balance your time between leading your team, working on customer incidents and being involved in projects

  • Be a source of guidance and advice to your own team members and other teams within MongoDB

  • Build a relationship with your team around trust

  • Successfully coordinate with a global team of Cloud Operations Engineers who are tasked with ensuring our uptime guarantees to the MongoDB Atlas customer base

  • Participate in designing and building internal tools

  • Assist in scoping, designing and deploying systems that reduce Mean Time to Resolve for customer incidents

  • Monitor and detect emerging customer-facing incidents on the Atlas platform; assist in their proactive resolution

  • Automate internal processes, routine monitoring and troubleshooting tasks

  • Diagnose live incidents, differentiate between platform issues versus usage issues, and take the next steps toward resolution

  • Cooperate with our Product Management and Cloud Engineering organizations by identifying areas for improvement in the management applications powering the Atlas infrastructure

  • Coordinate and participate in a weekly on-call rotation, where you will handle short term customer incidents (from direct surveillance or through alerts via our Technical Services Engineers)

Requirements

  • Management skills, with hands-on experience running small to mid sized Engineering Teams in a rapid-growth environment

  • Strong diagnostic/troubleshooting process, with significant experience troubleshooting end-to-end technical issues in production environments

  • Experience supervising, leading and monitoring progress of Software Development projects.

  • Patience, empathy, and a genuine desire to help others

  • Excellent communication skills, both written and verbal

  • Ability to think on your feet, remain calm under pressure, and find solutions to challenges in real-time

  • Experience with being an oncall DevOps, SRE, or Cloud Operations engineer

  • Expertise with Linux system administration and networking technologies

  • Knowledge of database and distributed system operations and concepts

  • Knowledgeable about a wide range of web and internet technologies

  • Familiarity with Amazon Web Services and other Cloud infrastructure platforms (e.g. GCP, Azure)

  • Experience in monitoring, system performance data collection and analysis, and reporting

  • Capability to write programs/scripts to solve both short-term systems problems and long term strategic objectives for the Atlas product

  • A CS/CE degree or equivalent experience

  • At least 2 of the following programming languages: Java, Go, Python, Typescript

  • A keen interest in learning new skills and competencies

To drive the personal growth and business impact of our employees, we’re committed to developing a supportive and enriching culture for everyone. From employee affinity groups, to fertility assistance and a generous parental leave policy, we value our employees’ wellbeing and want to support them along every step of their professional and personal journeys. Learn more about what it’s like to work at MongoDB, and help us make an impact on the world!

MongoDB is committed to providing any necessary accommodations for individuals with disabilities within our application and interview process. To request an accommodation due to a disability, please inform your recruiter.

MongoDB is an equal opportunities employer.

Req ID 1263073054

DirectEmployers