Amazon¹s network is a key differentiator for Amazon Cloud Computing and Web Services (AWS), enabling the global operation of thousands of applications across hundreds of thousands of servers worldwide. The AWS Networking team develops and operates the network platform for all of Amazon including e-commerce products and cloud computing solutions. This platform is industry-leading for its efficiency, throughput and reliability, and it is critical to the success of hundreds of thousands of AWS customers.
Are you ready to own the network availability for the largest cloud network on the planet? The Network Reliability and Optimisation Manager is responsible for a team of 10-12 highly qualified network engineers who own operational escalations; in-depth troubleshooting and root cause analysis. This role has regional ownership for much of our global network. Our network reliability manager is expected to further enhance the highest level of availability AWS is well known for. They will identify and mitigate any network problem that could lead to customer impact. This team is the face of operations to our internal customers and assists with coordinated investigations. They continue to improve our operational environment through process and automation. Our leaders are technical and understand their environment to accurately represent operational issues and exercise high judgment during high severity events.
Our engineers, managers and leaders are innovators at heart; come join us and become integral to the technology company that is the past, present and future of real Cloud Computing.
Responsibilities Operational Excellence
As a manager within the Networking team you will be expected to drive operational excellence in everything we do. This includes creating sane processes and procedures to improve efficiency in our day-to-day tasks and projects.
As a Network Engineering manager you will be expected to drive quality into the metrics we report to assist us in focusing on the areas that give us the best ROI. This includes measurement of our issues, network capacity, vendor equipment/failures analysis and network performance. Technical Leadership
As a manager of a highly technical team, which has responsibility for operational availability of the Amazon global network, you will be expected to have a deep knowledge of your area. As part of your role, you will be required to review and approve network changes for your team. Additionally, you will on occasion need to develop a detailed, low-level understanding of network issues that do occur and to be able to represent those issues at operational management review meetings. Performance Management/Team Health
You will own all facets of performance and career management for the team. Regular one-on-one meetings with all team members are required. You will be expected to provide both technical and Œsoft skill¹ mentoring in order to maintain a well-rounded, world-class organization. This includes project management, quality audits and coordination of training sessions with senior-level engineers as well as day-to-day oversight of the team including scheduling of a 7x8x365 operational rota. Recruiting and Hiring
You will take the lead in hiring quality personnel who not only fit the needs of the current organization but also will allow the team to scale with platform and service growth. You will coordinate with Amazon and external recruiting staff to evaluate potential candidates, participate in initial phone screens and provide relevant guidance and feedback during on-site interview loops. You will also be responsible for ensuring that proper training takes place for all new hires. 12x7 Oncall
As the leader of this team, you will be expected to be part of a 12x7 management escalation rotation.
This is an amazing opportunity in terms of responsibility, interesting challenges and high visibility. We truly are looking for the highest quality candidates, so you should expect a rigorous interview process.