Returning Candidate?

DCSE Support Engineer

DCSE Support Engineer

Job ID 
Posted Date 
Vadata, Inc
Position Category 
Operations, IT, & Support Engineering
Recruiting Team 

Job Description

Amazon is building some of the largest distributed systems in the world, and we need smart people to support and engineer the next generation of compute and storage platforms. Amazon’s Data Center Support Engineering (DCSE) group provides escalations support worldwide with focus on continuous improvement. We have high standards for our infrastructure as well as our employees, and our systems are highly reliable, highly available, and turn scale into an advantage for our business and an asset to our customers. Our employees are super smart, driven to serve customers, and fun to work with.
As a Support Engineer with DCSE you will help troubleshoot, diagnose, and support massive distributed systems. You are a point of escalation for complex issues to dive deep for solutions as well as provide guidance and knowledge-shares. You will work directly with the various service owners and hardware design teams to collaborate on hardware issues within the fleet. You drive the team to improve operational efficiency for all services through root cause and trend analysis with the identification and development of SLA, metrics, monitors, procedures, tools, and documentation. You think proactively and work to prevent support issues before they are realized. You participate in design reviews to identify and mitigate support risks. You work closely with development and QA teams to improve the change management life-cycle. You have a deep understanding of production architecture and are familiar with operating system best practices. You identify, communicate and drive small production architecture changes. You design and develop complex high performing scripts and applications. You work with other Amazon leaders to share ideas and improve support within the company. You take a role in the strategic direction of the team. You play a significant role in hiring, mentoring, and training employees. You demonstrate excellent judgment when making decisions.
  • Help develop tools to identify and remediate hardware issues
  • Drive operational efficiency improvements over hardware fleet
  • Initiate service improvements in the production environment
  • Handle and troubleshoot support incidents within SLA’s
  • Assist in developing methods for incident reduction
  • Monitor various data sources for unidentified fleet issues
  • Participate in on-call rotation and provide after-hours support
  • Collaborate with outside teams to resolve customers issues
Shift work is required. Travel may be required up to 10% of the time.

Basic Qualifications

This position requires the applicant selected to obtain and maintain a Top Secret security clearance with Sensitive Compartmented Information (TS/SCI) eligibility and access. A US Government administered polygraph examination will be required. TS/SCI eligibility is not required to start; however, the applicant selected will be subject to a Single-Scope Background Investigation (SSBI) and must meet eligibility requirements for access to classified national security information. Applicants with a current SSBI, SBPR, or PPR, may be eligible for crossover in accordance with ICPG 704.4.

  • 3+ years overall development/technical support experience
  • Experience with support procedures and methodologies
  • Strong understanding of x86/x64 hardware platforms and components
  • Experience with large database driven websites and web technologies
  • 2+ years experience with a UNIX/Linux operating system
  • 1+ years scripting experience (e.g. Perl, Shell)
  • Experience with networking
  • Highly organized and able to coordinate with other team members
* Data Center’s are 24x7 environments. All interested candidates must be prepared to work on-call.
* Data Center’s are 24x7 environments. All interested candidates must be willing to work rotating shifts (day/night) as well as weekends. Data Center Technicians are also required to wear a pager for communication, information, emergency, and on-call situations.
* All candidates must possess a valid driver's license, their own vehicle, and will need to pass a driving record check to operate company vehicles. Travel between sites/locations during the workday may occur.
* Employment for all candidates is contingent upon pass a background check.

Preferred Qualifications

  • Bachelor’s Degree in a technical field of study
  • Excellent hardware troubleshooting skills
  • Excellent documentation skills
  • Understanding of production monitoring and metrics
  • Experience writing complex scripts
  • Experience in a 24/7 production environment
  • ITIL Certified

Amazon is an Equal Opportunity-Affirmative Action Employer – Minority / Female / Disability / Veteran / Gender Identity / Sexual Orientation.