Returning Candidate?

Systems Development - AWS Messaging Services

Systems Development - AWS Messaging Services

Job ID 
Posted Date 
Amazon Corporate LLC
Position Category 
Systems, Quality, & Security Engineering
Recruiting Team 
North American Teams - AWS

Job Description

Amazon Web Services (AWS) is the world leader in providing a highly reliable, scalable, low-cost infrastructure platform in the cloud that powers tens of thousands of businesses around the world! The messaging team owns and operates Simple Queue Service (SQS), which provides AWS customers with the cloud infrastructure for building highly scalable, asynchronous and fault tolerant distributed cloud applications. It’s a core architectural component of the critical systems for Amazon as well as many leading global enterprises running on AWS.

The messaging service and the team is growing fast, and is innovating in big and brand new feature areas. We are looking for a Systems Engineer who is obsessed with operational excellence, automation and high availability. How do you know if you are a good fit for us? You want to automate common and complex tasks in distributed fault-tolerant systems that operate at scale. You love dive deep into data to identify latency and availability root causes. You find data center build-outs, performance engineering, and other scaling activities to be a joy. Finally, you insist upon giving customers what they want: high quality, highly usable, always-on services.

In this position you’ll get to:
  • Work with developers to design, build, and manage massively scaled systems
  • Automate all aspects of systems management
  • Build distributed systems in new data centers and regions, and add/manage capacity in existing regions as our usage grows
  • Optimize the performance of our systems by analyzing and deploying new hardware configurations
  • Track the health of our services, identify problems, drive to root cause, and fix
  • Collaborate with some of the leading minds in distributed systems

Basic Qualifications

    • Bachelor’s Degree in Systems Engineering, Computer Science or related field, or relevant work experience
    • Experience in 24x7 production environments
    • 5+ years Linux experience and associated tools/languages
    • 3+ year experience building scripts, tooling, and automation for large-scale computing environments
    • 3+ year experience in at least two of: Python, Java, Perl, PHP, Ruby, Bash/Shell

    This position requires that applicant selected be a U.S. citizen and obtain and maintain a TS/SCI US Government clearance with polygraph. TS/SCI eligibility is not required to start; however, the applicant selected will be subject to a Single-Scope Background Investigation (SSBI) and must meet eligibility requirements for access to classified national security information. Applicants with a current SSBI, SBPR, or PPR, may be eligible for crossover in accordance with ICPG 704.4.

Preferred Qualifications

    • Experience with TCP/IP network troubleshooting and administration
    • Excellent troubleshooting skills at all levels, from application to network to host
    • Experience with systems management and monitoring software (home-grown or commercially available)
    • Experience with performance testing and tuning
    • Automation or monitoring framework experience, deployment or development
    • Experience with very large distributed systems such as multi-terabyte storage farms, and/or horizontally scaled request processing fleets
    • Experience with SQL scripts and database administration preferred
    • Advanced degree in computer science, mathematics, or a related field

    Amazon is an Equal Opportunity-Affirmative Action Employer – Minority / Female / Disability / Veteran / Gender Identity / Sexual Orientation