System Development Engineer

US-VA-Haymarket
4 months ago
Job ID
443573
Vadata, Inc
Position Category
Operations, IT, & Support Engineering
Recruiting Team
North American Teams - AWS

Job Description

Amazon is building some of the largest distributed systems in the world, and we need smart people to support and engineer the next generation of compute and storage platforms. Amazon’s Data Center Operations Support Engineering (DCOSE) group provides escalations support worldwide with focus on continuous improvement. We have high standards for our infrastructure as well as our employees, and our systems are highly reliable, highly available, and turn scale into an advantage for our business and an asset to our customers. Our employees are super smart, driven to serve customers, and fun to work with.
Data Center Operations DevOps Engineering (DCODE) is a team of crafty engineers within DCOSE that focuses on continuous improvement through infrastructure automation and tool development. As a System Developer Engineer with DCODE you will utilize Problem Management to dive deep, analyzing data for trends and systemic issues, then follow our Software Development Life Cycle (SDLC) to develop solutions or effect changes to eliminate problems from our environment. You enjoy developing front-end applications and dashboards that would enable the business to make informed decisions. Your creativity and understanding of the business needs will drive agile development to keep up with ever-changing customer demand. You will also support the underlying infrastructure that hosts our applications through Availability, Performance and Capacity Management. You will work directly with the various service owners and hardware design teams to collaborate on hardware issues within the fleet. You think proactively and work to prevent support issues before they are realized. You participate in design reviews to identify and mitigate support risks. You work closely with development and QA teams to improve the change management life-cycle. You have a deep understanding of production architecture and are familiar with operating system best practices. You identify, communicate and drive small production architecture changes. You design and develop complex high performing scripts and applications. You work with other Amazon leaders to share ideas and improve support within the company. You take a role in the strategic direction of the team. You play a significant role in hiring, mentoring, and training employees. You demonstrate excellent judgment when making decisions.
Responsibilities

  • Help develop tools to identify and remediate hardware issues
  • Enable the business through development of front-end reporting tools
  • Drive operational efficiency improvements over hardware fleet
  • Initiate service improvements in the production environment
  • Handle and troubleshoot support incidents within SLA’s
  • Assist in developing methods for incident reduction
  • Monitor various data sources for unidentified fleet issues
  • Participate in on-call rotation and provide after-hours support
  • Collaborate with outside teams to resolve customers issues

Basic Qualifications

General

  • Understanding and evaluating risk with cost benefit analysis Business
  • Strong Communication skills both written and verbal, able to work across teams and stake holders

Software Development

  • A System Development Engineer must be able to independently develop and maintain a project with little to no supervision. In some cases, they may be the lead on a smaller project. Is responsible for identifying and defining the finer details in the project’s architecture and is responsible for implementing such requirements.
  • Experience with Software Development Life Cycle processes
  • On top of the basic coding skills you should be proficient in common data structures (stack vs queue/linked lists/trees), be able to design a datatype and also implement overloaded operators, API development, and debugging.
  • Have a strong software development skill set, preferably Python or Ruby, with JavaScript experience
  • Coding/Implementation
  • Being able to translate requirements into working functioning code
  • Developing code that is portable
  • Developing testing protocols
  • Server maintenance
  • Documentation
  • Software design and planning
  • Gathering requirements

Systems Administration

  • Experience installing, implementing and maintaining a webservice
  • Solid understanding troubleshooting Linux operating systems
  • Understanding how to query and modify databases

* Amazon.com Data Center’s are 24x7 environments. All interested candidates must be prepared to work on-call.

* All candidates must possess a valid driver's license, their own vehicle, and will need to pass a driving record check to operate company vehicles. Travel between sites/locations during the workday may occur.

* Employment for all candidates is contingent upon pass a background check.

Preferred Qualifications

Preferred Qualifications
  • Bachelor’s Degree in a technical field of study
  • Experience building applications on Amazon Web Services
  • Experience working in an ITIL environment
  • Big Data experience
  • Experience deploying or managing servers in large-scale, geographically diverse environments.
  • Deep understanding of enterprise level server and storage hardware components
  • Exposure to Linux internals and operating system fundamentals
  • Able to read core files and perform crash dump analysis
Amazon is an Equal Opportunity-Affirmative Action Employer – Minority / Female / Disability / Veteran / Gender Identity / Sexual Orientation.
Sorry the Share function is not working properly at this moment. Please refresh the page and try again later.
Share on your newsfeed