Amazon Dedicated Cloud Engineer, Infrastructure Engineering Reliabilty and Operations (IREO)
AWS Infrastructure Services owns the design, planning, delivery, and operation of all AWS global infrastructure. In other words, we’re the people who keep the cloud running. We support all AWS data centers and all of the servers, storage, networking, power, and cooling equipment that ensure our customers have continual access to the innovation they rely on. We work on the most challenging problems, with thousands of variables impacting the supply chain — and we’re looking for talented people who want to help.You’ll join a diverse team of software, hardware, and network engineers, supply chain specialists, security experts, operations managers, and other vital roles. You’ll collaborate with people across AWS to help us deliver the highest standards for safety and security while providing seemingly infinite capacity at the lowest possible cost for our customers. And you’ll experience an inclusive culture that welcomes bold ideas and empowers you to own them to completion.Do you want to support the efforts of the Intelligence Community’s mission by providing the highest availability, scalability and security requirements possible? Are you passionate to learn how to operate services at scale while deep diving issues and harnessing your customer obsession? If so, then look no further!!!Amazon is building some of the largest distributed systems in the world, and we need astute people to support and engineer the next generation of compute and storage platforms for our customer. Amazon’s IREO (Infrastructure Reliability Engineering Organization ) team provides support worldwide with a focus on continuous improvement. We have exceptionally high standards for our infrastructure as well as our employees, and our systems are highly reliable, highly available, and turn scale into an advantage for our business and an asset to our customers. Our employees are exceptional, driven to serve customers, and fun to work with.The IREO team is looking for experienced people who are willing to own solutions, insist on the highest standards for our customers and can both think of the big picture but dive deep into solutions when the situation warrants. The skills and traits we are looking for include:- Have strong Linux/Unix Systems Administration knowledge to include reading, understanding and execution of shell scripts- Work with internal Software Development teams to drive improvement of the systems/services within the team's scope- Be a focal point for the escalation for complex issues and dive deep for solutions as well as provide guidance and knowledge-sharing with your teammates- Work directly with the various service owners and hardware design teams to collaborate on hardware issues within the fleet- Ability to understand, execute and improve documented procedures and effectively generate documentation to communicate status- Identify areas to improve operational efficiency for all services through root cause and trend analysis with the identification and development of SLA, metrics, monitors, procedures, tools, and documentation- Have, or be willing to learn, the capability to automate day to day tasks and develop/build software and/or servicesThis position requires that the candidate selected be a US Citizen and must currently possess and maintain an active TS/SCI security clearance with polygraph. The position further requires the candidate to opt into a commensurate clearance for each government agency for which they perform AWS work. Key job responsibilities- Support software services before they go live in Amazon Dedicated Clouds through activities such as testing, integration and establishing key infrastructure- Maintain software services once they are live by measuring and monitoring availability and overall health of the environment - Scale systems sustainably through mechanisms, automation, and evolve systems by pushing for changes that improve reliability and velocity- Actively involved in hiring and building the AWS team- Effective communication with internal customers, coordinating between teams, developers, escalation points- Periodically support a 24/7 on-call rotationAbout the teamWhy AWSAmazon Web Services (AWS) is the world’s most comprehensive and broadly adopted cloud platform. We pioneered cloud computing and never stopped innovating — that’s why customers from the most successful startups to Global 500 companies trust our robust suite of products and services to power their businesses. Diverse ExperiencesAmazon values diverse experiences. Even if you do not meet all of the preferred qualifications and skills listed in the job description, we encourage candidates to apply. If your career is just starting, hasn’t followed a traditional path, or includes alternative experiences, don’t let it stop you from applying. Work/Life BalanceWe value work-life harmony. Achieving success at work should never come at the expense of sacrifices at home, which is why we strive for flexibility as part of our working culture. When we feel supported in the workplace and at home, there’s nothing we can’t achieve in the cloud. Inclusive Team CultureHere at AWS, it’s in our nature to learn and be curious. Our employee-led affinity groups foster a culture of inclusion that empower us to be proud of our differences. Ongoing events and learning experiences, including our Conversations on Race and Ethnicity (CORE) and AmazeCon (gender diversity) conferences, inspire us to never stop embracing our uniqueness. Mentorship and Career GrowthWe’re continuously raising our performance bar as we strive to become Earth’s Best Employer. That’s why you’ll find endless knowledge-sharing, mentorship and other career-advancing resources here to help you develop into a better-rounded professional.BASIC QUALIFICATIONS- High school or equivalent diploma, or A+ or CND (Certified Network Defender) or Network+ or Security+- Current, active US Government Security Clearance of TS/SCI with Polygraph- 5+ years of administrative experience with Linux or cloud platform environments- Ability to read, understand and utilize existing scripts. ...