KBR
Houston, TX (Remote)

Full-Time


Title:

Senior Cloud Systems Administrator (REMOTE POSSIBLE)

KBR is seeking a Senior Cloud Systems Administrator (SA) to join the Human Performance Research team. We are looking for someone who is an experienced generalist with knowledge in several different Information technology system disciplines such as hardware, software and network system integration, configuration, testing/troubleshooting, maintenance, and documentation.

The Senior Cloud Systems Administrator will provide Systems and Operations Administration support within the Agile Research and Development Integrated Systems (ARDIS) platform. The SA will work closely with a team of data architects, software engineers, data scientists and other disciplines to coordinate system/application installation, configuration, upgrades, monitoring and repairs. Must be able to work under minimal supervision, handling complex issues and problems, referring only the most complex issues to higher-level staff.

The Cloud SA manages the operational health, reliability, performance, security, and resiliency of infrastructure in support of daily operations. They enable modern cloud DevOps activities, provide advanced operational support, and perform problem resolution activities. They are evangelists for efficiency and work to continuously automate our cloud environment. They troubleshoot production issues, respond to incidents and collaborate with DevOps Engineers on projects to improve system resiliency and reliability. The ideal candidate will possess comprehensive knowledge of subject matter including networking architecture, system administration, and scripting languages.

~This position has the possibility to be remote for the right candidate~

***Must be U.S. Citizen***

Responsibilities:

+ Implement and manage cloud infrastructure and network, includes researching production issues

+ Deploy, manage, and assist architects in designing automated system implementation and configuration

+ Maintain performance metrics and monitoring for the application architecture and operations environment; create improvements where appropriate

+ Establish, measure, and optimize IT operations processes to promote stability, performance, and throughput

+ Work closely with DevOps engineers and operations DBAs to manage services with high resiliency, availability, and performance

+ Team with network and security personnel to implement cloud structure, connectivity, and security

+ Implement monitoring tools, develop automated provisioning, and develop self-healing automation

+ Troubleshoot operations and database issues for internal and external customers

+ Deploy application updates and manage infrastructure

+ Proactively monitor and address problems before they happen to improve system resiliency

+ Perform incident resolution and root cause analysis of critical outages

+ Implement solutions to systematic failures

+ Provide incident support, including after hours

+ Document the environments and processes; resolve audit findings and maintain logs

Required Experience, Education, & Skills:

+ Bachelor’s degree in computer science, engineering or related field

+ 5 to 7 years of related experience

+ 3 to 5 years of experience with cloud environments, Microsoft Azure and/or Amazon Web Services Associate or higher-level certification preferred

+ In lieu of formal education 13 – 15 years of related experience

+ Expert level experience with

+ Windows and Linux-based systems administration skills in a cloud environment

+ Network knowledge such as: SSL certs, CIDR blocks, DNS, SSH Keys, TCP dump, etc.

+ Managing Kubernetes and Docker

+ Monitoring tools such as ELK, AWS CloudWatch, etc.

+ Strong scripting skills with: PowerShell, Python, Perl, YAML, etc.

+ Accomplishment with continuous integration and delivery services including: GitLab, Jenkins, Azure DevOps, GitHub, AWS CodeBuild, etc.

+ Competency with CloudFormation templates or other Infrastructure-As-Code (IaC) tools.

+ In-depth knowledge of internet protocols – HTTP, SSH, RDP, etc.

+ Managing IAM accounts, security groups, permissions, and policies

+ Developing System Security Plans and reference documentation

Desired Experience & Skills:

+ Experience supporting Dept of Defense customers, sensitive/protected information

+ Knowledge of Rest and GraphQL

+ Familiarity of DoD Cloud System Requirements Guide (SRG)

+ Supporting platforms progressing through FedRAMP audit, accreditation, and certification

+ DoD Risk Management Framework and Authority to Operate process

KBR is an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, disability, sex, sexual orientation, gender identity or expression, age, national origin, veteran status, genetic information, union status and/or beliefs, or any other characteristic protected by federal, state, or local law.

Recommended Skills

  • Administration
  • Agile Methodology
  • Auditing
  • Automation
  • Cloud Platform System
  • Cloudformation