Site Reliability Engineer (SRE)

Job Location US-VA-McLean
Posted Date 2 weeks ago(3/24/2020 10:34 AM)
Job ID
Clearance Requirement
Public Trust


Design. Disrupt. Repeat.
Be an agent of change on a team committed to achieving client-focused, mission-driven excellence. Steampunk is looking for an experienced Site Reliability Engineer with an appetite for taking on new challenges.

Who We Are
Steampunk is the explosive collision of human-centered design and traditional government contracting. An employee-owned company with a startup mindset and time-tested approaches tailored for the federal government, we’re passionate about creating solutions that are impactful, practical, scalable, and most importantly, that meet our clients’ ever-changing needs.
At Steampunk, we believe in disrupting the status quo and setting the pace in the ecosystem of government contractors, while repurposing tried-and-true methodologies. We believe in empowering our people to find creative solutions to intractable problems. We believe the best environment in which to grow and thrive is outside our comfort zone. While good design makes for a good product, we believe human-centered design makes for an excellent one.

We also believe effective teams are powered by diverse perspectives, backgrounds, and experiences. To that end, Steampunk is an equal opportunity employer committed to promoting diversity of race, gender, sexual orientation, religion, ethnicity, national origin, disability status, and protected veteran status, amongst our ranks.  Additionally, we participate in the E-Verify program.

Why Steampunk?
Our people are the very core of what we do; their expertise and hunger for new and exciting challenges fuel our relentless pursuit of mission success. As part of our team of “Punks,” you’ll test the status quo, explore new boundaries, and set the bar high for how government clients expect to engage with contractors.

Because we value our employees’ work/life balance (and believe those who work hard deserve to play hard), we offer a very competitive benefits package, including telework/flex scheduling, health/dental with orthodontics/vision insurance upon hire, paid time off with a sell-back benefit and carryover option, 11 Federal Holidays, 100% paid military leave, 100% 401(k) plan match upon hire, professional development/education reimbursement, all flexible spending accounts, and more


As a Steampunk Site Reliability Engineer (SRE), you will be responsible for working with program development teams, infrastructure and platform services teams, and traditional operations and maintenance teams to embrace and embody a shared responsibility for the reliability of the applications and infrastructure. As an SRE, your responsibility is to bridge the gap between traditional software development and COTS deployment teams and IT operations. This role is more of a proactive for of operations, maintenance and quality assurance. SREs are dedicated full-time to working with and at times contributing to software teams to improve the reliability, performance, and cost efficiency of systems in production, fixing issues, responding to incidents, and usually taking on-call responsibilities.

There are a wide variety of responsibilities you will be delivering in this role:

Monitoring systems to make sure you have the insight into system performance, health, availability and what is happening internally in the system.
Understand what to monitor based on the system(s) you are managing, where to store the monitoring data, who can access historical monitoring data, and how to look at the data to make determinations about future actions.
Provide monitoring and alerting on symptoms and not on outages.
When systems are not performing as expected, or when something is broken, alert the appropriate personnel and respond accordingly.
Think about systems - edge cases, failure modes, behaviors, specific implementations.
Prevent incidents from happening across an organization’s infrastructure.
Document actions so your findings turn into repeatable actions–and then into automation.
Improve the deployment process to make it as boring as possible.
Design, build and maintain core infrastructure pieces that allow source code repositories, binary repositories and automation framework to scale to support many thousands of concurrent users.
Plan the growth of source code repository, binary repository and automation framework.
Debug production issues across services and levels of the stack.
Have an urge to collaborate and communicate asynchronously.
Have an urge to document all the things so you don't need to learn the same thing twice.
Have an enthusiastic, go-for-it attitude. When you see something broken, you can't help but fix it.
Have an urge for delivering quickly and iterating fast.
Share our values, and work in accordance with those values.


Bachelor’s degree and at least 5 years of IT experience
Eligible to obtain and maintain and government security clearance
Experience in System Engineering in one or more areas including telecommunications concepts, computer languages, operating systems, database/Data Base Management System (DBMS) and middleware
Experience with source code and binary repository products and techniques (GitHub, GitLab, BitBucket, Artifactory, Nexus, etc.)
Experience with messaging systems, collaboration software, application-based firewall and proxy server(s), and operating systems
Experience with Linux and Windows operating systems, along with scripting tools and techniques such as Bash, CSH, KSH, ZSH, etc. and/or Powershell.
Experience with configuration management tools/systems like Chef, Terraform, Ansible, etc.

 Preferred Skills:
Knowledge and experience with Agile and DevSecOps methodologies
Knowledge and experience with NewRelic and/or other AIOps platforms
Have programming skills – Javascript, Ruby and/or Go
Experience with Nginx, HAProxy, Docker, Kubernetes or similar technologies
Able and willing to work independently and in a fast-paced environment with tight deadlines, with minimal supervision
Excellent interpersonal skills, as well as excellent communication skills, verbal and written to both technical and non-technical audiences that are in a geographically dispersed environment (conference calls, Skype, face-to-face)

About steampunk


Steampunk is a Change Agent in the Federal contracting industry, bringing new thinking to clients in the Homeland, Federal Civilian, Health and DoD sectors.  Through our Human-Centered delivery methodology, we are fundamentally changing the expectations our Federal clients have for true shared accountability in solving their toughest mission challenges.  As an employee owned company, we focus on investing in our employees to enable them to do the greatest work of their careers – and rewarding them for outstanding contributions to our growth. If you want to learn more about our story, visit


We are an equal opportunity employer and all qualified applicants will receive consideration for employment without regard to race, color, religion, sex, national origin, disability status, protected veteran status, or any other characteristic protected by law. Steampunk participates in the E-Verify program. 


Sorry the Share function is not working properly at this moment. Please refresh the page and try again later.
Share on your newsfeed

Need help finding the right job?

We can recommend jobs specifically for you! Click here to get started.