We use cookies. Find out more about it here. By continuing to browse this site you are agreeing to our use of cookies.
#alert
Back to search results

Site Reliability Engineer

Karsun Solutions, LLC
United States
12825 Worldgate Drive (Show on map)
Dec 02, 2025

Why Karsun?

Join Karsun Solutions to grow your career with the company transforming possible for the US Government.

At Karsun, collaboration drives our community. We're committed to building an environment where team members from diverse backgrounds can innovate, learn and grow with us. Here at Karsun, the only limit to your potential is the limit of your curiosity.

Join Team Karsun, and Find Your Next!

Summary:

As a Site Reliability Engineer, you will help build out and run production environments, automate operations and maintain and support infrastructure. Drive and establish Service level objectives (SLOs) and metrics to meet reliability expectations of multiple applications.

What You'll Be Doing:

Kubernetes, CI/CD & Platform Engineering

  • Develop and maintain applications on Kubernetes container platform using Helm charts, K8s configurations, and GitOps workflows for repeatable and consistent deployments.
  • Monitor and troubleshoot complex issues involving container networking, zero-downtime availability, scaling behavior, and cluster reliability.
  • Architect, deploy, and optimize resilient cloud-native systems in AWS using services EKS, Lambda, RDS, Aurora, S3, and VPC networking components.
  • Build self-service deployment capabilities for development teams, enabling application deployments through standardized pipelines.
  • Integrate security scanning tools (SAST, SCA, secrets detection, container scanning) into the build pipeline to ensure DevSecOps alignment.
  • Implement automated release strategies using blue/green, canary, feature flags, and zero-downtime deployment patterns.
  • Implement all infrastructure and configuration using Terraform, CloudFormation, CDK, or Ansible, ensuring consistent and repeatable deployments.
  • Develop robust Python, or Bash scripts to streamline operational tasks.

Observability, Monitoring & Security

  • Implement and manage observability stacks using CloudWatch, Prometheus, Grafana, ELK/Opensearch, Jaeger/Zipkin, or DataDog for full stack visibility.
  • Develop proactive alerting strategies to minimize false positives and ensure actionable notifications; establish performance dashboards to measure system reliability and drive continuous improvement
  • Conduct root cause analysis (RCA) for production incidents and drive long-term remediation through automated guardrails.
  • Partner with security teams to implement vulnerability management, patch automation, and continuous compliance monitoring.
  • Lead blameless post-incident reviews and drive implementation of resilient engineering patterns such as retries, graceful degradation, chaos testing, and redundancy strategies.

Collaboration & Leadership

  • Work closely with software engineers, architects, product owners, and security stakeholders to design reliable systems that support mission-critical government applications.
  • Coach development teams on cloud-native principles, observability, performance tuning, and infrastructure best practices.
  • Advocate for SRE/DevOps culture, driving automation-first mindset and continuous improvement across the engineering organization.

Required Qualifications:

  • Bachelor's degree in computer science, Engineering, or a related field and 8-10 years of relevant experience
  • 5+ years in SRE, Platform Engineering and DevOps supporting operations and maintenance for cloud-native, scalable, and highly available applications,
  • Expertise in scripting (Python, Bash, Go preferred).
  • Deep understanding of cloud computing platforms (e.g., AWS, Azure, GCP) and containerization technologies (e.g., Kubernetes).
  • Experience with monitoring, logging, and observability tools like DataDog, AWS Cloudwatch, Splunk etc.
  • Experience with service mesh technologies (Istio, Linkerd) and GitOps platforms (ArgoCD/FluxCD).
  • Knowledge of infrastructure as code tools (e.g., Terraform, Ansible) and CI/CD pipelines.
  • Experience deploying enterprise software within AWS Services such as EKS, RDS, EC2, Elastic Load Balancers, Lambda.
  • Strong problem-solving and analytical skills, with a keen attention to detail.
  • Ability to obtain and maintain a Public Trust clearance.

Preferred Qualifications:

  • AWS Professional-Level and/or Kubernetes Certification
  • Experience with chaos engineering, performance testing, or advanced networking.
  • Understanding of platform product-thinking and developer experience optimization
  • Experience supporting US federal government contracts

Things to Know:

Commitment to Non-Discrimination

All qualified applicants will receive consideration for employment without regard to disability, status as a protected veteran or any other status protected by applicable federal, state, local, or international law.

Salary Range

The proposed salary range for this role is $150,000.00 to $165,000.00 USD. The salary range provided is a good faith estimate representative of all experience levels. Karsun considers several factors when extending an offer, including but not limited to, the role, function and associated responsibilities, a candidate's work experience, location, education/training, and key skills.

Third Party Resumes: Karsun does not accept unsolicited resumes through or from search firms or staffing agencies. All unsolicited resumes will be considered the property of Karsun and Karsun will not be obligated to pay a placement fee.

Clearance Information

This position requires the eligibility to obtain a security clearance. The Defense Industrial Security Clearance Office (DISCO), an agency of the Department of Defense, handles and adjudicates the security clearance process. More information about Security Clearances can be found on the US Department of State government website: https://www.state.gov/m/ds/clearances/c10978.htm

Location

To be considered for this role, you must reside in one of the following states: CA, CO, DC, FL, GA, IL, MD, NJ, NY, NC, OH, OK, PA, SC, TX, VA, WV.

Applicants must be authorized to work in the United States on a permanent basis. Due to recent federal changes impacting visa programs, we are not currently considering candidates who require employment-based visa sponsorship (including H-1B).

Applied = 0

(web-df9ddb7dc-rwcm4)