Sr. Site Reliability Engineer Job at Talent Groups, Mckinney, TX

NDBndG55WWFRMU5XemtPR1JtSHVHdDk1Z2c9PQ==
  • Talent Groups
  • Mckinney, TX

Job Description

Senior Site Reliability Engineer (Contract to Hire)

Location: McKinney, TX (Hybrid, 2–3 days onsite)

Must be authorized to work in the U.S.

Overview:

Our client is seeking a Senior Site Reliability Engineer to lead platform reliability and traffic enforcement in a Kubernetes-hosted SASE (Secure Access Service Edge) environment. This role ensures high availability, observability, and fair multi-tenant traffic handling across distributed systems.

Key Responsibilities:

Platform Reliability & Operations

  • Own uptime (target: 99.99%) and stability of multi-region Kubernetes environments.
  • Architect resilient, scalable infrastructure with proactive capacity planning and automated remediation.
  • Lead incident response, root cause analysis, disaster recovery, and change management.

Observability & Monitoring

  • Build a full-stack observability pipeline (Prometheus, OpenTelemetry, Grafana, etc.).
  • Implement golden signals, tracing, and alerting to drive real-time performance insights.
  • Develop automation for issue detection and resolution.

Kubernetes & Infrastructure

  • Manage full Kubernetes lifecycle (upgrades, autoscaling, GitOps automation).
  • Integrate and optimize OpenStack-based infrastructure beneath Kubernetes.
  • Enforce security compliance, resource efficiency, and FinOps best practices.

Traffic Enforcement & Networking

  • Design a Kubernetes-native traffic control layer for per-tenant/session enforcement.
  • Implement CRDs, custom controllers, and service mesh (e.g., Istio, Linkerd) for dynamic policy management.
  • Operate SDN telemetry agents (Cilium Hubble, WireGuard) and integrate with observability stack.

Leadership & Strategy

  • Contribute to infrastructure architecture and reliability strategy.
  • Mentor team members and promote Kubernetes best practices.
  • Partner cross-functionally across engineering, security, and product teams.

Required Skills:

  • Kubernetes in production across multi-region architectures.
  • Observability tools: Prometheus, OpenTelemetry, Grafana, Jaeger, Loki.
  • Strong Linux networking (tc, nftables, WireGuard, iptables).
  • Infrastructure automation: Helm, Terraform, ArgoCD/Flux (GitOps).
  • Programming: Go (preferred), Python/Bash scripting.
  • Familiarity with OpenStack (Nova, Neutron, Ceph) and CNI (Cilium preferred).

Preferred Experience:

  • Service mesh deployment (Istio, Linkerd), multi-cluster tools (Fleet, Rancher).
  • Chaos engineering frameworks (Chaos Mesh, Litmus).
  • Developer platform abstraction on Kubernetes.
  • FinOps cost optimization practices.
  • Edge Kubernetes and NFV/SDN background.
  • Active participation in the Kubernetes community.

Job Tags

Contract work,

Similar Jobs

Coleman|Nourian

Contract Paralegal Job at Coleman|Nourian

 ...A legal department seeks a Contract Paralegal to assist with litigation cases. Experience in submission of all types of filing required (e-filing), including the state of PA and federal filings. The Paralegal will conduct legal research; prepare documents, reports, and... 

Delaware North

Runner, Petco Park Job at Delaware North

**The opportunity**Delaware North Sportservice is hiring seasonal Food Runners to join our team at Petco Park in San Diego, California. As a Food Runner, you will be responsible for expediting food from the kitchen to our guests as quickly as possible while responding... 

LHH Recruitment Solutions

Administrative Assistant Job at LHH Recruitment Solutions

 ...Job Description Job Description LHH is partnering with a company in the construction industry to search for a temporary to hire Administrative Assistant. Your main tasks will be greeting customers, answering phone calls, maintaining the office space, and managing... 

Arrow Exterminators

Real Estate and Home Inspections Specialist Job at Arrow Exterminators

 ...Job Summary The REI Specialist will work with Home Inspection companies, Home Builders and Property Management companies processing inspection requests. Job Responsibilities Act as liaison between Home Inspection companies, Home Builders and Property Management... 

Infosys

Senior C++ Developer Job at Infosys

 ...Infosys is seeking a Senior C++ Developer - This positions primary responsibility will be to translate software requirements into working and maintainable solutions within the existing application frameworks. The chosen candidate will apply technical proficiency across...