Sr. Site Reliability Engineer Job at Talent Groups, Mckinney, TX

NDBndG55WWFRMU5XemtPR1JtSHVHdDk1Z2c9PQ==
  • Talent Groups
  • Mckinney, TX

Job Description

Senior Site Reliability Engineer (Contract to Hire)

Location: McKinney, TX (Hybrid, 2–3 days onsite)

Must be authorized to work in the U.S.

Overview:

Our client is seeking a Senior Site Reliability Engineer to lead platform reliability and traffic enforcement in a Kubernetes-hosted SASE (Secure Access Service Edge) environment. This role ensures high availability, observability, and fair multi-tenant traffic handling across distributed systems.

Key Responsibilities:

Platform Reliability & Operations

  • Own uptime (target: 99.99%) and stability of multi-region Kubernetes environments.
  • Architect resilient, scalable infrastructure with proactive capacity planning and automated remediation.
  • Lead incident response, root cause analysis, disaster recovery, and change management.

Observability & Monitoring

  • Build a full-stack observability pipeline (Prometheus, OpenTelemetry, Grafana, etc.).
  • Implement golden signals, tracing, and alerting to drive real-time performance insights.
  • Develop automation for issue detection and resolution.

Kubernetes & Infrastructure

  • Manage full Kubernetes lifecycle (upgrades, autoscaling, GitOps automation).
  • Integrate and optimize OpenStack-based infrastructure beneath Kubernetes.
  • Enforce security compliance, resource efficiency, and FinOps best practices.

Traffic Enforcement & Networking

  • Design a Kubernetes-native traffic control layer for per-tenant/session enforcement.
  • Implement CRDs, custom controllers, and service mesh (e.g., Istio, Linkerd) for dynamic policy management.
  • Operate SDN telemetry agents (Cilium Hubble, WireGuard) and integrate with observability stack.

Leadership & Strategy

  • Contribute to infrastructure architecture and reliability strategy.
  • Mentor team members and promote Kubernetes best practices.
  • Partner cross-functionally across engineering, security, and product teams.

Required Skills:

  • Kubernetes in production across multi-region architectures.
  • Observability tools: Prometheus, OpenTelemetry, Grafana, Jaeger, Loki.
  • Strong Linux networking (tc, nftables, WireGuard, iptables).
  • Infrastructure automation: Helm, Terraform, ArgoCD/Flux (GitOps).
  • Programming: Go (preferred), Python/Bash scripting.
  • Familiarity with OpenStack (Nova, Neutron, Ceph) and CNI (Cilium preferred).

Preferred Experience:

  • Service mesh deployment (Istio, Linkerd), multi-cluster tools (Fleet, Rancher).
  • Chaos engineering frameworks (Chaos Mesh, Litmus).
  • Developer platform abstraction on Kubernetes.
  • FinOps cost optimization practices.
  • Edge Kubernetes and NFV/SDN background.
  • Active participation in the Kubernetes community.

Job Tags

Contract work,

Similar Jobs

IMS

Leadership Development Trainee - Entry Level Job at IMS

 ...proud to lead meaningful campaigns across the Bay Area. As a Leadership Development Trainee, you will begin your journey in a hands-on, entry-level role while receiving direct mentorship and training from experienced leaders. Youll build the foundational skills needed... 

United States Army

Military Intelligence Linguist Job at United States Army

 ...qualifications and position. Guaranteed promotion opportunities. Additional Career Opportunities: Upon successful completion of first term contract, you are guaranteed up to 5 interviews with your choice 1,200 industry leading organizations including Disney, Tesla, and Coca-... 

Selking International Trucks

Diesel Mechanic Job at Selking International Trucks

Selking International is looking for a Heavy-Duty Truck Technician at our Elkhart, IN Dealership. CDL is preferred but not required. Selking International is hiring all levels of skills and experience Job Description Perform Preventative Maintenance, General ...

Hoag Memorial Hospital - 4699 Jamboree Road92660

Travel Health & Wellness Exercise Coach - $1,803 per week Job at Hoag Memorial Hospital - 4699 Jamboree Road92660

Job Details Seeking a passionate and experienced Health Coach, specializing in Exercise and cross functional training in health and wellness. Play a pivotal role in helping patients understand the impact of movement on health. Collaborate with Physician and NP...

Jobs via Dice

Junior Python Developer Job at Jobs via Dice

 ...looking for a strong Consultant for one of our precious clients in USA. If you are interested kindly share profile atTitle: Python Developer /Location: Jersey City, NJ (Onsite)Job Type: Full TimeResponsibilities:Collaborate with other team members and develop new...