Job Description
| Job Role | Site Reliability Engineer (SRE) |
| Duration | Long-term Contract (through 2026) |
| Location | Greenwood Village, CO 80111 |
| Work Model | Hybrid |
Onsite Expectations
- Contractors:
- Flexible onsite schedule
- Expected to come onsite a few times per week during ramp-up
- Once ramped up: 1 2 times per month
- Full-Time Employees:
Project Scope & Platform Overview
The Configuration Management (CMX) Team experimentation and configuration management platforms used across all customer-facing products.
Platform Responsibilities
- Internal A/B testing and experimentation platform
- Ensures:
- Safe product releases
- No negative customer impact
- Product enhancements drive measurable improvements
TDCS (Targeted Delivery Client Services) Platform
- Real-time configuration management tool
- Used across:
- Streaming applications
- Customer web portals
- Client-facing platforms
- Enables:
- Market-specific targeting
- Customer-specific experiences
- Controlled experiments and testing variants
Role Focus
- Primary support for TDCS platform
- Migrating infrastructure into a dedicated AWS account
- Maintain:
- High availability
- Low latency
- Platform is business-critical :
- Called multiple times daily by every application
- Directly impacts customer access to streaming/video services
Top Skills Required
- 6+ years of DevOps / SRE experience in large, complex environments
- Strong development background (ability to read and understand code)
- AWS
- Terraform (Infrastructure as Code)
- Kubernetes
- GitLab or similar CI/CD tools
- Datadog or similar monitoring tools
Job Description
The Applied AI and Data Science Program brings together data scientists, data engineers, and software engineers to empower Spectrum teams to safely release, test, and evaluate product changes. The mission is to deliver targeted, dynamic customer experiences while providing leaders with data-driven insights.
As a Senior Site Reliability Engineer , you will deploy, monitor, support, and optimize Charter's experimentation and configuration management platforms hosted on AWS. You'll work closely with software engineers, test engineers, and DevOps teams to ensure these mission-critical systems remain highly available and performant .
Key Responsibilities
Release Management
- Build and deploy application, service, and infrastructure releases
- Validate system integrity post-deployment
- Document release notes
Production Support
- Maintain 99.999% availability of critical systems
- Ensure smooth operation of infrastructure and applications
- Keep infrastructure resources up to date
- Participate in on-call rotation for incidents and outages
- Perform root cause analysis for production issues
Monitoring & Alerting
- Implement monitoring and alerting policies across systems
- Build and enhance dashboards
- Monitor:
- Errors and unexpected behavior
- Latency and resource consumption
- System degradation
- Proactively mitigate issues
- Alert stakeholders when SLAs are at risk
Optimization
- Manage scaling strategies aligned with project goals
- Optimize system behavior and resource utilization
Team Collaboration
- Assist with user support
- Act as the system architecture and deployment expert
- Coordinate with onshore and offshore teams
- Develop bug fixes as needed
Primary Qualifications
- Expertise with monitoring tools such as Datadog and/or Splunk
- Strong experience with AWS services (EKS, S3, DocumentDB, etc.)
- Experience supporting containerized microservices
- Proficient with Terraform and AWS Console
- Experience with performance benchmarking and testing
- Hands-on deployment of cloud-based applications
- Git-based source control experience (GitLab preferred)
- Bachelor's degree or equivalent experience
Secondary / Nice-to-Have Qualifications
- 6+ years of SDLC experience
- Familiarity with:
- Python, Node.js, React, TypeScript, GraphQL
- Experience with:
- SQL and NoSQL databases
- Docker, Kubernetes, Redis
- ORMs over relational databases
- Exposure to experimentation platforms and statistical testing
- Master's degree or higher
Thanks and Regards
Monu Singh Chauhan | 1Point System LLC
Technical Recruiter
monu.singh@1pointsys.com
LinkedIn: linkedin.com/in/monu-singh-chauhan-610857204
115 Stone Village Drive Suite C Fort Mill, SC 29708
An E-Verified company | An Equal Opportunity Employer
DISCLAIMER: If you have received this email in error or prefer not to receive such emails in the future, please notify by replying with a ''REMOVE'' in the subject line and your email address shall be removed immediately from the mailer list.
Job Tags
Long term contract, Full time, For contractors, Immediate start, Flexible hours,