SRE / Site Reliability Engineer
Charlotte, NC (On-Site)
Job Description:
Position: SRE Engineer
Location: Onsite(Charlotte, NC)
Duration: 12 Months
Client: Tcs/Vanguard
Visa : GC/Citizen
Job Description:
We are looking for a proactive and skilled Site Reliability Engineer (SRE) to join our dynamic team. The ideal candidate will be responsible for maintaining the reliability, availability, and performance of production systems with a strong focus on AWS infrastructure and incident management (MIM). You will play a key role in handling live production issues, attending MIM calls, and supporting our cloud-based systems.
Required Skills:
- 6+ years of experience as a Site Reliability Engineer, DevOps Engineer, or in a related production support role.
- Hands-on experience with AWS services (EC2, S3, Lambda, CloudWatch, RDS, VPC, etc.).
- Strong understanding of incident management practices, with experience attending and driving MIM calls.
- Solid experience with Linux/Unix systems administration.
- Proficiency in scripting languages like Python, Shell, or Bash.
- Experience with monitoring and alerting tools such as Datadog, CloudWatch, Prometheus, PagerDuty, etc.
- Familiarity with CI/CD pipelines and infrastructure-as-code tools like Terraform or CloudFormation.
- Strong communication skills and ability to coordinate across teams during high-pressure situations.
- SRE Enginer need to attend MIM calls ( major incident calls ) . AWS and production support.
Key Skills:
- aws