TikTok USDS
2024 - Present
Site Reliablility Engineer, Cloud Infrastructure
Build, design and maintain production cloud infrastructure.
- Build, maintain, and optimize cloud automation solutions using Ansible, Terraform, cloud-init, and Python, reducing deployment times and increasing infrastructure consistency
- Design, provision, and manage server infrastructure across multi-cloud and hybrid environments, ensuring scalability and reliability
- Implement advanced load balancers with layer 3/layer 4 and layer 7 load balancers
- Create comprehensive monitoring and observability solutions using cloud-native tooling, Grafana, and Kibana, enabling proactive issue detection and rapid incident response
- Perform in-depth troubleshooting and root cause analysis across complex bare-metal and virtualized cloud infrastructure, minimizing service disruptions
- Cloud
- Networking
- SRE
- Load Balancer
- Python