Senior Site Reliability Engineer

Company Overview Global Technology Services is a rapidly expanding organization situated in Medellín, Colombia. We pride ourselves on possessing one of the most influential networks within software development and IT services for the entertainment, financial, and logistics sectors. Our corporate projections offer a multitude of opportunities for professionals to elevate their careers and experience substantial growth. Joining our team means engaging with expansive engineering teams across Latin America, Philippines and the United States, contributing to cutting-edge developments in multiple industries. Position Title Senior Site Reliability Engineer Location Remote-US What you will be doing We are seeking a highly experienced Senior Site Reliability Engineer to own and evolve the reliability, security, observability, and operational maturity of our cloud platform. This is not a traditional SRE role. We are looking for an engineer who operates with an AI-native mindset and uses AI as a core operational force multiplier across infrastructure, incident response, automation, compliance, and operational excellence. Required Skills & Experience To excel in this role, you should possess AI-Native SRE Operations (Hard Requirement) Expert use of AI tools and agentic workflows to automate infrastructure and SRE tasks. Hands-on experience using AI for Terraform development, incident triage, log analysis, runbook creation, postmortems, operational automation, CI/CD pipeline generation, and reducing repetitive operational work. Strong understanding of AI capabilities, limitations, and necessary validation processes. Ability to clearly articulate AI workflows, tooling choices, operational safeguards, and production outcomes. Cloud Infrastructure & AWS (Hard Requirement) 10+ years managing production infrastructure for SaaS platforms, including 5+ years of senior AWS ownership. Deep expertise with AWS services such as ECS, VPC, IAM, RDS, S3, CloudFront, Route53, Lambda, API Gateway, CloudWatch, Secrets Manager, and related security and governance services. Advanced Terraform experience managing multi-account environments, infrastructure state, drift remediation, and dependency management. Advanced Terraform experience managing multi-account, multi-workspace infrastructure Strong understanding of provider versioning, state management, drift detection and remediation, dependency management, infrastructure blast radius analysis Proven experience resolving production infrastructure drift safely Significant experience leading production incidents as the accountable owner Ability to operate calmly and effectively during high-severity outages Proven experience authoring detailed postmortems and operational remediation plans Strong understanding of operational risk management and production recovery procedures Observability & Monitoring Proven experience leading production incidents, driving root-cause analysis, and creating remediation plans. Strong background in observability, monitoring, logging, distributed tracing, and alerting using tools such as Grafana. Experience owning CI/CD pipelines, deployment strategies, infrastructure automation, and operational workflows. Systems, Security & Compliance Strong Linux administration, containerization (Docker), networking, and scripting skills. Experience with security best practices, identity management (SAML, OIDC, SCIM), and compliance frameworks such as SOC 2, ISO 27001, HIPAA, or PCI. Comfortable working directly with auditors and maintaining compliance controls. Nice to Have Experience supporting Spring Boot or JVM-based systems in production Experience with runtime security or EDR tooling such as Falco Experience automating joiner/mover/leaver identity workflows using SCIM and IdP tooling AWS certifications including AWS Solutions Architect Professional AWS DevOps Engineer Professional AWS Security Specialty Ability to read and debug Kotlin or Java backend services from an SRE perspective Soft Skills Excellent verbal and written communication, able to convey ideas clearly. Highly autonomous and proactive, taking ownership of tasks. Adaptable to fast-paced, dynamic work environments. Responsive and reliable across channels, including email and Slack, consistently delivering results. Able to add immediate value to the client, contributing effectively from the first week. React/NodeJS/Backstage developer experience MuleSoft API Management experience Why you will love GTS Join a powerful tech workforce and help us change the world through technology Professional development opportunities with international customers Collaborative work environment Career path and mentorship programs that will lead to new levels. Join Lean Tech and contribute to shaping the data landscape within a dynamic and growing organization. Your skills will be honed, and your contributions will play a vital role in our continued success. Lean Tech is an equal opportunity employer. We celebrate diversity and are committed to creating an inclusive environment for all employees. Apply To This Job

Back to blog

Common Interview Questions And Answers

1. HOW DO YOU PLAN YOUR DAY?

This is what this question poses: When do you focus and start working seriously? What are the hours you work optimally? Are you a night owl? A morning bird? Remote teams can be made up of people working on different shifts and around the world, so you won't necessarily be stuck in the 9-5 schedule if it's not for you...

2. HOW DO YOU USE THE DIFFERENT COMMUNICATION TOOLS IN DIFFERENT SITUATIONS?

When you're working on a remote team, there's no way to chat in the hallway between meetings or catch up on the latest project during an office carpool. Therefore, virtual communication will be absolutely essential to get your work done...

3. WHAT IS "WORKING REMOTE" REALLY FOR YOU?

Many people want to work remotely because of the flexibility it allows. You can work anywhere and at any time of the day...

4. WHAT DO YOU NEED IN YOUR PHYSICAL WORKSPACE TO SUCCEED IN YOUR WORK?

With this question, companies are looking to see what equipment they may need to provide you with and to verify how aware you are of what remote working could mean for you physically and logistically...

5. HOW DO YOU PROCESS INFORMATION?

Several years ago, I was working in a team to plan a big event. My supervisor made us all work as a team before the big day. One of our activities has been to find out how each of us processes information...

6. HOW DO YOU MANAGE THE CALENDAR AND THE PROGRAM? WHICH APPLICATIONS / SYSTEM DO YOU USE?

Or you may receive even more specific questions, such as: What's on your calendar? Do you plan blocks of time to do certain types of work? Do you have an open calendar that everyone can see?...

7. HOW DO YOU ORGANIZE FILES, LINKS, AND TABS ON YOUR COMPUTER?

Just like your schedule, how you track files and other information is very important. After all, everything is digital!...

8. HOW TO PRIORITIZE WORK?

The day I watched Marie Forleo's film separating the important from the urgent, my life changed. Not all remote jobs start fast, but most of them are...

9. HOW DO YOU PREPARE FOR A MEETING AND PREPARE A MEETING? WHAT DO YOU SEE HAPPENING DURING THE MEETING?

Just as communication is essential when working remotely, so is organization. Because you won't have those opportunities in the elevator or a casual conversation in the lunchroom, you should take advantage of the little time you have in a video or phone conference...

10. HOW DO YOU USE TECHNOLOGY ON A DAILY BASIS, IN YOUR WORK AND FOR YOUR PLEASURE?

This is a great question because it shows your comfort level with technology, which is very important for a remote worker because you will be working with technology over time...