Job Description
We are seeking a Technical Operations Lead with an AI-first approach to oversee and optimize our infrastructure. This is a high-level pivotal role requiring a candidate who doesn’t just maintain systems but actively seeks to automate them using AI and modern DevOps practices.
You will ensure the stability, scalability, and cost-efficiency of our platforms while leading the charge in implementing AI-driven monitoring, automated remediation, and intelligent CI/CD workflows. The ideal candidate will possess deep hands-on experience with AWS, DevOps tools, and system administration across Windows and Linux environments.
Job Title
-
Technical Operations Lead
Work Schedule
- Day Shift
Job Qualifications/Requirements
- Minimum 7 – 10 years of professional working experience in IT/Technical Operations (excluding university/internship years).
- Proven track record of managing complex cloud environments at scale.
- Technical Proficiency:
• Cloud: Extensive experience with AWS services (RDS, Lambda, SQS, EC2, CloudFront, AWS Backup).
• Databases: High proficiency in relational databases (specifically MariaDB/MySQL).
• CI/CD: Advanced experience with Jenkins and modern containerization (Docker).
• Systems: Competence in managing both Windows and Linux server environments.
• Security: Hands-on experience with SSL certificate management and security best practices.
• AI Integration: Familiarity with using AI tools (LLMs, Copilots, or AI-based
monitoring) to speed up troubleshooting and scripting. - Preferred Skills:
• Infrastructure as Code (IaC) tools (e.g., CloudFormation, Terraform).
• Expert-level scripting (e.g., Python, Bash, PowerShell).
• Experience with Synology NAS or on-prem to cloud networking.
Job Responsibilities
- AI-Driven Infrastructure Management: Maintain and monitor AWS-based production and development environments, prioritizing the use of AI tools for predictive scaling and anomaly detection.
- Intelligent Deployments: Coordinate and execute production deployments, utilizing automated testing and AI-enhanced rollback procedures to ensure zero-impact releases.
- Proactive Issue Resolution: Diagnose infrastructure-related issues with a focus on “automated healing”—implementing long-term, self-correcting solutions rather than manual patches.
- Cost Optimization & FinOps: Leverage AI-driven analytics to identify and implement aggressive cost-reduction strategies across AWS billing without compromising performance.
- Hyper-Automation: Refine CI/CD pipelines by automating every possible touchpoint, reducing manual intervention through scripting and intelligent orchestration.
- Cross-Functional Support: Provide expert technical assistance to development, QA, and customer support, acting as a bridge between high-level architecture and daily operations.
- Living Documentation: Maintain comprehensive infrastructure documentation (Deployment, Architecture, DR) using modern, searchable, and AI-compatible formats.
Good luck and God Bless!


