Sr Cloud Architect
Sr Cloud Architect
Heredia, CR, 40101
Job Description:
About the role:
A Senior Cloud Engineer is responsible for designing, implementing, and managing cloud infrastructure and services to align with organizational goals and technical requirements. This role involves developing cloud architecture strategies, deploying and managing cloud-based solutions, and automating processes to streamline operations. Ensuring the security and compliance of cloud environments is crucial, as is optimizing performance for efficiency and cost-effectiveness.
A day in this role:
- Provide technical leadership across multiple teams, from different business units, guiding architectural
- decisions and ensuring adherence to best practices
- Architect, build, document, and maintain Cloud standards and processes
- Collaborate extensively with centralized architecture, engaging in deep collaboration to align
- cloud strategies with enterprise architecture principles and standards. Additionally,
- demonstrate the ability to flex into Business units, understanding their processes and ways of
- doing things, and integrating cloud solutions to support their specific needs and objectives.
- Lead projects and new application implementations
- Create new Terraform architecture and modules to provision AWS resources
- Guide teams on best practices in the cloud
- Provide mentorship and training to junior and senior engineers, sharing knowledge and
- experience in cloud technologies, system wide architectural design, and best practices.
- Lead project management activities, including decomposing work into manageable tasks, coordinating with stakeholders, and tracking progress.
- Demonstrate expertise in Kubernetes, including cluster management, deployment strategies, and best practices for container orchestration.
- Work with Software Development teams to write and tune their application Helm charts for EKS
- Create, manage, and administrate Kubernetes running on EKS
- Possess advanced skills in AWS, including advanced networking, security,and optimization techniques,
- and stay current with new AWS services and features.
- Create and modify Jenkins pipelines to support CI and automation
- Performance Engineering, load testing, hotspot isolation, and remediation
- POC new solutions and production in the cloud
- Configure APM, SLO, SLA and alerting via Dynatrace
- Configure log metrics and analysis via Splunk
- Build and manage CI deployment process for all environments
- Support and enable teams to migrate from on-prem environments into AWSLearning: Research and explore new cloud technologies, services, and best practices to inform and enhance cloud strategies
The expertise requested:
- Expert level experience and understanding of Terraform
- Expert level diagraming and explaining complex or complicated systems
- Expert level experience with AWS serviceso EC2, ASG, SG, ALB/NLB/WAF, ACL, Routing, Route53, Express Connect/Transit Gateway, EC2
- Image Builder, EKS, ECS, ECR, Lambda
- Experienced in AWS database technologies
- Experienced using Git based CI/CD pipelines
- Experienced in utilizing various infrastructure and dependency discovery tools/agents
- Experienced writing Jenkins files and Jenkins Shared Libraries
- Expert level with EKS creation and administration
- Expert level with Kubernetes application deployment and management
- Experienced writing and maintaining custom application Helm charts and Helm template libraries
- Expert level written and oral communication skills
- Expert level scripting in languages such as powershell, Bash, Python, Perl, and/or GoLang
Desired or nice to have expertise:
- AWS big data services
- Explaining & presenting complex to complicated technical matters to technical and non-technical people
- Strong ability to decompose large complex bodies of work into actionable stories for others to engageand execute
- Ability to communicate to both technical and executive level people on problem & opportunity things and progress towards goals
- Experience with leading application migrations into the cloud according to best practices, standards and cloud-native architecture.
- You are passionate about building and running distributed systems at scale in production. You understand the challenges and trade-offs to be made when building and deploying systems top roduction.
- Create automation for anything that is repeatable and can be automated.
- Expertise in working with container deployment and orchestration technologies at scale with strong knowledge of the fundamentals to include service discovery, deployments, monitoring, scheduling, load balancing.
- Knowledge of best practices related to security, performance, and high availability.
- Highly skilled in identifying performance bottlenecks, identifying anomalous system behavior, and
- determining the root cause of incidents.
- Experience working with APM and log aggregation tools as well as configuring the integrations and monitoring needed to leverage these tools.
- You've demonstrated the ability to effectively work collaboratively across functions and communicate between many different teams.
- Interest in designing, analyzing, and troubleshooting large-scale distributed systems.
- Systematic problem-solving approach coupled with strong communication skills and a sense ofownership and drive.
- Ability to debug and optimize code and automate routine tasks
Our Core values are focus to inclusion and diversity, all qualified applicants will be considered for employment and will go thru a fair recruitment process regardless of their race, religion, gender identity, sexual orientation, national origin or disability status.
Your journey with us begin begins here!!