Cloud Operations Specialist
BIP.com
Office
Tirana, Albania
Full Time
Enter Our World.
Transform With Us.
Become part of BIP – xTech, the BIP Center of Excellence specializing in consulting and innovative services in Big Data, Data Science, AI, Cloud, Blockchain, DeFi, IoT, and Networking.
We are looking for a Cloud Operations Specialist to join our Application Maintenance and Operations Cloud team.
Are you our new colleague?
Profile Description:
Highly motivated and detail-oriented operational professional, specializing in cloud infrastructure maintenance and monitoring. The ideal candidate will have a strong foundation in at least one major cloud provider (Microsoft Azure, Google Cloud Platform, Amazon AWS) and a proven ability to perform operational tasks to ensure high availability, security, and performance of cloud services.
Proficiency in Kubernetes for daily operational maintenance is essential, along with a solid understanding of core cloud components such as Virtual Machines (VMs), databases, storage, and networking services.
The role demands a 24/7 on-call commitment, based on work-shift managed inside the team, requiring constant monitoring, proactive issue detection, and swift response to incidents. Excellent English communication skills are crucial for effective collaboration within a global team.
Required Skills:
- Cloud Provider Expertise: Proficiency in at least one major public cloud platform (i.e. Microsoft Azure, Google Cloud Platform, Amazon AWS), a competence certification is preferred
- Operational Maintenance of Cloud Services:
o Monitoring & Alerting: Advanced experience with:
o Resource Monitoring: Utilizing Azure Monitor or Google Cloud Monitoring to track CPU, memory, and storage usage, and setting automated alerts for resource thresholds.
o Application Performance Monitoring (APM): Monitoring application response times, latency, and error rates to ensure optimal health and performance.
o Alert Configuration & Incident Response: Setting up multi-level alerts based on priority, with escalation paths and automated incident response workflows.
- Proactive Anomaly Detection: Using AI-driven tools to detect abnormal patterns in resource usage, network traffic, and application behavior.
- Logging & Auditing: Experience with Azure Log Analytics, Google Cloud Logging, and ELK Stack for managing and analyzing system logs
- Log Collection & Retention: Configuring log retention policies and organizing logs for long-term storage and compliance.
- Event Correlation: Analyzing logs across systems for root cause identification and compliance purposes.
- Backup & Recovery: Expertise in Azure Backup, Google Cloud Backup, and Disaster Recovery solutions, including regular backups, recovery testing, and managing retention policies.
• Kubernetes Operational Maintenance:
- Node Health & Scaling: Monitoring node status, scaling clusters, and managing node replacements or upgrades.
- Pod and Service Management: Handling pod deployments, resource monitoring, load balancing, and troubleshooting failures.
- Cluster Configuration Updates: Applying updates to cluster configurations, ensuring security patches, and optimizing performance.
- Security Maintenance: Implementing role-based access controls (RBAC) within Kubernetes, monitoring vulnerabilities, and applying updates.
• Operational Maintenance of Core Cloud Services:
- Virtual Machines (VMs): Provisioning VMs, monitoring resource usage, managing snapshots and backups, and applying OS patches.
- Database Management: Managing backups, monitoring query performance, applying security patches, and scaling databases like Azure SQL and Google Cloud SQL.
- Storage Management: Monitoring storage usage, configuring lifecycle policies, and securing access in services like Azure Blob Storage and Google Cloud Storage.
• Networking Services Configuration:
- Virtual Private Networks (VPNs): Managing VPN connections, monitoring traffic, and ensuring secure connectivity.
- Firewall & Security Groups: Configuring and managing firewall rules and security groups to control traffic and maintain network security.
- Load Balancing & Traffic Management: Configuring and monitoring load balancers (Azure Load Balancer, Google Cloud Load Balancing) for efficient traffic distribution and high availability.
- VPC/Subnet Management: Maintaining and configuring Virtual Networks (VNets) or Virtual Private Clouds (VPCs), subnets, routing tables, and IP schemes for optimized and secure networking.
- DNS Management: Configuring and maintaining DNS records using services like Azure DNS and Google Cloud DNS to ensure reliable domain resolution.
Why Bip?
- Growth & Development
- Over 300 courses on emerging technologies and business trends, tailored development programs, and training and people-care initiatives designed to support your professional and personal growth.
Flexibility & Work-Life Integration
Agile working with the possibility to plan remote and on-site days in coordination with your manager and project needs, a Solidarity Time Bank to donate or access additional leave hours in times of personal difficulty, and a culture that fosters balance between work and personal life.
Health & Benefits
Comprehensive health insurance, discounted medical check-ups, platforms dedicated to mental and physical well-being, and a supplementary welfare plan. Meal vouchers and other exclusive benefits are also included.
Family & Parenthood
Concrete support for new parents: 100% salary integration for the first 3 months of parental leave or a one-time bonus, additional leave days for fathers, and initiatives to support employees both during leave and upon returning to work.
Inclusion & Values
We embrace individuality and ensure equal opportunities for all. We are committed to fostering an ethical, fair, and welcoming workplace, including active policies supporting protected categories (L. 68/99).
Next Steps
Once we receive your resume, we will take the time to evaluate it carefully.
If there’s a match with this or other open positions within the Group, we’ll reach out to start getting to know each other.
About Us
Founded in 2003, we built on the historical expertise of consulting and added two key ingredients: innovation and digitalization.
Thanks to this journey, we now have over 5,000 professionals, operating in 13 countries, with over 4,500 completed projects and the latest expertise in Digital Transformation, Data Science, Cybersecurity, Industry 4.0, IoT, and disruptive technologies, which we leverage across all market sectors.
We help our clients make a difference by creating large-scale quality through a formula focused on three pillars: Value, People, and Technology.
We believe in the value of excellence, which guides our actions, and we adopt an ethical and fair approach towards everyone who chooses to work with us, fostering an environment where people can grow together through the blending of diverse skills.
Cloud Operations Specialist
Office
Tirana, Albania
Full Time
September 23, 2025