Lead Site Reliability Engineer
Why Valtech? We’re advisors, visionaries, creative and techies. We embrace all things digital. We talk to each other. We have fun. We love our clients. We’re looking ahead. We are global.
We are looking for Lead Site Reliability Engineer to join our network of more than 6.000 doer’s and maker’s working out from our 60+ offices in 22 countries.
Valtech has established itself as one of the largest independent global groups aiming at creating digital experiences that improve human lives while transforming the future of our clients’ businesses. We work with some of the world’s best-known brands from across the retail, manufacturing, distribution, mobility, travel, health, public, and finance sectors. And our promise to the market is that we get things done. Together. We transform by doing!
The ideal candidate's skills:
/ A proven background in Reliability, covering Observability, Reliability and Performance
/ Commercial experience of observability, from data ingestion to alerting and issue resolution. Elastic highly desirable but AppDynamics, Datadog etc considered
/ Comfortable with logging frameworks, log manipulation and shipping
/ Capable of designing, testing and implementing resilience on new and existing AWS solutions
/ Performance testing a bonus but not a must have
/ Comfortable coding in one or more languages such as Python, Go, Java, NodeJS etc.
/ Commercial experience of using AWS Services including EC2, ECS, Serverless (Lambda)
/ You’ll know how to debug a complex, high availability production environments;
Networking knowledge, load balancing, TCP/HTTP etc.
/ Comfortable with Linux operating systems and able to create and maintain shell scripts
/ Demonstrable range of in-depth technical knowledge/experience of handling complex software and platform architectures
/ In-depth level of technical knowledge/experience in building cloud solutions that have security, reliability, scalability, high availability and concurrency built-in from the outset.
/ Background and relevant current experience in a hands-on Observability/SRE/Platform Engineering role is needed
/ Knowledge of IaaS deployment tools such as Terraform
/ Competent in using source control, preferably Git based
/ Elastic Observability or OpenTelemetry experience
/ Working knowledge of continuous integration systems such as Jenkins and GitLab
/ Elasticsearch internals experience a big plus
/ Performance Test experience
/ AWS Certification
/ Docker development experience is desirable
As a Lead Site Reliability Engineer you’ll be expected to:
/ Develop and maintain Observability solutions using Elastic Observability and OpenTelemetry
/ Design, test and implement resilience on new and existing AWS solutions
/ Assist tribes with performance testing
/ Build and maintain solutions developed on AWS
/ Help the tribes enable observability features and develop solutions where none currently exist. Also document the process for future reference
/ Assist on the creation and maintenance of pipelines to manage the observability components
/ Monitor and reporting usage of our cloud solutions
/ Advise on the selection of the most appropriate technologies for the task
/ Ensure delivery pipeline for your IaaS code has optimal quality controls built-in to support testing, deployment, reporting and task management
/ Make a selection of appropriate quality controls to complete assigned tasks, including; code driven deployment; infrastructure deployment; automated testing; and effective operational monitoring
/ Alerting and incident responses
/ Supply appropriate information and analysis to support resolution of issues and incidents with the tribes Observability
/ As a consultant and as a binding part between developers and our clients you are expected to develop expertise both in technology and the means to communicate complex concepts and rationale to non-techies. We’ll encourage and support this with frequent opportunities to share ideas internally. We also have consultants who frequently deliver at regional, national and global conferences
What do we offer in return?
- Private health insurance
We hope you will never need it, but nevertheless, we offer private health insurance to all our employees.
- Education program
We never stop learning, that’s why we offer our employees an educational program with training and certification.
- Wellbeing program
We all deserve to live a healthy and well-balanced life. It's not an option, it's a necessity!
- One simple rule
We really want you to enjoy your work, so we set up one simple rule, that none of our teammates will work on the same project for a very long time in the same industry, but will always be following the latest technology standards. You can never get bored working at Valtech!
- Work from home
Our jobs wrap around our lives – not the other way around. Wherever you feel you can be most comfortable and productive as well, we make sure to respect your choice.
- Social events
We enjoy spending time together, not only at work. Ski trips, carting, laser-tag, wine tasting, picnics, cooking classes… you name it – we’ve done it! There are plenty of cool events to join and to get to know your colleagues.
- Company sponsored Multisport card
- Food vouchers
- Competitive conditions
Besides a competitive salary and 24 days of vacation, you will join annual company events with the whole team.
- Challenging projects
Ready for a challenge? We guarantee you’ll find challenging projects at Valtech!
- Cool colleagues
What’s the most important thing in a job? Cool colleagues with whom you spent most of the time during the week. We have a lot of them!
- Honest feedback
Honesty, openness and respect are among our core values. We encourage an open feedback culture in order to build trust and grow together
25 vacation days
... and a lot of fun and growing opportunities
Our company values
We SHARE our knowledge with our clients and colleagues all over the world. We value different opinions and embrace open discussions. We DARE to go into unknown territories. We dare to speak up and be totally honest. We CARE about the end-user experience, our clients' businesses, and the quality of the things we make. We want to make the world a better place through the work we do.
Say hello to your future. Apply!
- Valtech Bulgaria, Sofia
- Remote status
- Hybrid Remote
Valtech Bulgaria, Sofia
About Valtech Bulgaria
Valtech Bulgaria is a part of a global company, focused on business transformation. Currently, we are a growing team of 30 makers, thinkers, marketers, creatives and developers, determined at making things happen.
Challenging projects, new technologies and multicultural environment let us create experiences, that improve human lives and make our clients' business grow.
#onevaltech means that you will always work with the best team across the globe and be supported whatever the challenge. We transform by doing.
Lead Site Reliability Engineer
Loading application form