24 February 2020

Site Reliability Engineer (pyton)

Site Reliability Engineer (pyton)

To our new customer in Central Stockholm we are now looking for a site reliability Engineer (pyton)
Start: ASAP
Lenght: 6 month, or more

More info
At our customer more than 100 billion events flow through our streaming infrastructure and platform every day. We are looking to find someone eager to help us engineer and manage the Kafka environment constituting the heart of this ecosystem.
We believe that you share our passion for learning new things, coding (primarily in Python), quality, automation, continuous improvements, and actively building and upholding a great culture. Above all, we would like to see that you have a genuine interest in streaming.
Your role
Our job is to build effective, stable and reliable large scale infrastructure tools and services for our platform, games, and product teams. We strive to empower developers to be autonomous and flexible. We continuously work to create self service models for our tech in close collaboration with development teams.

We engineer and provide the shared infrastructure platform serving all of our games, as well as environments for developers and supporting tech like observability, log management, and event transport. This includes everything from working in our data centers, writing code for full stack orchestration and automation, troubleshooting distributed systems and resolving production incidents.

This includes:
• Datacenter colocation management
• Production network engineering
• Linux platforms, on-prem and hybrid clouds
• Databases, event-transport, and orchestration of java platforms
• Alerting, monitoring, and Incident resolution

We care deeply about our culture and believe in:
• Continuous improvement of everything we do
• An inclusive and diverse workplace
• Iterate tech changes in small steps
• Infrastructure as code
• Automation and coding as much as possible
• Spreading and sharing ideas openly
• Collaboration and peer reviews
• Being accountable and taking responsibility for our actions and results
• Blame-free and respectful problem solving
• Asking for help

Skills to create thrills
• Experience working with streaming data platforms
• Strong development skills in Python, and some knowledge of Java, Perl, SQL, or similar
• Experience automating and orchestrating distributed systems as well as creating internal tools such as service discovery integrations or metrics pipelines
• Interest or experience in database technologies like MySQL, Cassandra, HDFS/Hadoop, etc
• Monitoring systems like OpenTSDB, InfluxDB, Graphite, etc
• Log management systems like Graylog, the ELK stack, etc
• Orchestration frameworks like Ansible, Salt, etc
• Familiarity with Linux performance tools
• Ability to communicate proficiently in written and spoken English

 

Please contact [email protected]

The position is filled