We are looking for a software engineer to help us build the next generation of Azure compute infrastructure that powers Microsoft first party and third party customers. This is a rare opportunity to define and drive the next frontier of compute infrastructure that can meet the ever growing compute demand and optimize utilization of the hardware through allocation algorithms and packing. While we are a production team, we work very closely with an applied research team so we are able to stay abreast of the latest cutting edge research, as well as participate in research publications. If you are interested in pushing the envelope on the cloud-scale distributed system technology, graph optimization, allocation and building rock-solid platform based on them with 99999 availability with low latency -- we would love to hear from you!
Design and develop robust, distributed, scalable and resilient platform for Azure Computing, including services and tools; Developing Pub-Sub system on top of open sourced Zookeeper/Kafka and also contributing to the open source community. Live site monitoring, tooling and customer engagement on issues and features that affect customers; proactive live site detections and mitigations
Experience in following areas is a must: - Delivering cloud services in an engineering role - Distributed systems design and analysis - 10+ years of coding in Java/C#/C++ - Experience in messaging systems like Kafka, RabbitMQ and CEP like Spark - A BS or MS degree in Computer science or 12+ years' of equivalent industry experience.