Job Description
That's where you come in—you can help us to:
- Define standards & maintain libraries for monitoring, logging, distributed tracing, alarming, and provisioning across 100+ developers.
- Build software to monitor the health of IB and MSG systems via black-box health testing frameworks and other approaches.
- Bring best practices and consistency to our infrastructure.
- Improve the resiliency of our infrastructure through stress tests and chaos engineering.
- Standardize our performance and capacity planning environment to ensure our systems have room to grow as we continue adding features and users.
What’s in it for you:
- A critical part of our mission is fostering a culture of reliability across Engineering teams in CC – you’ll be able to make a significant impact on the design choices and decisions that go into developing MSG and IB infrastructure.
- This is an opportunity to forge your own path and drive the engineering culture forward; making our infrastructure best–in-class will be your main mission, so you’ll have many opportunities to create and implement your own improvements.
- You’ll be involved from design to deployment, to ensure our infrastructure is reliable, performant and scalable.
- We’ll send you to professional conferences and meetups to keep up with the technology space outside Bloomberg and apply that knowledge to building and improving our processes and products.
Our projects include:
- Building downstream and upstream caller reports to quickly identify bottlenecks and dependencies of our system using Apache Spark and distributed tracing infrastructure.
- Creating black-box health testing frameworks to monitor the health of IB and MSG.
- Establishing standards and building dashboards, libraries and tools for metric collection, visualization, and alarming.
- Establishing procedures around scalability, failover, Service Level Objectives (SLOs), cluster provisioning, deployment strategies, etc. with the goal of improving the robustness of our infrastructure.
You’ll need to have:
- 4+ years working with an object-oriented programming language (C/C++, Python, Java, etc.).
- A Degree in Computer Science, Engineering, Mathematics, similar field of study or equivalent work experience.
- An understanding of Computer Science fundamentals such as data structures and algorithms.
- Demonstrated experience designing and implementing large scale distributed systems.
- BA, BS, MS, PhD in Computer Science, Engineering or related technology field.
Job ID: 125717