Principal Software Engineer- AIOps

Principal Software Engineer- AIOps

Job Overview

Location
Hyderabad, Telangana
Job Type
Full Time Job
Job ID
37981
Date Posted
4 months ago
Recruiter
Aarav
Job Views
24

Job Description

Are you looking to make a real difference in Microsoft’s mission to empower every person and organization to achieve more, with the power of cloud computing? Do you want to work at the forefront of Cloud Computing to design, deliver & operate cloud-scale systems that are the foundation of the World’s Computer which is Azure? Do you want to be part of a highly motivated and passionate team that works together to do what it takes for our customers to be successful while having fun and learning along the way? 

Azure Core is Azure's most profitable business and growing incredibly fast! We, the Azure Core Compute team in IDC, pride ourselves in working without boundaries to deliver the cloud platform capable of running any customer application/workload. And we mean ANY! Think of any large Microsoft service; Teams, Bing, Exchange, Xbox, SQL Azure, Office 365, HDInsight, COSMOS DB…. Or customers such as Accenture, Adobe, Walgreens, Walmart, Flipkart, In-Mobi and the multitude of the large/medium/small shopping, banking, finance, gaming, logistics, enterprises, data analytics/management, storage services of the world. They run on infrastructure we build. 

As the owners of the platform that runs such a diversity of applications, we need to guarantee QoS for our customers. Metrics like uptime/availability/reliability and service performance take on a new meaning at the scale we operate. Our customers want 5 9’s+ of availability, transparent maintenance & updates to their systems, resiliency of their workloads despite H/W and system failures that happen all the time in the datacentre, ability to deploy & dynamically scale large number of VMs/containers reliably.  

 

Ensuring the QoS at scale requires us to architect and build systems across all layers and we work on architectural improvements across the entire Azure Core stack (Azure Compute Manager, Azure Host OS and Host Agent, Compute Resource Provider, Azure Networking etc) to achieve the SLAs, along with leveraging AI/ML techniques to analyze large volumes of telemetry data for continuous quality evaluation and optimization of these services. 

You:  

Are curious, looking for opportunities to learn and adapt. 

Are passionate about making customers successful. 

Are into solving challenging/innovative technical problems with an iterative and measurable approach. 

Are passionate about building high scale systems and services.  

Bring a diverse/fresh perspective in what you do. 

 

We

Are the team for you!! 

Responsibilities

In this role, you will help realize the team’s vision to “Build and operate world class E-E engineering systems & services that contribute to measurable improvements in the experience for our customers & quality of the platform and scales with the growth/complexity of Azure Core systems”. ​Achieving this vision is critical for the success of the Azure platform and guaranteeing QoS for our customers. 

You will work on building, deploying, and operating intelligence based world-class services that validate, observe, and measure the quality of components in all layers of the Azure Core stack, and that enable customers evaluate and optimize their workloads on Azure. 

 

You will leverage existing and build new platforms and solutions powered by AI/ML which: 

  • Enable experimentation environment generation based on intelligent selection and replay of patterns learnt from production towards evaluation of Azure core services and applications. 
  • Analyze large volumes of telemetry data in quick time to detect and predict quality degradations, recommend mitigations enabling auto-healing and prevention. 
  • Provide intelligent monitoring, proactive recommendations, automated root cause analysis and self-healing mechanisms that empower customers with accelerated onboarding, continuous evaluation and optimization of their workloads on Azure. 

The systems you build will be the common infrastructure extended and utilized by multiple partner teams within Azure and beyond, and as part of customer facing Azure services. 

Qualifications

  • 12+ years of experience in commercial software development. (Required) 
  • BS/MS/PhD in Computer Science or equivalent industry experience. (Required) 
  • Demonstrated problem solving and debugging skills. (Required) 
  • Experience in designing, developing, deploying, and operating reliable systems preferably with experience in distributed systems fundamentals. (Required) 
  • Knowledge and experience in data engineering, statistics, AI/ML and in building large scale data systems. (Required) 
  • Experience in technical leadership in driving engineering roadmaps, leading v-teams, mentoring and helping others grow technically. (Required) 
  • Experience writing maintainable code in C++/C#/Java/Python. (Required)  
  • Ability to write and debug code which requires good understanding of threading and asynchronous programming fundamentals.  (Required) 
  • Ability to engage in site-reliability engineering practices. (Required) 
  • Experience with or understanding of CI/CD pipelines & DevOps. (Preferred) 
  • Experience developing software hosted in Azure, AWS, or other similar Cloud platforms (Preferred). 
  • Data driven approach to solving problems iteratively and measuring success. 
  • Commitment to collaboration and teamwork and ability to deliver via influence. 

#idccompute #principalic

Microsoft is an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to age, ancestry, color, family or medical care leave, gender identity or expression, genetic information, marital status, medical condition, national origin, physical or mental disability, political affiliation, protected veteran status, race, religion, sex (including pregnancy), sexual orientation, or any other characteristic protected by applicable laws, regulations and ordinances. If you need assistance and/or a reasonable accommodation due to a disability during the application or the recruiting process, please send a request via the Accommodation request form.

 Benefits/perks listed below may vary depending on the nature of your employment with Microsoft and the country where you work.

Job ID: 37981

Similar Jobs

Meta

Full Time Job

Principal software engineer- aiops Principal software engineer- aiops

Meta is embarking on the most transformative change to its business and technolo...

Full Time Job

Deloitte

Full Time Job

Principal software engineer- aiops Principal software engineer- aiops

Deloitte’s Enterprise Performance professionals are leaders in optimizing...

Full Time Job

Labcorp

Full Time Job

Principal software engineer- aiops Principal software engineer- aiops

Job Duties/Responsibilities:Determine the acceptability of specimens for testing...

Full Time Job

Braintrust

Full Time Job

Principal software engineer- aiops Principal software engineer- aiops

• JOB TYPE: Direct Hire Position (no agencies/C2C - see notes below)â€...

Full Time Job

Cookies

This website uses cookies to ensure you get the best experience on our website.

Accept