Senior Cloud Engineer
Detroit
Friday, 29 May 2026
The Senior Cloud Engineer – System Resiliency is responsible for designing, building, and operating highly reliable, fault-tolerant cloud platforms and applications. This role applies software and systems engineering principles to prevent outages proactively, minimize blast radius, and ensure rapid recovery when failures occur. This engineer treats reliability as a first-class product feature and partners closely with application teams, architecture, platform engineering, security, and operations to embed resiliency patterns, automation, and observability into all critical systems. Leads the company’s digital factory by setting the cloud strategy for automation and deployment of cloud infrastructure and services. Oversees application development and hosting, including processes involving architecting, engineering, deploying, and operationally managing the underlying logical and physical cloud computing infrastructure. Works with multiple product and platform teams to support complex digital initiatives across the business. Guides less experienced team members—Span of control 0; individual contributor. Key Accountabilities Leads development of standards and/or cloud vendor product selections for each infrastructure tower in coordination with traditional engineering and network teams Develops and implements cloud automation strategies for automating and deploying cloud infrastructure Oversees and manages development and operations (DevOps) automation, including set-up and configuration of continuous integration/continuous deployment (CI/ CD) processes for digital factory products Configures and provisions next generation information technology (NGI) and physical stacks in private cloud environment, including rack-level design and other support activities Improves cloud product reliability, availability, maintainability, and cost/benefit impacts, including developing fault-tolerant tools to ensure robustness of the cloud infrastructure Oversees capacity across public and private cloud resource pools, including automating scale down/up of environments Provides guidance to less experienced cloud developers in optimizing and automating cloud engineering activities (e.g., real time migration, provisioning, and deployment) Minimum Education & Experience Requirements This is a multi-track base requirement job; education and experience requirements can be satisfied through one of the following three options: Bachelor’s degree and 6 years of experience in the technology field, inclusive of 4 years with strategy development, architecture, design, and implementation of cloud initiatives leveraging agile and development and operations (DevOps) methodologies; OR Associate degree and 8 years of experience in the technology field, inclusive of 4 years with strategy development, architecture, design, and implementation of cloud initiatives leveraging agile and development and operations (DevOps) methodologies; OR High school diploma or GED and 10 years of experience in the technology field, including 4 years with strategy development, architecture, design, and implementation of cloud initiatives leveraging agile and development and operations (DevOps) methodologies Other Qualifications Preferred: Bachelor’s degree in computer science, management information systems, or engineering Experience developing cloud strategies for automating and deploying cloud infrastructure Experience leading the configuration, deployment, and operation of public cloud services, including experience with public cloud providers and related private/public zones, etc. Other Requirements: Deep understanding of software development lifecycles and cloud economics, including knowledge of consumption-driven total cost of ownership (TCO) Strong understanding of high availability, disaster recovery, and distributed systems. Expertise in applying Infrastructure-as-Code (Ia. C) to deploy resilient cloud infrastructure, including Terraform and Azure-specific Ia. C tools (e.g ARM, Bicep) Experience designing and implementing highly available cloud infrastructure, including cross-Availability Zone and cross-region implementations. Experience with Site Reliability Engineering practices. Ability to create and set up automated deployments (e.g., blue/green or red/black) to meet application operational requirements Advanced knowledge in infrastructure as code (e.g., DevOps, etc.) Advanced knowledge of security implications related to public and private cloud infrastructure design Advanced knowledge of network architectures suitable for different cloud topologies, with familiarity with user expectations / operational level agreements (OLA) for cloud services Advanced knowledge of standard networking protocols and components (e.g., load balancing, etc.) Additional Information Incumbents may engage in all or some combination of the activities and accountabilities and utilize a variety of the competencies cited in this description depending upon the organization and role to which they are assigned. This description is intended to describe the general nature and level of work performed by incumbents in this job. It is not intended as an all-inclusive list of accountabilities or responsibilities, nor is it intended to limit the rights of supervisors or management representatives to assign, direct and control the work of employees under their supervision. PRIVACY NOTICE TO CALIFORNIA JOB APPLICANTS At DTE Energy, we are committed to providing an inclusive workplace where everyone feels welcome and a sense of belonging. We seek individuals with a heart for service, a passion to help our communities prosper, and ideas to help shape the future of energy. We are proud to be an equal opportunity, employer that considers all qualified applicants without regard to race, color, sex, sexual orientation, gender identity, age, religion, disability, national origin, citizenship, height, weight, genetic information, marital status, pregnancy, protected veteran status or any other status protected by applicable federal and/or state laws.