Principal Observability Engineer- Dallas, TX or Tampa, FL- hybrid

New Today

Hello, I hope you are doing well! My name is Aditya Kumar and I am a Staffing Specialist at Stellent IT LLC. I am reaching out to you on an exciting job opportunity with one of our clients. If are you Comfortable with this Position then please share me your updated resume.
Role: Principal Observability Engineer Location: Dallas, TX or Tampa, FL- hybrid Duration: Long Term
Job Description: We are seeking a skilled Architect experienced in AIOps, Observability, and SRE engineering practices to join our team to enhance IT software engineering through Architecture and Governance practices. In this role, you will be responsible for designing, prototyping, testing, and documenting solutions that mature the enterprise observability, performance, resilience and overall reliability of our IT systems and applications.
Principal Responsibilities and Outputs: Architectural Guidance: Publish technology strategies and supporting architectures to mature business and technology operations to enable AI/MLOps. Standards and Best Practices: Publish observability standards and best practices for adopting new and existing frameworks or technologies. Technical Solutions: Translate business goals into technical solutions designs to include descriptive and diagnostic capabilities through engineering at delivery satisfying non-functional requirements for business solutions. Delivery Enhancements: Create actionable Observability Driven Development procedures to ensure consistent adoption of open standard (i.e. OTel, MELTS) industry frameworks. AI Augmented Testing: Deliver strategies to help enable more AI-Augmented testing capabilities empower federated execution and central enterprise governance. Communication and Education: Develop and routinely publish communication as well as training and education sessions for knowledge transfer and raising awareness of current or future enterprise direction. Reliability Design: Design and implement full stack applications for reliability and integration patterns to enable more operational predictability and prescriptive disruption response. Monitoring and Alerting: Establish appropriate monitoring and alerting standards for performance, scalability, availability, and reliability.
Experience:
Distributed Applications: Minimum of 10 years? experience in the design and implementation of distributed applications. Networking and Infrastructure: Minimum of 5 years' experience in networking, infrastructure, middleware, and database architecture. Highly Available Architecture: Minimum of 5 years' experience in highly available architecture and solution implementation. Disaster Recovery: Minimum of 5 years' experience with industry patterns, methodologies, and techniques across disaster recovery disciplines. Knowledge and Skills: Problem-Solving: Ability to solve problems and engineer solutions that meet resiliency requirements. Independent Work: Ability to work independently with minimal supervision. Public Cloud Environment: Strong knowledge of AWS and Azure cloud environment is a plus. Performance Analysis: Experience with performance analysis, tuning, and engineering is a plus. Monitoring Tools: Knowledge of monitoring tools such as CloudWatch, CloudTrail, Splunk, and other application monitoring tools. Tech
Aditya Kumar Sr. Technical Recruiter Email: Aditya@stellentit.com Address: 505 Knolle Court Saint Augustine, FL 32092 Telephone: +1 732-795-9133
www.stellentit.com
Location:
Tampa