Search for Jobs

11 Results
AT&T
Plano, TX, United States (on-site)
2 days ago
AT&T
Bothell, WA, United States (on-site)
2 days ago
AT&T
Dallas, TX, United States (on-site)
2 days ago
AT&T
Dallas, TX, United States (on-site)
2 days ago
AT&T
Plano, TX, United States (on-site)
2 days ago
AT&T
Plano, TX, United States (on-site)
2 days ago
AT&T
Plano, TX, United States (on-site)
2 days ago
AT&T
Redmond, WA, United States (on-site)
2 days ago
AT&T
Middletown, NJ, United States (on-site)
9 days ago
AT&T
Plano, TX, United States (on-site)
16 days ago
AT&T
Middletown, NJ, United States (on-site)
16 days ago
1 - 11 Results of 11
AT&T
Plano, Texas, United States (on-site)
2 days ago

Description

Principal Software Site Reliability Engineer needed by AT&T Services, Inc. in Plano, Texas to work 24x7 Problem and Incident Management impact, RCA assessment, communication for consumer online sales, account management, support websites, and mobile apps. Define Service Level Objectives (SLOs), track, drive availability, service metrics, and accomplishment of operational SLOs. Analyze GTOC enterprise Incidents, including implementing automated tracking and reporting of system, customer, business impacts from site outages, incidents, and critical defects. Analyze progress and accomplishment against Service Level Objectives (SLOs) and identify gap closures. Coordinate with GTOC, Digital Product Delivery including PO, PM, Dev, QA Operations, site reliability engineers, infrastructure, network, and 3rd Party vendors to resolve reported problems. Lead root-cause analysis (RCA) for complex outages, incidents, defects, and tracking resolution through completion. Provide training to teams and audit RCAs to ensure blameless post-mortems were conducted as per established principles. Develop tools, scripts, queries, and perform data analysis of weekly, month, YTD incidents and problems to determine chronic and recurring root causes and applications with a high frequency of incidents. Partner with Site Reliability Engineers (SREs), DevOps teams, network, infrastructure, security and fraud services to establish proactive and automated monitoring for chronic root causes. Develop improvement plans and drive established improvement plans through to resolution. Coordinate with external teams, such as Iconic Launch Command Center, for heightened support of SPT defect, and incident triage, and providing status during major launch events. Utilize Unix and Linux command line. Work within Agile, Scrum, and Kanban development team. Provide technical skills: HTML5, CSS3, JavaScript, React, Angular, Node.js, REST services, Oracle DB, MQ, Git, Jira, Jenkins, Docker, and Kubernetes. Utilize tools including DynaTrace, Kibana, Grafan, Splunk, postman, SoapUI, jMeter, GIT, Jenkins, and Maven.


Requires a Bachelor's degree, or foreign equivalent degree in Electronic Engineering or Computer Engineering, and 5 years of progressive, post-baccalaureate experience in the job offered or 5 years of progressive, postbaccalaureate experience coordinating with GTOC, Digital Product Delivery including PO, PM, Dev,QA Operations, site reliability engineers, infrastructure, network and 3rd Party vendors to drive resolution of reported problems; leading root-cause analysis (RCA) for complex outages, incidents, defects, and tracking resolution through completion; utilizing Unix and Linux command line; working within Agile, Scrum, and Kanban development team; providing technical skills: HTML5, CSS3, JavaScript, React, Angular, Node.js, REST services, Oracle DB, MQ, Git, Jira, Jenkins, Docker, and Kubernetes; and utilizing tools including DynaTrace, Kibana, Grafan, Splunk, postman, SoapUI, jMeter, GIT, Jenkins, and Maven.

Our Principal Software Site Reliability Engineer's earn between $134992.00  to $254,300 yearly. Not to mention all the other amazing rewards that working at AT&T offers. From health insurance to tuition reimbursement and paid time off to discounts on products and services just to name a few. There is a lot to be excited about around here. Individual starting salary within this range may depend on geography, experience, expertise, and education/training.

AT&T is an Affirmative Action/Equal Opportunity Employer, and we are committed to hiring a diverse and talented workforce.  EOE/AA/M/F/D/V

 *np*

 

 

Our Principal Software Site Reliability Engineer's earn between $170,000.00  to $ 254,300 yearly. Not to mention all the other amazing rewards that working at AT&T offers. From health insurance to tuition reimbursement and paid time off to discounts on products and services just to name a few. There is a lot to be excited about around here. Individual starting salary within this range may depend on geography, experience, expertise, and education/training.




Job Information

  • Job ID: 68035371
  • Workplace Type: On-Site
  • Location:
    Plano, Texas, United States
  • Company Name For Job: AT&T
  • Position Title: Principal Software Site Reliability Engineer
  • Job Function: Information Technology
  • Job Type: Full-Time
Jobs You May Like
Filters
Workplace Type
Job Function
State