Texas A&M University — Aggie Innovation Platform Site Reliability Engineer

Our Commitment
Texas A&M University is committed to enriching the learning and working environment for all visitors, students, faculty, and staff by promoting a culture that embraces inclusion, diversity, equity, and accountability.  Diverse perspectives, talents, and identities are vital to accomplishing our mission and living our core values.

Who we are
The Division of Information Technology provides reliable and accessible IT services to elevate and enhance Texas A&M University. We provide IT leadership to the campus community while enabling the research, education and service mission of Texas A&M. With trusted services and innovative solutions, we are changing the technology landscape on campus.  To learn more about IT at Texas A&M University visit us at: https://it.tamu.edu/

What we want
The Senior IT Professional II (Site Reliability Engineer II) is responsible for providing technical leadership for identity management projects or services. Provides technical oversight for the application of and compliance with technical standards. May coordinate the technical activities of a support team. Completes reports and summaries for management and/or users including project status reports, problem reports, and progress summaries.

Required Education and Experience:

  • Bachelor’s degree in applicable field or equivalent combination of education and experience
  • Eight years of experience in multiple technology areas such as system administration, DevOps, collaborative software development, customer support, application support, project management, database administration, system reporting, access management, system security, and/or disaster recovery

Required Knowledge, Skills, and Abilities:

  • Must be able to work in a collaborative team environment.
  • Ability to multi-task and work cooperatively with a diverse range of people.
  • Must have strong interpersonal skills.

Preferred Education and Experience:

  • Bachelor of Science degree
  • Programming experience with at least two of the following languages: Node.js, Python, Ruby, Go, or Bash.
  • Knowledge of and experience using databases, particularly MySQL.
  • Knowledge of and experience with data analysis.
  • Knowledge of and experience writing REST web services
  • Knowledge of and experience consuming cloud web services (Azure and Google APIs in particular).
  • Knowledge of and experience with PowerShell.
  • Knowledge of and experience with Docker, containers, and related technologies.
  • Knowledge of and experience with Kubernetes on-premise and in one or more public clouds (AWS, GCP, Azure).
  • Experience with at least one of the following automation technologies: Chef, Ansible, and/or Puppet.
  • Experience, including actual pull requests, with Github or Gitlab.
  • Knowledge of and experience with CI/CD methodologies.
  • Knowledge of and experience with Microsoft, Linux, and Mac operating systems (Windows Server 2012 & 2016, Windows 10, CentOS, Mac OS X).
  • Knowledge and experience with Microsoft Active Directory and OpenLDAP.
  • General familiarity with network protocols and theory (TCP/IP, UDP, ICMP, MAC addresses, IP packets, DNS, OSI layers, and load balancing, etc.).
  • General familiarity with principles of project management and service management framework (e.g., ITIL/ITSM).
  • Knowledge of and experience with DevOps methodologies.

Preferred Knowledge, Skills, and Abilities:

  • Advanced cross-disciplined IT skills, advanced analysis and troubleshooting/problem-solving, client relations skills, requirement assessment and analysis, project management methodology, understands context/interrelationships, and proficiency of ITIL.
  • Experience with Objectives and Key Results methodologies is highly desirable.

Preferred Licenses and Certifications:

  • ITIL Foundations, PMP.

Responsibilities:

  • Scripting – Maintains, develops, and documents scripts to maintain infrastructure services.
  • Server Administration – Provides technical guidance and oversight for server administration. Sets-up and configures large and complex servers. Develops complex system logic and configuration. Conducts complex server performance analyses and tuning. Coordinates routine audits of systems and software.
  • Problem Management – Oversees and coordinates the analysis of system logs. Coordinates and monitors the problem management process to include backup support. Troubleshoots complex network problems. Provides Tier III support.
  • Data Security – Oversees the maintenance of system security, and for protecting and recovering client data. Develops disaster recovery plans for complex systems.
  • Documentation – Oversees the process used to document server support methods, procedures, and configuration.
  • New Technology Planning, Evaluation, Deployment, and System Integration Testing – Coordinates the evaluation of new technologies. Makes recommendations based on the evaluation of new technologies for their applicability to the client’s needs. Creates, evaluates, and approves plans for the implementation of new technology deployments and system integration testing.
  • Project Planning Support – Collaborates with the project leader to develop work plans and time schedules for projects, including outlining phases, identifying personnel, and computing equipment requirements.
  • Common – May coordinate the technical activities of a project team. Completes reports and summaries for management and/or users including status reports, problem reports, progress summaries, and system utilization reports. Serves as a senior member of an information resource team responsible for setting technical direction. Performs some of the duties of a Site Reliability Engineer I. Performs other duties as assigned.
  • Professional Development – Participates in training and professional development sessions.

All positions are security-sensitive. Applicants are subject to a criminal history investigation, and employment is contingent upon the institution’s verification of credentials and/or other information required by the institution’s procedures, including the completion of the criminal history check.

Equal Opportunity/Affirmative Action/Veterans/Disability Employer committed to diversity.

Related: