InTelligent Automation Engineering
The Automation Engineer will play a crucial role on our InTelligent Automation Engineering Team by connecting the dots between code, infrastructure, and production system availability. They will actively engage in early warning troubleshooting to stop a product outage.
The right candidate for this position will have a background in hosted application instrumentation technologies, be technically proficient in system engineering, operating system diversity, and have a firm understanding of providing operational excellence in technical environments.
ACTIVITIES
- Develop and maintain system integrations between technology operations tools such as DataDog, ServiceNow, and VictorOps.to improve overall technology automation
- Build and execute steps in a documented runbook to resolve incidents in a timely manner and further develop automated techniques to avoid future incidents
- Work with other Engineering and Development colleagues to identify a work-around if permanent solution cannot be reached and further automate work-around solution
- Continuously improve tooling capabilities by adapting the instrumentation tools to provide more meaningful and actionable alerts
- Proactively contribute to the Enterprise Knowledge Base to further reduce future incident resolution times
- Collaborate with technology SCRUM teams to design and implement performance benchmarks for each application, and report results
- Develop processes to proactively respond and automate alerts for critical business transactions and applications
- Triage and resolve production incidents or issues reported by the instrumentation platforms
- Trouble-shoot Linux and Windows system and application logs to resolve irregularities
- Trouble-shoot database performance issues, including stored procedures and table structure issues
- Analyze application logging, memory dumps, trace routes and other forensic data such as .NET, Java, or Spring stack traces to help determine the source of incidents
- Utilizing technical expertise, directly pinpoint the appropriate SCRUM team resource to help resolve production issues
- Manage and automate DevOps tools such as VictorOps and DataDog
REQUIREMENTS
- Bachelors or Associate degree completed, preferable in the IT Field
- Be willing to work non-standard shifts
- ITIL Certifications a plus
- OpenShift, AWS, Azure, or other advanced cloud-technology certifications a plus
- MCSE, RHCSA or CompTIA certifications a plus
KNOWLEDGE/SKILLS/ABILITIES
- Exposure to advanced availability instrumentation systems such as New Relic, Prometheus, Grafana, Zabbix, and DataDog
- Scripting familiarity in languages such as Bash, Python, PowerShell or PERL
- Familiarity with concepts such as IAAC, CI/CD, and DevOps
- Trouble-shooting and escalation procedures
- Must demonstrate the ability to work proactively
- Must demonstrate a high level of ethics and integrity