|
Search Jobvertise Jobs
|
Jobvertise
|
SITE RELIABILITY ENGINEER Location: US-TX-Dallas Email this job to a friend
Report this Job
JD: We are looking for candidates to take the role of Site Reliability Engineer and help us to: Administer and support the managed file transfer applications supported in the Bank. Collaborate with other IT and business groups, and readily share information to resolve production problems related to file transfer integrations. Incident resolution and root cause analysis. Platform-level monitoring and health check execution. Provide technical support for the execution and troubleshooting of file transfers. Responsible for automating file transfers, as well as creating monitoring, scheduling, and alerting mechanisms. Adhering to change control processes and procedures. The key responsibilities are: determine the reliability of our digital products, technology services, and the infrastructure that underpins them minimize the risk and impact of failures by engineering operational improvements, such as predictive monitoring, auto scaling or self-healing respond to production incidents to gain first-hand experience of operational hotspots and to identify the root causes of problems collect and analyze operational data, define and monitor key metrics to identify and communicate areas for improvement apply a broad range of engineering practices with a focus on reliability, from instrumentation, performance analysis, and log analytics to automated testing, deployment, and operations ensure the quality, security, reliability, and compliance of our solutions by applying our digital principles and implementing both functional and non-functional requirements
SOFTHQ INC
|