Technical Skills & Tools:
- Unix/Linux, Shell scripting, Jenkins, Splunk, Dynatrace, Kibana, Weblogic Server, SQL, Postman, Jmeter, Soap UI, Confluence, Rabbit MQ, Kafka, JMS Apache Tomcat Server, XML, JSON, Java Script, TIBCO
- Good To have: Java and AWS
Job Description:
Monitoring and Incident Management:
- Monitor backend services, particularly servers, application infrastructure, and partner files.
- Support and troubleshoot issues, investigate incidents, and analyze system metrics, logs, traffic, and configuration changes.
- Improve and maintain monitoring and alerting systems by testing and deploying new functionalities.
Root Cause Analysis and Troubleshooting:
- Conduct root cause analysis of production errors and resolve technical issues.
- Troubleshoot problems and identify bottlenecks in the software development process.
- Use automated tools for troubleshooting to minimize errors.
Technical Expertise:
- Extensive experience in configuration and deployment automation with various app servers, such as Weblogic servers.
- Strong scripting skills (Shell, Linux) and experience with Linux command-line/administration.
- Understanding of network protocols (TCP/IP, Reverse Proxy).
- Experience with Git and release management processes.
- Ability to troubleshoot API-driven services by checking server logs and third-party APIs.
- Strong problem-solving skills and the ability to work under pressure.
Automation and Scripting:
- Develop scripts to automate visualization and operational processes.
- Create new features and tools for automating the troubleshooting and investigation process.
System Deployment and Maintenance:
- Deploy updates and fixes and provide Level 2 technical support.
- Build tools to reduce the occurrence of errors and improve customer experience.
- Regularly assess infrastructure and realign configurations to minimize errors.
Customer Requirements and Project Management:
- Understand customer requirements and issues raised in production.
- Implement various development, testing, and automation tools, and manage IT infrastructure.
- Collaborate with developers and partners to manage code releases and work on issues and tickets.
Reporting and Documentation:
- Draft reports and summarize information following investigations and incidents.
- Document procedures and update documentation and operational processes.
Kindly regard your application as unsuccessful if you have not heard from the agency within 2 weeks.
Apply Now