【上海Site Reliability Engineer职位招聘_大连思泰克科技有限公司招工招聘信息】-51米多多招聘网

Responsibilities
61 Drive day-to-day site reliability engineering functions including maintenance and incident resolution for all Visa China applications, products, and services.
61 Perform ongoing/Proactive analysis of Visa systems and applications to detect potential problems with little or no guidance.
61 Support critical applications, ensuring stability through proactive mitigation and automation.
61 Contribute to root cause analysis and run the production environment by monitoring availability and taking a holistic view of system health.
61 Provide inputs into and build software and systems to manage platform infrastructure and applications with the guidance from the global teams.
61 Provide inputs into increasing reliability of supported applications.
61 Measure and optimize system performance, with an eye toward pushing our capabilities forward, getting ahead of customer needs, and supporting for continual improvements.
61 Provide primary operational support and engineering for multiple large-scale distributed software applications.
61 Responsible for providing 24X7 Application Support across multiple visa applications.
61 Responsible for evaluating the Issues reported by Visa clients and partners and provide solutions.
61 Gather and analyze metrics from operating systems as well as applications to assist in performance tuning and fault finding.
61 Partner with development teams to improve services through rigorous testing and release procedures.
61 Participate in system design consulting, platform management, and capacity planning.
61 Create sustainable systems and services through automation and uplifts.
61 Balance product and site reliability with well-defined service-level objectives.
Basic Qualifications
61 Strong work ethic, ability to work in fast-paced, team-oriented environment with minimal guidance. Good oral and written communication skills.
61 5+ years of relevant work experience with a bachelor’s degree, or 3+ years of relevant work experience with an Advanced Degree (e.g., Masters, MBA, JD, MD).
61 Minimum 3 years’ experience in working with Linux and UNIX.
61 Minimum 3 years’ working experience on Java micro services technologies(Restful APIs).
61 Minimum 3 years’ experience in DevOps/Production support profile working in a globally distributed team.
61 This role also requires working knowledge of production support processes such as incident/change/problem management, call triaging, escalation procedures as such.
Preferred Qualifications
61 Intermediate level of experience on Kubernetes (preferred Openshift and Mirantis) or Docker
61 Experienced in working as a SRE or Cloud DevOps Engineer, System Administrator and Automation Expert(Ansible, Python scripts).
61 Experienced in Integrated Docker container orchestration framework using Kubernetes by creating pods, config Maps, deployments using Jenkins.
61 Experienced in Kubernetes to deploy scale, load balance and manage Docker containers with multiple namespace versions.
61 Proactive approach to identifying problems, performance bottlenecks, and areas for improvement.
61 Basic level knowledge on Akamai/Cloudflare, Active-Active setup Application along with Disaster Recovery.
61 Experienced in working with Oracle/Postgress DB’s, DML/DDL SQLs and Store PROCs
61 Experience with Service Now and ticketing workflows is preferred.
61 Working experience with monitoring tools like Prometheus, Grafana, Splunk, Humio or any other monitoring tools/processes will be advantageous.
61 Prior working experience with Card and transaction domains will be advantageous.
61 Should have a technical and business mindset.
61 Familiar with ITIL or similar framework – this is a highly audited highly regulated industry where process plays a crucial role in our day-to-day operations.