Primary responsibilities: 80Ensuring availability, performance, security, and scalability of production systems. 80Serve as primary POC for LiveOps/GCX to address service incidents and troubleshoot/triage as appropriate. 80Work with the core engineering team to address bugs and deploy services across different environments. 80Collaborate with infrastructure teams to scale for service capacity and traffic spikes of 20x normal load during T1 events. 80Provide recommendations for architecture and process improvements. Secondary responsibilities: 80Create and maintain service health dashboards to monitor production systems. 80Implement alarms where there are gaps in service health monitoring 80Maintaining and updating eBay Live SOP Requirements: 80Experience with modern programming languages, e.g. Java, Go, C++ 80Experience in building large, reliable, scalable distributed systems 80Experience with building event-driven applications 80Experience with designing and building RESTful APIs 80Experience with modern DevOps principles and continuous delivery Work closely w/ colleagues and customers in different functional groups and remote offices 80Hands on experience with GraphQL is a big plus 80Experience on web socket is a big plus Need to take shift on weekend every two weeks(Take extra two days off next week)