SRE Engineer – REMOTE

Website Medable

Job description

Company Description

Medable’s mission is to get effective therapies to patients faster. We provide an end-to-end, cloud-based platform with a flexible suite of tools that allows patients, healthcare providers, clinical research organizations and pharmaceutical sponsors to work together as a team in clinical trials. Our solutions enable more efficient clinical research, more effective healthcare delivery, and more accurate precision and predictive medicine. Our target audiences are patients, providers, principal investigators, and innovators who work in healthcare and life sciences.

Our vision is to accelerate the path to human discovery and medical cures. We are passionate about driving innovation and empowering consumers. We are proactive, collaborative, self-motivated learners, committed, bold and tenacious. We are dedicated to making this world a healthier place.

Job Description

• Develop a roadmap to the Engineering team for the Application and Infrastructure monitoring and availability
• Identify innovative ways to research logs to uncover and report defects as well as to validate the Application
• Incorporate best practices for the High Availability and Reliability of the Application & Infrastructure
• Meet the compliance requirements for the Software development process
• Continuously improve the monitoring process to meet Service level Objectives
• Participate and optimize the On-call rotation process
• Implement tools and Develop Automation scripts for the smooth CI/CD of the software releases
• Identify critical metrics to monitor and alert on the issues with software Applications
• Use Tools to Apply data modeling and predictive analysis to anticipate issues
• Work closely with the teams to resolve production issues and perform root cause analysis of the incidents
• Document solutions, SRE architectural patterns, and best practices
• Other duties as assigned

Qualifications

• 5+ years of experience in managing SRE operations in a complex Production environment
• Experience with Automation using Python/Go programming or equivalent
• Experience with YAML, JSON, bash shell scripting, and Linux command-line utilities
• Experience in working with any of the configuration management tools: Ansible, Chef, Puppet, Salt
• Experience working with CI/CD tools such as Jenkins, Gitlab or equivalent and Artifactory, Fastlane for both web and Mobile Platforms
• Knowledge of working with Infrastructure-as-Code tools such as Terraform, CloudFormation, Google Cloud Deployment Manager
• Experience working with Docker and Kubernetes monitoring and debugging
• Experience with Monitoring solutions such as Datadog, Prometheus, Grafana or equivalent, etc.
• Experience working with Log management using Tools
• Experience with implementing Security Best practices
• Solid troubleshooting experience in a large scale production environment
• Expertise in AWS and GCP cloud Infrastructure and the related services

Additional Information

All your information will be kept confidential according to EEO guidelines.

To apply for this job please visit careervault.io.