Ryan Cocks, Developer in London, United Kingdom
Ryan is available for hire
Hire Ryan

Ryan Cocks

Verified Expert  in Engineering

DevOps Engineer and Software Developer

Location
London, United Kingdom
Toptal Member Since
July 15, 2019

Ryan是一位经验丰富的可靠和可扩展的生产云系统软件工程师. He specializes in DevOps, microservices, architecting applications, and application-level observability. 他在云基础设施和后端工作方面有扎实的背景. He has good soft skills and has worked in teams of all sizes. Ryan在理解需求背后的业务需求方面具有出色的能力.

Portfolio

BCG - Gamma
Datadog,站点可靠性工程(SRE), Amazon EC2, DevOps,微服务...
Toptal Project
Amazon Web Services (AWS), Terraform, Terragrunt...
Global Fashion Group
Amazon Web Services (AWS)、DevOps、Docker、Amazon S3 (AWS S3)、AWS CodeBuild...

Experience

Availability

Full-time

Preferred Environment

亚马逊网络服务(AWS)、MacOS、谷歌云、Docker、Git、Kubernetes、Node.js, ECS

The most amazing...

...project I've worked on was the Rosetta project for Apple. 这是一个动态二进制转换器,用于在X86上执行PPC二进制文件.

Work Experience

Site Reliability Engineer (Datadog Specialist)

2021 - 2023
BCG - Gamma
  • Worked with multiple product teams within the organization, designing their observability (monitoring) solutions.
  • Guided teams on architectural considerations for observability. Defined observability best practices and coached the various teams.
  • 努力尽可能接近客户可见问题的实时意识.
  • 针对不同级别的严重程度,将警报分割成不同的路径.
  • 开发Terraform,为Kubernetes集群和规范架构(fe/be+db)应用程序设置仪表板和警报(Datadog).
Technologies: Datadog,站点可靠性工程(SRE), Amazon EC2, DevOps,微服务, JavaScript, Kubernetes, Terraform, Flux, Monitoring, Application Monitoring, Infrastructure Monitoring, Cloud Infrastructure, Infrastructure as Code (IaC), Containers, GitHub Actions, Amazon Web Services (AWS), Back-end Performance, Database Performance, Cloud Engineering, VPS/VDS, GitOps, SIEM, Dashboard Development, Technical Documentation, Data Visualization, Load Balancers, DNS, Unix, Performance Analysis, Team Leadership, AWS ALB, Cloud, Transport Layer Security (TLS), AWS Cloud Architecture, Amazon Aurora

Site Reliability Engineer (ECS)

2020 - 2021
Toptal Project
  • 系统中易受高负载影响的重新架构部分, 因此,在交通高峰黑色星期五期间,完美的性能没有下降.
  • 在新的基础设施上推出了他们网站的新版本. Completed with only 10 minutes of planned downtime. 该项目两年的总停机时间不到三个小时.
  • Implemented alerting and monitoring for the new clusters.
  • Customized Fastly CDN to provide outage mitigation. 将不可靠的第三方API的端点包装为cdn管理的端点,如果主API的延迟很高,该端点可以重定向到备份.
  • 指导团队根据十二要素应用原则和SRE最佳实践改进他们的架构设计.
  • 为部署的服务创建了terraform管理的AWS Fargate集群.
Technologies: Amazon Web Services (AWS), Terraform, Terragrunt, Amazon Simple Queue Service (SQS), Datadog, Sentry, Amazon CloudWatch, Amazon Elastic Container Service (Amazon ECS), AWS Fargate, Amazon EC2, Fastly, Amazon CloudFront CDN, Site Reliability Engineering (SRE), Monitoring, Application Monitoring, Infrastructure Monitoring, CI/CD Pipelines, Cloud Infrastructure, Infrastructure as Code (IaC), Configuration Management, Containers, GitHub Actions, AWS DevOps, Amazon RDS, Amazon S3 (AWS S3), Back-end Performance, Cloud Engineering, VPS/VDS, Continuous Integration (CI), Continuous Delivery (CD), AWS Lambda, DevSecOps, GitOps, Dashboard Development, Technical Documentation, Data Visualization, APIs, Load Balancers, DNS, Web Application Firewall (WAF), Network Administration, GitHub, Unix, Performance Analysis, Cloud Architecture, AWS ALB, AWS CLI, Cloud, AWS IAM, Transport Layer Security (TLS), AWS Cloud Architecture, Amazon Virtual Private Cloud (VPC), Amazon Aurora

Site Reliability Engineer (EKS)

2019 - 2019
Global Fashion Group
  • 创建了新的terraform管理的AWS EKS Kubernetes集群(多区域).
  • 执行实时集群迁移到新的Kubernetes集群,零停机时间.
  • 将PHP后端拆分为多个微服务,从而提高了可靠性和可伸缩性.
  • Moved from self-hosted services to AWS-managed ones, improving reliability using Redis and SQL databases.
  • 用AWS CodePipeline取代Jenkins,降低了维护成本.
  • Replaced legacy storage with S3, resulting in improved reliability.
  • 重新设计了数据库使用,消除了高负载期间的瓶颈.
Technologies: Amazon Web Services (AWS)、DevOps、Docker、Amazon S3 (AWS S3)、AWS CodeBuild, AWS CodePipeline, Helm, Terraform, Redis, Kubernetes, Site Reliability Engineering (SRE), Monitoring, Application Monitoring, Infrastructure Monitoring, CI/CD Pipelines, Cloud Infrastructure, Infrastructure as Code (IaC), Configuration Management, Containers, GitHub Actions, AWS DevOps, Amazon RDS, Back-end Performance, Database Performance, Cloud Engineering, VPS/VDS, Continuous Integration (CI), Continuous Delivery (CD), MySQL, AWS Lambda, DevSecOps, GitOps, Dashboard Development, Technical Documentation, Data Visualization, APIs, Load Balancers, DNS, Network Administration, NGINX, Amazon EKS, GitHub, Unix, Performance Analysis, Team Leadership, Cloud Architecture, AWS ALB, AWS CLI, Cloud, Memcached, AWS IAM, Transport Layer Security (TLS), AWS Cloud Architecture, Amazon Virtual Private Cloud (VPC), Amazon Aurora

DevOps Engineer and Release Manager

2016 - 2018
HERE Technologies
  • Designed and developed Jenkins deployment pipelines into AWS. 使用作业DSL对Jenkins管道的编程生成做出了贡献.
  • Set up the production Docker on Amazon EC2 instances.
  • 运行AWS自动伸缩、微服务、Kafka、Flink和窗口流处理.
  • 开发物联网专用测试,为生产提供连续的测试数据. 这使我们能够构建实时仪表板,以确定复杂微服务系统的哪个部分出现了故障.
Technologies: Amazon Web Services (AWS), DevOps, Terraform, Node.js, JavaScript, Scala, Apache Kafka, Apache Flink, Microservices, Grafana, Splunk, Jenkins, Kubernetes, Docker, Monitoring, CI/CD Pipelines, Containers, Ansible, Cloud Engineering, Linux Server Administration, VPS/VDS, Continuous Integration (CI), Continuous Delivery (CD), AWS CloudFormation, Dashboard Development, Technical Documentation, Data Visualization, APIs, Cloud Architecture, Cloud, Transport Layer Security (TLS), AWS Cloud Architecture

Test Lead

2015 - 2016
HERE Technologies
  • Oversaw the analytics and A/B testing using Apptimize and Amplitude.
  • Developed test strategies for mobile devices.
技术:HockeyApp, Amplitude, Apptimize, iOS, Android, Containers, Ansible

Test Lead

2013 - 2014
Auckland Transport
  • 定义并执行全市关键基础设施的测试策略.
  • Created tooling to optimize work methods.
Technologies: Ruby on Rails (RoR), MySQL, Ruby

Test Lead

2012 - 2013
Serato, Inc.
  • Oversaw and mentored junior developers.
  • Introduced tools and processes for bug tracking, test management, peer review, crash report collection and analysis, beta test cycles, 改善客户支持和产品管理团队之间的沟通.
  • Tested iOS apps.
  • 帮助Scrum团队在测试和质量控制中采用最佳实践.
Technologies: Testing, Engineering, Ruby

Test Team Manager

2011 - 2012
IBM
  • Oversaw the management and technical rigor for a team of 11 testers. This included five products in flight from IBM's virtualization, security, operating system performance, and failover stacks.
  • 通过关注快速迭代反馈,改变了开发和QA团队的互动方式. This reduced the release cycles from 2-3 months down to 2-3 weeks.
  • Successfully oversaw two new major product launches.
Technologies: Virtual Machines, C++, Containers, Team Leadership

Project Manager

2010 - 2011
IBM
  • 管理一个小型软件团队的开发和发布周期.
Technologies: Ruby on Rails (RoR), C++, Containers

C++ Developer

2001 - 2009
Transitive
  • Developed automated testing infrastructure, 包括工具链(交叉链接和引导构建系统), assembly, linkers, CPU, and memory management architecture (SPARC, x86, X86_64, ARM, Itanium), and Linux kernel patching and building.
  • 开发了动态二进制转换器,可以为一个处理器加载二进制文件,并使用UNIX内核接口(系统调用)在另一个处理器上执行它们。.
  • 担任专业性能分析团队的首席工程师. 学习性能分析和改进的原理,并在客户现场性能低于预期时应用于解决性能问题.
Technologies: Linux, C++, Containers, Back-end Performance, Software Engineering, Linux Server Administration, DNS, Network Administration, Ruby, Unix, Performance Analysis

Observability Expert

在三大咨询公司之一担任内部可观察性专家. 我是组织中开发团队的主要联系人,他们希望改进部署的可观察性, specifically with Datadog at the client's request. 我为各种产品和项目设计了可观察性解决方案,涵盖了AWS和Azure上的ECS和Kubernetes. 因为很多产品都是在Kubernetes上使用规范的前端/后端架构, I produced Terraform to install baseline standard monitoring. This entailed monitoring the Kubernetes clusters, databases, LBs, front-end and back-end services, Watch Dogs, SLOs, and uptime.

我参与了Kubernetes监控的设置,成为了这个领域的专家. 我还为Kubernetes集群开发了用于快速态势感知的定制仪表板. Bringing together monitoring (and alerting) on OOMs, crash-loop backoff, container restarts, resource usage vs. 限制、节点资源、pod所需状态和不可用的部署副本.

Automated Stocks and Crypto Trading Systems

我在加密货币和股票/外汇交易领域广泛从事个人项目. I did low-frequency swing trading but used this as a personal project to keep my developer skills honed; I worked as a developer for ten years before specializing in DevOps.

I performed backtesting in Python real-time systems as Node.js microservices deployed on Kubernetes.

Languages

Perl, Bash, c++ 98, JavaScript, Ruby, TypeScript, c++, Scala, Python, SQL

Libraries/APIs

Terragrunt, Node.js, Jenkins Job DSL, Amazon EC2 API, PubSubJS

Tools

Jenkins, Amazon Elastic Container Service (Amazon ECS), Terraform, Git, Fastly, GitHub, Sentry, Google Kubernetes Engine (GKE), Amazon EKS, RabbitMQ, Helm, Amazon Simple Queue Service (SQS), Amazon CloudWatch, AWS Fargate, Amazon CloudFront CDN, NGINX, Amazon Virtual Private Cloud (VPC), Splunk, Grafana, AWS CodeBuild, Amazon Simple Notification Service (Amazon SNS), Bitbucket, Ansible, AWS CloudFormation, AWS CLI, AWS IAM

Paradigms

Microservices, DevOps, Agile, Continuous Integration (CI), Continuous Delivery (CD), DevSecOps, Testing

Platforms

Docker, Apache Kafka, Kubernetes, Linux, Amazon Web Services (AWS), Unix, AWS ALB, Amazon EC2, AWS Lambda, DigitalOcean, MacOS, Android, iOS, HockeyApp, Apache Flink, Google Cloud Platform (GCP)

Storage

Datadog, Amazon S3 (AWS S3), Redis, Memcached, Amazon Aurora, Google Cloud, MongoDB, PostgreSQL, JSON, Database Performance, MySQL

Other

Monitoring, Site Reliability Engineering (SRE), Infrastructure Monitoring, CI/CD Pipelines, Infrastructure as Code (IaC), Containers, AWS DevOps, Cloud Engineering, GitOps, Dashboard Development, Technical Documentation, APIs, Load Balancers, DNS, Performance Analysis, Cloud, AWS Cloud Architecture, Virtual Machines, Lambda Functions, Application Monitoring, Cloud Infrastructure, Configuration Management, GitHub Actions, Amazon RDS, ECS, Back-end Performance, Software Engineering, VPS/VDS, Network Administration, Team Leadership, Cloud Architecture, Transport Layer Security (TLS), Engineering, Apptimize, Amplitude, Google Cloud Functions, AWS CodePipeline, Scrum Master, Financial APIs, Stock Trading, Forex Trading, TradingView, Linux Server Administration, SIEM, Data Visualization, Web Application Firewall (WAF)

Frameworks

Ruby on Rails (RoR), Flux

Industry Expertise

Trading Systems

2014 - 2014

Scrum Master in Scrum

Clarus (Agile Coaching) - New Zealand

2012 - 2012

ISTQB Foundation Certificate in Software Testing

ISTQB - New Zealand

1998 - 2000

Bachelor of Science Degree in Computer Science

The University of Manchester - United Kingdom

JANUARY 2014 - PRESENT

Scrum Master

Clarus (scrum.org)

JANUARY 2012 - PRESENT

ISTQB

ISTQB

Collaboration That Works

How to Work with Toptal

在数小时内,而不是数周或数月,我们的网络将为您直接匹配全球行业专家.

1

Share your needs

在与Toptal领域专家的电话中讨论您的需求并细化您的范围.
2

Choose your talent

在24小时内获得专业匹配人才的简短列表,以进行审查,面试和选择.
3

Start your risk-free talent trial

Work with your chosen talent on a trial basis for up to two weeks. Pay only if you decide to hire them.

Top talent is in high demand.

Start hiring