社名
社名非公開
職種
通信インフラ設計・構築(キャリア・ISP系)
業務内容
...
[About the company]Our client is a leading technology innovator, building a real-time analytics data platform that powers hundreds of services across e-commerce, fintech, digital content, and communications. They are dedicated to providing foundational insights into users, products, and markets, driven by a passion for engineering excellence and continuous improvement.[Role & Responsibilities]As a Senior Site Reliability Engineer specializing in their Analytics Platform team, you will be instrumental in ensuring the reliability, scalability, performance, and security of their core data infrastructure. You will drive architectural excellence and mentor a growing team, tackling complex distributed systems challenges and contributing to the optimization of large-scale data pipelines. You will also play a key role in migrating critical components to Google Cloud Platform (GCP), advocating for and implementing robust cloud security best practices.Main tasks include:- Architecting, developing, and deploying solutions to automate, maintain, operate, and optimize large-scale data pipelines.- Conducting system-wide analysis and performance tuning for capacity planning and bottleneck identification.- Implementing and refining monitoring, alerting, and incident response strategies to meet SLAs, SLOs, and SLIs.- Leading and assisting in the migration of critical data components to GCP, emphasizing secure cloud architecture and IAM.- Designing and implementing security controls and automation within GCP environments.- Ensuring system resilience through high-availability and disaster recovery mechanisms.- Enhancing and maintaining CI/CD pipelines for applications in Java, Node.js, and Scala.- Providing expert technical guidance and troubleshooting support to cross-functional teams.- Mentoring junior and mid-level SREs and software engineers.
求められる経験
- Minimum 4 years of professional experience in application development, primarily with Python.
- Minimum 2 years of experience designing and operating distributed systems handling large volumes of data in near real-time.
- Minimum 4 years of experience with Linux operating system internals.
- Minimum 2 years of experience managing infrastructure in both bare-metal and cloud environments (GCP, AWS, Azure).
- Minimum 2 years of experience with cloud security principles and practices (IAM, network security, data encryption).
- Minimum 2 years of experience with Infrastructure as Code tools like Terraform, Ansible, or Chef.
- Minimum 3 years of experience with monitoring, logging, and alerting systems, and defining/tracking SLAs, SLOs, and SLIs.
- Experience with setting up, testing, and monitoring distributed relational databases.
- Minimum 3 years of experience with CI/CD pipelines using Jenkins.
- Minimum 3 years of experience maintaining and operating containerized applications (Docker, Kubernetes).
保険
健康保険 厚生年金保険 雇用保険
休日休暇
祝日
給与
年収600 ~ 900万円
賞与
0
雇用期間
期間の定めなし
show more
社名
社名非公開
職種
通信インフラ設計・構築(キャリア・ISP系)
業務内容
[About the company]Our client is a leading technology innovator, building a real-time analytics data platform that powers hundreds of services across e-commerce, fintech, digital content, and communications. They are dedicated to providing foundational insights into users, products, and markets, driven by a passion for engineering excellence and continuous improvement.[Role & Responsibilities]As a Senior Site Reliability Engineer specializing in their Analytics Platform team, you will be instrumental in ensuring the reliability, scalability, performance, and security of their core data infrastructure. You will drive architectural excellence and mentor a growing team, tackling complex distributed systems challenges and contributing to the optimization of large-scale data pipelines. You will also play a key role in migrating critical components to Google Cloud Platform (GCP), advocating for and implementing robust cloud security best practices.Main tasks include:- Architecting, developing, and deploying solutions to automate, maintain, operate, and optimize large-scale data pipelines.- Conducting system-wide analysis and performance tuning for capacity planning and bottleneck identification.- Implementing and refining monitoring, alerting, and incident response strategies to meet SLAs, SLOs, and SLIs.- Leading and assisting in the migration of critical data components to GCP, emphasizing secure cloud architecture and IAM.- Designing and implementing security controls and automation within GCP environments.- Ensuring system resilience through high-availability and disaster recovery mechanisms.- Enhancing and maintaining CI/CD pipelines for applications in Java, Node.js, and Scala.- Providing expert technical guidance and troubleshooting support to cross-functional teams.- Mentoring junior and mid-level SREs and software engineers.
...
求められる経験
- Minimum 4 years of professional experience in application development, primarily with Python.
- Minimum 2 years of experience designing and operating distributed systems handling large volumes of data in near real-time.
- Minimum 4 years of experience with Linux operating system internals.
- Minimum 2 years of experience managing infrastructure in both bare-metal and cloud environments (GCP, AWS, Azure).
- Minimum 2 years of experience with cloud security principles and practices (IAM, network security, data encryption).
- Minimum 2 years of experience with Infrastructure as Code tools like Terraform, Ansible, or Chef.
- Minimum 3 years of experience with monitoring, logging, and alerting systems, and defining/tracking SLAs, SLOs, and SLIs.
- Experience with setting up, testing, and monitoring distributed relational databases.
- Minimum 3 years of experience with CI/CD pipelines using Jenkins.
- Minimum 3 years of experience maintaining and operating containerized applications (Docker, Kubernetes).
保険
健康保険 厚生年金保険 雇用保険
休日休暇
祝日
給与
年収600 ~ 900万円
賞与
0
雇用期間
期間の定めなし
show more