About the ProjectDBot Software is building a new, comprehensive digital commerce + design marketplace platform for a large design and eCommerce company operating in the US and Thailand. Alongside the new platform build, we also maintain and modernize a large, business-critical existing IT system.
We're hiring a DevOps Lead& Technical Project Manager with a clear DevOps-first focus. You will own cloud infrastructure, CI/CD, observability, security fundamentals, and production reliability and you will communicate confidently with international stakeholders in excellent English.
The RoleThis role is primarily hands-on DevOps / cloud engineering with operational ownership. You will build the foundation that allows engineering teams to deliver frequently and safely.
You will:
- design and operate production-grade cloud infrastructure
- build/own CI/CD pipelines and release safety patterns
- ensure database reliability and performance as part of platform stability
- implement monitoring/alerting and incident routines
- coordinate with engineering and product on delivery readiness (light PM, not the main focus)
Key ResponsibilitiesCloud & Infrastructure Ownership- Design, implement, and maintain cloud environments (AWS/Azure/GCP) for dev/stage/prod
- Build and manage Infrastructure as Code (Terraform/CloudFormation/Bicep or equivalent)
- Manage IAM, networking, secrets, and environment configurations securely
- Drive platform scalability, cost efficiency (FinOps mindset), and operational resilience
CI/CD & Release Engineering- Build and maintain robust CI/CD pipelines (automated testing gates, promotion flows)
- Implement safe deployment strategies (blue/green, canary, rollbacks)
- Standardize release processes and ensure reproducible deployments
- Improve developer experience: fast pipelines, clean environments, reliable tooling
Observability & Reliability Engineering- Implement and own observability: logs, metrics, tracing, dashboards, alerting
- Define operational KPIs (SLOs/SLAs), error budgets where applicable, and uptime practices
- Lead incident response routines (triage, mitigation, postmortems, prevention)
- Create runbooks, on-call guidelines (if applicable), and operational documentation
Database & Data Reliability- Ensure database reliability: backups, restore tests, high availability / DR concepts
- Improve performance: indexing, query tuning guidance, connection management, caching strategy support
- Support safe migrations and release coordination around schema/data changes
- Define standards for retention, privacy, access logging, and auditability (where required)
Cross-Team Collaboration (Light PM)- Coordinate release readiness across teams and time zones (US/Thailand)
- Surface risks early (infra, security, performance, reliability) and drive resolution
- Provide clear status updates and technical communication to stakeholders
Required Skills & Experience (Must-Have)- Senior experience in DevOps / Cloud Engineering (production operations, not just setup)
- Strong cloud expertise in AWS or Azure or GCP (architecture + operations)
- Strong CI/CD expertise and release engineering ownership
- Strong Infrastructure as Code experience
- Strong understanding of databases (SQL, performance tuning basics, backup/restore, HA/DR concepts)
- Solid foundation in security basics: IAM, secrets management, network security, least privilege
- Hands-on container experience (Docker) and comfort with modern deployment patterns
- Proven ability to troubleshoot production issues end-to-end (logs metrics infra app symptoms)
English Requirement- Excellent English communication skills (spoken + written)
- You must be comfortable leading technical discussions, writing clear updates, and aligning stakeholders across international teams.
Nice to Have (Strong Plus)- Kubernetes (EKS/AKS/GKE) or managed container orchestration experience
- Experience with marketplace/eCommerce systems (traffic peaks, search, orders, payments)
- Monitoring stacks like Datadog, Grafana/Prometheus, ELK, OpenTelemetry
- Experience supporting Node.js-based platforms and modern web stacks (React/Next.js) helpful, not required
- Security/compliance familiarity (audit trails, SOC2/ISO-style controls, vulnerability routines)
Why Join DBot Software- High-impact ownership: you shape how the platform runs in production
- Real-world enterprise context: large system, real users, real consequences
- International environment (US/Thailand), professional engineering culture
- Hybrid/remote flexibility depending on project needs
- Office in central Bangkok (near Phrom Phong BTS)
- Health insurance + life insurance
- 1220 days annual leave (depending on level/contract)
How to ApplyApply via LinkedIn and include:
- CV / LinkedIn profile
- 3 short bullets on:
- the largest production system you operated (cloud + scale)
- your CI/CD approach (tools + deployment strategy)
- your strongest database work (what you improved, measurable if possible)