20 messages
Job opportunities (hiring or looking)
Archive: https://archive.sweetops.com/jobs/
X
Xeniaabout 1 month ago(edited)
👋 Hello, team!
NEW JOB OPENING: DevOps Engineer
https://www.activeprime.com/devopsengineer (full details and apply here)
Company: ActivePrime, Inc. (A Silicon Valley Data Quality SaaS company)
Location: FULLY REMOTE-You must be based either in the USA, Canada or Europe
Visa Sponsorship: N/A
Role Type: Independent Contractor for 30-40 hours per week
Hourly Rate: Starting at $65/hour + depending on experience
About the company
We are a dynamic, remote-first company building user-friendly, cutting-edge data cleansing
SaaS applications for CRM data. We focus on Salesforce and Heroku but remain fundamentally
CRM and data agnostic, serving customers with a distributed Silicon Valley team.
The Role
We're seeking a versatile DevOps Engineer to help scale our infrastructure and operations. This
is a "wear many hats" position perfect for someone who thrives in a fast-paced, distributed
environment and enjoys building systems that grow with the business.
What You'll Do
● Design and evolve AWS infrastructure using Python and 3rd party libraries.
● Build and maintain provisioning systems that scale with our growth.
● Assist with and eventually lead the provisioning system and software release process.
● Develop internal tools for employees and partners using our stack (Python, FastAPI, nicegui, PostgreSQL, Vue.js).
● Manage security policies and respond to customer security questionnaires.
● Support our remote-first culture through robust, reliable systems.
What We're Looking For
● Strong experience with AWS infrastructure and Python automation.
● Proven ability to work independently in a remote environment.
● Excellent async communication skills and self-management.
● Experience building scalable systems from the ground up.
● Comfortable with security best practices and compliance requirements.
● Adaptability to wear multiple hats and tackle diverse challenges.
● We release often, including some weekends. We need you to be available on these weekends.
Our Stack
AWS, Python, FastAPI, NiceGUI, PostgreSQL, Vue.js, plus various 3rd party integrations
NEW JOB OPENING: DevOps Engineer
https://www.activeprime.com/devopsengineer (full details and apply here)
Company: ActivePrime, Inc. (A Silicon Valley Data Quality SaaS company)
Location: FULLY REMOTE-You must be based either in the USA, Canada or Europe
Visa Sponsorship: N/A
Role Type: Independent Contractor for 30-40 hours per week
Hourly Rate: Starting at $65/hour + depending on experience
About the company
We are a dynamic, remote-first company building user-friendly, cutting-edge data cleansing
SaaS applications for CRM data. We focus on Salesforce and Heroku but remain fundamentally
CRM and data agnostic, serving customers with a distributed Silicon Valley team.
The Role
We're seeking a versatile DevOps Engineer to help scale our infrastructure and operations. This
is a "wear many hats" position perfect for someone who thrives in a fast-paced, distributed
environment and enjoys building systems that grow with the business.
What You'll Do
● Design and evolve AWS infrastructure using Python and 3rd party libraries.
● Build and maintain provisioning systems that scale with our growth.
● Assist with and eventually lead the provisioning system and software release process.
● Develop internal tools for employees and partners using our stack (Python, FastAPI, nicegui, PostgreSQL, Vue.js).
● Manage security policies and respond to customer security questionnaires.
● Support our remote-first culture through robust, reliable systems.
What We're Looking For
● Strong experience with AWS infrastructure and Python automation.
● Proven ability to work independently in a remote environment.
● Excellent async communication skills and self-management.
● Experience building scalable systems from the ground up.
● Comfortable with security best practices and compliance requirements.
● Adaptability to wear multiple hats and tackle diverse challenges.
● We release often, including some weekends. We need you to be available on these weekends.
Our Stack
AWS, Python, FastAPI, NiceGUI, PostgreSQL, Vue.js, plus various 3rd party integrations
E
W
Williamabout 1 month ago
I really need a job,proficiency in Ai and a personal assistant
C
Coleabout 1 month ago
Full-Stack Founding Engineer @ Trove
Targeting $80k - $125k CAD + 1.0% - 2.5% Equity
Toronto-based if possible, but open to fully remote work with periodic in-person work sessions for the right person
Trove is building the behavioral identity layer for the internet — a way to measure character, values, and taste through interactive fiction, not forms. Think Bandersnatch meets psychometrics, with real-world payoff.
We've shipped two viral campaigns with zero marketing spend, hit 77% D3 retention, and have inbound interest from employers who want what we're building. Pre-seed, current investors include Betaworks, Slack Fund, True Ventures and RRE. Now we're looking for a founding engineer to help us build the product that we already know works.
This is a 0→1 role. You'll own large chunks of the stack, think about what to do with behavioral data, and ship things that have never been built before. Full description (required reading) and application link here: https://trovewithin.notion.site/Full-Stack-Founding-Engineer-2a7a0346862c80089b83fdb17b4efc91
Targeting $80k - $125k CAD + 1.0% - 2.5% Equity
Toronto-based if possible, but open to fully remote work with periodic in-person work sessions for the right person
Trove is building the behavioral identity layer for the internet — a way to measure character, values, and taste through interactive fiction, not forms. Think Bandersnatch meets psychometrics, with real-world payoff.
We've shipped two viral campaigns with zero marketing spend, hit 77% D3 retention, and have inbound interest from employers who want what we're building. Pre-seed, current investors include Betaworks, Slack Fund, True Ventures and RRE. Now we're looking for a founding engineer to help us build the product that we already know works.
This is a 0→1 role. You'll own large chunks of the stack, think about what to do with behavioral data, and ship things that have never been built before. Full description (required reading) and application link here: https://trovewithin.notion.site/Full-Stack-Founding-Engineer-2a7a0346862c80089b83fdb17b4efc91
S
Suzuki Renabout 1 month ago
Hello Everyone
We are currently seeking talented professionals to join our growing engineering team and I am sure you will be great fit for one of our roles. Open roles include Full Stack Developers, AI Engineers, DevOps Engineers, and Technical Speakers so on.
If you are passionate about building innovative solutions and working in a collaborative, forward-thinking environment, we would love to hear from you.
To learn more about these opportunities and apply, please visit: https://nttsoftware.org/careers
If you have any questions or would like to connect directly, feel free to send me a DM.
We are currently seeking talented professionals to join our growing engineering team and I am sure you will be great fit for one of our roles. Open roles include Full Stack Developers, AI Engineers, DevOps Engineers, and Technical Speakers so on.
If you are passionate about building innovative solutions and working in a collaborative, forward-thinking environment, we would love to hear from you.
To learn more about these opportunities and apply, please visit: https://nttsoftware.org/careers
If you have any questions or would like to connect directly, feel free to send me a DM.
J
joshmyersabout 1 month ago
Any US based folks with production GCP experience (and battle scars) looking for a role? DM me
K
Kevin Southwickabout 1 month ago(edited)
Hey all -- Kevin Southwick here, open to contract, consulting, and FTE opportunities.
I've spent 15 years in cybersecurity across a range of roles and environments. Most recently I've been deep in AWS cloud security, building AWSight (awsight.com), a no nonsense CSPM platform that runs automated security checks across AWS environments. That work has kept me sharp on the infrastructure and tooling side, but my background is broader than cloud.
What I bring:
• Cloud security assessments and posture management across AWS environments
• IAM architecture, hardening, and least-privilege design
• Vulnerability assessment and hands-on remediation
• Compliance program support and audit readiness
• Security architecture review for cloud-native and hybrid environments
• Incident response planning and tabletop support
• Experience with both startup and enterprise environments
• Originally have a SOC, threat modelling, and DFIR background
• Active certs include: CCSP and AWS Certified Security - Specialty
I'm comfortable going heads-down on a specific engagement, embedding with an existing security team, or taking on a broader advisory role depending on what you need. I've worked across commercial and government environments: SAM registered, CAGE code confirmed, GovCloud experience.
US-based, fully remote, available now.
https://www.linkedin.com/in/southwick-awsight/
I've spent 15 years in cybersecurity across a range of roles and environments. Most recently I've been deep in AWS cloud security, building AWSight (awsight.com), a no nonsense CSPM platform that runs automated security checks across AWS environments. That work has kept me sharp on the infrastructure and tooling side, but my background is broader than cloud.
What I bring:
• Cloud security assessments and posture management across AWS environments
• IAM architecture, hardening, and least-privilege design
• Vulnerability assessment and hands-on remediation
• Compliance program support and audit readiness
• Security architecture review for cloud-native and hybrid environments
• Incident response planning and tabletop support
• Experience with both startup and enterprise environments
• Originally have a SOC, threat modelling, and DFIR background
• Active certs include: CCSP and AWS Certified Security - Specialty
I'm comfortable going heads-down on a specific engagement, embedding with an existing security team, or taking on a broader advisory role depending on what you need. I've worked across commercial and government environments: SAM registered, CAGE code confirmed, GovCloud experience.
US-based, fully remote, available now.
https://www.linkedin.com/in/southwick-awsight/
C
CK24 days ago(edited)
Hey 👋
If your infra was built to ship fast, not to scale — this might be for you.
Most early teams hit the same wall: the cloud setup that got you to launch is now the thing slowing you down (or keeping you up at night). Deployments are nerve-wracking. The bill keeps growing. Nobody's sure what's actually running in prod.
I run CloudSRE Consulting and have 25+ years of experience across software engineering, systems architecture, and infrastructure roles — with the last 10 years focused exclusively on cloud infrastructure, platform engineering, and site reliability engineering across AWS, GCP, and Azure.
What I bring:
• Reduced infrastructure costs by 42% and improved uptime from 99.7% → 99.95% on a SaaS platform processing $500M+ in annual transactions
• Cut deployment time by 67% (45 min → 15 min) and deployment errors by 85% through GitOps and Terraform migrations
• Scaled platforms 300% while maintaining sub-5-minute deployment velocity
• Led SOC2 Type II certifications with zero audit findings across multiple engagements
• Built end-to-end GitOps pipelines (ArgoCD, FluxCD, GitHub Actions) across 20+ microservice environments
• Implemented HashiCorp Vault secrets management, eliminating credential-related incidents entirely
• Reduced MTTR from 90 → 35 minutes through observability stack improvements and runbook automation
• Delivered $240K+ in annualized cloud cost savings through rightsizing, Savings Plans, and lifecycle policies
I'm comfortable going deep on a specific problem (a messy Terraform codebase, a reliability gap before a big launch, a cloud bill that won't stop growing), embedding with an existing platform or SRE team, or taking on a broader infrastructure ownership role depending on what you need.
Core stack: Terraform · Kubernetes (EKS/GKE/AKS) · GitHub Actions · ArgoCD · FluxCD · Helm · Prometheus · Grafana · HashiCorp Vault · AWS · GCP · Azure
India-based (IST), fully remote, available now. Works well with EU, ME, APAC, and US East Coast timezones.
hello@cloudsre.consulting
🔗 https://www.cloudsre.consulting/
If your infra was built to ship fast, not to scale — this might be for you.
Most early teams hit the same wall: the cloud setup that got you to launch is now the thing slowing you down (or keeping you up at night). Deployments are nerve-wracking. The bill keeps growing. Nobody's sure what's actually running in prod.
I run CloudSRE Consulting and have 25+ years of experience across software engineering, systems architecture, and infrastructure roles — with the last 10 years focused exclusively on cloud infrastructure, platform engineering, and site reliability engineering across AWS, GCP, and Azure.
What I bring:
• Reduced infrastructure costs by 42% and improved uptime from 99.7% → 99.95% on a SaaS platform processing $500M+ in annual transactions
• Cut deployment time by 67% (45 min → 15 min) and deployment errors by 85% through GitOps and Terraform migrations
• Scaled platforms 300% while maintaining sub-5-minute deployment velocity
• Led SOC2 Type II certifications with zero audit findings across multiple engagements
• Built end-to-end GitOps pipelines (ArgoCD, FluxCD, GitHub Actions) across 20+ microservice environments
• Implemented HashiCorp Vault secrets management, eliminating credential-related incidents entirely
• Reduced MTTR from 90 → 35 minutes through observability stack improvements and runbook automation
• Delivered $240K+ in annualized cloud cost savings through rightsizing, Savings Plans, and lifecycle policies
I'm comfortable going deep on a specific problem (a messy Terraform codebase, a reliability gap before a big launch, a cloud bill that won't stop growing), embedding with an existing platform or SRE team, or taking on a broader infrastructure ownership role depending on what you need.
Core stack: Terraform · Kubernetes (EKS/GKE/AKS) · GitHub Actions · ArgoCD · FluxCD · Helm · Prometheus · Grafana · HashiCorp Vault · AWS · GCP · Azure
India-based (IST), fully remote, available now. Works well with EU, ME, APAC, and US East Coast timezones.
hello@cloudsre.consulting
🔗 https://www.cloudsre.consulting/
A
Akshay Ghalme23 days ago
👋 Hey SweetOps! I'm Akshay — a DevOps/Cloud Infrastructure Engineer based in Pune, India with ~4 years of production AWS experience.
🔍️ Actively looking for remote DevOps / Cloud Infrastructure / SRE roles with US or EU companies.
🛠️ What I work with daily:
• AWS (IAM, VPC, EC2, EKS, S3, Cost Optimization)
• Terraform • Kubernetes • Docker
• Jenkins • GitHub Actions • ArgoCD • GitOps
• Prometheus + Grafana + Loki + OpenTelemetry
• Trivy • Cosign • Kyverno • Checkov (supply-chain security)
📌 What I've actually shipped:
• 💰️ Reduced AWS infrastructure costs by ~80% through right-sizing, gp2→gp3 migration, Reserved Instance planning & traffic re-architecture
• 🌐 Managed multi-tenant SaaS infra for 1,000+ customer subdomains across 30+ countries at 99.9% uptime
• 🔒️ Zero public DB exposure — least-privilege IAM, private subnets, passed all security audits
• ⚡️ Zero-downtime deployments with Jenkins CI/CD + automated rollback
• AWS Certified Solutions Architect – Associate (March 2026)
💡 I'm the kind of engineer who thinks in systems, cares about the 2 AM on-call experience, and brings production-grade thinking to every decision — not just "it works in staging" energy.
Open to full-time remote contracts or salaried roles. Happy to chat — DM me or find me at 👉️ akshayghalme.com
🔍️ Actively looking for remote DevOps / Cloud Infrastructure / SRE roles with US or EU companies.
🛠️ What I work with daily:
• AWS (IAM, VPC, EC2, EKS, S3, Cost Optimization)
• Terraform • Kubernetes • Docker
• Jenkins • GitHub Actions • ArgoCD • GitOps
• Prometheus + Grafana + Loki + OpenTelemetry
• Trivy • Cosign • Kyverno • Checkov (supply-chain security)
📌 What I've actually shipped:
• 💰️ Reduced AWS infrastructure costs by ~80% through right-sizing, gp2→gp3 migration, Reserved Instance planning & traffic re-architecture
• 🌐 Managed multi-tenant SaaS infra for 1,000+ customer subdomains across 30+ countries at 99.9% uptime
• 🔒️ Zero public DB exposure — least-privilege IAM, private subnets, passed all security audits
• ⚡️ Zero-downtime deployments with Jenkins CI/CD + automated rollback
• AWS Certified Solutions Architect – Associate (March 2026)
💡 I'm the kind of engineer who thinks in systems, cares about the 2 AM on-call experience, and brings production-grade thinking to every decision — not just "it works in staging" energy.
Open to full-time remote contracts or salaried roles. Happy to chat — DM me or find me at 👉️ akshayghalme.com
S
S.Taro21 days ago
Hey everyone 👋
I’m a full-stack developer & AI engineer with 8 years of experience helping startups and product teams design, build, and scale high quality products. I focus on delivering systems that are easy to use, reliable in production, and built to scale from day one.
💡 What I do
• Build end-to-end web platforms (SaaS, dashboards, marketplaces, internal tools)
• Develop and integrate AI features (chatbots, copilots, automation, agents)
• Design scalable backend architectures & APIs
• Build data pipelines (scraping, ETL, real-time processing)
• Optimize, refactor, and scale existing systems
🚀 How I help clients
• Turn ideas into production-ready products (MVP → scale)
• Fix slow, unstable, or hard-to-maintain systems
• Add AI that actually delivers business value (not just demos)
• Act as a long-term technical partner, not just a coder
🧩 Tech Stack
• Frontend: React, Next.js, Nuxt, TypeScript
• Backend: Node.js, Python (FastAPI, Django), Golang, Rust
• AI/ML: OpenAI, Claude, Hugging Face, LangChain, LlamaIndex, RAG pipelines, vector DBs (Pinecone, Weaviate, FAISS)
• Data & Realtime: PostgreSQL, MongoDB, Redis, WebSockets, GraphQL
• DevOps & Cloud: AWS, GCP, Docker
If you’re building something ambitious and need a reliable senior engineer to take it from idea to scalable product, feel free to reach out 🙌
https://dev.aoi-webstudio.com
WhatsApp : <tel:+819032703814|+81 90-3270-3814>
Email : suzukitaro0512@gmail.com
I’m a full-stack developer & AI engineer with 8 years of experience helping startups and product teams design, build, and scale high quality products. I focus on delivering systems that are easy to use, reliable in production, and built to scale from day one.
💡 What I do
• Build end-to-end web platforms (SaaS, dashboards, marketplaces, internal tools)
• Develop and integrate AI features (chatbots, copilots, automation, agents)
• Design scalable backend architectures & APIs
• Build data pipelines (scraping, ETL, real-time processing)
• Optimize, refactor, and scale existing systems
🚀 How I help clients
• Turn ideas into production-ready products (MVP → scale)
• Fix slow, unstable, or hard-to-maintain systems
• Add AI that actually delivers business value (not just demos)
• Act as a long-term technical partner, not just a coder
🧩 Tech Stack
• Frontend: React, Next.js, Nuxt, TypeScript
• Backend: Node.js, Python (FastAPI, Django), Golang, Rust
• AI/ML: OpenAI, Claude, Hugging Face, LangChain, LlamaIndex, RAG pipelines, vector DBs (Pinecone, Weaviate, FAISS)
• Data & Realtime: PostgreSQL, MongoDB, Redis, WebSockets, GraphQL
• DevOps & Cloud: AWS, GCP, Docker
If you’re building something ambitious and need a reliable senior engineer to take it from idea to scalable product, feel free to reach out 🙌
https://dev.aoi-webstudio.com
WhatsApp : <tel:+819032703814|+81 90-3270-3814>
Email : suzukitaro0512@gmail.com
A
A. Fahmy21 days ago(edited)
Hi All,
I'm sharing a new opportunity: Founding Engineer at Elyos.ai!
Location: London, United Kingdom
Workplace: On-site
Skills: Python, TypeScript, Cloud infrastructure, Event-driven systems, Third-party integrations
~ £150-170K/annum
Build and ship core product features, work with customers, and shape AI voice agents for trades businesses as a founding engineer.
Notes:
Eligibility:
• Proficiency with at least Python or TypeScript
• Experience building and shipping AI products end-to-end (DM me for details / courses in order to prepare)
• Comfort working across the stack - frontend, backend, infrastructure
• Ability to work directly with customers or non-technical stakeholders
Check it out: https://jobs.techtree.dev/job/da1cf5a3-49c5-4a97-808f-b93c3c0d8794?tp=04b74924-4168-417c-91a8-2ec52e5b28a1
I'm sharing a new opportunity: Founding Engineer at Elyos.ai!
Location: London, United Kingdom
Workplace: On-site
Skills: Python, TypeScript, Cloud infrastructure, Event-driven systems, Third-party integrations
~ £150-170K/annum
Build and ship core product features, work with customers, and shape AI voice agents for trades businesses as a founding engineer.
Notes:
Eligibility:
• Proficiency with at least Python or TypeScript
• Experience building and shipping AI products end-to-end (DM me for details / courses in order to prepare)
• Comfort working across the stack - frontend, backend, infrastructure
• Ability to work directly with customers or non-technical stakeholders
Check it out: https://jobs.techtree.dev/job/da1cf5a3-49c5-4a97-808f-b93c3c0d8794?tp=04b74924-4168-417c-91a8-2ec52e5b28a1
A
Alex Kwan20 days ago(edited)
Hi All,
My company, Brown and Caldwell, is looking to bring on an engineer well versed in Azure cloud infrastructure. The job title is Data & AI Platform Engineer but the work will involve a good amount of DevOps and Cloud Infrastructure work. See details in the job description. The role is remote but we will not be considering candidates outside the US. Also we will not sponsor applicants for work visas for this position.
My company, Brown and Caldwell, is looking to bring on an engineer well versed in Azure cloud infrastructure. The job title is Data & AI Platform Engineer but the work will involve a good amount of DevOps and Cloud Infrastructure work. See details in the job description. The role is remote but we will not be considering candidates outside the US. Also we will not sponsor applicants for work visas for this position.
E
M
S
Sven Verleyen15 days ago
Hi All 👋
Cloudpepper is hiring a Senior Platform / DevOps Engineer.
Stack: Symfony/PHP, Ansible, Linux, PostgreSQL, Nginx, Python
We run the platform behind 10,000+ Odoo instances. This is a senior role spanning backend platform work in Symfony/PHP and ownership of the Linux + Ansible automation underneath.
Location: Remote with CET overlap, or Brussels-based
Comp: $150k–$180k/year
Small team, high ownership, very little bureaucracy.
Full role + apply: https://cloudpepper.io/careers/platform-engineer/
Cloudpepper is hiring a Senior Platform / DevOps Engineer.
Stack: Symfony/PHP, Ansible, Linux, PostgreSQL, Nginx, Python
We run the platform behind 10,000+ Odoo instances. This is a senior role spanning backend platform work in Symfony/PHP and ownership of the Linux + Ansible automation underneath.
Location: Remote with CET overlap, or Brussels-based
Comp: $150k–$180k/year
Small team, high ownership, very little bureaucracy.
Full role + apply: https://cloudpepper.io/careers/platform-engineer/
A
A. Fahmy15 days ago
🚀 Hiring Alert: Founding Engineer at Elastics
An exciting early-stage opportunity for engineers who enjoy ownership, fast execution, and building systems at the cutting edge of AI and financial markets.
📍 Location: 🇵🇱 Warsaw, Poland
🏢 Workplace: On-site - $120k
⚠️ Please note: This role does not sponsor a work visa.
What you will do:
✅️ Design and build infrastructure for AI agents in prediction markets
✅️ Create real-time data pipelines and execution engines
✅️ Build asynchronous and distributed backend systems
✅️ Integrate exchanges, APIs, websockets, and live market data
✅️ Develop risk systems and market discovery dashboards
✅️ Support Web3 integrations including wallets and on-chain data
✅️ Deploy and scale services using Docker, CI/CD, and cloud platforms
Ideal background:
🔹 5+ years of backend engineering experience
🔹 Strong skills in Go, Python, PostgreSQL, Kafka, Docker
🔹 Experience with distributed systems and financial data pipelines
🔹 Interest in trading, markets, and quantitative systems
🔹 Startup mindset with strong ownership and speed
🔹 Degree in Computer Science, Mathematics, or related field
A brilliant chance to join early, work directly with founders, and shape the future of AI-native trading infrastructure.
Apply here:
https://jobs.techtree.dev/job/cb7b2018-bc53-44be-ae3b-057aa66515e3?tp=04b74924-4168-417c-91a8-2ec52e5b28a1
An exciting early-stage opportunity for engineers who enjoy ownership, fast execution, and building systems at the cutting edge of AI and financial markets.
📍 Location: 🇵🇱 Warsaw, Poland
🏢 Workplace: On-site - $120k
⚠️ Please note: This role does not sponsor a work visa.
What you will do:
✅️ Design and build infrastructure for AI agents in prediction markets
✅️ Create real-time data pipelines and execution engines
✅️ Build asynchronous and distributed backend systems
✅️ Integrate exchanges, APIs, websockets, and live market data
✅️ Develop risk systems and market discovery dashboards
✅️ Support Web3 integrations including wallets and on-chain data
✅️ Deploy and scale services using Docker, CI/CD, and cloud platforms
Ideal background:
🔹 5+ years of backend engineering experience
🔹 Strong skills in Go, Python, PostgreSQL, Kafka, Docker
🔹 Experience with distributed systems and financial data pipelines
🔹 Interest in trading, markets, and quantitative systems
🔹 Startup mindset with strong ownership and speed
🔹 Degree in Computer Science, Mathematics, or related field
A brilliant chance to join early, work directly with founders, and shape the future of AI-native trading infrastructure.
Apply here:
https://jobs.techtree.dev/job/cb7b2018-bc53-44be-ae3b-057aa66515e3?tp=04b74924-4168-417c-91a8-2ec52e5b28a1
S
Stephen Oholendt14 days ago
Principal SRE @ Upstart (Remote-US / Remote-Canada) - NO RELO TO US / CANADA
Hi everybody! We’re hiring a Principal SRE at Upstart to help define how reliability and performance are built across a publicly traded, rapid-paced, tech-first company with 1,500+ employees, and 700+ engineers.
This role sits at the intersection of software engineering + SRE, but at this level it’s about setting strategy, driving adoption, and influencing org-wide engineering practices - but we also need exceptional hands-on skills (not just architects).
We’ve historically leaned into production engineering, but the shift now is toward performance engineering and proactive reliability. Think latency, efficiency, cost, and user experience - not just uptime.
What you’d actually do:
• Define and drive SRE principles across dozens of engineering teams
• Partner with leadership on defining & executing a long-term reliability + observability strategy
• Lead initiatives like distributed tracing, RUM, LCP, and performance standards
• Build and scale self-healing systems (including GenAI/LLM-driven approaches)
• Influence incident management, ML reliability, and engineering velocity across the org
• Own cross-functional initiatives end-to-end
This is a high-impact, highly cross-functional role working with Product Eng, DevEx, Data, and ML to raise the bar on SRE excellence company-wide
Must-haves (please self-select):
• ~10+ years across both SWE + SRE (not purely ops)
• Proven track record driving SRE practices org-wide (not just within your team)
• Strong background in observability (distributed tracing, RUM, performance metrics)
• Experience building internal tooling / platforms from scratch
• Hands-on expertise with several languages such as Python, Go, or TypeScript + modern cloud infra (AWS, K8s, Terraform)
• Comfortable operating as a technical leader and influencing senior stakeholders
Big plus!
• Prior experience building/maturing, SRE-related programs (ex. SLO Program)
• Experience successfully applying GenAI for SRE automation
If you’re someone who’s set reliability strategy at scale and cares about performance as much as uptime or the Google SRE playbook, this is probably a good fit.
---
Tech: AWS, EKS, Terraform, CDK, Datadog, Python/Go/TS, Istio, etc.
Salary Range (US): $195,300—$270,400 USD
Salary Range (Canada): $182,800—$230,000 CAD
Visa Sponsorship: Yes (but you have to currently be in US or Canada - no exceptions)
---
If interested (or you know somebody else that is), please DM me w a resume or link to your profile/experience, or apply at the following link (then DM me after): https://careers.upstart.com/jobs/principal-site-reliability-engineer?gh_src=83373b341us
Hi everybody! We’re hiring a Principal SRE at Upstart to help define how reliability and performance are built across a publicly traded, rapid-paced, tech-first company with 1,500+ employees, and 700+ engineers.
This role sits at the intersection of software engineering + SRE, but at this level it’s about setting strategy, driving adoption, and influencing org-wide engineering practices - but we also need exceptional hands-on skills (not just architects).
We’ve historically leaned into production engineering, but the shift now is toward performance engineering and proactive reliability. Think latency, efficiency, cost, and user experience - not just uptime.
What you’d actually do:
• Define and drive SRE principles across dozens of engineering teams
• Partner with leadership on defining & executing a long-term reliability + observability strategy
• Lead initiatives like distributed tracing, RUM, LCP, and performance standards
• Build and scale self-healing systems (including GenAI/LLM-driven approaches)
• Influence incident management, ML reliability, and engineering velocity across the org
• Own cross-functional initiatives end-to-end
This is a high-impact, highly cross-functional role working with Product Eng, DevEx, Data, and ML to raise the bar on SRE excellence company-wide
Must-haves (please self-select):
• ~10+ years across both SWE + SRE (not purely ops)
• Proven track record driving SRE practices org-wide (not just within your team)
• Strong background in observability (distributed tracing, RUM, performance metrics)
• Experience building internal tooling / platforms from scratch
• Hands-on expertise with several languages such as Python, Go, or TypeScript + modern cloud infra (AWS, K8s, Terraform)
• Comfortable operating as a technical leader and influencing senior stakeholders
Big plus!
• Prior experience building/maturing, SRE-related programs (ex. SLO Program)
• Experience successfully applying GenAI for SRE automation
If you’re someone who’s set reliability strategy at scale and cares about performance as much as uptime or the Google SRE playbook, this is probably a good fit.
---
Tech: AWS, EKS, Terraform, CDK, Datadog, Python/Go/TS, Istio, etc.
Salary Range (US): $195,300—$270,400 USD
Salary Range (Canada): $182,800—$230,000 CAD
Visa Sponsorship: Yes (but you have to currently be in US or Canada - no exceptions)
---
If interested (or you know somebody else that is), please DM me w a resume or link to your profile/experience, or apply at the following link (then DM me after): https://careers.upstart.com/jobs/principal-site-reliability-engineer?gh_src=83373b341us
R
Rishi13 days ago
Hey folks,
I'm Rishikesh — Senior Platform & DevOps Engineer with 7 years of experience, currently available after my contract ended due to layoffs from Invisible Technologies.
What I've been building:
• End-to-end CI/CD and platform infra for 15+ production client environments — 6+ months with zero unplanned downtime
• Re-architected Celery worker pools (single-queue → multi-queue) that cut a representative task from ~10 min → 1–2 min, measured in Datadog APM
• Deployed OpenBao across 5 production K8s clusters for in-cluster secrets management, replacing a SaaS tool to meet client compliance requirements
• Built Backstage service-bootstrap templates used as the daily golden path for new service creation across the engineering org
• Provisioned AWS infrastructure, CI/CD, and secrets management for RL / agentic training environments (Meta ARE-inspired isolation architecture)
• Worked on evaluating and implementing service mesh in the infra (Istio and Linkerd)
Stack: Python (Django, FastAPI, Celery), Kubernetes, Terraform/OpenTofu, GitHub Actions, ArgoCD, Postgres, AWS/GCP
Looking for: Senior/Staff Platform Engineer, SRE, or DevOps. Remote preferred. (Flexible to any timezone)
Open to contract or FTE.
Happy to share my resume — DM me or reply here.
Linkedin: https://www.linkedin.com/in/rishikesh-vijay-gajelli/
Github: github.com/AlphaRishi1229
I'm Rishikesh — Senior Platform & DevOps Engineer with 7 years of experience, currently available after my contract ended due to layoffs from Invisible Technologies.
What I've been building:
• End-to-end CI/CD and platform infra for 15+ production client environments — 6+ months with zero unplanned downtime
• Re-architected Celery worker pools (single-queue → multi-queue) that cut a representative task from ~10 min → 1–2 min, measured in Datadog APM
• Deployed OpenBao across 5 production K8s clusters for in-cluster secrets management, replacing a SaaS tool to meet client compliance requirements
• Built Backstage service-bootstrap templates used as the daily golden path for new service creation across the engineering org
• Provisioned AWS infrastructure, CI/CD, and secrets management for RL / agentic training environments (Meta ARE-inspired isolation architecture)
• Worked on evaluating and implementing service mesh in the infra (Istio and Linkerd)
Stack: Python (Django, FastAPI, Celery), Kubernetes, Terraform/OpenTofu, GitHub Actions, ArgoCD, Postgres, AWS/GCP
Looking for: Senior/Staff Platform Engineer, SRE, or DevOps. Remote preferred. (Flexible to any timezone)
Open to contract or FTE.
Happy to share my resume — DM me or reply here.
Linkedin: https://www.linkedin.com/in/rishikesh-vijay-gajelli/
Github: github.com/AlphaRishi1229
E
Erik Liu12 days ago
Hi Everybody! I am a senior software engineer for over 9 years. Now I am seeking a good partner who skilled DevOps or Data Engineering in US or Canada. Must be a real man, NOT SCAM...
M