Local Development Environments
Rapidly onboard new developers. Efficiently iterate on features.
Rapid Software Development...
Reproducible 100% of the time. Spin up as many environments as needed and treat them all the same.
Treat everything as "Infrastructure as Code" for more manageable environments. Reduce the human element wherever possible.
EASY TO USE
Developers should be able to get up and contribute their first "Pull Requests" day-one.
Simple enough that anyone on your team should be able to use it.
You need a process to reliably release software at any time and without downtime.
Confidence that it works...
Leverage CodeFresh, GitHub Actions or other systems like AWS CodeBuild, and Jenkins to build and test every commit. Know exactly which commit broke the build every time.
Deploy exactly what was tested to any cluster using immutable containers. Identify problems before they get into production. Run identical environments to eliminate headaches.
Preview Environments enable any branch or Pull Request to be deployed as a short-lived ephemeral environment. Unlimited environments ensure developers are unblocked to test their changes.
Zero downtime, rolling deployments are accomplished automatically using Kubernetes with Helm. Need a service mesh like Istio? No problem.
Continually test every change made to your infrastructure and ensure all systems go. "Operations by Pull Request" ensure anyone that can open a Pull Request is capable of contributing.
Infrastructure as code means it can be tested as code.
Easy rollbacks when things don't work as expected. Just revert to the previous deployment without bending over backward.
Zero downtime, rolling deployments are accomplished automatically by Kubernetes.
Improve overall stability by catching problems early. Treat every problem as an opportunity to eliminate future headaches.
Site Reliability Engineering
Monitor everything that your organization depends on to meet SLAs, which means keeping an eye on both internal and external services.
Dashboards provide an overview of everything at a glance and provide the necessary transparency across departments. Get everyone on the same page and working towards the same goals by giving them the insights they need to do it.
KEY PERFORMANCE INDICATORS
KPIs provide the benchmarks for success. They give a concrete indicator when things are working or broken. Alert based on thresholds instead of discrete events. Generate actionable notifications that escalate only when it matters to On-Call Engineers.
Monitor internal services for both availability and correctness. Aggregate and report on logs collected from all services across all machines.
MONITORING AS CODE
External services are just as integral to the performance of your product as internal ones. Monitor all dependencies as if they were your own. Escalate before their problems become yours.
Know your limits...
Collect and ship logs somewhere for easy reporting.
Reporting on logs requires visualization of events because that's the only way to make sense of mounds of data.
OPTIMIZE & REPEAT
Integrate with monitoring and alerting so that critical events are not lost.
Training and Support
Foster an engineering culture that fuses ops and dev by cross-training engineers to achieve maximum productivity and complete business continuity.
Foster a DevOps culture...
DevOps involves constant cross-training of engineers to achieve business continuity at the human-level.
Live pairing with your team via Zoom helps them pick things up quickly. Schedule time easily with any member of our team.
Best Practices exist to teach hard lessons more easily.
Cloud Technologies are evolving at an astonishing rate. Get help staying on top of the latest & greatest tech without getting overwhelmed.
Security & Compliance
Implement a strategy that bakes security into the DNA of the organization that addresses both technological attack vectors and social engineering.
Protect your business...
Cloud security involves hardening all components, restricting access with SSO/MFA, and having a bird's eye view of everything going on to quickly remediate any incident.
On-prem security is just as important as cloud security. Protect your intellectual property (IP) from being compromised. Lockdown laptops, wifi, and physical access. A company is only as secure as its weakest link.
Auditing is the on-going process of surfacing anomalous events happening across all systems by combing through centrally aggregated logs like Splunk, Sumologic or Kibana/ElasticSearch.
Secrets management ensures there's a formal process for storing, securing, and rotating passwords and keys. Well designed solutions help ensure your company will not be tomorrow's headline news.
Our "Best Practices" exist to teach hard lessons more easily.
Gain the upper hand...
We'll perform a comprehensive 12-factor assessment on your code base
We'll review your GitHub organization to make sure your taking maximum advantage of the platform, including a comprehensive security assessment.
We'll review your Dockerfiles and Docker Compositions to make sure you're making best use of the tools.
We'll review your usage of Kubernetes and make recommendations on how to better leverage the platform to your advantage.
Hundreds of Terraform Modules
We are the largest provider of high quality, well-maintained, 100% Open Source (APACHE2) Terraform Modules. All modules are tested with terratest. Pull Requests welcome! View our Terraform Modules
Dozens of Helm Charts
What makes them special is we've developed these charts to integrate with third-party services like Github for authentication (OAuth2) and Duo for MFA. View our Helm Charts
Helpful Slack Community
Join our community, It's FREE! This is the best place to talk shop, ask questions, solicit feedback, and work together as a community to build sweet infrastructure. Join our Slack Community
Here you'll find comprehensive guides and documentation to help you start working with the Cloud Posse technology stack as quickly as possible, as well as support if you get stuck. Read our Docs
Free Weekly "Office Hours"
Every week we hold a conference call via Zoom for our community members to share what they are working on and ask questions. Join our next call
Frequently Asked Questions
- Take our quiz to find out if we are a good fit!
- Book a discovery call to go over your exact challenges.
- If we can help, we'll execute a Mutual NDA (ours or yours), then collaborate with you on our Engagement Workbook using Google Docs.
- Once we agree on the general scope, we'll prepare a comprehensive Statement of Work (SOW) detailing the entire project.
- Master Services Agreement (MSA) and SOW are executed, we'll send an invoice for the deposit and first Sprint.
- Work will commence shortly thereafter.
We work with companies anywhere in the world.
While most of our customers are based in the United States, we've worked with companies in the United Kingdom, Germany, Australia, Hong Kong, India, Argentina, etc. Our team is distributed across the US and Eastern Europe.
We can start as soon as you sign our Statement of Work. Typically we see this process take 2-3 weeks from the first introductory call to the start of our engagement.
Here's our checklist we'll need to complete before we can start.
- Execute Mutual NDA (ours or yours)
- Collaborate on Engagement Workbook via Google Docs
- Execute Statement of Work, and Master Services Agreement
- Deposit Payment
We can kick off the initial introductory call immediately, so please make sure that you schedule it today.
After talking with you and assessing if we're a proper fit, we'll execute a Mutual NDA and then send over an Engagement Workbook so we can gather all the requirements for your project and estimate the cost.
We can add easily additional sprints to a Scope of Work. We just need to agree on what goes into a Sprint which will determine the number of Sprints required.
Our typical engagement model begins with a complete platform rollout. This includes roughly 6-8 sprints, each one 1-2 weeks in duration. During this time we set up all AWS Accounts with IAM federation, Cloud Trail audit logs, a comprehensive release engineering process, total observability with our Site Reliability Engineering (SRE) sprint, Remote Access Management (Teleport and KeyCloak), GitOps Operations by Pull Request.
The first engagement takes roughly 3-4 months to complete. These engagements have extremely well-defined project plans. Ask us and we can show you what that looks like.
Customers most often decide to keep us on after the initial engagement for follow up work.
We provide entirely optional ongoing support for customers who've gone through our DevOps Accelerator.
By in large, most of our customers take over the day to day management of their infrastructure.
We're here though to help out anywhere you need it.
We do not provide 24×7 “on-call” (aka PagerDuty) support.
We offer all of our customers’ ongoing support for as long as they need it. Choose what's right for you.
- We provide free weekly support via our “Office Hours” webinars every Wednesday at 11:30 am PST. These calls last one hour and we'll answer as many of your questions as we can.
- We also provide optional support retainers which include a fixed block of hours that go towards maintenance and support. You'll have direct access to our team via a shared Slack channel in addition to the ability to schedule one-on-one calls via Zoom.