Komodor is a Kubernetes management platform that empowers everyone from Platform engineers to Developers to stop firefighting, simplify operations and proactively improve the health of their workloads and infrastructure.
Proactively detect & remediate issues in your clusters & workloads.
Easily operate & manage K8s clusters at scale.
Reduce costs without compromising on performance.
Empower developers with self-service K8s troubleshooting.
Simplify and accelerate K8s migration for everyone.
Fix things fast with AI-powered root cause analysis.
Explore our K8s guides, e-books and webinars.
Learn about K8s trends & best practices from our experts.
Listen to K8s adoption stories from seasoned industry veterans.
The missing UI for Helm – a simplified way of working with Helm.
Visualize Crossplane resources and speed up troubleshooting.
Validate, clean & secure your K8s YAMLs.
Navigate the community-driven K8s ecosystem map.
Kubernetes 101: A comprehensive guide
Expert tips for debugging Kubernetes
Tools and best practices
Kubernetes monitoring best practices
Understand Kubernetes & Container exit codes in simple terms
Exploring the building blocks of Kubernetes
Cost factors, challenges and solutions
Kubectl commands at your fingertips
Understanding K8s versions & getting the latest version
Rancher overview, tutorial and alternatives
Kubernetes management tools: Lens vs alternatives
Troubleshooting and fixing 5xx server errors
Solving common Git errors and issues
Who we are, and our promise for the future of K8s.
Have a question for us? Write us.
Come aboard the K8s ship – we’re hiring!
Hear’s what they’re saying about Komodor in the news.
Today, just in time for Kubecon 2021, I am happy to announce the beta availability of Workflows. For me, this is our most exciting product announcement to date – a completely new capability that expands the definition of what Komodor is, as it charts the course for its next evolution.
Let me start with the feature first. In a nutshell, Workflows is a series of smart algorithms that operate within the “depths” of Komodor. Listening to the signals Komodor collects, Workflows algorithms can automatically:
Typically what I described above could take hours, and likely require the involvement of several team members, working with multiple tools.
With Workflows, however, it takes a mere second for the entire process to complete, turning troubleshooting into an effortless experience – something that anyone can do on the fly.
As interesting as this all may sound, our vision for Workflows is much broader. In this post I`ll dive into what ‘Workflows’ is and the foundations it lays for the future.
The goal of Komodor is to take the complexity out of Kubernetes troubleshooting. Setting out, this meant building a tool that would streamline the process of root cause analysis – a tool that quickly answers the “who did what?” question by taking inventory of all changes and pinpointing the thing that caused fires in the production.
Having wasted an untold amount of hours on this exact question ourselves, we thought this was a good place to start. Turns out that many other folks felt the same way and, by now, we already got used to (but not tired of) hearing “…this used to take HOURS” from our customers.
But we heard other things as well. From talking to dozens of organizations using Kubernetes we learned about the knowledge gaps that prevented some of the developers from being fully autonomous. The most common themes were:
As a result, in many organizations, the last mile of troubleshooting still fell on the shoulders of a few domain experts (e.g., SRE or DevOps leads) who carried the responsibility for fixing all production issues – big or small.
This created a bottleneck, but also a perfect opportunity for us to step in and improve the process. After all, we already built a tool that helped those experts troubleshoot at record speeds… Why not bake their expertise into the product, and have every developer troubleshoot common issues on the fly?
This is how Workflows came to be. It started by us sitting down and mapping the different actions we expected our users to take when faced with a certain K8s issue. Very quickly we saw that, with the benefit of the right insights, these actions fell into predictable patterns that could be distilled into a series of checks.
If A1, C3, E3 and F5 are true, then do X, and so forth… The actual algorithms, however, are indefinitely more complex. To demonstrate, here is a very small sub-segment of the workflow for ImagePullBackoff error:
ImagePullBackoff
After automating the steps an expert would take to fix the issue, our next question was: Just how opinionated did we want Komodor to be?
We definitely were not planning to be in the AIOps game. On the other hand, we were trying to automate troubleshooting and minimize the workload on our users.
With the above in mind, for the beta version, we settled on the following principles:
Example of Workflow’s recommendations
Above you can see the result – a detailed summary of checks run and the suggestion for the fix that appears in the context of our main dashboard.
This is just the first iteration, and this can and likely will change. But this feels like a good place to start. With the initial feedback from the beta validating our approach and concept, we look forward to expanding the functionality to cover more troubleshooting scenarios, while also improving the user experience.
If you are interested in Workflows and want to learn more, or even join our beta program, please use this link to apply to reach out.
As I’ve mentioned, this is just the beginning. As to what’s next, I don’t want to reveal too much but I can share that we already have plans for customization that will allow admins to create their own playbooks, granularly addressing the specific needs of their organization.
Imagine having full control of what checks our platform executes, in what order, and how their results shape the suggestion for the end-users… That could be very powerful for any organization looking to streamline K8s troubleshooting and improving control over its processes. And who said we even have to stop at troubleshooting?
We are thinking big, and this is just a taste. Stay tuned!
Share:
How useful was this post?
Click on a star to rate it!
Average rating 5 / 5. Vote count: 6
No votes so far! Be the first to rate this post.
and start using Komodor in seconds!