7 posts tagged with "architecture"

How WeDAA Beat Load Balancer Limits Using Kong API Gateway

May 29, 2025 · 5 min read

Introduction

We're WeDAA, short for Well Defined Application Architecture—a platform that empowers developers and architects to visually prototype software architectures on a canvas. Whether you're building enterprise applications, startup MVPs, or incubating new ideas, WeDAA helps you bootstrap non-functional requirements like APIs, microservices, and infrastructure in minutes instead of days.

Everything was going smoothly—until we hit a hard limit. Our hosting provider, Hetzner, imposed a frustrating restriction: their load balancers only allow five service mappings. For a platform like ours that depends on running dozens of services, this was a serious roadblock.

Spoiler alert: Kong API Gateway came to our rescue. Here’s how we turned a rigid infrastructure limitation into a streamlined, scalable solution.

WeDAA GATEWAY

The Backstory: What’s WeDAA Anyway?

A Visual Platform for Modern Architecture

WeDAA is like a LEGO set for software architecture—developers can visually design, connect, and deploy microservices without the heavy lifting of backend setup. It handles the non-functional groundwork so you can focus on building features.

Our Cloud of Choice: Hetzner

Hetzner

When we launched, we chose Hetzner for hosting. It’s fast, reliable, and super budget-friendly—which is a big deal when you're bootstrapping or scaling quickly.

Why Hetzner? Cost vs Features

For the price, Hetzner rocks. But not everything that glitters is gold. While compute and storage were amazing, their load balancer offering had… well, “quirks.”

Enter the Challenge: Microservices Meet Their Limit

WeDAA’s Need for Scalability

As a microservices platform, WeDAA doesn’t run one or two services—we run dozens, sometimes more. Each service typically needs to be exposed externally or internally in a managed way.

Load Balancers and Path-Based Routing: What’s the Deal?

Normally, you’d use a path-based routing strategy.

Path-based routing strategy

Like this:

/auth → Authentication Service
/billing → Billing Service
/notifications → Notification Service

Simple, clean, and only one public-facing endpoint required.

Hetzner’s Load Balancer Limitation

But here’s where it broke down: Hetzner’s load balancers don’t support path-based routing. Oof.

Only 5 Port Mappings? Seriously? Instead, you have to map services by port. So if your LB is lb11, you can only do stuff like:

Port 8080 → Auth
Port 8081 → Billing
Port 8082 → Notifications

...and so on—but only 5 of these mappings allowed.

Load Balancer Bottleneck: Our Initial Struggle

Trying to Squeeze Multiple Services Behind One LB

You can see where this is going. We had way more than 5 services.

Options We Considered

We brainstormed all possible hacks:

Buy a bigger load balancer (was an option—but the cost jumped fast)
Spin up more LBs (costly and messy)
Use a reverse proxy manually (yikes—complex maintenance)

All Roads Led to Cost & Complexity

Every workaround added either cost, technical debt, or overhead. We needed something smarter—a gateway between our load balancer and services.

Discovering a Smarter Way: Say Hello to Kong API Gateway

Kong API Gateway

What is Kong API Gateway, Anyway?

Kong Gateway is a lightweight, fast, and flexible cloud-native API gateway. Kong Gateway sits in front of your service applications, dynamically controlling, analyzing, and routing requests and responses. Kong Gateway implements your API traffic policies by using a flexible, low-code, plugin-based approach.

It's like a digital traffic cop that decides where to send API requests based on path, headers, method—you name it.

How Kong Helped Us Bypass LB Limitations

With Kong sitting behind the load balancer, we only needed to expose one service—Kong itself. Then Kong handled internal routing to all our microservices based on the URL path.

One Service, Multiple Paths

Boom. Now instead of mapping 5 different ports on the LB, we had:

lb11:443 → Kong
- /auth → Auth Service
- /billing → Billing Service
- /notifications → Notification Service
- /anything → Any Service We Wanted

Internal Routing Made Simple

All services were registered with Kong. We configured routing rules, and it took care of the rest. No extra LBs. No extra cost.

A Real-World Architecture Shift

Before Kong: Chaos

Dozens of services, all needing public access, but limited to 5. It was like trying to stuff an elephant into a mini-fridge.

After Kong: Clarity

Now it’s:

Hetzner LB → Kong → All internal microservices

[Internet] → [Hetzner LB] → [Kong API Gateway] → [Service A, B, C...Z]

Clean, scalable, and way easier to manage.

Would We Do It Again? 100% Yes

Hetzner

Scalability Without Extra Costs

We’re saving $$$ by not buying extra load balancers.

Future-Proofing the Platform

Need to add 10 more services? 50? No problem—Kong’s got it.

Faster Deployments

New service? Register with Kong, deploy, and done. No port juggling.

Wrapping Up: Kong API Gateway – Game-Changer in Our Stack

If you're running multiple services and your cloud provider’s load balancer has limitations, Kong can help. It did the job for us—quickly and without adding complexity.

What started as a problem with too many services turned into a simple solution. Now, Kong is a key part of our setup—and we’re not looking back.

👉 In our next blog, we’ll show you how to configure Kong API Gateway and use it in your microservices architecture step by step—stay tuned!

Bulkhead Pattern -> Cell based architecture

May 5, 2024 · 4 min read

Cell based architecture

Bulkhead or Cell-based architecture is an architectural pattern adopted for constructing highly available, scalable, and fault-tolerant enterprise applications.

The concept of the bulkhead pattern draws inspiration from the architecture of ship hulls, wherein the hull is divided into multiple cells or sections to prevent the entire ship from sinking if one section is damaged.

ship hulls

Vertical Partitioning of Walls dividing the Ship’s hull

In the bulkhead architecture approach, ships manage the risk of sinking by isolating compromised compartments. The vertical partitioning of walls divides the ship's interior into self-contained, watertight compartments, aiming to contain a hull breach within a specific section of the ship.

A similar concept is applied when designing scalable enterprise applications having dynamic workload, known as cell-based architecture.

Key Components of Cell Based Architecture

Cell Router

A cell router is a key component in a cell-based architecture. Traffic to every individual cell is routed through cell router. If a cell fails, only the traffic directed to that cell will be impacted. In short impact on the system will be minimal. The cell router is responsible for receiving the request, determining their destination based on cell partitioning algorithms, and then forwarding them towards the destination cell.

Blast Radius

Represents the area or range of systems, components, or processes that may be affected directly or indirectly by a single cell failure event. For example, if a system is built with four cells and any service in a cell goes down only the 25% of traffic will be affected. So, the blast radius is approximate 25% here.

Traffic distribution

Cell router plays a critical role in orchestrating traffic distribution among cells in cell-based architecture. Traffic distribution works by routing incoming requests to the appropriate cell or service instance based on predefined routing rules or policies. The cells get partitioned using a partition key. A simple or composite partition key can be used to distribute the traffic between cells. the partitioning strategy could be ranging partitioning, hash partitioning, or list partitioning. The chosen partitioning strategy will influence how the partition key is used to distribute traffic across partitions.

Self-Contained Unit (Cell)

Complete in itself modules are the first-class citizen of any cell-based architecture. Such modules are the one which serves the logical / business purpose of a feature or service along with non-functional entities around it. Such as API’s, load balancer and database. In short, a cell in a cell-based architecture is self-contained and self- sustainable instance of an application which can be deployed, scaled, and observed independently. Cells are isolated with each other at logical level, failure of a cell does not affect other cells and reduces the overall impact to software. With the cell-based architecture a cell does not share encompassed services state with other cells in the system.

Cell Health Check

Before forwarding the request, the cell router layer may perform health checks on the target cell or service instance to ensure that cell can handle the request. If the target instance is unhealthy or unavailable, the router may reroute the request to a healthy cell or trigger automatic recovery mechanisms.

Database Replication

In cell-based architecture Database replication is used to synchronize data between cells to ensure consistency and availability. Changes made to data in one cell need to be propagated to other cells where that data is relevant. Replication mechanisms vary based on the specific database technology used, but they involve replicating data changes asynchronously or semi-synchronously between cells.

When to use Cell Based Architecture?

For systems that require high scalability and the ability to handle large volumes of traffic, cell-based architecture allows for horizontal scaling by adding more cells as needed.

Applications with dynamic workloads that require rapid scaling up or down based on demand, cell-based architecture provides the flexibility to scale individual cells independently, optimizing resource utilization.

Definitive Guide to Knative Serving—A Deep Dive into Theory and Architecture

May 1, 2024 · 10 min read

If you missed the first installment of our Knative series, you can catch up by diving into our previous blog post: Dive into Knative—Explore Serverless with Kubernetes

Overview

Technology Conversations

The key aspects and benefits of Knative Serving:

Serverless Platform: Knative Serving is a serverless platform built on top of Kubernetes.
Deployment Simplification: It simplifies the deployment of containerized applications on Kubernetes.
Auto-scaling: Automatically scales applications based on demand, ensuring optimal resource utilization.
Traffic Management: Provides features for managing traffic routing, allowing seamless updates and rollbacks.
Focus on Development: Abstracts away infrastructure management complexities, enabling developers to focus on writing and deploying code.
Cloud-Native Applications: Facilitates the development of modern, scalable, and resilient cloud-native applications.

For an introductory exploration of Knative Serving, delve into our dedicated Knative Serving section.

Knative Serving Architecture

Architecture Diagram

Knative Serving consists of several components forming the backbone of the Serverless Platform. This blog explains the high-level architecture of Knative Serving.

Architecture

Components

Activator: The activator is part of the [data-plane]. It is responsible to queue incoming requests (if a Knative Service is scaled-to-zero). It communicates with the autoscaler to bring scaled-to-zero Services back up and forward the queued requests. Activator can also act as a request buffer to handle traffic bursts.
Autoscaler: The autoscaler is responsible to scale the Knative Services based on configuration, metrics and incoming requests.
Controller: The controller manages the state of Knative resources within the cluster. It watches several objects, manages the lifecycle of dependent resources, and updates the resource state.
Queue-Proxy: The Queue-Proxy is a sidecar container in the Knative Service's Pod. It is responsible to collect metrics and enforcing the desired concurrency when forwarding requests to the user's container. It can also act as a queue if necessary, similar to the Activator.
Webhooks: Knative Serving has several webhooks responsible to validate and mutate Knative Resources.

HTTP Request Flows

This explains the behavior and flow of HTTP requests to an application which is running on Knative Serving.

HTTP Request Flows

Initial Request: When a user sends an HTTP request to your Knative service, it first hits the ingress gateway.
Routing Decision: The ingress gateway examines the request to determine which Knative service should handle it based on the requested domain name.
Service Activation: Knative Serving keeps your service deployed at all times. When a request arrives and no instances are running, it promptly activates a new instance by spinning up a pod.
Scaling Decision: Knative Serving checks the current load and decides how many instances of the service need to be running to handle incoming requests efficiently.
Activator Interaction: For the first-time request, it goes to the activator. The activator asks the auto scaler to scale up one pod to serve the initial request, ensuring rapid response and availability.
Request Handling: The request is then forwarded to one of the instances of your service, where your application code processes it.
Containerized Environment: Within each pod, there are two containers:
- User Container: This container hosts your application code, serving user requests.
- Queue Container: This container monitors metrics and observes concurrency levels.
Auto-scaling Based on Concurrency: When the concurrency exceeds the default level, the autoscaler spins up new pods to handle the increased concurrent requests, ensuring optimal performance.
Response: After processing the request, your service generates a response, which is sent back through the same flow to the user who made the initial request.
Scaling Down: If there is no more traffic or if the traffic decreases significantly, Knative Serving may scale down the number of running instances to save resources.

Revisions

Revisions are Knative Serving resources representing snapshots of application code and configuration.
They are created automatically in response to updates in a Configuration spec.
Revisions cannot be directly created or updated; they are managed through Configuration changes.
Deletion of Revisions can be forced to handle resource leaks or remove problematic Revisions.
Revisions are generally immutable, but may reference mutable Kubernetes resources like ConfigMaps and Secrets.
Changes in Revision defaults can lead to syntactic mutations in Revisions, affecting configuration without altering their core behavior.

Autoscaling

Kubernetes Autoscaling Options

Knative Serving provides automatic scaling, or autoscaling, for applications to match incoming demand. This is provided by default, by using the Knative Pod Autoscaler (KPA).

For example, if an application is receiving no traffic and scale to zero is enabled, Knative Serving scales the application down to zero replicas. If scaling to zero is disabled, the application is scaled down to the minimum number of replicas specified for applications on the cluster. Replicas are scaled up to meet demand if traffic to the application increases.

Supported Autoscaler types

Knative Serving supports the implementation of Knative Pod Autoscaler (KPA) and Kubernetes' Horizontal Pod Autoscaler (HPA).

Knative Pod Autoscaler (KPA)
- Part of the Knative Serving core and enabled by default once Knative Serving is installed.
- Supports scale to zero functionality.
- Does not support CPU-based autoscaling.
Horizontal Pod Autoscaler (HPA)
- Not part of the Knative Serving core, and you must install Knative Serving first.
- Does not support scale to zero functionality.
- Supports CPU-based autoscaling.

Knative Serving Autoscaling System

APIs

PodAutoscaler (PA):
- API: podautoscalers.autoscaling.internal.knative.dev
- It's an abstraction that encompasses all possible PodAutoscalers, with the default implementation being the Knative Pod Autoscaler (KPA).
- The PodAutoscaler manages the scaling target, the metric used for scaling, and other relevant inputs for the autoscaling decision-making process.
  1. Scaling Target: The PodAutoscaler determines what resource it should scale. This could be the number of pods, CPU utilization, memory consumption, or any other metric that indicates the workload's demand.
  2. Metric for Scaling: It specifies which metric or metrics should be used to make scaling decisions. For example, it might use CPU utilization to decide when to add or remove pods based on workload demand.
  3. Other Inputs: The PodAutoscaler considers additional factors beyond just the scaling metric. These could include constraints, policies, or thresholds that influence scaling decisions. For instance, it might have rules to prevent scaling beyond a certain limit or to ensure a minimum number of pods are always running.
- PodAutoscalers are automatically created from Revisions by default.
Metric:
- API: metrics.autoscaling.internal.knative.dev
- This API controls the collector of the autoscaler, determining which service to scrape data from, how to aggregate it, and other related aspects.
  1. Collector Control: The API controls the collector component of the autoscaler. The collector is responsible for gathering data related to the performance and behavior of the services being monitored for autoscaling.
  2. Data Scraping: It determines which service or services the autoscaler should scrape data from. This involves collecting relevant metrics such as CPU utilization, request latency, or throughput from the specified services.
  3. Aggregation: The API defines how the collected data should be aggregated. This could involve calculating averages, sums, or other statistical measures over a specific time window to provide a meaningful representation of the service's performance.
  4. Other Related Aspects: Beyond data collection and aggregation, the API likely handles other aspects such as data retention policies, thresholds for triggering scaling actions, and configurations for interacting with the autoscaler's decision-making process.
- Metrics are automatically generated from PodAutoscalers by default.
ServerlessServices (SKS):
- API: serverlessservices.networking.internal.knative.dev
- It's an abstraction layer built on top of Kubernetes Services, managing the data flow and the switch between using the activator as a buffer or routing directly to application instances.
- SKS creates two Kubernetes services for each revision: a public service and a private service.
- The private service points to the application instances, while the public service endpoints are managed directly by the SKS reconciler.
- SKS operates in two modes: Serve and Proxy.
  1. In Serve mode, traffic flows directly to the revision's pods.
  2. In Proxy mode, traffic is directed to activators.
- ServerlessServices are created from PodAutoscalers.

Scaling up and down (steady state)

steady state

Steady State Operation:
- The autoscaler operates continuously at a steady state.
- It regularly scrapes data from the currently active revision pods to monitor their performance.
Dynamic Adjustment:
- As incoming requests flow into the system, the scraped values of performance metrics change accordingly.
- Based on these changing metrics, the autoscaler dynamically adjusts the scale of the revision.
SKS Functionality:
- The ServerlessServices (SKS) component keeps track of changes to the deployment's size.
- It achieves this by monitoring the private service associated with the deployment.
Public Service Update:
- SKS updates the public service based on the changes detected in the deployment's size.
- This ensures that the public service endpoints accurately reflect the available instances of the revision.

Scaling to zero

Scaling to Zero Process (1):
- A revision scales down to zero when there are no more requests in the system.
- All data collected by the autoscaler from revision pods and the activator reports zero concurrency, indicating no active requests.
Activator Preparation:
- Before removing the last pod of the revision, the system ensures that the activator is in the path and reachable.
Proxy Mode Activation (4.1):
- The autoscaler, which initiated the decision to scale to zero, directs the SKS to switch to Proxy mode.
- In Proxy mode, all incoming traffic is routed to the activators.
Public Service Probing:
- The SKS's public service is probed continuously to ensure it returns responses from the activator.
- Once the public service reliably returns responses from the activator and a configurable grace period (set via scale-to-zero-grace-period) has elapsed,
Final Scaling Down (5):
- The last pod of the revision is removed, marking the successful scaling down of the revision to zero instances.

Scaling from zero

Scaling Up Process:
- If a revision is scaled to zero and a request arrives for it, the system needs to scale it up.
- As the SKS is in Proxy mode, the request reaches the activator.
Request Handling:
- The activator counts the incoming request and reports its appearance to the autoscaler (2.1).
- It then buffers the request and monitors the SKS's private service for endpoints to appear (2.2).
Autoscaling Cycle (3):
- The autoscaler receives the metric from the activator and initiates an autoscaling cycle.
- This process determines the desired number of pods based on the incoming request.
Scaling Decision (4):
- The autoscaling process concludes that at least one pod is needed to handle the incoming request.
Scaling Up Instructions (5.1):
- The autoscaler instructs the revision's deployment to scale up to N > 0 replicas to accommodate the increased demand.
Serve Mode Activation (5.2):
- The autoscaler switches the SKS into Serve mode, directing traffic to the revision's pods directly once they are up.
Endpoint Probing:
- The activator monitors the SKS's private service for the appearance of endpoints.
- Once the endpoints come up and pass the probe successfully, the respective address is considered healthy and used to route the buffered request and any additional requests that arrived in the meantime (8.2).
Successful Scaling Up:
- The revision has successfully scaled up from zero to handle the incoming request.

Conclusion

In summary, we've explored the core concepts of Knative Serving, from its architecture to scaling mechanisms. Next, we'll dive into practical implementation in our upcoming blog. Also, stay tuned for the integration of the serverless component into the WeDAA Platform, making prototyping and deployment faster and easier than ever.

Unveiling the Power of Feature Flags in Software Development

April 18, 2024 · 4 min read

Introduction

In the realm of software development, there is often a need for implementing features dynamically, toggling functionalities, and rolling out changes seamlessly without disrupting user experience. Imagine this scenario: you're working on a high-stakes project, and you need to introduce a new feature. However, releasing it to all users at once might be risky. What if there are bugs? What if users don't like it? This is where feature flags come to the rescue.

The Tale of Dynamic Feature Rollouts

Let's delve into a hypothetical scenario. Meet Adam, a software engineer working on a cutting-edge e-commerce platform. Their team is gearing up to introduce a new payment gateway, which promises to enhance user experience and reduces failure rates. However, they're wary of unforeseen bugs that might surface during the rollout. Plus, they're unsure if the new checkout flow will resonate well with all users.

Here's where feature flags come into play. By leveraging feature flags, Adam and their team can deploy the new payment gateway to a small subset of users initially. They can monitor its performance, gather feedback, and make necessary tweaks without affecting the entire user base. Once they're confident in the feature's stability and user acceptance, they can gradually roll it out to all users, mitigating risks and ensuring a smooth transition.

Understanding Feature Flags

Feature flags, also known as feature toggles or feature switches, are a powerful technique used in software development to enable or disable certain features at runtime. They provide developers with fine-grained control over feature rollout, allowing them to manage feature releases, perform A/B testing, and mitigate risks associated with deploying new functionalities.

Hands-on

In this blog, we'll explore how to implement feature flags using the Flagsmith in a Go application built on the Go-Micro framework.

Generate prototype from WeDAA

Use below Architecture as reference and generate code from WeDAA

A Go Micro Service

Setup Flagsmith

Flagsmith is a feature flag tool that lets you manage features across web, mobile and server side applications.

It provides free account as well for SaaS offering. Signup, Create Organisation and add a feature flag.

Flagsmith setup

Flagsmith SDK

Include flagsmith SDK in go.mod

github.com/Flagsmith/flagsmith-go-client/v3 v3.4.0

Payment Handler

In this snippet, we initialize the Flagsmith client with our API key, retrieve the status of a feature flag, and conditionally execute feature-specific functionality based on the flag's status.

// src/handlers/payments.go
package handler

import (
	"context"
	"net/http"
	flagsmith "github.com/Flagsmith/flagsmith-go-client/v3"
)

type PaymentsHandler struct{}

func (handler *PaymentsHandler) ProcessPayment(response http.ResponseWriter, request *http.Request) {
	client := flagsmith.NewClient("<YOUR_FLAGSMITH_API_KEY>")
	flags, _ := client.GetEnvironmentFlags(context.TODO())
	isEnabled, _ := flags.IsFeatureEnabled("payment_gateway")

	if isEnabled {
		response.Write([]byte(`{ "message": "New Payment Gateway" }`))
	} else {
		response.Write([]byte(`{ "message": "Old Payment Gateway" }`))
	}
}

Payment Controller

A sample controller with API to simulate payments.

// src/controllers/payments.go
package controllers

import (
   "github.com/gorilla/mux"
   "net/http"
   "payments/handlers"
)

var paymentsHandler *handler.PaymentsHandler

type PaymentsController struct {}

func (paymentsController PaymentsController) RegisterRoutes(r *mux.Router) {
	r.Handle("/api/payments",http.HandlerFunc(paymentsHandler.ProcessPayment)).Methods(http.MethodGet,http.MethodOptions)
}

Register Payments Controller

Add the following code in registerRoutes function of main.go in src

func registerRoutes(router *mux.Router) {
    registerControllerRoutes(controllers.ManagementController{}, router)
    registerControllerRoutes(controllers.PaymentsController{}, router) // Register Payments Controller
}

Execution

Run the Go Micro Service using following commands
```
go mod tidy
go run .
```

Check health of the service

curl -i -H "Accept: application/json" http://localhost:6060/management/health/readiness

Response should be as follows

HTTP/1.1 200 OK
Access-Control-Allow-Headers: Origin, Content-Type, Accept,Authorization
Access-Control-Allow-Methods: GET, POST, PUT, DELETE, OPTIONS
Access-Control-Allow-Origin: *
Content-Type: application/json
Date: Wed, 17 Apr 2024 19:30:53 GMT
Content-Length: 64

{"components":{"readinessState":{"status":"UP"}},"status":"UP"}

Test the new Payment API

curl -i -H "Accept: application/json" http://localhost:6060/api/payments

Response will be based on feature flag, whether new or old payment gateway is used.

HTTP/1.1 200 OK
Access-Control-Allow-Headers: Origin, Content-Type, Accept,Authorization
Access-Control-Allow-Methods: GET, POST, PUT, DELETE, OPTIONS
Access-Control-Allow-Origin: *
Content-Type: application/json
Date: Wed, 17 Apr 2024 19:33:33 GMT
Content-Length: 36

{ "message": "New Payment Gateway" }

Conclusion

Feature flags revolutionize the way software is developed and released. They empower developers to iterate quickly, gather feedback, and deliver value to users with confidence. By adopting feature flags in your development workflow, you can mitigate risks, improve deployment agility, and ultimately, delight your users with timely and impactful features.

Dive into Knative—Explore Serverless with Kubernetes

April 13, 2024 · 8 min read

What is serverless?

Serverless is a cloud-native development model that allows developers to build and run applications without having to manage servers.

There are still servers in serverless, but they are abstracted away from app development. A cloud provider handles the routine work of provisioning, maintaining, and scaling the server infrastructure. Developers can simply package their code in containers for deployment.

Once deployed, serverless apps respond to demand and automatically scale up and down as needed.

Serverless Computing: A Catering Service Analogy

Catering Service Analogy

Imagine you're hosting a dinner party. In a traditional hosting scenario, you'd have to plan everything from cooking the food to setting the table and serving your guests. This is like managing servers in traditional computing – you have to handle all the details yourself.

Now, consider a serverless approach as hiring a catering service for your party. You tell them what you need, and they take care of everything – from cooking the food to setting up and serving. You don't have to worry about the kitchen logistics or cleaning up afterward; you can focus on enjoying the party with your guests. Similarly, in serverless computing, you provide your code, and the cloud provider takes care of the infrastructure, scaling, and management, allowing you to focus on writing and improving your application.

Kubernetes-Powered Serverless: Introducing Knative

Serverless Framework Knative

In the rapidly evolving landscape of cloud computing, serverless technology has become increasingly popular for its simplicity in deploying applications without worrying about infrastructure. Knative, built on top of Kubernetes (k8s), extends the power of Kubernetes to manage serverless workloads seamlessly. While major cloud providers like AWS, Google Cloud, and Microsoft Azure offer their serverless solutions, Knative stands out as an open-source, platform-agnostic framework.

Collaboratively developed by industry leaders like Google and Red Hat, Knative abstracts away the complexities of deploying, scaling, and managing containerized applications, allowing developers to focus solely on writing code without worrying about infrastructure management. Knative simplifies serverless deployments across diverse cloud environments, revolutionizing the way applications are developed and deployed in modern cloud-native architectures.

Exploring Knative Features: Simplifying Serverless Deployment

Serverless refers to running back-end programs and processes in the cloud. Serverless works on an as-used basis, meaning that companies only use what they pay for. Knative is a platform-agnostic solution for running serverless deployments.

Knative Features

Simpler Abstractions: simplifies the YAML configuration process by providing custom CRDs (Custom Resource Definitions), streamlining the abstraction layers and making development workflows more straightforward.
Autoscaling: autoscaling feature seamlessly adjusts resource allocation, scaling applications down to zero and up from zero based on demand.
Progressive Rollouts: Customize your rollout strategy with Knative's Progressive Rollouts feature, offering flexibility to select the ideal approach based on your specific requirements.
Event Integrations: Easily manage events from diverse sources with Knative's Event Integrations, streamlining event handling for seamless integration.
Handle Events: Effortlessly trigger handlers from the event broker with Knative's event handling capabilities, ensuring seamless integration and streamlined workflow.
Plugable: Knative's pluggable architecture ensures seamless integration and extension within the Kubernetes ecosystem, providing flexibility and scalability for diverse use cases.

Knative Components

Knative has two main components that empower teams working with Kubernetes. Serving and Eventing work together to automate and manage tasks and applications.

Serving Eventing

Knative Serving: Allows running serverless containers in Kubernetes with ease. Knative takes care of the details of networking, autoscaling (even to zero), and revision tracking. Teams can focus on core logic using any programming language.
Knative Eventing: Allows universal subscription, delivery and management of events. Build modern apps by attaching compute to a data stream with declarative event connectivity and developer friendly object models.

Knative Serving

Knative Serving defines a set of objects as Kubernetes Custom Resource Definitions (CRDs). These objects get used to define and control how your serverless workload behaves on the cluster:

Knative Serving

Savita Ashture, CC BY-SA 4.0

Service: A Knative Service describes a combination of a route and a configuration as shown above. It is a higher-level entity that does not provide any additional functionality. It should make it easier to deploy an application quickly and make it available. You can define the service to always route traffic to the latest revision or a pinned revision.
Route: The Route describes how a particular application gets called and how the traffic gets distributed across the different revisions. There is a high chance that several revisions can be active in the system at any given time based on the use case in those scenarios. It's the responsibility of routes to split the traffic and assign to revisions.
Configuration: The Configuration describes what the corresponding deployment of the application should look like. It provides a clean separation between code and configuration and follows the Twelve-Factor App methodology. Modifying a configuration creates a new revision.
Revision: The Revision represents the state of a configuration at a specific point in time. A revision, therefore, gets created from the configuration. Revisions are immutable objects, and you can retain them for as long as useful. Several revisions per configuration may be active at any given time, and you can automatically scale up and down according to incoming traffic.

Knative Serving focuses on:

Rapid deployment of serverless containers.
Autoscaling includes scaling pods down to zero.
Support for multiple networking layers such as Ambassador, Contour, Kourier, Gloo, and Istio for integration into existing environments.
Give point-in-time snapshots of deployed code and configurations.

Knative Eventing

Knative Eventing is a collection of APIs that enable you to use an event-driven architecture with your applications. You can create components that route events from event producers to event consumers, known as sinks, that receive events.

Use-cases

General areas of application are:

Publishing an event without creating a consumer. You can send events to a broker as an HTTP POST, and use binding to decouple the destination configuration from your application that produces events.
Consuming an event without creating a publisher. You can use a trigger to consume events from a broker based on event attributes.
IoT, network monitoring, application monitoring, website testing and validation, and mobile app front-end processes that act as event generators.

Use Knative eventing when:

When you want to publish an event without creating a consumer. You can send events to a broker as an HTTP POST, and use binding to decouple the destination configuration from your application that produces events.
When you want to consume an event without creating a publisher. You can use a trigger to consume events from a broker based on event attributes. The application receives events as an HTTP POST.
When you want to create components that route events from event producers to event consumers, known as sinks, that receive events. Sinks can also be configured to respond to HTTP requests by sending a response event.

Knative Eventing

Eventing Components

Components

Sources: Knative eventing sources are objects that generate events and send them to a sink. They are created by instantiating a custom resource (CR) from a source object. There are different types of sources, such as PingSource, ApiServerSource, KafkaSource, etc., depending on the event producer.
Sinks: Knative eventing sinks are objects that receive events from sources or other components. They can be Addressable or Callable resources that have an address defined in their status.address.url field. Addressable sinks can receive and acknowledge an event delivered over HTTP, while Callable sinks can also respond to HTTP requests by sending a response event. Knative Services, Channels, and Brokers are all examples of sinks.
Brokers: Knative eventing brokers are objects that define an event mesh for collecting a pool of events. Brokers provide a discoverable endpoint for event ingress, and use triggers for event delivery. Event producers can send events to a broker by posting the event.
Channels: Channels are custom resources that define a single event-forwarding and persistence layer. You can connect channels to various backends for sourcing events, such as In-Memory, Kafka, or GCP PubSub. You can also fan-out received events, through subscriptions, to multiple destinations, or sinks. Examples of sinks include brokers and Knative services.
Subscriptions: Knative subscriptions are objects that enable event delivery from a channel to an event sink, also known as a subscriber. A subscription specifies the channel and the sink to deliver events to, as well as some sink-specific options, such as how to handle failures.
Triggers: Knative Triggers are objects that enable seamless integration with external event sources, allowing applications to react dynamically to incoming events, fostering the development of scalable, event-driven architectures.

Conclusion

In this overview, we've explored serverless computing with Knative on Kubernetes, covering core concepts, features, and components. Stay tuned for practical implementations and real-world use cases in upcoming blogs, unlocking Knative's full potential for your projects. With Knative, the future of serverless on Kubernetes is brighter than ever.

Furthermore, I'm excited to announce that our platform, WeDAA, will be hosting these upcoming blogs. WeDAA is committed to providing innovative solutions, and soon, we'll be incorporating serverless capabilities into our platform. Keep an eye out for our future updates, as we continue to evolve and enhance our services to meet your needs.

Continue your exploration of Knative by diving into our next blog on Knative Serving Definitive Guide to Knative Serving—A Deep Dive into Theory and Architecture!

Rapid Application Prototyping (RAP)

April 8, 2024 · 2 min read

With the long history of software development, one debate persists: should we prioritize solid application architecture and adhere to best development practices before building a working prototype, or should we quickly create a functional prototype to validate the idea before investing considerable time and resources in identifying technology and architecture?

Given the rapid evolution of technologies and the increasing demands of business requirements, right approach is to emphasises speed while ensuring the quality and robustness of the application or architecture.

Rapid Application Prototyping (RAP) offers a valuable method to put ideas into action and comprehend both the technical and functional aspects of a solution. Rapid Application Prototyping (RAP) is an approach that prioritizes building and displaying the minimum viable functional view of an application as soon as possible.

Few essential aspects of platforms supporting Rapid Application Prototypes.

Modularity: A prototype should be developed using modern modular architecture patterns, enabling easy integration or modification of business features and technical solutions. Modularity facilitates the construction and maintenance of smaller, more manageable components.

Modularity

Loose Coupling: Modular components of the application should possess well-defined interfaces to encourage loose coupling among them. Loose coupling simplifies the integration of new features or technologies.

Loose Coupling

Scalability: A RAP platform should support the construction of a scalable architecture, enabling preliminary horizontal scaling of modular components. Scalability is crucial for creating resilient and high-performance systems.

Scalability

Resilience and Robustness: In the event of a component failure or issue, the entire system should not necessarily collapse. Failures should be contained within the affected module or service, minimizing their impact on other parts of the application. The modules or services within an application should demonstrate robustness, meaning they can gracefully handle failures, unexpected conditions, and varying loads while maintaining overall functionality and availability.

Resilience and Robustness

WeDAA engineering platform empower developers to build Rapid Application Prototypes (RAP) quickly with all the essential features required for building a well architected enterprise application.

Building event driven microservices architecture with RabbitMQ

January 15, 2024 · 5 min read

The Story

Sample Message Broker App

Imagine we're building a simple e-commerce application. When a customer places an order, it's not instantly whisked away by elves. Instead, the order details – a message filled with product information, shipping address, and payment details – gets sent to a queue managed by a message broker.

Meanwhile, our order processing system sits like a hungry rabbit, constantly checking the queue for new messages. Once it grabs an order message, it springs into action: verifying payment, notifying the warehouse, and sending updates to the customer. All without the two systems ever needing to directly talk to each other!

This decoupling is the superpower of message brokers. Applications don't need to know the specifics of each other's internal workings. They simply send and receive messages, leaving the orchestration to the broker. This makes systems more flexible, scalable, and resilient.

Let's delve deeper into this rabbit hole, using RabbitMQ as our trusty guide.

The Technology

RabbitMQ is a popular open-source message broker, and it's a great starting point to understand the magic behind these event-driven systems.

It is used worldwide at small startups and large enterprises.
It is lightweight and easy to deploy on premises and in the cloud.
It can be deployed in distributed and federated configurations to meet high-scale, high-availability requirements.
It runs on many operating systems and cloud environments, and provides a wide range of developer tools for most popular languages.

The Concepts

RabbitMQ Concepts

Message: It is the fundamental unit of communication in RabbitMQ. It contains the data being sent from the producer to the consumer. It is like a post carrying a message.
Producer: A producer is an application or component that sends messages to RabbitMQ. It is like a person sending the post.
Consumer: A consumer is an application or component that receives and processes messages from RabbitMQ. It is a person receiving the post.
Queue: A queue is a buffer that stores messages until they are consumed. Messages are placed in queues by producers and retrieved by consumers. It is a postbox that stores messages of a person.
Exchange: An exchange is a routing mechanism that receives messages from producers and routes them to queues. It is like a post office.
Routing Key: A routing key is a property of a message that is used by exchanges to determine which queues should receive the message. This is like a mailing address for a post.

The Tutorial

Generate prototype from WeDAA

Use below Architecture as reference and generate a project from WeDAA

All the code mentioned in the blog will be generated by WeDAA. It can be further extended as necessary.

Sample RabbitMQ WeDAA Architecture

RabbitMQ Configuration

RabbitMQConfigOrdersToInventory class in orders service registers Queue, Exchange, Binding and Message Converters are as beans for auto-configuration in Spring AMPQ.

@Configuration
public class RabbitMQConfigOrdersToInventory {

    public static final String QUEUE = "OrdersToInventory_message_queue";
    public static final String EXCHANGE = "OrdersToInventory_message_exchange";
    public static final String ROUTING_KEY = "OrdersToInventory_message_routingKey";

    @Bean
    public Queue queueOrdersToInventory() {
        return new Queue(QUEUE);
    }

    @Bean
    public TopicExchange exchangeOrdersToInventory() {
        return new TopicExchange(EXCHANGE);
    }

    @Bean
    public Binding bindingOrdersToInventory() {
        return BindingBuilder.bind(this.queueOrdersToInventory()).to(this.exchangeOrdersToInventory()).with(ROUTING_KEY);
    }

    @Bean
    public MessageConverter messageConverter() {
        return new Jackson2JsonMessageConverter();
    }

    @Bean
    public AmqpTemplate template(ConnectionFactory connectionFactory) {
        RabbitTemplate template = new RabbitTemplate(connectionFactory);
        template.setMessageConverter(messageConverter());
        return template;
    }
}

Message Producer

RabbitMQProducerOrdersToInventory class in orders service sends a message to the exchange every 15 seconds.

@Scheduled(cron = "0/15 * * * * *")
public void publishMessage() {
    RabbitMessageModel message = new RabbitMessageModel();
    message.setMessage("Publishing this message from orders with key: " + RabbitMQConfigOrdersToInventory.QUEUE);
    message.setDateTime(new Date());
    template.convertAndSend(RabbitMQConfigOrdersToInventory.EXCHANGE, RabbitMQConfigOrdersToInventory.ROUTING_KEY, message);
    logger.info("Message Published Successfully");
}

Message Consumer

RabbitMQConsumerOrdersToInventory in the inventory service starts receiving the messages as the messages are sent by the Producer.

msgs, err := channel.Consume(
    queueName,
    "",
    true,
    false,
    false,
    false,
    nil,
)

forever := make(chan bool)
go func() {
    for d := range msgs {
        logger.Infof("Received Message: %s\n", d.Body)
    }
}()
<-forever

The execution

Bootup the RabbitMQ server

WeDAA provides dockerfile for starting RabbitMQ server quickly. It can be found in both inventory and orders service.

RabbitMQ server can be started using below command from orders service.
```
docker compose -f src/main/docker/rabbitmq.yml up --wait
```
RabbitMQ's management console can be accessed on http://localhost:15672/

Default username: guest, password: guest

Start the orders service

In the sample architecture, orders service acts as producer. Start the service using the following command

./mvnw

Once the service is started, it can be seen from the logs that messages are sent periodically.

 2024-01-15T20:35:00.015+05:30  INFO 55955 --- [rs-scheduling-1] .o.c.r.RabbitMQProducerOrdersToInventory : Message Published Successfully 
 2024-01-15T20:35:15.008+05:30  INFO 55955 --- [rs-scheduling-1] .o.c.r.RabbitMQProducerOrdersToInventory : Message Published Successfully 
 2024-01-15T20:35:30.003+05:30  INFO 55955 --- [rs-scheduling-1] .o.c.r.RabbitMQProducerOrdersToInventory : Message Published Successfully 
 2024-01-15T20:35:45.002+05:30  INFO 55955 --- [rs-scheduling-1] .o.c.r.RabbitMQProducerOrdersToInventory : Message Published Successfully 

Start the inventory service

In the sample architecture, inventory service acts as consumer.

Build and start the service using the following commands

go mod tidy
go run .

Once started, inventory service starts consuming the messages sent by orders service.

 2024-01-15 20:41:33  file=rabbitmq/RabbitMQConsumerOrdersToInventory.go:51 level=info Received Message: {"id":1,"message":"Publishing this message from orders with key: OrdersToInventory_message_queue","dateTime":1705331085013}
 2024-01-15 20:41:33  file=rabbitmq/RabbitMQConsumerOrdersToInventory.go:51 level=info Received Message: {"id":2,"message":"Publishing this message from orders with key: OrdersToInventory_message_queue","dateTime":1705331100012}
 2024-01-15 20:41:33  file=rabbitmq/RabbitMQConsumerOrdersToInventory.go:51 level=info Received Message: {"id":3,"message":"Publishing this message from orders with key: OrdersToInventory_message_queue","dateTime":1705331115005}
 2024-01-15 20:41:33  file=rabbitmq/RabbitMQConsumerOrdersToInventory.go:51 level=info Received Message: {"id":4,"message":"Publishing this message from orders with key: OrdersToInventory_message_queue","dateTime":1705331130001}

Track activity on RabbitMQ management console

RabbitMQ Exchange

RabbitMQ Queue

The Conclusion

This blog gives a head start on making use of RabbitMQ to orchestrate your event-driven microservice application architectures.

Introduction​

The Backstory: What’s WeDAA Anyway?​

A Visual Platform for Modern Architecture​

Our Cloud of Choice: Hetzner​

Why Hetzner? Cost vs Features​

Enter the Challenge: Microservices Meet Their Limit​

WeDAA’s Need for Scalability​

Load Balancers and Path-Based Routing: What’s the Deal?​

Hetzner’s Load Balancer Limitation​

Load Balancer Bottleneck: Our Initial Struggle​

Trying to Squeeze Multiple Services Behind One LB​

Options We Considered​

All Roads Led to Cost & Complexity​

Discovering a Smarter Way: Say Hello to Kong API Gateway​

What is Kong API Gateway, Anyway?​

How Kong Helped Us Bypass LB Limitations​

One Service, Multiple Paths​

Internal Routing Made Simple​

A Real-World Architecture Shift​

Before Kong: Chaos​

After Kong: Clarity​

Would We Do It Again? 100% Yes​

Scalability Without Extra Costs​

Future-Proofing the Platform​

Faster Deployments​

Wrapping Up: Kong API Gateway – Game-Changer in Our Stack​

Cell based architecture​

Key Components of Cell Based Architecture​

Cell Router​

Blast Radius​

Traffic distribution​

Self-Contained Unit (Cell)​

Cell Health Check​

Database Replication​

When to use Cell Based Architecture?​

Overview​

Knative Serving Architecture​

Architecture Diagram​

Components​

HTTP Request Flows​

Revisions​

Autoscaling​

Supported Autoscaler types​

Knative Serving Autoscaling System​

APIs​

Scaling up and down (steady state)​

Scaling to zero​

Scaling from zero​

Conclusion​

Introduction​

The Tale of Dynamic Feature Rollouts​

Understanding Feature Flags​

Hands-on​

Generate prototype from WeDAA​

Setup Flagsmith​

Flagsmith SDK​

Payment Handler​

Payment Controller​

Register Payments Controller​

Execution​

Conclusion​

What is serverless?​

Serverless Computing: A Catering Service Analogy​

Kubernetes-Powered Serverless: Introducing Knative​

Exploring Knative Features: Simplifying Serverless Deployment​

Knative Components​

Knative Serving​

Knative Serving focuses on:​

Knative Eventing​

Use-cases​

Use Knative eventing when:​

Components​

Conclusion​

The Story​

The Technology​

The Concepts​

The Tutorial​

Generate prototype from WeDAA​

RabbitMQ Configuration​

Message Producer​

Introduction

The Backstory: What’s WeDAA Anyway?

A Visual Platform for Modern Architecture

Our Cloud of Choice: Hetzner

Why Hetzner? Cost vs Features

Enter the Challenge: Microservices Meet Their Limit

WeDAA’s Need for Scalability

Load Balancers and Path-Based Routing: What’s the Deal?

Hetzner’s Load Balancer Limitation

Load Balancer Bottleneck: Our Initial Struggle

Trying to Squeeze Multiple Services Behind One LB

Options We Considered

All Roads Led to Cost & Complexity

Discovering a Smarter Way: Say Hello to Kong API Gateway

What is Kong API Gateway, Anyway?

How Kong Helped Us Bypass LB Limitations

One Service, Multiple Paths

Internal Routing Made Simple

A Real-World Architecture Shift

Before Kong: Chaos

After Kong: Clarity

Would We Do It Again? 100% Yes

Scalability Without Extra Costs

Future-Proofing the Platform

Faster Deployments

Wrapping Up: Kong API Gateway – Game-Changer in Our Stack

Cell based architecture

Key Components of Cell Based Architecture

Cell Router

Blast Radius

Traffic distribution

Self-Contained Unit (Cell)

Cell Health Check

Database Replication

When to use Cell Based Architecture?

Overview

Knative Serving Architecture

Architecture Diagram

Components

HTTP Request Flows

Revisions

Autoscaling

Supported Autoscaler types

Knative Serving Autoscaling System

APIs

Scaling up and down (steady state)

Scaling to zero

Scaling from zero

Conclusion

Introduction

The Tale of Dynamic Feature Rollouts

Understanding Feature Flags

Hands-on

Generate prototype from WeDAA

Setup Flagsmith

Flagsmith SDK

Payment Handler

Payment Controller

Register Payments Controller

Execution

Conclusion

What is serverless?

Serverless Computing: A Catering Service Analogy

Kubernetes-Powered Serverless: Introducing Knative

Exploring Knative Features: Simplifying Serverless Deployment

Knative Components

Knative Serving

Knative Serving focuses on:

Knative Eventing

Use-cases

Use Knative eventing when:

Components

Conclusion

The Story

The Technology

The Concepts

The Tutorial

Generate prototype from WeDAA

RabbitMQ Configuration

Message Producer