Deploying ML Models in Clojure Applications

March 28, 2026

How to deploy machine learning models from Clojure with clear inference boundaries, container builds, and operational practices that survive production traffic.

17.8. Deployment of ML Models in Clojure Applications

Deploying machine learning (ML) models in Clojure applications involves several strategies and considerations to ensure that models are scalable, maintainable, and performant. In this section, we will explore different deployment scenarios, discuss how to serve models using popular web frameworks like Ring and Compojure, and provide guidance on containerization with Docker and orchestration with Kubernetes. Additionally, we will address monitoring and updating deployed models to ensure they remain effective over time.

Deployment Scenarios

When deploying ML models in Clojure applications, you can choose from various deployment scenarios based on your application’s architecture and requirements. Here are some common scenarios:

Embedded Models

In this scenario, the ML model is embedded directly within the Clojure application. This approach is suitable for applications where the model is relatively small and does not require frequent updates. Embedding models can reduce latency since the model is loaded into memory alongside the application.

Advantages:

Low latency due to in-memory model access.
Simplified deployment without external dependencies.

Disadvantages:

Limited scalability for large models.
Difficult to update models without redeploying the application.

Microservices

Deploying ML models as microservices allows for greater flexibility and scalability. Each model can be deployed as an independent service, which can be updated or scaled independently of the main application. This approach is ideal for applications with multiple models or when models require frequent updates.

Advantages:

Independent scaling and updating of models.
Separation of concerns between application logic and model inference.

Disadvantages:

Increased complexity in managing multiple services.
Potential for increased latency due to network calls.

Serving Models with Ring and Compojure

Clojure’s web frameworks, such as Ring and Compojure, provide a robust foundation for serving ML models as web services. Let’s explore how to set up a simple REST API to serve an ML model using these frameworks.

Setting Up a Ring Server

Ring is a Clojure library for handling HTTP requests and responses. It provides a simple and flexible way to create web servers.

1(ns ml-server.core
2  (:require [ring.adapter.jetty :refer [run-jetty]]
3            [ring.util.response :refer [response]]))
4
5(defn handler [request]
6  (response "Hello, World!"))
7
8(defn -main []
9  (run-jetty handler {:port 3000}))

Explanation:

We define a simple handler function that returns a “Hello, World!” response.
The run-jetty function starts a Jetty server on port 3000.

Integrating Compojure for Routing

Compojure is a routing library that works with Ring to define routes for your web application.

 1(ns ml-server.core
 2  (:require [compojure.core :refer :all]
 3            [compojure.route :as route]
 4            [ring.adapter.jetty :refer [run-jetty]]
 5            [ring.util.response :refer [response]]))
 6
 7(defroutes app-routes
 8  (GET "/" [] (response "Welcome to the ML Model Server"))
 9  (route/not-found "Not Found"))
10
11(defn -main []
12  (run-jetty app-routes {:port 3000}))

Explanation:

We use defroutes to define our application’s routes.
The GET route responds with a welcome message.
The route/not-found provides a default response for unmatched routes.

Serving an ML Model

To serve an ML model, we need to load the model and create an endpoint that accepts input data, performs inference, and returns the result.

 1(ns ml-server.core
 2  (:require [compojure.core :refer :all]
 3            [compojure.route :as route]
 4            [ring.adapter.jetty :refer [run-jetty]]
 5            [ring.util.response :refer [response]]
 6            [clojure.data.json :as json]))
 7
 8(defn load-model []
 9  ;; Load your ML model here
10  ;; For example, using a pre-trained model from a file
11  (println "Model loaded"))
12
13(defn predict [input]
14  ;; Perform inference using the loaded model
15  ;; Return the prediction result
16  {:result "prediction"})
17
18(defroutes app-routes
19  (GET "/" [] (response "Welcome to the ML Model Server"))
20  (POST "/predict" req
21        (let [input (json/read-str (slurp (:body req)) :key-fn keyword)
22              prediction (predict input)]
23          (response (json/write-str prediction))))
24  (route/not-found "Not Found"))
25
26(defn -main []
27  (load-model)
28  (run-jetty app-routes {:port 3000}))

Explanation:

The load-model function is a placeholder for loading your ML model.
The predict function performs inference using the model and returns a prediction.
The /predict endpoint accepts POST requests with input data, performs prediction, and returns the result as JSON.

Containerization with Docker

Containerization is a powerful technique for packaging applications and their dependencies into a single, portable unit. Docker is a popular tool for creating and managing containers.

Creating a Dockerfile

A Dockerfile is a script that contains instructions for building a Docker image.

 1# Use an official Clojure image as the base
 2FROM clojure:openjdk-11-lein
 3
 4# Set the working directory
 5WORKDIR /app
 6
 7# Copy the project files
 8COPY . .
 9
10# Install dependencies and build the project
11RUN lein uberjar
12
13# Expose the application port
14EXPOSE 3000
15
16# Run the application
17CMD ["java", "-jar", "target/uberjar/ml-server.jar"]

Explanation:

We use the official Clojure image as the base.
The WORKDIR command sets the working directory inside the container.
The COPY command copies the project files into the container.
The RUN command installs dependencies and builds the project using Leiningen.
The EXPOSE command specifies the port the application will listen on.
The CMD command runs the application.

Building and Running the Docker Image

To build and run the Docker image, use the following commands:

1# Build the Docker image
2docker build -t ml-server .
3
4# Run the Docker container
5docker run -p 3000:3000 ml-server

Explanation:

The docker build command creates a Docker image named ml-server.
The docker run command starts a container from the image, mapping port 3000 on the host to port 3000 in the container.

Orchestration with Kubernetes

Kubernetes is an open-source platform for automating the deployment, scaling, and management of containerized applications.

Creating a Kubernetes Deployment

A Kubernetes deployment manages a set of identical pods, ensuring that the desired number of pods are running.

 1apiVersion: apps/v1
 2kind: Deployment
 3metadata:
 4  name: ml-server
 5spec:
 6  replicas: 3
 7  selector:
 8    matchLabels:
 9      app: ml-server
10  template:
11    metadata:
12      labels:
13        app: ml-server
14    spec:
15      containers:
16      - name: ml-server
17        image: ml-server:latest
18        ports:
19        - containerPort: 3000

Explanation:

The Deployment resource specifies the desired state for the application.
The replicas field defines the number of pod replicas to run.
The selector and template fields define the pod labels and template.

Exposing the Deployment with a Service

A Kubernetes service exposes a set of pods to the network, providing a stable endpoint for accessing the application.

 1apiVersion: v1
 2kind: Service
 3metadata:
 4  name: ml-server
 5spec:
 6  type: LoadBalancer
 7  selector:
 8    app: ml-server
 9  ports:
10  - protocol: TCP
11    port: 80
12    targetPort: 3000

Explanation:

The Service resource exposes the deployment to the network.
The type field specifies the service type, such as LoadBalancer.
The ports field maps the service port to the target port in the pods.

Monitoring and Updating Deployed Models

Once your ML models are deployed, it’s crucial to monitor their performance and update them as needed to maintain accuracy and relevance.

Monitoring Model Performance

Monitoring involves tracking key metrics such as response time, error rates, and prediction accuracy. Tools like Prometheus and Grafana can be used to collect and visualize these metrics.

Updating Models

Updating models can be done by redeploying the service with a new model version. In a microservices architecture, this can be achieved with minimal downtime by using rolling updates or blue-green deployments.

Visualizing the Deployment Architecture

Below is a diagram illustrating the deployment architecture for an ML model served as a microservice using Docker and Kubernetes.

    graph TD;
	    A[User] -->|HTTP Request| B[Load Balancer];
	    B --> C[Service];
	    C --> D[Pod 1];
	    C --> E[Pod 2];
	    C --> F[Pod 3];
	    D -->|Model Inference| G[ML Model];
	    E -->|Model Inference| G;
	    F -->|Model Inference| G;

Diagram Description:

The user sends an HTTP request to the load balancer.
The load balancer forwards the request to the Kubernetes service.
The service routes the request to one of the available pods.
Each pod contains the ML model and performs inference.

Conclusion

Deploying ML models in Clojure applications involves selecting the appropriate deployment scenario, serving models using web frameworks, containerizing applications with Docker, orchestrating with Kubernetes, and monitoring and updating models. By following these strategies, you can ensure that your ML models are scalable, maintainable, and performant.

Try It Yourself

Experiment with the code examples provided in this section. Try modifying the model inference logic, changing the number of replicas in the Kubernetes deployment, or integrating additional monitoring tools. Remember, this is just the beginning. As you progress, you’ll build more complex and interactive applications. Keep experimenting, stay curious, and enjoy the journey!

Ready to Test Your Knowledge?

Loading quiz…

Revised on Wednesday, June 3, 2026

17.7 Real-Time Analytics and Anomaly Detection

17.9 Case Studies and Practical Examples

Deploying ML Models in Clojure Applications

17.8. Deployment of ML Models in Clojure Applications

Deployment Scenarios

Embedded Models

Microservices

Serving Models with Ring and Compojure

Setting Up a Ring Server

Integrating Compojure for Routing

Serving an ML Model

Containerization with Docker

Creating a Dockerfile

Building and Running the Docker Image

Orchestration with Kubernetes

Creating a Kubernetes Deployment

Exposing the Deployment with a Service

Monitoring and Updating Deployed Models

Monitoring Model Performance

Updating Models

Visualizing the Deployment Architecture

Conclusion

Try It Yourself

Ready to Test Your Knowledge?

Browse Clojure Design Patterns & Functional Architecture