From 153e97715543188212a366eeccecf112f5115e8c Mon Sep 17 00:00:00 2001 From: Ettore Di Giacinto Date: Mon, 22 Jul 2024 17:35:10 +0200 Subject: [PATCH] Update distributed_inferencing.md Signed-off-by: Ettore Di Giacinto --- docs/content/docs/features/distributed_inferencing.md | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/docs/content/docs/features/distributed_inferencing.md b/docs/content/docs/features/distributed_inferencing.md index b7ce41a9..1ab3fa55 100644 --- a/docs/content/docs/features/distributed_inferencing.md +++ b/docs/content/docs/features/distributed_inferencing.md @@ -11,7 +11,7 @@ This functionality enables LocalAI to distribute inference requests across multi LocalAI supports two modes of distributed inferencing via p2p: - **Federated Mode**: Requests are shared between the cluster and routed to a single worker node in the network based on the load balancer's decision. -- **Worker Mode**: Requests are processed by all the workers which contributes to the final inference result (by sharing the model weights). +- **Worker Mode** (aka "model sharding" or "splitting weights"): Requests are processed by all the workers which contributes to the final inference result (by sharing the model weights). ## Usage