Update distributed_inferencing.md

Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com>
2025-06-13 04:28:10 +00:00 · 2024-07-22 17:35:10 +02:00
parent 7d61de63ae
commit 153e977155
1 changed files with 1 additions and 1 deletions
--- a/docs/content/docs/features/distributed_inferencing.md
+++ b/docs/content/docs/features/distributed_inferencing.md
@ -11,7 +11,7 @@ This functionality enables LocalAI to distribute inference requests across multi
 LocalAI supports two modes of distributed inferencing via p2p:

 - **Federated Mode**: Requests are shared between the cluster and routed to a single worker node in the network based on the load balancer's decision.
- **Worker Mode**: Requests are processed by all the workers which contributes to the final inference result (by sharing the model weights).
+- **Worker Mode** (aka "model sharding" or "splitting weights"): Requests are processed by all the workers which contributes to the final inference result (by sharing the model weights).

 ## Usage