LocalAI

mirror of https://github.com/mudler/LocalAI.git synced 2025-05-26 03:54:18 +00:00

History

feat: include tokens usage for streamed output (#4282 )

Use pb.Reply instead of []byte with Reply.GetMessage() in llama grpc to get the proper usage data in reply streaming mode at the last [DONE] frame

Co-authored-by: Ettore Di Giacinto <mudler@users.noreply.github.com>

2024-11-28 14:47:56 +01:00

elevenlabs

feat: track internally started models by ID (#3693 )

2024-10-02 08:55:58 +02:00

explorer

feat(explorer): make possible to run sync in a separate process (#3224 )

2024-08-12 19:25:44 +02:00

jina

feat: track internally started models by ID (#3693 )

2024-10-02 08:55:58 +02:00

localai

feat(silero): add Silero-vad backend (#4204 )

2024-11-20 14:48:40 +01:00

openai

feat: include tokens usage for streamed output (#4282 )

2024-11-28 14:47:56 +01:00