|
3cddf24747
|
feat: Centralized Request Processing middleware (#3847)
* squash past, centralize request middleware PR
Signed-off-by: Dave Lee <dave@gray101.com>
* migrate bruno request files to examples repo
Signed-off-by: Dave Lee <dave@gray101.com>
* fix
Signed-off-by: Dave Lee <dave@gray101.com>
* Update tests/e2e-aio/e2e_test.go
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com>
---------
Signed-off-by: Dave Lee <dave@gray101.com>
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com>
Co-authored-by: Ettore Di Giacinto <mudler@users.noreply.github.com>
|
2025-02-10 12:06:16 +01:00 |
|
|
03974a4dd4
|
feat: tokenization with llama.cpp (#4724)
feat: tokenization
Signed-off-by: shraddhazpy <shraddha@shraddhafive.in>
|
2025-02-02 17:39:43 +00:00 |
|
|
6daef00d30
|
chore(refactor): drop unnecessary code in loader (#4096)
* chore: simplify passing options to ModelOptions
Signed-off-by: Ettore Di Giacinto <mudler@localai.io>
* chore(refactor): do not expose internal backend Loader
Signed-off-by: Ettore Di Giacinto <mudler@localai.io>
---------
Signed-off-by: Ettore Di Giacinto <mudler@localai.io>
|
2024-11-08 21:54:25 +01:00 |
|
|
3acd767ac4
|
chore: simplify model loading (#3715)
Signed-off-by: Ettore Di Giacinto <mudler@localai.io>
|
2024-10-02 08:59:06 +02:00 |
|
|
5488fc3bc1
|
feat: tokenization endpoint (#3710)
endpoint to access the tokenizer
Signed-off-by: shraddhazpy <shraddha@shraddhafive.in>
Co-authored-by: Ettore Di Giacinto <mudler@users.noreply.github.com>
Co-authored-by: Dave <dave@gray101.com>
|
2024-10-02 08:56:18 +02:00 |
|