ggml : optimize for ppc64le using VSX intrinsics (ggml/784)

* optimize for ppc64le using VSX intrinsics

* 1. code clean up by removing comments about overflow concern.

2. fix typo in suffix of scaling.

* Continue to fix typo in suffix of scaling for QK_K <> 256

---------

Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>
This commit is contained in:
Hong Bo PENG 2024-05-12 17:17:18 +08:00 committed by Georgi Gerganov
parent 5a863fbe18
commit 40aeeeecc4

File diff suppressed because it is too large Load Diff