whisper.cpp/bindings/go
Stavros Panakakis 3853d83d73
go : add tests and update bindings (#2425)
Update Go version to ^1.23, the actions/setup-go
to v5, actions/checkout to v4 and
github.com/stretchr/testify to v1.9.0.

Add test cases for the following model
struct methods:
 - New
 - Close
 - NewContext
 - IsMultilingual
 - Languages

Add test cases for the following context
struct methods:
 - SetLanguage
 - IsMultilingual
 - Language
 - Process
2024-09-20 15:36:12 +03:00
..
examples whisper : remove speed_up and phase_vocoder* functions (#2198) 2024-05-31 11:37:29 +03:00
pkg/whisper go : add tests and update bindings (#2425) 2024-09-20 15:36:12 +03:00
samples go : run go mod tidy before building examples + fix permissions (#296) 2022-12-22 16:34:20 +02:00
.gitignore go : bindings updated so they can be used in third party packages. (#379) 2023-01-06 19:32:28 +02:00
doc.go bindings : initial import of golang bindings (#287) 2022-12-20 08:54:33 +02:00
go.mod go : add tests and update bindings (#2425) 2024-09-20 15:36:12 +03:00
go.sum go : add tests and update bindings (#2425) 2024-09-20 15:36:12 +03:00
LICENSE go : run go mod tidy before building examples + fix permissions (#296) 2022-12-22 16:34:20 +02:00
Makefile go : fix CUDA build (#2416) 2024-09-15 12:23:56 +03:00
params.go go : add beamsize/entropythold/maxcontext to context interface (#2350) 2024-08-28 17:09:01 +03:00
README.md go : fix CUDA build (#2416) 2024-09-15 12:23:56 +03:00
whisper_test.go go : improve progress reporting and callback handling (#1024) 2023-06-25 14:07:55 +03:00
whisper.go go : add beamsize/entropythold/maxcontext to context interface (#2350) 2024-08-28 17:09:01 +03:00

Go bindings for Whisper

This package provides Go bindings for whisper.cpp. They have been tested on:

  • Darwin (OS X) 12.6 on x64_64
  • Debian Linux on arm64
  • Fedora Linux on x86_64

The "low level" bindings are in the bindings/go directory and there is a more Go-style package in the bindings/go/pkg/whisper directory. The most simple usage is as follows:

import (
	"github.com/ggerganov/whisper.cpp/bindings/go/pkg/whisper"
)

func main() {
	var modelpath string // Path to the model
	var samples []float32 // Samples to process

	// Load the model
	model, err := whisper.New(modelpath)
	if err != nil {
		panic(err)
	}
	defer model.Close()

	// Process samples
	context, err := model.NewContext()
	if err != nil {
		panic(err)
	}
	if err := context.Process(samples, nil, nil); err != nil {
		return err
	}

	// Print out the results
	for {
		segment, err := context.NextSegment()
		if err != nil {
			break
		}
		fmt.Printf("[%6s->%6s] %s\n", segment.Start, segment.End, segment.Text)
	}
}

Building & Testing

In order to build, you need to have the Go compiler installed. You can get it from here. Run the tests with:

git clone https://github.com/ggerganov/whisper.cpp.git
cd whisper.cpp/bindings/go
make test

This will compile a static libwhisper.a in a build folder, download a model file, then run the tests. To build the examples:

make examples

To build using cuda support add GGML_CUDA=1:

GGML_CUDA=1 make examples

The examples are placed in the build directory. Once built, you can download all the models with the following command:

./build/go-model-download -out models

And you can then test a model against samples with the following command:

./build/go-whisper -model models/ggml-tiny.en.bin samples/jfk.wav

Using the bindings

To use the bindings in your own software,

  1. Import github.com/ggerganov/whisper.cpp/bindings/go/pkg/whisper (or github.com/ggerganov/whisper.cpp/bindings/go into your package;
  2. Compile libwhisper.a (you can use make whisper in the bindings/go directory);
  3. Link your go binary against whisper by setting the environment variables C_INCLUDE_PATH and LIBRARY_PATH to point to the whisper.h file directory and libwhisper.a file directory respectively.

Look at the Makefile in the bindings/go directory for an example.

The API Documentation:

Getting help:

  • Follow the discussion for the go bindings here

License

The license for the Go bindings is the same as the license for the rest of the whisper.cpp project, which is the MIT License. See the LICENSE file for more details.