ggml_quantize_init method

void ggml_quantize_init(
  1. int type
)
  • ggml_quantize_init can be called multiple times with the same type it will only initialize the quantization tables for the first call or after ggml_quantize_free automatically called by ggml_quantize_chunk for convenience

  • ggml_quantize_free will free any memory allocated by ggml_quantize_init call this at the end of the program to avoid memory leaks

note: these are thread-safe

Implementation

void ggml_quantize_init(
  int type,
) {
  return _ggml_quantize_init(
    type,
  );
}