Attention: This MediaPipe Solutions Preview is an early release. Learn more

LlmInference.LlmInferenceOptions

public static abstract class LlmInference.LlmInferenceOptions

Options for setting up an LlmInference.

class LlmInference.LlmInferenceOptions.Builder Builder for LlmInference.LlmInferenceOptions.

LlmInferenceOptions()

static LlmInference.LlmInferenceOptions.Builder	builder() Instantiates a new LlmInferenceOptions builder.
abstract int	maxTokens() The total length of the kv-cache.
abstract String	modelPath() The path that points to the tflite model file.
abstract int	randomSeed() Random seed for sampling tokens.
abstract float	temperature() Randomness when decoding the next token.
abstract int	topK() Top K number of tokens to be sampled from for each decoding step.

Any	convertToAnyProto() Converts a MediaPipe Tasks task-specific options to an proto3 `ERROR(/Any)` message.
CalculatorOptions	convertToCalculatorOptionsProto() Converts a MediaPipe Tasks task-specific options to a `ERROR(/CalculatorOptions)` protobuf message.

From class java.lang.Object

Public Constructors

Instantiates a new LlmInferenceOptions builder.

The total length of the kv-cache. In other words, this is the total number of input + output tokens the model needs to handle.

The path that points to the tflite model file.

Random seed for sampling tokens.

Randomness when decoding the next token. A value of 0.0f means greedy decoding.

Top K number of tokens to be sampled from for each decoding step. A value of 1 means greedy decoding.