Base class for chat models. It extends the BaseLanguageModel class and provides methods for generating chat based on input messages.

Hierarchy

Implements

Constructors

Properties

ParsedCallOptions: Omit<ChatOpenAICallOptions, never>
caller: AsyncCaller

The async caller should be used by subclasses to make any async calls, which will thus benefit from the concurrency and retry logic.

frequencyPenalty: number

A value that influences the probability of generated tokens appearing based on their cumulative frequency in generated text. Positive values will make tokens less likely to appear as their frequency increases and decrease the likelihood of the model repeating the same statements verbatim.

modelName: string

Model name to use

n: number

The number of completions choices that should be generated per provided prompt as part of an overall completions response. Because this setting can generate many completions, it may quickly consume your token quota. Use carefully and ensure reasonable settings for max_tokens and stop.

presencePenalty: number

A value that influences the probability of generated tokens appearing based on their existing presence in generated text. Positive values will make tokens less likely to appear when they already exist and increase the model's likelihood to output new topics.

streaming: boolean

Whether to stream the results or not. Enabling disables tokenUsage reporting

temperature: number

The sampling temperature to use that controls the apparent creativity of generated completions. Higher values will make output more random while lower values will make results more focused and deterministic. It is not recommended to modify temperature and top_p for the same completions request as the interaction of these two settings is difficult to predict.

topP: number

An alternative to sampling with temperature called nucleus sampling. This value causes the model to consider the results of tokens with the provided probability mass. As an example, a value of 0.15 will cause only the tokens comprising the top 15% of probability mass to be considered. It is not recommended to modify temperature and top_p for the same completions request as the interaction of these two settings is difficult to predict.

verbose: boolean

Whether to print out response text.

azureExtensionOptions?: AzureExtensionsOptions
azureOpenAIApiCompletionsDeploymentName?: string
azureOpenAIApiKey?: string

API key to use when making requests to Azure OpenAI.

azureOpenAIEndpoint?: string

Endpoint to use when making requests to Azure OpenAI

callbacks?: Callbacks
logitBias?: Record<string, number>

A map between GPT token IDs and bias scores that influences the probability of specific tokens appearing in a completions response. Token IDs are computed via external tokenizer tools, while bias scores reside in the range of -100 to 100 with minimum and maximum values corresponding to a full ban or exclusive selection of a token, respectively. The exact behavior of a given bias score varies by model.

maxTokens?: number

Maximum number of tokens to generate in the completion. -1 returns as many tokens as possible given the prompt and the model's maximum context size.

metadata?: Record<string, unknown>
modelKwargs?: Record<string, any>

Holds any additional parameters that are valid to pass to openai.createCompletion that are not explicitly specified on this class.

name?: string
stop?: string[]

A collection of textual sequences that will end completions generation.

tags?: string[]
timeout?: number

Timeout to use when making requests to OpenAI.

user?: string

An identifier for the caller or end user of the operation. This may be used for tracking or rate-limiting purposes.

Accessors

Methods

  • Assigns new fields to the dict output of this runnable. Returns a new runnable.

    Parameters

    • mapping: RunnableMapLike<Record<string, unknown>, Record<string, unknown>>

    Returns Runnable<any, any, RunnableConfig>

  • Parameters

    • messages: BaseMessageLike[]

      An array of BaseMessage instances.

    • Optional options: string[] | ChatOpenAICallOptions

      The call options or an array of stop sequences.

    • Optional callbacks: Callbacks

      The callbacks for the language model.

    Returns Promise<BaseMessage>

    A Promise that resolves to a BaseMessage.

    ⚠️ Deprecated ⚠️

    Use .invoke() instead. Will be removed in 0.2.0.

    This feature is deprecated and will be removed in the future.

    It is not recommended for use.

    Makes a single call to the chat model.

  • Parameters

    • promptValue: BasePromptValueInterface

      The value of the prompt.

    • Optional options: string[] | ChatOpenAICallOptions

      The call options or an array of stop sequences.

    • Optional callbacks: Callbacks

      The callbacks for the language model.

    Returns Promise<BaseMessage>

    A Promise that resolves to a BaseMessage.

    Deprecated

    Use .invoke() instead. Will be removed in 0.2.0.

    Makes a single call to the chat model with a prompt value.

  • Generates chat based on the input messages.

    Parameters

    • messages: BaseMessageLike[][]

      An array of arrays of BaseMessage instances.

    • Optional options: string[] | ChatOpenAICallOptions

      The call options or an array of stop sequences.

    • Optional callbacks: Callbacks

      The callbacks for the language model.

    Returns Promise<LLMResult>

    A Promise that resolves to an LLMResult.

  • Generates a prompt based on the input prompt values.

    Parameters

    • promptValues: BasePromptValueInterface[]

      An array of BasePromptValue instances.

    • Optional options: string[] | ChatOpenAICallOptions

      The call options or an array of stop sequences.

    • Optional callbacks: Callbacks

      The callbacks for the language model.

    Returns Promise<LLMResult>

    A Promise that resolves to an LLMResult.

  • Parameters

    • Optional suffix: string

    Returns string

  • Parameters

    Returns Promise<number>

  • Get the parameters used to invoke the model

    Parameters

    Returns any

  • Parameters

    • text: string

      The text input.

    • Optional options: string[] | ChatOpenAICallOptions

      The call options or an array of stop sequences.

    • Optional callbacks: Callbacks

      The callbacks for the language model.

    Returns Promise<string>

    A Promise that resolves to a string.

    Deprecated

    Use .invoke() instead. Will be removed in 0.2.0.

    Predicts the next message based on a text input.

  • Parameters

    • messages: BaseMessage[]

      An array of BaseMessage instances.

    • Optional options: string[] | ChatOpenAICallOptions

      The call options or an array of stop sequences.

    • Optional callbacks: Callbacks

      The callbacks for the language model.

    Returns Promise<BaseMessage>

    A Promise that resolves to a BaseMessage.

    Deprecated

    Use .invoke() instead. Will be removed in 0.2.0.

    Predicts the next message based on the input messages.

  • Generate a stream of events emitted by the internal steps of the runnable.

    Use to create an iterator over StreamEvents that provide real-time information about the progress of the runnable, including StreamEvents from intermediate results.

    A StreamEvent is a dictionary with the following schema:

    • event: string - Event names are of the format: on_[runnable_type]_(start|stream|end).
    • name: string - The name of the runnable that generated the event.
    • run_id: string - Randomly generated ID associated with the given execution of the runnable that emitted the event. A child runnable that gets invoked as part of the execution of a parent runnable is assigned its own unique ID.
    • tags: string[] - The tags of the runnable that generated the event.
    • metadata: Record<string, any> - The metadata of the runnable that generated the event.
    • data: Record<string, any>

    Below is a table that illustrates some events that might be emitted by various chains. Metadata fields have been omitted from the table for brevity. Chain definitions have been included after the table.

    event name chunk input output
    on_llm_start [model name] {'input': 'hello'}
    on_llm_stream [model name] 'Hello' OR AIMessageChunk("hello")
    on_llm_end [model name] 'Hello human!'
    on_chain_start format_docs
    on_chain_stream format_docs "hello world!, goodbye world!"
    on_chain_end format_docs [Document(...)] "hello world!, goodbye world!"
    on_tool_start some_tool {"x": 1, "y": "2"}
    on_tool_stream some_tool {"x": 1, "y": "2"}
    on_tool_end some_tool {"x": 1, "y": "2"}
    on_retriever_start [retriever name] {"query": "hello"}
    on_retriever_chunk [retriever name] {documents: [...]}
    on_retriever_end [retriever name] {"query": "hello"} {documents: [...]}
    on_prompt_start [template_name] {"question": "hello"}
    on_prompt_end [template_name] {"question": "hello"} ChatPromptValue(messages: [SystemMessage, ...])

    Parameters

    Returns AsyncGenerator<StreamEvent, any, unknown>

  • Stream all output from a runnable, as reported to the callback system. This includes all inner runs of LLMs, Retrievers, Tools, etc. Output is streamed as Log objects, which include a list of jsonpatch ops that describe how the state of the run has changed in each step, and the final state of the run. The jsonpatch ops can be applied in order to construct state.

    Parameters

    Returns AsyncGenerator<RunLogPatch, any, unknown>

  • Default implementation of transform, which buffers input and then calls stream. Subclasses should override this method if they can start producing output while input is still being generated.

    Parameters

    Returns AsyncGenerator<BaseMessageChunk, any, unknown>

  • Bind lifecycle listeners to a Runnable, returning a new Runnable. The Run object contains information about the run, including its id, type, input, output, error, startTime, endTime, and any tags or metadata added to the run.

    Parameters

    • params: {
          onEnd?: ((run, config?) => void | Promise<void>);
          onError?: ((run, config?) => void | Promise<void>);
          onStart?: ((run, config?) => void | Promise<void>);
      }

      The object containing the callback functions.

      • Optional onEnd?: ((run, config?) => void | Promise<void>)
          • (run, config?): void | Promise<void>
          • Called after the runnable finishes running, with the Run object.

            Parameters

            Returns void | Promise<void>

      • Optional onError?: ((run, config?) => void | Promise<void>)
          • (run, config?): void | Promise<void>
          • Called if the runnable throws an error, with the Run object.

            Parameters

            Returns void | Promise<void>

      • Optional onStart?: ((run, config?) => void | Promise<void>)
          • (run, config?): void | Promise<void>
          • Called before the runnable starts running, with the Run object.

            Parameters

            Returns void | Promise<void>

    Returns Runnable<BaseLanguageModelInput, BaseMessageChunk, ChatOpenAICallOptions>

  • Type Parameters

    • RunInput = BaseLanguageModelInput

    • RunOutput extends ZodObject<any, any, any, any, {}> = ZodObject<any, any, any, any, {}>

    Parameters

    • __namedParameters: {
          includeRaw: true;
          name: string;
          schema: Record<string, any> | ZodEffects<RunOutput, output<RunOutput>, input<RunOutput>>;
          method?: "functionCalling" | "jsonMode";
      }
      • includeRaw: true
      • name: string
      • schema: Record<string, any> | ZodEffects<RunOutput, output<RunOutput>, input<RunOutput>>
      • Optional method?: "functionCalling" | "jsonMode"

    Returns Runnable<RunInput, {
        parsed: RunOutput;
        raw: BaseMessage;
    }, RunnableConfig>

  • Type Parameters

    • RunInput = BaseLanguageModelInput

    • RunOutput extends ZodObject<any, any, any, any, {}> = ZodObject<any, any, any, any, {}>

    Parameters

    • __namedParameters: {
          name: string;
          schema: Record<string, any> | ZodEffects<RunOutput, output<RunOutput>, input<RunOutput>>;
          includeRaw?: false;
          method?: "functionCalling" | "jsonMode";
      }
      • name: string
      • schema: Record<string, any> | ZodEffects<RunOutput, output<RunOutput>, input<RunOutput>>
      • Optional includeRaw?: false
      • Optional method?: "functionCalling" | "jsonMode"

    Returns Runnable<RunInput, RunOutput, RunnableConfig>

  • Model wrapper that returns outputs formatted to match the given schema.

    Type Parameters

    • RunInput extends BaseLanguageModelInput = BaseLanguageModelInput

      The input type for the Runnable, expected to be the same input for the LLM.

    • RunOutput extends ZodObject<any, any, any, any, {}> = ZodObject<any, any, any, any, {}>

      The output type for the Runnable, expected to be a Zod schema object for structured output validation.

    Parameters

    • __namedParameters: {
          name: string;
          schema: Record<string, any> | ZodEffects<RunOutput, output<RunOutput>, input<RunOutput>>;
          includeRaw?: boolean;
          method?: "functionCalling" | "jsonMode";
      }
      • name: string
      • schema: Record<string, any> | ZodEffects<RunOutput, output<RunOutput>, input<RunOutput>>
      • Optional includeRaw?: boolean
      • Optional method?: "functionCalling" | "jsonMode"

    Returns Runnable<RunInput, RunOutput, RunnableConfig> | Runnable<RunInput, {
        parsed: RunOutput;
        raw: BaseMessage;
    }, RunnableConfig>

    A new runnable that calls the LLM with structured output.

Generated using TypeDoc