Use `Union` and `Literal` for multiple choices #1420

rlouf · 2025-02-17T16:24:57Z

For instance to choose between the numbers 1, 2 and 3:

model(prompt, Literal[1, 2, 3])

And to generate a json object that is valid to either the User or the Customer model:

model(prompt, Union[User, Customer])

The text was updated successfully, but these errors were encountered:

RobinPicard · 2025-02-17T19:48:26Z

Right now Choice accepts for its definition either an Enum or a list[str], neither of which can be used directly.
Would do you think about:

making Choice accept Enum, list[str], Union and Literal for its definition
also letting the user use Enum, Union and Literal directly when calling the model/generator and at the top of the Generator function turning them into a Choice instance.

The advantage would be to give lots of options to the user, but still limit the number of different types the models have to handle in the format_input method of their ModelTypeAdapter. The downside is that the user would not be able to use the model.generate method directly (but do we want them to do that?).

rlouf · 2025-02-17T21:06:44Z

This is a v1.0 issue where they will be no distinct choice, json, format or regex functions and everything will be inferred from the output type specified by the user.

The equivalent of choice would be model(prompt, Literal[x, y, z]) or model(prompt, Enum). model(prompt, list[x]) now means "generate a list of X". model(prompt, Union[X, Y]) means "generate X or Y", which is more general than choice (unless it is model(prompt, Union[Literal[x], Literal[y]] which is equivalent to model(prompt, Literal[x, y]).

The way to think about it is "if this call were encapsulated in a function, what would this function's output type be?":

def foo(prompt: str) -> Literal[1, 2, 3]:
    return model(prompt, Literal[1, 2, 3])

def bar(prompt: str) -> List[Client]:
    return model(prompt, List[Client])

rlouf · 2025-02-17T21:10:06Z

Tangentially related, this makes me think that we need to make the output type of [X]Generation.__call__ match the output type specified by the user so mypy can look for type errors in a user's code.

rlouf added enhancement impact/user interface Related to improving the user interface structured generation Linked to structured generation labels Feb 17, 2025

rlouf added this to the 1.0 milestone Feb 17, 2025

rlouf assigned RobinPicard Feb 17, 2025

This was referenced Feb 17, 2025

Dynamically change the output type annotation of [X]Generator.__call__ to match the output type requested by the user #1424

Open

Refactor the type system #1441

Draft

rlouf linked a pull request Feb 23, 2025 that will close this issue

Refactor the type system #1441

Draft

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Use `Union` and `Literal` for multiple choices #1420

Use `Union` and `Literal` for multiple choices #1420

rlouf commented Feb 17, 2025

RobinPicard commented Feb 17, 2025

rlouf commented Feb 17, 2025

rlouf commented Feb 17, 2025

Use Union and Literal for multiple choices #1420

Use Union and Literal for multiple choices #1420

Comments

rlouf commented Feb 17, 2025

RobinPicard commented Feb 17, 2025

rlouf commented Feb 17, 2025

rlouf commented Feb 17, 2025

Use `Union` and `Literal` for multiple choices #1420

Use `Union` and `Literal` for multiple choices #1420