Optional apiOptional baseURLOptional clientOptional maxOptional metadataOptional parallelOptional serviceSpecifies the processing tier (e.g. 'auto', 'default', 'priority', 'flex').
Optional storeOptional strictOptional temperatureOptional toolOptional useWhether to use the WebSocket API.
true
Upper bound for the number of tokens that can be generated for a response.