ExtractVlmTextOperation

Extract text from a document using vision language models (VLMs). VLMs can understand document layout and structure more intelligently than traditional OCR.

Properties

Name	Type	Required	Description
llm_spec	LlmSpec	Yes
preprocessing_configuration	Optional[VlmPreprocessingConfig]	No
image_spec	Optional[ImageSpec]	No
output_format	TextOutputFormat	Yes
page_range	Optional[PageRange]	No
type	Literal["extractVlmText"]	Yes	None

[Back to Model list] [Back to API list] [Back to README]

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ExtractVlmTextOperation

Properties

FilesExpand file tree

ExtractVlmTextOperation.md

Latest commit

History

ExtractVlmTextOperation.md

File metadata and controls

ExtractVlmTextOperation

Properties