Optional chunkchunkDocument represents whether to separate the document into chunks.
Optional chunkchunkSize represents the size of chunks for the documents.
Optional embedembedImages represents whether to embed images from the document.
Optional enableOCRenableOCR represents whether to enable OCR for document parsing.
file represents the document file to upload (can be File object or path string).
Optional injectioninjection represents whether to check the output for a prompt injection.
Optional outputoutputFormat represents the output format for the content of the document.
Optional piipii represents whether to check the output for PII.
Optional replacereplaceMethod represents the method to replace any found PII.
Optional toxicitytoxicity represents whether to check the output for toxicity.
DocumentExtractInput represents the input for document extraction.