Method that reads the buffer contents and metadata based on the type of
filePathOrBlob
, and then calls the parse()
method to parse the
buffer and return the documents.
Promise that resolves with an array of Document
objects.
Optional
splitter: BaseDocumentTransformer<DocumentInterface<Record<string, any>>[], DocumentInterface<Record<string, any>>[]>A Promise that resolves with an array of Document instances, each split according to the provided TextSplitter.
A method that takes a raw
buffer and metadata
as parameters and
returns a promise that resolves to an array of Document
instances. It
uses the extractRawText
function from the mammoth
module to extract
the raw text content from the buffer. If the extracted text content is
empty, it returns an empty array. Otherwise, it creates a new
Document
instance with the extracted text content and the provided
metadata, and returns it as an array.
The raw buffer from which to extract text content.
The metadata to be associated with the created Document
instance.
A promise that resolves to an array of Document
instances.
Static
importsStatic method that imports the readFile
function from the
fs/promises
module in Node.js. It is used to dynamically import the
function when needed. If the import fails, it throws an error
indicating that the fs/promises
module is not available in the
current environment.
Promise that resolves with an object containing the readFile
function.
A class that extends the
BufferLoader
class. It represents a document loader that loads documents from DOCX files.