Method that reads the buffer contents and metadata based on the type of
filePathOrBlob
, and then calls the parse()
method to parse the
buffer and return the documents.
Promise that resolves with an array of Document
objects.
Optional
splitter: BaseDocumentTransformer<DocumentInterface<Record<string, any>>[], DocumentInterface<Record<string, any>>[]>A Promise that resolves with an array of Document instances, each split according to the provided TextSplitter.
A method that takes a raw
buffer and metadata
as parameters and
returns a promise that resolves to an array of Document
instances. It
uses the parseOfficeAsync
function from the officeparser
module to extract
the raw text content from the buffer. If the extracted powerpoint content is
empty, it returns an empty array. Otherwise, it creates a new
Document
instance with the extracted powerpoint content and the provided
metadata, and returns it as an array.
The buffer to be parsed.
The metadata of the document.
A promise that resolves to an array of Document
instances.
Static
importsStatic method that imports the readFile
function from the
fs/promises
module in Node.js. It is used to dynamically import the
function when needed. If the import fails, it throws an error
indicating that the fs/promises
module is not available in the
current environment.
Promise that resolves with an object containing the readFile
function.
A class that extends the
BufferLoader
class. It represents a document loader that loads documents from PDF files.