> ## Documentation Index > Fetch the complete documentation index at: https://r2r-patch-fix-ingestion.mintlify.site/llms.txt > Use this file to discover all available pages before exploring further. # Ingestion > Ingesting files with R2R. ## Document Ingestion and Management ### Ingest Files Ingest files or directories into your R2R system: ```javascript const files = [ { path: 'path/to/file1.txt', name: 'file1.txt' }, { path: 'path/to/file2.txt', name: 'file2.txt' } ]; const metadatas = [{ key1: 'value1' }, { key2: 'value2' }]; const ingestResponse = await client.ingestFiles(files, { metadatas, user_ids: ['user-id-1', 'user-id-2'], }); ``` The response from the R2R system after ingesting the files. ```bash [{'message': 'Ingestion task queued successfully.', 'task_id': '6e27dfca-606d-422d-b73f-2d9e138661b4', 'document_id': 'c3291abf-8a4e-5d9d-80fd-232ef6fd8526'}, ...] ``` An array of file paths, File objects, or objects with path and name properties to ingest. An optional array of metadata objects corresponding to each file. An optional array of document IDs to assign to the ingested files. An optional array of user IDs associated with the ingested files. The ingestion config override parameter enables developers to customize their R2R chunking strategy at runtime. Which chunking provider to use. Options are "r2r", "unstructured\_local", or "unstructured\_api". Which chunking method to apply. Options are "by\_title", "basic", "recursive", or "character". The average size of chunks, in tokens. The default overlap between chunks. Sets a maximum size on output chunks. Combine chunks smaller than this number of characters. Maximum number of characters per chunk. Whether to include coordinates in the output. Encoding to use for text files. Types of image blocks to extract. Content type for uncompressed gzip files. Name of the high-resolution model to use. Whether to include original elements in the output. Whether to include page breaks in the output. List of languages to consider for text processing. Whether to allow sections to span multiple pages. Start a new chunk after this many characters. Languages to use for OCR. Format of the output. Number of characters to overlap between chunks. Whether to overlap all chunks. Whether to infer table structure in PDFs. Threshold for considering chunks similar. Types of tables to skip inferring. Concurrency level for splitting PDFs. Whether to split PDFs by page. Page number to start processing from. Strategy for processing. Options are "auto", "fast", or "hi\_res". Strategy for chunking. Options are "by\_title" or "basic". Whether to generate unique IDs for elements. Whether to keep XML tags in the output. ### Update Files Update existing documents: ```javascript const files = [ { path: '/path/to/updated_file1.txt', name: 'updated_file1.txt' } ]; const document_ids = ['document-id-1']; const updateResponse = await client.updateFiles(files, { document_ids, metadatas: [{ key: 'updated_value' }] // to overwrite the existing metadata }); ``` The response from the R2R system after updating the files. ```bash {'results': {'processed_documents': [{'id': '9f375ce9-efe9-5b57-8bf2-a63dee5f3621', 'title': 'aristotle_v2.txt'}], 'failed_documents': [], 'skipped_documents': []}} ``` An array of File objects or objects with path and name properties to update. An array of document IDs corresponding to the files being updated. An optional array of metadata objects for the updated files. The ingestion config override parameter enables developers to customize their R2R chunking strategy at runtime. Which chunking provider to use, `r2r` or `unstructured`. Selecting `unstructured` is generally recommended when parsing with `unstructured` or `unstructured_api`. Which chunking method to apply? When using unstructured, `by_title` or `basic` are supported. The average size of chunks, in tokens. The default overlap between chunks. Sets a maximum size on output chunks. ### Documents Overview Retrieve high-level document information, restricted to user files, except when called by a superuser where it will then return results from over all users: ```javascript const documentsOverview = await client.documentsOverview(); ``` An array of objects containing document information. ```bash [ { 'document_id': '9fbe403b-c11c-5aae-8ade-ef22980c3ad1', 'version': 'v1', 'size_in_bytes': 73353, 'metadata': {}, 'status': 'success', 'user_id': '2acb499e-8428-543b-bd85-0d9098718220', 'title': 'aristotle.txt', 'created_at': '2024-07-21T20:09:14.218741Z', 'updated_at': '2024-07-21T20:09:14.218741Z', 'metadata': {'x': 'y'} }, ... ] ``` An optional array of document IDs to filter the overview. ### Document Chunks Fetch chunks for a particular document: ```javascript const documentId = '9fbe403b-c11c-5aae-8ade-ef22980c3ad1'; const chunks = await client.documentChunks(documentId); ``` An array of objects containing chunk information. ```bash [{ 'text': 'Aristotle[A] (Greek: Ἀριστοτέλης Aristotélēs, pronounced [aristotélɛːs]; 384–322 BC) was an Ancient Greek philosopher and polymath...', 'user_id': '2acb499e-8428-543b-bd85-0d9098718220', 'document_id': '9fbe403b-c11c-5aae-8ade-ef22980c3ad1', 'extraction_id': 'aeba6400-1bd0-5ee9-8925-04732d675434', 'fragment_id': 'f48bcdad-4155-52a4-8c9d-8ba06e996ba3' 'metadata': {'title': 'aristotle.txt', 'version': 'v0', 'chunk_order': 0}} }, ...] ``` The ID of the document to retrieve chunks for. ### Delete Documents Delete a document by its ID: ```javascript const deleteResponse = await client.delete({ document_id: "91662726-7271-51a5-a0ae-34818509e1fd" }); ``` The response from the R2R system after successfully deleting the documents. ```bash {'results': {}} ``` A list of logical filters to perform over input documents fields which identifies the unique set of documents to delete (e.g., `{"document_id": {"$eq": "9fbe403b-c11c-5aae-8ade-ef22980c3ad1"}}`). Logical operations might include variables such as `"user_id"` or `"title"` and filters like `neq`, `gte`, etc.