Version: 4.59.1

File system connector

The file system connector allows the scraping of a file system. It will return the most basic file system metadata for folders and files. The connector also stores the root directory.

Supported Features

Exporting content structure
Exporting binaries into the Content Store

Features Currently Not Supported

Permissions
Exporting users

Resumability

In case the extraction is stopped for any reason, the connector is able to resume where it left off. Use the Trigger component labeled 'Resume crawl' to do this.

Extracting Only Top Level Folder Items

In case only extraction of the specified folder is needed (and not all the sub-folders), only the Document Retrieve component (im5187dn8) needs to be updated. Change the query from

{
  "migration.flags.retrievedChildren": false,
  "kind": "CONTAINER",
  "migration.origin": "%origin%"
}

{
  "migration.flags.retrievedChildren": null,
  "kind": "CONTAINER",
  "migration.origin": "%origin%"
}

Settings

roots

An array of the paths to the directories and files that need to be scraped. These paths will be used to construct the configurations for the File System Crawler component configuration. The File System Crawler component allows for more configuration if needed. Please refer to the File System Crawler component documentation.

Example: configuration of root directories to scrape on the file system:

[
  "C:/Windows",
  "C:/Drivers"
]

mongoConnection

The Mongo connection string including the database name to connect to.

retrieveBinaries

By enabling this setting, the connector will also store the binary content of the files in MongoDB (GridFS).

origin

Specifies the origin of the document in the Content Store.

Supported Features​

Features Currently Not Supported​

Resumability​

Extracting Only Top Level Folder Items​

Settings

roots​

mongoConnection​

retrieveBinaries​

origin​