File system connector
The file system connector allows the scraping of a file system. It will return the most basic file system metadata for folders and files. The connector also stores the root directory.
Supported Features
- Exporting content structure
- Exporting binaries into the Content Store
Features Currently Not Supported
- Permissions
- Exporting users
Resumability
In case the extraction is stopped for any reason, the connector is able to resume where it left off. Use the Trigger component labeled 'Resume crawl' to do this.
Extracting Only Top Level Folder Items
In case only extraction of the specified folder is needed (and not all the sub-folders), only the Document Retrieve component (im5187dn8) needs to be updated. Change the query from
{
"migration.flags.retrievedChildren": false,
"kind": "CONTAINER",
"migration.origin": "%origin%"
}
to
{
"migration.flags.retrievedChildren": null,
"kind": "CONTAINER",
"migration.origin": "%origin%"
}
Settings
roots
An array of the paths to the directories and files that need to be scraped. These paths will be used to construct the configurations for the File System Crawler component configuration. The File System Crawler component allows for more configuration if needed. Please refer to the File System Crawler component documentation.
Example: configuration of root directories to scrape on the file system:
[
"C:/Windows",
"C:/Drivers"
]
mongoConnection
The Mongo connection string including the database name to connect to.
retrieveBinaries
By enabling this setting, the connector will also store the binary content of the files in MongoDB (GridFS).
origin
Specifies the origin of the document in the Content Store.