You create content crawlers from the Collaboration Crawler content source to import documents from Oracle WebCenter Collaboration into the Knowledge Directory. You must use the Collaboration Crawler content source with Oracle WebCenter Collaboration content crawlers. The authentication settings for the Collaboration Crawler content source must match the authentication settings in the Oracle WebCenter Collaboration remote server object.
The API Service and Automation Service must be installed for Oracle WebCenter Collaboration documents to be imported into the Knowledge Directory. For more information about installing them, see the Installation Guide for Oracle WebCenter Interaction.
On the content crawler’s Main Settings page:
Click Browse next to the Project icon to choose the project that contains the folder that you want the content crawler to access. You can only select projects for which you are a Project Leader.
Click Browse next to the Folder icon to choose a folder. You must have Admin access to the folder. Additionally, you must make this folder content crawler-accessible by selecting Accessible to content crawlers in the folder’s properties. Content crawler accessibility settings are passed down to child folders.
By default, the maximum number of levels within the folder that the content crawler can access is unlimited. You can change this number using the Maximum number of levels to crawl drop-down list.
We recommend the following settings for content crawlers that import files into the Knowledge Directory:
On the content crawler’s Main Settings page, select the Mirror the source folder structure option.
On the content crawler’s Advanced Settings page:
Select the from this Content Source option.
Select the refresh them option.
Select the regenerate deleted links option.
Error information about crawler jobs can be found in:
The Oracle WebCenter Collaboration log, which can be found at:
The top of the Diagnostics page in the Collaboration Administration Utility (click the here link).
<install_dir>\<version_number>\settings\logs.
The job history for each crawler job.
For more information about creating content crawlers, see Administrator Guide for Oracle WebCenter Interaction.