GETTING STARTED
SearchAssist Overview
SearchAssist Introduction
Onboarding SearchAssist
Build your first App
Glossary
Release Notes
What's new in SearchAssist
Previous Versions

CONCEPTS
Managing Sources
Introduction
Files
Web Pages
FAQs
Structured Data 
Connectors
Introduction to Connectors
Azure Storage Connector
Confluence Cloud Connector
Confluence Server Connector
Custom Connector
DotCMS Connector
Dropbox Connector
Google Drive Connector
Oracle Knowledge Connector
Salesforce Connector
ServiceNow Connector
SharePoint Connector
Zendesk Connector
RACL
Virtual Assistants
Managing Indices
Introduction
Index Fields
Traits
Workbench
Introduction to Workbench
Field Mapping
Entity Extraction
Traits Extraction
Keyword Extraction
Exclude Document
Semantic Meaning
Snippet Extraction
Custom LLM Prompts
Index Settings
Index Languages
Managing Chunks
Chunk Browser
Managing Relevance
Introduction
Weights
Highlighting
Presentable
Synonyms
Stop Words
Search Relevance
Spell Correction
Prefix Search
Custom Configurations
Personalizing Results
Introduction
Answer Snippets
Introduction
Extractive Model
Generative Model
Enabling Both Models
Simulation and Testing
Debugging
Best Practices and Points to Remember
Troubleshooting Answers
Answer Snippets Support Across Content Sources
Result Ranking
Facets
Business Rules
Introduction
Contextual Rules
NLP Rules
Engagement
Small Talk
Bot Actions
Designing Search Experience
Introduction
Search Interface
Result Templates
Testing
Preview and Test
Debug Tool
Running Experiments
Introduction
Experiments
Analyzing Search Performance
Overview
Dashboard
User Engagement
Search Insights
Result Insights
Answer Insights

ADMINISTRATION
General Settings
Credentials
Channels
Team
Collaboration
Integrations
OpenAI Integration
Azure OpenAI Integration
Custom Integration
Billing and Usage
Plan Details
Usage Logs
Order and Invoices
Smart Hibernation

SearchAssist APIs
API Introduction
API List

SearchAssist SDK

HOW TOs
Use Custom Fields to Filter Search Results and Answers
Add Custom Metadata to Ingested Content
Write Painless Scripts
Configure Business Rules for Generative Answers

DotCMS Connector

DotCMS is a content management system specifically designed to manage documentation. dotCMS Connector enables you to easily integrate with the dotCMS application to ingest content and use it to answer user queries.

Type of Repository  Cloud/ On-prem
Supported API version v1
Search Results Support Yes
Extractive Model for Answer Generation Yes
Generative Model for Answer Generation Yes

Prerequisites

  • dotCMS user with the permissions to access the required resources. 

Supported Content Layout 

DotCMS allows various Content Types with multiple Content Type Fields. By default, SearchAssist supports searching the data available in the following fields:

  • Title 
  • Content 
  • HTML 

All the other information from dotCMS content is ingested as raw_data into the SearchAssist application. To make any other fields searchable, use the Workbench to process the content (copy/append) from those fields into the fields listed above. 

Configure dotCMS connector in SearchAssist

  • Go to Connectors under the Sources tab and select DotCMS.
  • The following page provides instructions for the integration. Click Proceed
  • On the Configuration and Authentication page, provide the following information to enable SearchAssist to establish a connection with dotCMS. 
Fields Description
Name  Name for the connector
Host URL  URL of the dotCMS application
Username dotCMS Credentials 
Password dotCMS Credentials 
  • Click Connect, This will initiate the authorization process for the connector.  
  • Once the authorization process is complete,  you can select the content to be ingested and initiate synchronization. Go to the Sync Settings tab and select the content. You can choose to sync all the content from the dotCMS account or select specific content.  Configure the content to be ingested and click Synchronize. Find more information on adding rules to filter specific content from dotCMS below. You can also set a scheduler here for automatic content synchronization at regular intervals.

Filtering content for Ingestion

DotCMS connector allows you to set up rules to selectively ingest content from the application. To define such rules, go to the Sync Settings tab and select Sync Specific Content. Click on the Configure link. The following page allows you to define rules for selecting the content. Each rule can be defined using a parameter, operator, and its values. 

The Parameter field refers to the system property used to filter the content. By default, SearchASsist offers the following parameters but you can add any of the properties that you use for your content. Refer to this for more information on available System Properties in dotCMS

  • Base Type: Base content type in dotCMS. There is a fixed set of base types in dotCMS. Refer to this for more info on the values that the field can take.  
  • Content Type: Variable name for the Content Type defined for your content.
  • Locked:  Refers to the locked content. This can take values as true or false. Suppose locked content is to be ingested, set this to true. Else, set it to false. 
  • Live: Refers to the published content. This can take values as true or false. If published content is to be ingested, set this to true. Else, set it to false. 
  • Language ID:  If content in a specific language is to be ingested, set this to the language ID defined by dotCMS.

The Operator field can take different values depending upon the parameter selected like equals to, not equals to, contains, etc. 

The value field is used for providing the value of the parameter.

For instance, you can ingest all the content of Base Type as ‘Page’ using the following rule.

Similarly, to selectively ingest only the live pages in a certain language, you can configure the rule as shown below.

Note:

  • You can define more than one condition to choose different types of content from the connector using the OR operator. 
  • Every rule can have one or more conditions to select a subset of content using the AND operator. 
  • You can use other System Properties defined by dotCMS to add rules. The properties listed by default are the commonly used ones. 
  • Ensure that the properties and their values should exactly match the expected values in dotCMS. Refer to this for details. 

DotCMS Connector

DotCMS is a content management system specifically designed to manage documentation. dotCMS Connector enables you to easily integrate with the dotCMS application to ingest content and use it to answer user queries.

Type of Repository  Cloud/ On-prem
Supported API version v1
Search Results Support Yes
Extractive Model for Answer Generation Yes
Generative Model for Answer Generation Yes

Prerequisites

  • dotCMS user with the permissions to access the required resources. 

Supported Content Layout 

DotCMS allows various Content Types with multiple Content Type Fields. By default, SearchAssist supports searching the data available in the following fields:

  • Title 
  • Content 
  • HTML 

All the other information from dotCMS content is ingested as raw_data into the SearchAssist application. To make any other fields searchable, use the Workbench to process the content (copy/append) from those fields into the fields listed above. 

Configure dotCMS connector in SearchAssist

  • Go to Connectors under the Sources tab and select DotCMS.
  • The following page provides instructions for the integration. Click Proceed
  • On the Configuration and Authentication page, provide the following information to enable SearchAssist to establish a connection with dotCMS. 
Fields Description
Name  Name for the connector
Host URL  URL of the dotCMS application
Username dotCMS Credentials 
Password dotCMS Credentials 
  • Click Connect, This will initiate the authorization process for the connector.  
  • Once the authorization process is complete,  you can select the content to be ingested and initiate synchronization. Go to the Sync Settings tab and select the content. You can choose to sync all the content from the dotCMS account or select specific content.  Configure the content to be ingested and click Synchronize. Find more information on adding rules to filter specific content from dotCMS below. You can also set a scheduler here for automatic content synchronization at regular intervals.

Filtering content for Ingestion

DotCMS connector allows you to set up rules to selectively ingest content from the application. To define such rules, go to the Sync Settings tab and select Sync Specific Content. Click on the Configure link. The following page allows you to define rules for selecting the content. Each rule can be defined using a parameter, operator, and its values. 

The Parameter field refers to the system property used to filter the content. By default, SearchASsist offers the following parameters but you can add any of the properties that you use for your content. Refer to this for more information on available System Properties in dotCMS

  • Base Type: Base content type in dotCMS. There is a fixed set of base types in dotCMS. Refer to this for more info on the values that the field can take.  
  • Content Type: Variable name for the Content Type defined for your content.
  • Locked:  Refers to the locked content. This can take values as true or false. Suppose locked content is to be ingested, set this to true. Else, set it to false. 
  • Live: Refers to the published content. This can take values as true or false. If published content is to be ingested, set this to true. Else, set it to false. 
  • Language ID:  If content in a specific language is to be ingested, set this to the language ID defined by dotCMS.

The Operator field can take different values depending upon the parameter selected like equals to, not equals to, contains, etc. 

The value field is used for providing the value of the parameter.

For instance, you can ingest all the content of Base Type as ‘Page’ using the following rule.

Similarly, to selectively ingest only the live pages in a certain language, you can configure the rule as shown below.

Note:

  • You can define more than one condition to choose different types of content from the connector using the OR operator. 
  • Every rule can have one or more conditions to select a subset of content using the AND operator. 
  • You can use other System Properties defined by dotCMS to add rules. The properties listed by default are the commonly used ones. 
  • Ensure that the properties and their values should exactly match the expected values in dotCMS. Refer to this for details.