Deploy a database connector

You can set up Google Cloud Search to discover and index data from your organization's databases by using the Cloud Search database connector.

Important considerations

You can install and run the Cloud Search database connector in almost any environment where Java apps can run, as long as the connector has access to both the internet and the database.

System requirements

System requirements
Operating system	Windows or Linux
SQL database	Any SQL database with a JDBC 4.0 or later compliant driver, including: MS SQL Server (2008, 2012, 2014, 2016) Oracle (11g, 12c) Google Cloud SQL MySQL
Software	JDBC driver (downloaded and installed separately)

Deploy the connector

These steps describe how to install the connector and configure it to index your databases and return results to Cloud Search users.

Prerequisites

Before you deploy the connector, gather this information:

Google Workspace private key (containing the service account ID). See Configure access to the Cloud Search API.
Google Workspace data source ID. See Add a data source to search.

Step 1. Download and build the database connector software

Clone the connector repository from GitHub.

$ git clone https://github.com/google-cloudsearch/database-connector.git
$ cd database-connector

Check out your selected version:
```
$ git checkout tags/v1-0.0.3
```
Build the connector:
```
$ mvn package
```
To skip tests, use mvn package -DskipTests.

Extract the connector zip file to your installation directory:

$ cp target/google-cloudsearch-database-connector-v1-0.0.3.zip installation-dir
$ cd installation-dir
$ unzip google-cloudsearch-database-connector-v1-0.0.3.zip
$ cd google-cloudsearch-database-connector-v1-0.0.3

Step 2. Configure the database connector

Create a text file named connector-config.properties (the default). Google recommends the .properties or .config extension. Keep it in the same directory as the connector.

Add parameters as key-value pairs. The file must specify data source access, database access, a full traversal SQL statement, a content field title, and column definitions.

# Data source access
api.sourceId=1234567890abcdef
api.identitySourceId=0987654321lmnopq
api.serviceAccountPrivateKeyFile=./PrivateKey.json

# Database access
db.url=jdbc:mysql://localhost:3306/mysql_test
db.user=root
db.password=passw0rd

# Full traversal SQL statement
db.allRecordsSql=select customer_id, first_name, last_name, phone from address_book

# Column definitions and URL format
db.allColumns=customer_id, first_name, last_name, phone
db.uniqueKeyColumns=customer_id
url.columns=customer_id

# Content field
contentTemplate.db.title=customer_id

# Optional: ACLs
defaultAcl.mode=fallback
defaultAcl.public=true

# Optional: traversal schedule
schedule.traversalIntervalSecs=36000
schedule.performTraversalOnStart=true

For database-specific parameters, see the Configuration parameters reference. For common parameters, see Google-supplied connector parameters.

Step 3. Run the database connector

Run the connector from the command line:

java
   -cp "google-cloudsearch-database-connector-v1-0.0.3.jar:mysql-connector-java-5.1.41-bin.jar"
   com.google.enterprise.cloudsearch.database.DatabaseFullTraversalConnector
   [-Dconfig=mysql.config]

The connector reports configuration and initialization errors. Other errors, such as invalid SQL syntax, appear when the connector first attempts to access the database.

Configuration parameters reference

This section lists parameters used in the database connector configuration file.

Data source access parameters

Setting	Parameter
Data source ID	`api.sourceId = source-ID` Required. The Cloud Search source ID.
Service account	`api.serviceAccountPrivateKeyFile = path` Required. The path to the service account key file.

Database access parameters

Setting	Parameter
Database URL	`db.url = database-URL` Required. The full path, e.g., `jdbc:mysql://127.0.0.1/dbname`.
Credentials	`db.user = username` `db.password = password` Required. Read access is necessary for the relevant records.

Traversal SQL query parameters

The connector uses SQL SELECT queries to traverse records.

Full traversal: Reads every configured record. Required for initial indexing and periodic re-indexing.
Incremental traversal: Reads only newly modified records. Requires timestamp fields in the database.

Setting	Parameter
Full traversal query	`db.allRecordsSql = SELECT columns FROM table` Required. Include all columns used for content, IDs, and ACLs.
Incremental traversal query	`db.incrementalUpdateSql = SELECT columns FROM table WHERE update_time > ?` Required for incremental schedules. The "?" is a mandatory timestamp placeholder.

Column definition parameters

Setting	Parameter
All columns	`db.allColumns = column-1, column-2, ...` Required. Lists all columns referenced in SQL queries.
Unique key columns	`db.uniqueKeyColumns = column-1` Required. Defines the unique ID for each record.
URL link column	`url.columns = column-1` Required. Specifies the column used for clickable search results.

Content fields

Setting	Parameter
Title column	`contentTemplate.db.title = column-name` Required. Highest priority for search indexing.
Prioritization	`contentTemplate.db.quality.high = column-1` Designate columns as high, medium, or low quality.