Run Bulk Operations from Your Cluster

The CockroachCloud tiers offer different levels of support for the following bulk operations. This page provides information on the availability of these operations in each CockroachCloud cluster tier and examples.

The examples below include details on the storage options available with each of the CockroachCloud tiers.

Examples

For guidance on connecting to your CockroachCloud Free (beta) cluster, visit Connect to a CockroachCloud Free (beta) Cluster.

In CockroachCloud Free (beta) clusters, userfile, a per-user bulk file storage, is the only available storage option for BACKUP, RESTORE, and IMPORT operations.

Note:

userfile is only available as storage for BACKUP, RESTORE, and IMPORT operations on CockroachCloud Free (beta) after upgrading to v21.1.

For information on userfile commands, visit the following pages:

Backup and restore with `userfile`

We recommend starting backups from a time at least 10 seconds in the past using AS OF SYSTEM TIME. Read our guidance in the Performance section on the BACKUP page.

Note:

Only database and table-level backups are possible when using userfile as storage. Restoring cluster-level backups will not work because userfile data is stored in the defaultdb database, and you cannot restore a cluster with existing table data.

Database and table

When working on the same cluster, userfile storage allows for database and table-level backups.

First, run the following statement to backup a database to a directory in the default userfile space:

BACKUP DATABASE bank TO 'userfile://defaultdb.public.userfiles_$user/bank-backup' AS OF SYSTEM TIME '-10s';

This directory will hold the files that make up a backup; including the manifest file and data files.

Note:

When backing up from a cluster and restoring a database or table that is stored in your userfile space to a different cluster, you can run cockroach userfile get to download the backup files to a local machine and cockroach userfile upload --url {CONNECTION STRING} to upload to the userfile of the alternate cluster.

In cases when your database needs to be restored, run the following:

RESTORE DATABASE bank FROM 'userfile://defaultdb.public.userfiles_$user/bank-backup';

It is also possible to run userfile:///bank-backup as userfile:/// refers to the default path userfile://defaultdb.public.userfiles_$user/.

Once the backup data is no longer needed, delete from the userfile storage:

cockroach userfile delete bank-backup --url {CONNECTION STRING}

If you use cockroach userfile delete {file}, it will take as long as the garbage collection to be removed from disk.

To resolve database or table naming conflicts during a restore, see Troubleshooting naming conflicts.

Import data into your CockroachCloud Free (beta) cluster

To import a table from userfile, use the following command:

IMPORT TABLE customers (
        id UUID PRIMARY KEY,
        name TEXT,
        INDEX name_idx (name)
)
   CSV DATA ('userfile:///test-data.csv');

userfile:/// references the default path (userfile://defaultdb.public.userfiles_$user/).

        job_id       |  status   | fraction_completed |  rows  | index_entries |  bytes
---------------------+-----------+--------------------+--------+---------------+-----------
  599865027685613569 | succeeded |                  1 | 300024 |             0 | 13389972
(1 row)

For more import options, see IMPORT.

Stream data out of your CockroachCloud Free (beta) cluster

Core changefeeds stream row-level changes to a client until the underlying SQL connection is closed.

Note:

Only core changefeeds are available on CockroachCloud Free (beta). To create a changefeed into a configurable sink, like cloud storage or Kafka, use CockroachCloud, which has this feature enabled by default.

To create a core changefeed in CockroachCloud Free (beta), use the following example.

In this example, you'll set up a core changefeed on your CockroachCloud Free (beta) cluster.

As the root user, open the built-in SQL client:
```
cockroach sql --url {CONNECTION STRING} --format=csv
```
Note:

Because core changefeeds return results differently than other SQL statements, they require a dedicated database connection with specific settings around result buffering. In normal operation, CockroachDB improves performance by buffering results server-side before returning them to a client; however, result buffering is automatically turned off for core changefeeds. Core changefeeds also have different cancellation behavior than other queries: they can only be canceled by closing the underlying connection or issuing a CANCEL QUERY statement on a separate connection. Combined, these attributes of changefeeds mean that applications should explicitly create dedicated connections to consume changefeed data, instead of using a connection pool as most client drivers do by default.

Note:

To determine how wide the columns need to be, the default table display format in cockroach sql buffers the results it receives from the server before printing them to the console. When consuming core changefeed data using cockroach sql, it's important to use a display format like csv that does not buffer its results. To set the display format, use the --format=csv flag when starting the built-in SQL client, or set the \set display_format=csv option once the SQL client is open.
Enable the kv.rangefeed.enabled cluster setting:
```
> SET CLUSTER SETTING kv.rangefeed.enabled = true;
```
Create table foo:
```
> CREATE TABLE foo (a INT PRIMARY KEY);
```
Insert a row into the table:
```
> INSERT INTO foo VALUES (0);
```

Start the core changefeed:

> EXPERIMENTAL CHANGEFEED FOR foo;

table,key,value
foo,[0],"{""after"": {""a"": 0}}"

In a new terminal, add another row:
```
cockroach sql --url {CONNECTION STRING} -e "INSERT INTO foo VALUES (1)"
```
Back in the terminal where the core changefeed is streaming, the following output has appeared:
```
foo,[1],"{""after"": {""a"": 1}}"
```
Note that records may take a couple of seconds to display in the core changefeed.
To stop streaming the changefeed, enter CTRL+C into the terminal where the changefeed is running.

For further information on changefeeds, read Stream Data Out of CockroachDB and CHANGEFEED FOR.

The Most Highly Evolved Database on the Planet

CockroachDB

CockroachCloud

Get to know CockroachDB

CockroachDB

CockroachCloud

Run Bulk Operations from Your Cluster

Examples

Backup and restore with `userfile`

Database and table

Import data into your CockroachCloud Free (beta) cluster

Stream data out of your CockroachCloud Free (beta) cluster

See also

Backup and restore your CockroachCloud data

Backup a cluster

Backup a database

Backup a table or view

View the backup subdirectories

Restore a cluster

Restore a database

Restore a table

Import data into your CockroachCloud cluster

Export data out of CockroachCloud

Stream data out of CockroachCloud

Create a changefeed connected to Kafka

Create a changefeed connected to a cloud storage sink

See also

CockroachDB

CockroachCloud

CockroachDB

CockroachCloud

Run Bulk Operations from Your Cluster

Examples

Backup and restore with userfile

Database and table

Import data into your CockroachCloud Free (beta) cluster

Stream data out of your CockroachCloud Free (beta) cluster

See also

Backup and restore your CockroachCloud data

Backup a cluster

Backup a database

Backup a table or view

View the backup subdirectories

Restore a cluster

Restore a database

Restore a table

Import data into your CockroachCloud cluster

Export data out of CockroachCloud

Stream data out of CockroachCloud

Create a changefeed connected to Kafka

Create a changefeed connected to a cloud storage sink

See also

Backup and restore with `userfile`