Streaming inserts

PostgreSQL only

Streaming inserts use PostgreSQL's COPY FROM STDIN protocol. The feature is not available on other databases.

Streaming inserts use PostgreSQL's COPY protocol to load data much faster than individual INSERT statements. Rows are text-encoded in batches and streamed to the server, bypassing prepared-statement overhead.

StreamingInsert.of() returns an Operation<Long> that can be transacted like any other operation. The COPY participates in the current transaction, so you can compose it atomically with other operations.

Single column

For one-column tables, use the PgText encoder from the type directly:

Kotlin
Java
Scala

// Insert a list of strings using COPY
fun insertNames(names: Iterator<String>, tx: Transactor): Long {
    return StreamingInsert
        .of("COPY users(name) FROM STDIN", 1000, names, PgTypes.text.pgText())
        .transact(tx)
}

// Insert a list of strings using COPY
long insertNames(Iterator<String> names, Transactor tx) {
  return StreamingInsert.of("COPY users(name) FROM STDIN", 1000, names, PgTypes.text.pgText())
      .transact(tx);
}

// Insert a list of strings using COPY
def insertNames(names: Iterator[String], tx: Transactor): Long =
  StreamingInsert
    .of("COPY users(name) FROM STDIN", 1000, names, PgTypes.text.pgText())
    .transact(tx)

Show entire file

Parameter	Description
`copyCommand`	A PostgreSQL `COPY ... FROM STDIN` command
`batchSize`	Number of rows to buffer before flushing to the server
`rows`	An `Iterator` over your data
`text`	A `PgText<T>` encoder for your row type

Multi-column rows

For rows with multiple columns, derive a PgText encoder from a RowCodec:

Kotlin
Java
Scala

// Define a RowCodec for your row type
val productCodec: RowCodec<ProductRow> = RowCodec.builder<ProductRow>()
    .field(PgTypes.text, ProductRow::name)
    .field(PgTypes.numeric, ProductRow::price)
    .field(PgTypes.int4, ProductRow::quantity)
    .build(::ProductRow)

// PgText.from() derives a text encoder from the RowCodec
val productText: PgText<ProductRow> = PgText.from(productCodec.underlying)

fun insertProducts(products: Iterator<ProductRow>, tx: Transactor): Long {
    return StreamingInsert
        .of("COPY products(name, price, quantity) FROM STDIN", 1000, products, productText)
        .transact(tx)
}

// Define a RowCodec for your row type
static RowCodec<ProductRow> productCodec =
    RowCodec.<ProductRow>builder()
        .field(PgTypes.text, ProductRow::name)
        .field(PgTypes.numeric, ProductRow::price)
        .field(PgTypes.int4, ProductRow::quantity)
        .build(ProductRow::new);

// PgText.from() derives a text encoder from the RowCodec
static PgText<ProductRow> productText = PgText.from(productCodec);

long insertProducts(Iterator<ProductRow> products, Transactor tx) {
  return StreamingInsert.of(
          "COPY products(name, price, quantity) FROM STDIN", 1000, products, productText)
      .transact(tx);
}

// Define a RowCodec for your row type
val productCodec: RowCodec[ProductRow] =
  RowCodec
    .builder[ProductRow]()
    .field(PgTypes.text)(_.name)
    .field(PgTypes.numeric)(_.price)
    .field(PgTypes.int4)(_.quantity)
    .build(ProductRow.apply)

// PgText.from() derives a text encoder from the RowCodec
val productText: PgText[ProductRow] = PgText.from(productCodec)

def insertProducts(products: Iterator[ProductRow], tx: Transactor): Long =
  StreamingInsert
    .of("COPY products(name, price, quantity) FROM STDIN", 1000, products, productText)
    .transact(tx)

Show entire file

PgText.from(rowCodec) uses each column's text encoder to produce tab-delimited COPY format. The same RowCodec you use for reading rows can drive bulk loading.

Supported types

Most PostgreSQL types support text encoding for COPY. Types that don't (such as jsonb) will throw UnsupportedOperationException at encode time.

Batch size

The batchSize parameter controls how many rows are buffered in memory before flushing to PostgreSQL. A larger batch means fewer network round-trips but more memory. Try a value between 1000 and 10000 as a starting point.

Single column​

Multi-column rows​

Supported types​

Batch size​

Single column

Multi-column rows

Supported types

Batch size