duckdb: Transient issue on 0.8.1: `duckdb.TransactionException: TransactionContext Error: Failed to commit: Input is invalid/unsupported GZIP stream`

What happens?

I run a pipeline to ingest multiple json.gz files daily into a DuckDB database, using a code like

insert into my_table by name select 
            * 
        from read_json_auto('/folder/my_file.json.gz', sample_size=10000)

This used to work without a problem with DuckDB 0.7 but since I upgrade to 0.8.1 recently, I get transient errors of

duckdb.TransactionException: TransactionContext Error: Failed to commit: Input is invalid/unsupported GZIP stream

Re-running my pipeline a second time usually fixes the problem though

To Reproduce

I can’t reproduce the issue as it comes and goes.

OS:

x86_64

DuckDB Version:

0.8.1

DuckDB Client:

Python

Full Name:

Benoit Perigaud

Affiliation:

NA

Have you tried this on the latest master branch?

  • I agree

Have you tried the steps to reproduce? Do they include all relevant data and configuration? Does the issue you report still appear there?

  • I agree

About this issue

  • Original URL
  • State: closed
  • Created a year ago
  • Comments: 15 (6 by maintainers)

Most upvoted comments

This should be solved now in #8943 - would be great if you could verify that the problem is solved in the latest main