Slow blob sub_type 1 retrieval [CORE4601] #4916

firebird-automations · 2014-11-10T08:11:54Z

Submitted by: brickman (brickman)

Votes: 4

We are currently trying to optimise a Java application that performs a high number of Blob Sub Type 1 reads.
The application is required to process hundreds of thousands of rows with Blob Sub Type 1 columns in a timely manor.
We recently discovered reading the blobs to be a significant bottleneck.

We initially raised an issue against the JayBird JDBC driver under JDBC369
However, the outcome of the JayBird analysis seemed to indicate that there was not much room for optimisation without optimising the core Firebird blob protocols.

To help us diagnose this issue, we developed a simple test application to populate a table with 100,000 rows with a varchar and a blob sub_type 1 column, and then capture the timings to read the entire contents of the table iterating through the java.sql.ResultSet. Both the blob and varchar contained the same text of 9 characters in length.
The first iteration of the test captures the time when reading only the varchar column. The second iteration captures the time when reading the blob column.

We found that when only reading the varchar, it took 450 milliseconds to read the 100,000 rows, with 100 milliseconds spent in reading the varchar column.
When reading the blob column, it took 29700 milliseconds to read the 100,000 rows, with 29400 milliseconds spent in reading the blob column.

We respect that there is extra overhead in reading blob columns, however we felt that 30 seconds to retrieve 100,000 records may be excessive.

We ran our tests with the following setups. They all had similar results:
- Jaybird JDBC Driver 2.2.5 (JDK 1.6) against Firebird 2.5.2 Super Server hosted locally (i.e. no network)
- Jaybird JDBC Driver 2.1.6 (JDK 1.6) against Firebird 2.5.2 Super Server hosted locally
- Jaybird JDBC Driver 2.2.5 (JDK 1.6) against Firebird 2.5.3 Super Server hosted locally
- Jaybird JDBC Driver 2.2.5 (JDK 1.6) against Firebird 2.5.3 Super Classic hosted on a dedicated server (remote call)

To get a comparison, we ran the same test against MySQL and PostgreSQL. The results were a lot faster:
MySQL took 290 milliseconds to read 100,000 rows, with 100 millseconds spend in reading the blob (LONGTEXT in MySQL) column.
PostgreSQL took 1245 milliseconds to read 100,000 rows, with 473 millseconds spend in reading the blob (TEXT in PostgreSQL) column.

During the assessment of the JayBird JDBC driver (JDBC369), it was advised that MySQL could achieve this performance by returning the blob data inline with the ResultSet.
We would like to understand if there is any possibility to have similar optimisations added to Firebird to enable faster blob processing?

Please advise if there is anything that can be done to optimise the blob retrieval, or if you require any additional information.

firebird-automations · 2014-11-10T08:28:12Z

Commented by: brickman (brickman)

I attached the simple test Java application that reproduces the issue.

firebird-automations · 2014-11-10T08:28:22Z

Modified by: brickman (brickman)

Attachment: TestBlob.zip [ 12624 ]

firebird-automations · 2014-11-10T20:51:37Z

Modified by: @mrotteveel

Link: This issue replaces JDBC369 [ JDBC369 ]

firebird-automations · 2014-11-25T10:00:44Z

Commented by: Michał Ziemski (r_o_o_k)

If the blob data you store is generally shorter than 32kb
you can do

select cast(blob_column as varchar(32000))
from table

RIght now reading each blob is a round-trip to he server, so
you're paying a latency fee for each row.

firebird-automations · 2014-11-25T10:40:49Z

Commented by: @mrotteveel

The CAST workaround was also provided in JDBC369, however that is a specific solution that will not always work.

firebird-automations added affect-version: 2.5.3 affect-version: 2.5.2 priority: major component: engine type: bug labels Apr 25, 2021

mrotteveel mentioned this issue Aug 18, 2021

Slow query with a blob over the network (FB 2.x / fb3.x) #6924

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Slow blob sub_type 1 retrieval [CORE4601] #4916

Slow blob sub_type 1 retrieval [CORE4601] #4916

firebird-automations commented Nov 10, 2014

firebird-automations commented Nov 10, 2014

firebird-automations commented Nov 10, 2014

firebird-automations commented Nov 10, 2014

firebird-automations commented Nov 25, 2014

firebird-automations commented Nov 25, 2014

Slow blob sub_type 1 retrieval [CORE4601] #4916

Slow blob sub_type 1 retrieval [CORE4601] #4916

Comments

firebird-automations commented Nov 10, 2014

firebird-automations commented Nov 10, 2014

firebird-automations commented Nov 10, 2014

firebird-automations commented Nov 10, 2014

firebird-automations commented Nov 25, 2014

firebird-automations commented Nov 25, 2014