Index of many unicode characters results in "internal gds software consistency check" [CORE1502] #1916

firebird-automations · 2007-09-11T12:42:59Z

Submitted by: Richard Wesley (hawkfish)

This looks a lot like CORE1049 et alia, but it is showing up in 2.0.3:

CREATE TABLE "Sheet1$" (
"Appearance" VARCHAR(6) CHARACTER SET UTF8 COLLATE UNICODE,
"Decimal" FLOAT(53),
"Name" VARCHAR(83) CHARACTER SET UTF8 COLLATE UNICODE,
"Position" VARCHAR(6) CHARACTER SET UTF8 COLLATE UNICODE
);

INSERT INTO "Sheet1$"
("Appearance", "Decimal", "Name", "Position")
VALUES(?, ?, ?, ?);

// At this point, I inserted a table of about 10600 rows with
distinct single character ucs2 code points in the "Appearance" column
// using the API. Many thanks to my sadistic test cr?e...
// We use UTF8 for the communication character set and convert the
ucs2 to utf8 using the standard Windows character conversion
// routines.

// If I now say:

CREATE INDEX "_tidx_128_1a" ON "Sheet1$" ("Appearance");

// we log the following error in our application:

--- FB Error
---------------------------------------------------------------
File: db\firebirdprotocol.cpp, Line: 1921
Status: 335544333
internal gds software consistency check (index key too big (174),
file: idx.cpp line: 448)
------------------------------------------------------------------------
----

firebird-automations · 2007-09-11T12:46:22Z

Commented by: Richard Wesley (hawkfish)

This is a zip of a Firebird database. I think it has a .tde extension but you can just change that to fdb.

firebird-automations · 2007-09-11T12:46:22Z

Modified by: Richard Wesley (hawkfish)

Attachment: CREATETABLETEST.FDB.zip [ 10610 ]

firebird-automations · 2007-10-13T18:56:00Z

Commented by: Richard Wesley (hawkfish)

The issue does not appear in 2.1b1.

firebird-automations · 2008-01-28T15:17:19Z

Modified by: @pcisar

Workflow: jira [ 13260 ] => Firebird [ 13990 ]

firebird-automations · 2008-11-12T23:03:45Z

Commented by: @asfernandes

There is a record with the character U+FDFA. You can see it here: http://www.fileformat.info/info/unicode/char/fdfa/index.htm.

Note its decomposition in many others characters. When getting sort key of it, ICU returns 55 bytes. How can we deal with it? With our current fixed size buffers for keys, there is no way...

The attached test case has synthetic data, so it seems this problem is not affecting our users, so I believe it's low priority.

firebird-automations added affect-version: 2.0.3 priority: major component: charsets/collation component: engine type: bug labels Apr 25, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Index of many unicode characters results in "internal gds software consistency check" [CORE1502] #1916

Index of many unicode characters results in "internal gds software consistency check" [CORE1502] #1916

firebird-automations commented Sep 11, 2007

firebird-automations commented Sep 11, 2007

firebird-automations commented Sep 11, 2007

firebird-automations commented Oct 13, 2007

firebird-automations commented Jan 28, 2008

firebird-automations commented Nov 12, 2008

Index of many unicode characters results in "internal gds software consistency check" [CORE1502] #1916

Index of many unicode characters results in "internal gds software consistency check" [CORE1502] #1916

Comments

firebird-automations commented Sep 11, 2007

firebird-automations commented Sep 11, 2007

firebird-automations commented Sep 11, 2007

firebird-automations commented Oct 13, 2007

firebird-automations commented Jan 28, 2008

firebird-automations commented Nov 12, 2008