Skip to content

Error running crypto functions on Dictionary arrays such as md5 #13444

@alamb

Description

@alamb

Describe the bug

A regression appears to have been introduced in

T

To Reproduce

This used to work in DataFusion 42.0.0 but doesn't in DataFusion 43.0.0:

DataFusion CLI v43.0.0
> create table t as values (arrow_cast('Foo', 'Dictionary(Int32, LargeUtf8)'));
0 row(s) fetched.
Elapsed 0.013 seconds.

> select md5(column1) from t;
Arrow error: Compute error: Internal Error: Cannot cast Utf8View to StringArray of expected type

Expected behavior

Query should complete

Additional context

I found this while writing tests in #13443 -- Feel free to take those tests to work on this issue

Those tests fail like this:

External error: query failed: DataFusion error: Arrow error: Invalid argument error: column types must match schema types, expected LargeUtf8 but found Utf8 at column index 0
[SQL] select md5(ascii_1) from test_basic_operator;
at test_files/string/./string_query.slt.part:1381
at test_files/string/large_string.slt:93

Metadata

Metadata

Assignees

Labels

bugSomething isn't workingregressionSomething that used to work no longer does

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions