Skip to content

Support npy as input format #49293

@gingerwizard

Description

@gingerwizard

npy is often used to encode numpy data including embeddings as a arrays. Large embedding datasets are often distributed in this format. Propose we support as an input format.
A mapping to clickhouse rows needs to be discussed, but possibly first level is mapped to row and nested arrays become Array(Array.

Metadata

Metadata

Assignees

Labels

comp-formatsInput/output formats (CSV/JSON/Parquet/ORC/Arrow/Protobuf/etc.).feature

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions