Skip to content

Using the parsingLimitEntries parameter when importing vector embeddings causes the import to fail #2432

@odysseaspenta

Description

@odysseaspenta

I am using version 25.7.1.

If you attempt to import a set of embeddings while specifying the parsingLimitEntries parameter, the import fails.

import database file://./cc.en.300.small.vec.gz " // + "with distanceFunction = cosine, m = 16, ef = 128, efConstruction = 128, " // + "vertexType = Word, edgeType = Proximity, vectorProperty = vector, idProperty = name, " + "parsingLimitEntries = 100

The stack trace in the error is shown below.

Caused by: com.arcadedb.integration.importer.ImportException: Error on importing Word2Vec datasource
at com.arcadedb.integration.importer.format.Word2VecImporterFormat.load(Word2VecImporterFormat.java:55)
at com.arcadedb.integration.importer.Importer.loadFromSource(Importer.java:107)
at com.arcadedb.integration.importer.Importer.load(Importer.java:53)
... 12 more
Caused by: java.lang.IllegalStateException: stream has already been operated upon or closed
at java.base/java.util.stream.AbstractPipeline.(AbstractPipeline.java:203)
at java.base/java.util.stream.ReferencePipeline.(ReferencePipeline.java:96)
at java.base/java.util.stream.ReferencePipeline$StatelessOp.(ReferencePipeline.java:800)
at java.base/java.util.stream.ReferencePipeline$3.(ReferencePipeline.java:191)
at java.base/java.util.stream.ReferencePipeline.map(ReferencePipeline.java:190)
at com.arcadedb.integration.importer.vector.TextEmbeddingsImporter.loadFromFile(TextEmbeddingsImporter.java:237)
at com.arcadedb.integration.importer.vector.TextEmbeddingsImporter.run(TextEmbeddingsImporter.java:116)
at com.arcadedb.integration.importer.format.Word2VecImporterFormat.load(Word2VecImporterFormat.java:52)

Metadata

Metadata

Assignees

Labels

No labels
No labels

Type

Projects

No projects

Milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions