Skip to content

csv_example: Negative size passed to PyString_FromStringAndSize error #571

@suneetdewan

Description

@suneetdewan

Hi, I am just getting started with dedupe and tried to run the csv_example out of the box and ran into this error:

Traceback (most recent call last):
File "csv_example.py", line 161, in
threshold = deduper.threshold(data_d, recall_weight=2)
File "/Users/suneetdewan/Documents/projects/dedupe/dedupe/api.py", line 237, in threshold
return self.thresholdBlocks(blocked_pairs, recall_weight)
File "/Users/suneetdewan/Documents/projects/dedupe/dedupe/api.py", line 68, in thresholdBlocks
probability = core.scoreDuplicates(self._blockedPairs(blocks),
File "/Users/suneetdewan/Documents/projects/dedupe/dedupe/api.py", line 248, in _blockedPairs
block, blocks = core.peek(blocks)
File "/Users/suneetdewan/Documents/projects/dedupe/dedupe/core.py", line 279, in peek
record = next(records)
File "/Users/suneetdewan/Documents/projects/dedupe_env/lib/python2.7/site-packages/future/builtins/newnext.py", line 59, in newnext
return iterator.next()
File "/Users/suneetdewan/Documents/projects/dedupe/dedupe/api.py", line 281, in _blockData
for block in viewvalues(blocks):
File "/Users/suneetdewan/Documents/projects/dedupe_env/lib/python2.7/site-packages/future/utils/init.py", line 297, in viewvalues
return func(**kwargs)
File "/Users/suneetdewan/Documents/projects/dedupe_env/lib/python2.7/UserDict.py", line 120, in values
return [v for _, v in self.iteritems()]
File "/Users/suneetdewan/Documents/projects/dedupe_env/lib/python2.7/UserDict.py", line 110, in iteritems
for k in self:
File "/Users/suneetdewan/Documents/projects/dedupe_env/lib/python2.7/UserDict.py", line 97, in iter
for k in self.keys():
File "/Users/suneetdewan/anaconda/lib/python2.7/shelve.py", line 101, in keys
return self.dict.keys()
SystemError: Negative size passed to PyString_FromStringAndSize

running on python 2.7 with osx.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions