Skip to content

number of Users doesn't match, in Amazon-Book #7

@liulu1998

Description

@liulu1998

The number of Users in Amazon-Book isn't equal to that shown in your table.

I use Books (ratings only, 22,507,155 ratings), which is 873.81MB, download from Amazon Reviews 2014.

My source code :

df = pd.read_csv("./ratings_Books.csv", header=None)

df.columns = ["user", "item", "rating", "timestamp"]

print(f"number of user: {len(np.unique(df.user))}")
print(f"number of item: {len(np.unique(df.item))}")
print(f"number of interaction: {len(df)}")

the output is :

number of user: 8026324
number of item: 2330066
number of interaction: 22507155

As shown above, number of Users I count is 8,026,324, which isn't equal to 3,468,412 shown in your table., while other numbers match.

Am I using a wrong version of Amazon-Book ?

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions