Skip to content

read_csv from S3 times out #1178

@hussainsultan

Description

@hussainsultan
data = dask.dataframe.read_csv('s3://test/test.csv')

times out with

ConnectTimeout: HTTPSConnectionPool(host='sts.amazonaws.com', port=443): Max retries exceeded with url: / (Caused by ConnectTimeoutError(<botocore.awsrequest.AWSHTTPSConnection object at 0x7f999ca2c790>, 'Connection to sts.amazonaws.com timed out. (connect timeout=60)'))

however, the following works:

import s3fs
fs = s3fs.S3FileSystem()
fs.ls('test')

I am on ec2 instance which has an IAM role attached with S3FullAccess

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions