Amazon Standard Identification Number (ASIN) by mimiflynn · Pull Request #104 · bee-san/pyWhat

mimiflynn · 2021-06-28T22:16:34Z

No description provided.

pywhat/Data/regex.json

ghost

Also, please format tests using black

pywhat/Data/regex.json

mimiflynn · 2021-07-04T14:47:42Z

I'm not quite sure whats happening: with the hardcoded boundaries ^((?:[/dp/]|$)([A-Z0-9]{10}))$, the tests fail and without them (?:[/dp/]|$)([A-Z0-9]{10}) they pass. I'm looking into it more, but if you have any quick insights, I'd appreciate the feedback.

Thanks!

amadejpapez · 2021-07-04T15:05:31Z

This is happening because with hardcoded boundaries the whole match needs to be in one line with nothing else around it.

http://www.amazon.com/Kindle-Wireless-Reading-Display-Generation/dp/B0015T963C

Your regex is matching to only /dp/B0015T963C, so it doesn't match as there is something before it. If you add only this part as a test it will pass. 😄

mimiflynn · 2021-07-04T15:46:28Z

This is happening because with hardcoded boundaries the whole match needs to be in one line with nothing else around it.

http://www.amazon.com/Kindle-Wireless-Reading-Display-Generation/dp/B0015T963C

Your regex is matching to only /dp/B0015T963C, so it doesn't match as there is something before it. If you add only this part as a test it will pass. smile

Oh, I see! I had lost sight of main use of pyWhat while using regex101 and was extracting ASINs from strings instead of checking if a string is an ASIN. Updating now with new regex removing URL specific aspects.

bee-san · 2021-07-07T09:58:35Z

This is happening because with hardcoded boundaries the whole match needs to be in one line with nothing else around it.
http://www.amazon.com/Kindle-Wireless-Reading-Display-Generation/dp/B0015T963C
Your regex is matching to only /dp/B0015T963C, so it doesn't match as there is something before it. If you add only this part as a test it will pass. smile

Oh, I see! I had lost sight of main use of pyWhat while using regex101 and was extracting ASINs from strings instead of checking if a string is an ASIN. Updating now with new regex removing URL specific aspects.

I think in bounardyless mode we do extract that string, but we need it to have boundaries in our database so we can take them away in our boundaryless mode :)

mimiflynn and others added 5 commits June 22, 2021 22:00

regex for ASIN

5d4e7f9

ASIN tests

29ebbb3

ASIN update rarity to 0.5

ec9b6cb

clean up

f360065

Merge branch 'main' into main

952823e

amadejpapez requested changes Jun 29, 2021

View reviewed changes

pywhat/Data/regex.json Outdated Show resolved Hide resolved

ghost suggested changes Jun 29, 2021

View reviewed changes

pywhat/Data/regex.json Outdated Show resolved Hide resolved

mimiflynn added 5 commits July 4, 2021 09:47

Merge branch 'main' of https://github.com/bee-san/pyWhat into main

b182e2a

PR review updates for ASIN

80891a0

Hardcoded boundaries for regexes

6da6fcb

format tests with black

6f84ad5

Remove hardcoded boundaries for ASIN

6ae32f2

identify standalone ASIN outside of url

04fbced

ghost approved these changes Jul 6, 2021

View reviewed changes

bee-san approved these changes Jul 7, 2021

View reviewed changes

bee-san requested a review from amadejpapez July 7, 2021 09:59

bee-san merged commit 72a894d into bee-san:main Jul 7, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Amazon Standard Identification Number (ASIN)#104

Amazon Standard Identification Number (ASIN)#104
bee-san merged 11 commits intobee-san:mainfrom
mimiflynn:main

mimiflynn commented Jun 28, 2021

Uh oh!

Uh oh!

ghost left a comment

Uh oh!

Uh oh!

mimiflynn commented Jul 4, 2021

Uh oh!

amadejpapez commented Jul 4, 2021

Uh oh!

mimiflynn commented Jul 4, 2021

Uh oh!

bee-san commented Jul 7, 2021

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Uh oh!

Conversation

mimiflynn commented Jun 28, 2021

Uh oh!

Uh oh!

ghost left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

mimiflynn commented Jul 4, 2021

Uh oh!

amadejpapez commented Jul 4, 2021

Uh oh!

mimiflynn commented Jul 4, 2021

Uh oh!

bee-san commented Jul 7, 2021

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants