[added] new search plugin for private tracker torrentday#1597

Merged

liiight merged 8 commits intoFlexget:developfrom

zosky:zosky-td-1

Jan 6, 2017

Contributor

zosky commented Dec 31, 2016

i used torrentleech as a starting point. The main difference being that TL uses uname/pass to login, generate the cookies & access the search pages. TD's login page has captcha, so instead i put the 3 cookies it needs as required keys. this should be fine, because in my browser they all have expiry date of 2038. Beyond that, the 2 sites have a slightly different CSS so i'm look for some different classes and divs (this is my first PR ever, so plz be gentle)

Motivation for changes:

TD is my primary private tracker and TL secondary
i'd like to discover stuff. maybe others will too

Detailed changes:

new private tracker search plugin

Config usage if relevant (new plugin or updated schema):

    discover:
      from:
        - torrentday:
           uid: xxxxxxxxxxxxx  (required)  NOT YOUR LOGIN. find this in your browser's cookies
           passkey: xxxxxxxxx  (required)  NOT YOUR PASSWORD. see previous
           cfduid: xxxxxxxxxx  (required)  AGAIN IN THE COOKIES
           rss_key: xxxxxxxxx  (required)  get this from your profile page
           category: xxxxxxxx

Log and/or tests output (preferably both):

https://dl.dropboxusercontent.com/u/28529352/flexget-torrentday-test.log

zosky added 3 commits

December 30, 2016 23:49


          new search plugin for private tracker torrentday

5e6337f

i used torrentleech as a starting point. The main difference being that TL uses uname/pass to login, generate the cookies & access the search pages. TD's login page has captcha, so instead i put the 3 cookies it needs as required keys. this should be fine, because in my browser they all have expiry date of 2038. Beyond that, the 2 sites have a slightly different CSS so i'm look for some different classes and divs (this is my first PR ever, so plz be gentle)


          [added] new search plugin for torrentday

d96255d

sorted out tabs and spaces


          cleanup

b87f320

removed 1 garbage line

cvium reviewed

View reviewed changes

flexget/plugins/sites/torrentday.py Outdated

+                      if 'url' not in entry:
+                          log.error("Didn't actually get a URL...")
+                      else:
+                          log.debug("Got the URL: %s" % entry['url'])

Contributor

cvium Dec 31, 2016

You can pass the args to the logger and let it do the string formatting ie. comma instead of %

cvium requested changes

View reviewed changes

Contributor

cvium left a comment

Pass the arguments to the logger instead of doing explicit string formatting and handle the requests exceptions. Seems fine otherwise.

flexget/plugins/sites/torrentday.py Outdated

+                          cookies["pass"] = config['passkey']
+                          cookies["__cfduid"] = config['cfduid']
+                          page = requests.get(url, cookies=cookies).content

Contributor

cvium Dec 31, 2016

You need more exception handling


          added suggested changes

317fbe4

exception handling & better debug logging

zosky changed the title ~~new search plugin for private tracker torrentday~~ [added] new search plugin for private tracker torrentday

cvium requested changes

View reviewed changes

flexget/plugins/sites/torrentday.py Outdated

+                          try:
+                              page = requests.get(url, cookies=cookies).content
+                          except RequestException as e:
+                              raise PluginError('Could not connect to torrentday: %s', str(e))

Contributor

cvium Jan 1, 2017

PluginError only takes one argument. You have to do the string formatting here.

flexget/plugins/sites/torrentday.py Outdated

+                      if not isinstance(config, dict):
+                          config = {}
+                          # sort = SORT.get(config.get('sort_by', 'seeds'))

Contributor

cvium Jan 1, 2017

You should remove any useless comments

flexget/plugins/sites/torrentday.py Outdated

+                              # find the torrent names
+                              title = tr.find("a", { "class": "torrentName" })
+                              entry['title'] = title.contents[0]
+                              log.debug('title: %s' % title.contents[0])

Contributor

cvium Jan 1, 2017

String formatting

flexget/plugins/sites/torrentday.py Outdated

+                              # construct download URL
+                              torrent_url = ( "https://www.torrentday.com/" + torrent_url + '?torrent_pass=' + config['rss_key'] )
+                              log.debug('RSS-ified download link: %s' % torrent_url)

Contributor

cvium Jan 1, 2017

String formatting

flexget/plugins/sites/torrentday.py Outdated

+                          # urllib.quote will crash if the unicode string has non ascii characters, so encode in utf-8 beforehand
+                          url = ('https://www.torrentday.com/browse.php?search=' +
+                                 quote(query.encode('utf-8')) + filter_url)
+                          log.debug('Using %s as torrentday search url' % url)

Contributor

cvium Jan 1, 2017

String formatting


          additional suggested changes

028f758

cvium reviewed

View reviewed changes

flexget/plugins/sites/torrentday.py Outdated

+                          try:
+                              page = requests.get(url, cookies=cookies).content
+                          except RequestException as e:
+                              raise PluginError('Could not connect to torrentday')

Contributor

cvium Jan 1, 2017 •

edited

Loading

You could've changed it to raise PluginError('Could not connect to torrentday: %s' % e)


          another suggested change

184036c

liiight approved these changes

View reviewed changes

flexget/plugins/sites/torrentday.py Outdated

+                      Search for name from torrentday.
+                      """
+                      if not isinstance(config, dict):

Member

liiight Jan 2, 2017

No need for this, your schema means config cannot be anything other than a dict

flexget/plugins/sites/torrentday.py Outdated

+                          categories = [categories]
+                      # If there are any text categories, turn them into their id number
+                      categories = [c if isinstance(c, int) else CATEGORIES[c] for c in categories]
+                      filter_url = '&cata=yes&c%s=1&clear-new=1' % ','.join(str(c) for c in categories)

Member

liiight Jan 2, 2017

Not mandatory, but I prefer passing URL params as a dict to requests, makes it more readable:

params = { 'cata': 'yes', 'c%s' % ','.join(str(c) for c in categories): 1, 'clear-new': 1}

Then add it with params=params in the requests call. Just a suggestion


          make a better search request w/ params

878266c

use params rather than putting it all in the url
also removed check for 'config is dict' not necessary, schema mandates it
and fixed crash in scraping seed/leech by stripping number formatting

Member

paranoidi commented Jan 3, 2017

Is it possible to get cookie data by logging into the site, grabbing them from cookies manually sucks ..

paranoidi added the Enhancement label

Member

paranoidi commented Jan 3, 2017

Looks fine to me besides cumbersome cookie usage.

Contributor Author

zosky commented Jan 3, 2017

i dont like it either, but it works. Their login page has reCaptha so i cant go through the front door & catch their cookies. any suggestions ?

Contributor

cvium commented Jan 5, 2017

One final change I'd like to see is cleaning up your inconsistent use of quotes regarding strings. Sometimes you use double quotes, other times you use single quotes. It has to be one or the other. Single quotes would probably be more in line with the rest of the code.


          changed double-qoute to single-qoute

c854505

as requested to match the rest of the project

liiight merged commit f0d01df into Flexget:develop

zosky deleted the zosky-td-1 branch

January 7, 2017 06:12

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels