release 2017.02.14

[ChangeLog] Actualize
[zdf] Fix extraction (closes #12117 )
2017-02-14 01:09:18 +07:00 · 2017-02-14 01:07:35 +07:00 · 2017-02-14 01:00:06 +07:00 · 2017-02-13 23:44:43 +07:00 · 2017-02-13 23:34:14 +07:00 · 2017-02-13 23:17:48 +07:00
69 changed files with 2125 additions and 547 deletions
--- a/.github/ISSUE_TEMPLATE.md
+++ b/.github/ISSUE_TEMPLATE.md
@ -6,8 +6,8 @@
 ---
-### Make sure you are using the *latest* version: run `youtube-dl --version` and ensure your version is *2017.02.01*. If it's not read [this FAQ entry](https://github.com/rg3/youtube-dl/blob/master/README.md#how-do-i-update-youtube-dl) and update. Issues with outdated version will be rejected.
+### Make sure you are using the *latest* version: run `youtube-dl --version` and ensure your version is *2017.02.14*. If it's not read [this FAQ entry](https://github.com/rg3/youtube-dl/blob/master/README.md#how-do-i-update-youtube-dl) and update. Issues with outdated version will be rejected.
- [ ] I've **verified** and **I assure** that I'm running youtube-dl **2017.02.01**
+- [ ] I've **verified** and **I assure** that I'm running youtube-dl **2017.02.14**
 ### Before submitting an *issue* make sure you have:
 - [ ] At least skimmed through [README](https://github.com/rg3/youtube-dl/blob/master/README.md) and **most notably** [FAQ](https://github.com/rg3/youtube-dl#faq) and [BUGS](https://github.com/rg3/youtube-dl#bugs) sections
@ -35,7 +35,7 @@ $ youtube-dl -v <your command line>
 [debug] User config: []
 [debug] Command-line args: [u'-v', u'http://www.youtube.com/watch?v=BaW_jenozKcj']
 [debug] Encodings: locale cp1251, fs mbcs, out cp866, pref cp1251
-[debug] youtube-dl version 2017.02.01
+[debug] youtube-dl version 2017.02.14
 [debug] Python version 2.7.11 - Windows-2003Server-5.2.3790-SP2
 [debug] exe versions: ffmpeg N-75573-g1d0487f, ffprobe N-75573-g1d0487f, rtmpdump 2.4
 [debug] Proxy map: {}
--- a/.travis.yml
+++ b/.travis.yml
@ -6,8 +6,14 @@ python:
  - "3.3"
  - "3.4"
  - "3.5"
  - "3.6"
 sudo: false
-script: nosetests test --verbose
+env:
  - YTDL_TEST_SET=core
  - YTDL_TEST_SET=download
 before_script:
  - chmod +x ./devscripts/run_tests.sh
 script: ./devscripts/run_tests.sh
 notifications:
  email:
    - filippo.valsorda@gmail.com
--- a/1
+++ b/1
@ -201,3 +201,4 @@ Stephen Chen
 Fabian Stahl
 Bagira
 Odd Stråbø
 Philip Herzog
--- a/109
+++ b/109
@ -1,3 +1,112 @@
 version 2017.02.14
 Core
 * TypeError is fixed with Python 2.7.13 on Windows (#11540, #12085)
 Extractor
 * [zdf] Fix extraction (#12117)
 * [xtube] Fix extraction for both kinds of video id (#12088)
 * [xtube] Improve title extraction (#12088)
 + [lemonde] Fallback delegate extraction to generic extractor (#12115, #12116)
 * [bellmedia] Allow video id longer than 6 characters (#12114)
 + [limelight] Add support for referer protected videos
 * [disney] Improve extraction (#4975, #11000, #11882, #11936)
 * [hotstar] Improve extraction (#12096)
 * [einthusan] Fix extraction (#11416)
 + [aenetworks] Add support for lifetimemovieclub.com (#12097)
 * [youtube] Fix parsing codecs (#12091)
 version 2017.02.11
 Core
 + [utils] Introduce get_elements_by_class and get_elements_by_attribute
  utility functions
 + [extractor/common] Skip m3u8 manifests protected with Adobe Flash Access
 Extractor
 * [pluralsight:course] Fix extraction (#12075)
 + [bbc] Extract m3u8 formats with 320k audio
 * [facebook] Relax video id matching (#11017, #12055, #12056)
 + [corus] Add support for Corus Entertainment sites (#12060, #9164)
 + [pluralsight] Detect blocked account error message (#12070)
 + [bloomberg] Add another video id pattern (#12062)
 * [extractor/commonmistakes] Restrict URL regular expression (#12050)
 + [tvplayer] Add support for tvplayer.com
 version 2017.02.10
 Extractors
 * [xtube] Fix extraction (#12023)
 * [pornhub] Fix extraction (#12007, #12018)
 * [facebook] Improve JS data regular expression (#12042)
 * [kaltura] Improve embed partner id extraction (#12041)
 + [sprout] Add support for sproutonline.com
 * [6play] Improve extraction
 + [scrippsnetworks:watch] Add support for Scripps Networks sites (#10765)
 + [go] Add support for Adobe Pass authentication (#11468, #10831)
 * [6play] Fix extraction (#12011)
 + [nbc] Add support for Adobe Pass authentication (#12006)
 version 2017.02.07
 Core
 * [extractor/common] Fix audio only with audio group in m3u8 (#11995)
 + [downloader/fragment] Respect --no-part
 * [extractor/common] Speed-up HTML5 media entries extraction (#11979)
 Extractors
 * [pornhub] Fix extraction (#11997)
 + [canalplus] Add support for cstar.fr (#11990)
 + [extractor/generic] Improve RTMP support (#11993)
 + [gaskrank] Add support for gaskrank.tv (#11685)
 * [bandcamp] Fix extraction for incomplete albums (#11727)
 * [iwara] Fix extraction (#11781)
 * [googledrive] Fix extraction on Python 3.6
 + [videopress] Add support for videopress.com
 + [afreecatv] Extract RTMP formats
 version 2017.02.04.1
 Extractors
 + [twitch:stream] Add support for player.twitch.tv (#11971)
 * [radiocanada] Fix extraction for toutv rtmp formats
 version 2017.02.04
 Core
 + Add --playlist-random to shuffle playlists (#11889, #11901)
 * [utils] Improve comments processing in js_to_json (#11947)
 * [utils] Handle single-line comments in js_to_json
 * [downloader/external:ffmpeg] Minimize the use of aac_adtstoasc filter
 Extractors
 + [piksel] Add another app token pattern (#11969)
 + [vk] Capture and output author blocked error message (#11965)
 + [turner] Fix secure HLS formats downloading with ffmpeg (#11358, #11373,
  #11800)
 + [drtv] Add support for live and radio sections (#1827, #3427)
 * [myspace] Fix extraction and extract HLS and HTTP formats
 + [youtube] Add format info for itag 325 and 328
 * [vine] Fix extraction (#11955)
 - [sportbox] Remove extractor (#11954)
 + [filmon] Add support for filmon.com (#11187)
 + [infoq] Add audio only formats (#11565)
 * [douyutv] Improve room id regular expression (#11931)
 * [iprima] Fix extraction (#11920, #11896)
 * [youtube] Fix ytsearch when cookies are provided (#11924)
 * [go] Relax video id regular expression (#11937)
 * [facebook] Fix title extraction (#11941)
 + [youtube:playlist] Recognize TL playlists (#11945)
 + [bilibili] Support new Bangumi URLs (#11845)
 + [cbc:watch] Extract audio codec for audio only formats (#11893)
 + [elpais] Fix extraction for some URLs (#11765)
 version 2017.02.01
 Extractors
--- a/README.md
+++ b/README.md
@ -182,6 +182,7 @@ Alternatively, refer to the [developer instructions](#developer-instructions) fo
                                     automatically resized from an initial value
                                     of SIZE.
    --playlist-reverse               Download playlist videos in reverse order
    --playlist-random                Download playlist videos in random order
    --xattr-set-filesize             Set file xattribute ytdl.filesize with
                                     expected file size (experimental)
    --hls-prefer-native              Use the native HLS downloader instead of
--- a/devscripts/run_tests.sh
+++ b/devscripts/run_tests.sh
@ -0,0 +1,19 @@
 #!/bin/bash
 DOWNLOAD_TESTS="age_restriction|download|subtitles|write_annotations|iqiyi_sdk_interpreter"
 test_set=""
 case "$YTDL_TEST_SET" in
    core)
        test_set="-I test_($DOWNLOAD_TESTS)\.py"
    ;;
    download)
        test_set="-I test_(?!$DOWNLOAD_TESTS).+\.py"
    ;;
    *)
        break
    ;;
 esac
 nosetests test --verbose $test_set
--- a/docs/supportedsites.md
+++ b/docs/supportedsites.md
@ -11,6 +11,7 @@
 - **4tube**
 - **56.com**
 - **5min**
 - **6play**
 - **8tracks**
 - **91porn**
 - **9c9media**
@ -84,6 +85,7 @@
 - **bambuser:channel**
 - **Bandcamp**
 - **Bandcamp:album**
 - **bangumi.bilibili.com**: BiliBili番剧
 - **bbc**: BBC
 - **bbc.co.uk**: BBC iPlayer
 - **bbc.co.uk:article**: BBC articles
@ -167,6 +169,7 @@
 - **ComedyCentralShortname**
 - **ComedyCentralTV**
 - **CondeNast**: Condé Nast media group: Allure, Architectural Digest, Ars Technica, Bon Appétit, Brides, Condé Nast, Condé Nast Traveler, Details, Epicurious, GQ, Glamour, Golf Digest, SELF, Teen Vogue, The New Yorker, Vanity Fair, Vogue, W Magazine, WIRED
 - **Corus**
 - **Coub**
 - **Cracked**
 - **Crackle**
@ -211,7 +214,8 @@
 - **DRBonanza**
 - **Dropbox**
 - **DrTuber**
- - **DRTV**
+ - **drtv**
 - **drtv:live**
 - **Dumpert**
 - **dvtv**: http://video.aktualne.cz/
 - **dw**
@ -247,6 +251,8 @@
 - **fc2:embed**
 - **Fczenit**
 - **fernsehkritik.tv**
 - **filmon**
 - **filmon:channel**
 - **Firstpost**
 - **FiveTV**
 - **Flickr**
@ -278,6 +284,7 @@
 - **Gamersyde**
 - **GameSpot**
 - **GameStar**
 - **Gaskrank**
 - **Gazeta**
 - **GDCVault**
 - **generic**: Generic downloader that works on some sites
@ -303,7 +310,6 @@
 - **HellPorno**
 - **Helsinki**: helsinki.fi
 - **HentaiStigma**
 - **HGTV**
 - **hgtv.com:show**
 - **HistoricFilms**
 - **history:topic**: History.com Topic
@ -662,6 +668,7 @@
 - **screen.yahoo:search**: Yahoo screen search
 - **Screencast**
 - **ScreencastOMatic**
 - **scrippsnetworks:watch**
 - **Seeker**
 - **SenateISVP**
 - **SendtoNews**
@ -671,7 +678,6 @@
 - **Shared**: shared.sx
 - **ShowRoomLive**
 - **Sina**
 - **SixPlay**
 - **skynewsarabia:article**
 - **skynewsarabia:video**
 - **SkySports**
@ -703,10 +709,10 @@
 - **Spiegeltv**
 - **Spike**
 - **Sport5**
 - **SportBox**
 - **SportBoxEmbed**
 - **SportDeutschland**
 - **Sportschau**
 - **Sprout**
 - **sr:mediathek**: Saarländischer Rundfunk
 - **SRGSSR**
 - **SRGSSRPlay**: srf.ch, rts.ch, rsi.ch, rtr.ch and swissinfo.ch play sites
@ -800,6 +806,7 @@
 - **tvp**: Telewizja Polska
 - **tvp:embed**: Telewizja Polska
 - **tvp:series**
 - **TVPlayer**
 - **Tweakers**
 - **twitch:chapter**
 - **twitch:clips**
@ -856,6 +863,7 @@
 - **videomore:season**
 - **videomore:video**
 - **VideoPremium**
 - **VideoPress**
 - **videoweed**: VideoWeed
 - **Vidio**
 - **vidme**
--- a/test/test_utils.py
+++ b/test/test_utils.py
@ -34,6 +34,9 @@ from youtube_dl.utils import (
    find_xpath_attr,
    fix_xml_ampersands,
    get_element_by_class,
    get_element_by_attribute,
    get_elements_by_class,
    get_elements_by_attribute,
    InAdvancePagedList,
    intlist_to_bytes,
    is_html,
@ -785,12 +788,27 @@ class TestUtil(unittest.TestCase):
        on = js_to_json('["abc", "def",]')
        self.assertEqual(json.loads(on), ['abc', 'def'])
        on = js_to_json('[/*comment\n*/"abc"/*comment\n*/,/*comment\n*/"def",/*comment\n*/]')
        self.assertEqual(json.loads(on), ['abc', 'def'])
        on = js_to_json('[//comment\n"abc" //comment\n,//comment\n"def",//comment\n]')
        self.assertEqual(json.loads(on), ['abc', 'def'])
        on = js_to_json('{"abc": "def",}')
        self.assertEqual(json.loads(on), {'abc': 'def'})
        on = js_to_json('{/*comment\n*/"abc"/*comment\n*/:/*comment\n*/"def"/*comment\n*/,/*comment\n*/}')
        self.assertEqual(json.loads(on), {'abc': 'def'})
        on = js_to_json('{ 0: /* " \n */ ",]" , }')
        self.assertEqual(json.loads(on), {'0': ',]'})
        on = js_to_json('{ /*comment\n*/0/*comment\n*/: /* " \n */ ",]" , }')
        self.assertEqual(json.loads(on), {'0': ',]'})
        on = js_to_json('{ 0: // comment\n1 }')
        self.assertEqual(json.loads(on), {'0': 1})
        on = js_to_json(r'["<p>x<\/p>"]')
        self.assertEqual(json.loads(on), ['<p>x</p>'])
@ -800,15 +818,27 @@ class TestUtil(unittest.TestCase):
        on = js_to_json("['a\\\nb']")
        self.assertEqual(json.loads(on), ['ab'])
        on = js_to_json("/*comment\n*/[/*comment\n*/'a\\\nb'/*comment\n*/]/*comment\n*/")
        self.assertEqual(json.loads(on), ['ab'])
        on = js_to_json('{0xff:0xff}')
        self.assertEqual(json.loads(on), {'255': 255})
        on = js_to_json('{/*comment\n*/0xff/*comment\n*/:/*comment\n*/0xff/*comment\n*/}')
        self.assertEqual(json.loads(on), {'255': 255})
        on = js_to_json('{077:077}')
        self.assertEqual(json.loads(on), {'63': 63})
        on = js_to_json('{/*comment\n*/077/*comment\n*/:/*comment\n*/077/*comment\n*/}')
        self.assertEqual(json.loads(on), {'63': 63})
        on = js_to_json('{42:42}')
        self.assertEqual(json.loads(on), {'42': 42})
        on = js_to_json('{/*comment\n*/42/*comment\n*/:/*comment\n*/42/*comment\n*/}')
        self.assertEqual(json.loads(on), {'42': 42})
    def test_extract_attributes(self):
        self.assertEqual(extract_attributes('<e x="y">'), {'x': 'y'})
        self.assertEqual(extract_attributes("<e x='y'>"), {'x': 'y'})
@ -1097,6 +1127,32 @@ The first line
        self.assertEqual(get_element_by_class('foo', html), 'nice')
        self.assertEqual(get_element_by_class('no-such-class', html), None)
    def test_get_element_by_attribute(self):
        html = '''
            <span class="foo bar">nice</span>
        '''
        self.assertEqual(get_element_by_attribute('class', 'foo bar', html), 'nice')
        self.assertEqual(get_element_by_attribute('class', 'foo', html), None)
        self.assertEqual(get_element_by_attribute('class', 'no-such-foo', html), None)
    def test_get_elements_by_class(self):
        html = '''
            <span class="foo bar">nice</span><span class="foo bar">also nice</span>
        '''
        self.assertEqual(get_elements_by_class('foo', html), ['nice', 'also nice'])
        self.assertEqual(get_elements_by_class('no-such-class', html), [])
    def test_get_elements_by_attribute(self):
        html = '''
            <span class="foo bar">nice</span><span class="foo bar">also nice</span>
        '''
        self.assertEqual(get_elements_by_attribute('class', 'foo bar', html), ['nice', 'also nice'])
        self.assertEqual(get_elements_by_attribute('class', 'foo', html), [])
        self.assertEqual(get_elements_by_attribute('class', 'no-such-foo', html), [])
 if __name__ == '__main__':
    unittest.main()
--- a/youtube_dl/YoutubeDL.py
+++ b/youtube_dl/YoutubeDL.py
@ -24,6 +24,7 @@ import sys
 import time
 import tokenize
 import traceback
 import random
 from .compat import (
    compat_basestring,
@ -159,6 +160,7 @@ class YoutubeDL(object):
    playlistend:       Playlist item to end at.
    playlist_items:    Specific indices of playlist to download.
    playlistreverse:   Download playlist items in reverse order.
    playlistrandom:    Download playlist items in random order.
    matchtitle:        Download only matching titles.
    rejecttitle:       Reject downloads for matching titles.
    logger:            Log messages to a logging.Logger instance.
@ -842,6 +844,9 @@ class YoutubeDL(object):
            if self.params.get('playlistreverse', False):
                entries = entries[::-1]
            if self.params.get('playlistrandom', False):
                random.shuffle(entries)
            for i, entry in enumerate(entries, 1):
                self.to_screen('[download] Downloading video %s of %s' % (i, n_entries))
                extra = {
--- a/youtube_dl/init.py
+++ b/youtube_dl/init.py
@ -344,6 +344,7 @@ def _real_main(argv=None):
        'playliststart': opts.playliststart,
        'playlistend': opts.playlistend,
        'playlistreverse': opts.playlist_reverse,
        'playlistrandom': opts.playlist_random,
        'noplaylist': opts.noplaylist,
        'logtostderr': opts.outtmpl == '-',
        'consoletitle': opts.consoletitle,
--- a/youtube_dl/compat.py
+++ b/youtube_dl/compat.py
@ -2883,6 +2883,7 @@ __all__ = [
    'compat_cookiejar',
    'compat_cookies',
    'compat_etree_fromstring',
    'compat_etree_register_namespace',
    'compat_expanduser',
    'compat_get_terminal_size',
    'compat_getenv',
--- a/youtube_dl/downloader/external.py
+++ b/youtube_dl/downloader/external.py
@ -17,6 +17,7 @@ from ..utils import (
    encodeArgument,
    handle_youtubedl_headers,
    check_executable,
    is_outdated_version,
 )
@ -198,6 +199,15 @@ class FFmpegFD(ExternalFD):
        args = [ffpp.executable, '-y']
        seekable = info_dict.get('_seekable')
        if seekable is not None:
            # setting -seekable prevents ffmpeg from guessing if the server
            # supports seeking(by adding the header `Range: bytes=0-`), which
            # can cause problems in some cases
            # https://github.com/rg3/youtube-dl/issues/11800#issuecomment-275037127
            # http://trac.ffmpeg.org/ticket/6125#comment:10
            args += ['-seekable', '1' if seekable else '0']
        args += self._configuration_args()
        # start_time = info_dict.get('start_time') or 0
@ -264,7 +274,9 @@ class FFmpegFD(ExternalFD):
            if self.params.get('hls_use_mpegts', False) or tmpfilename == '-':
                args += ['-f', 'mpegts']
            else:
-                args += ['-f', 'mp4', '-bsf:a', 'aac_adtstoasc']
+                args += ['-f', 'mp4']
                if (ffpp.basename == 'ffmpeg' and is_outdated_version(ffpp._versions['ffmpeg'], '3.2', False)) and (not info_dict.get('acodec') or info_dict['acodec'].split('.')[0] in ('aac', 'mp4a')):
                    args += ['-bsf:a', 'aac_adtstoasc']
        elif protocol == 'rtmp':
            args += ['-f', 'flv']
        else:
--- a/youtube_dl/downloader/fragment.py
+++ b/youtube_dl/downloader/fragment.py
@ -61,6 +61,7 @@ class FragmentFD(FileDownloader):
                'noprogress': True,
                'ratelimit': self.params.get('ratelimit'),
                'retries': self.params.get('retries', 0),
                'nopart': self.params.get('nopart', False),
                'test': self.params.get('test', False),
            }
        )
--- a/youtube_dl/extractor/aenetworks.py
+++ b/youtube_dl/extractor/aenetworks.py
@ -23,7 +23,7 @@ class AENetworksBaseIE(ThePlatformIE):
 class AENetworksIE(AENetworksBaseIE):
    IE_NAME = 'aenetworks'
    IE_DESC = 'A+E Networks: A&E, Lifetime, History.com, FYI Network'
-    _VALID_URL = r'https?://(?:www\.)?(?P<domain>(?:history|aetv|mylifetime)\.com|fyi\.tv)/(?:shows/(?P<show_path>[^/]+(?:/[^/]+){0,2})|movies/(?P<movie_display_id>[^/]+)/full-movie)'
+    _VALID_URL = r'https?://(?:www\.)?(?P<domain>(?:history|aetv|mylifetime|lifetimemovieclub)\.com|fyi\.tv)/(?:shows/(?P<show_path>[^/]+(?:/[^/]+){0,2})|movies/(?P<movie_display_id>[^/]+)(?:/full-movie)?)'
    _TESTS = [{
        'url': 'http://www.history.com/shows/mountain-men/season-1/episode-1',
        'md5': 'a97a65f7e823ae10e9244bc5433d5fe6',
@ -62,11 +62,15 @@ class AENetworksIE(AENetworksBaseIE):
    }, {
        'url': 'http://www.mylifetime.com/movies/center-stage-on-pointe/full-movie',
        'only_matching': True
    }, {
        'url': 'https://www.lifetimemovieclub.com/movies/a-killer-among-us',
        'only_matching': True
    }]
    _DOMAIN_TO_REQUESTOR_ID = {
        'history.com': 'HISTORY',
        'aetv.com': 'AETV',
        'mylifetime.com': 'LIFETIME',
        'lifetimemovieclub.com': 'LIFETIMEMOVIECLUB',
        'fyi.tv': 'FYI',
    }
--- a/youtube_dl/extractor/afreecatv.py
+++ b/youtube_dl/extractor/afreecatv.py
@ -221,10 +221,23 @@ class AfreecaTVGlobalIE(AfreecaTVIE):
                s_url = s.get('purl')
                if not s_url:
                    continue
-                # TODO: extract rtmp formats
+                stype = s.get('stype')
-                if s.get('stype') == 'HLS':
+                if stype == 'HLS':
                    formats.extend(self._extract_m3u8_formats(
-                        s_url, channel_id, 'mp4', fatal=False))
+                        s_url, channel_id, 'mp4', m3u8_id=stype, fatal=False))
                elif stype == 'RTMP':
                    format_id = [stype]
                    label = s.get('label')
                    if label:
                        format_id.append(label)
                    formats.append({
                        'format_id': '-'.join(format_id),
                        'url': s_url,
                        'tbr': int_or_none(s.get('bps')),
                        'height': int_or_none(s.get('brt')),
                        'ext': 'flv',
                        'rtmp_live': True,
                    })
            self._sort_formats(formats)
            info.update({
--- a/youtube_dl/extractor/bandcamp.py
+++ b/youtube_dl/extractor/bandcamp.py
@ -209,6 +209,15 @@ class BandcampAlbumIE(InfoExtractor):
            'id': 'entropy-ep',
        },
        'playlist_mincount': 3,
    }, {
        # not all tracks have songs
        'url': 'https://insulters.bandcamp.com/album/we-are-the-plague',
        'info_dict': {
            'id': 'we-are-the-plague',
            'title': 'WE ARE THE PLAGUE',
            'uploader_id': 'insulters',
        },
        'playlist_count': 2,
    }]
    def _real_extract(self, url):
@ -217,12 +226,16 @@ class BandcampAlbumIE(InfoExtractor):
        album_id = mobj.group('album_id')
        playlist_id = album_id or uploader_id
        webpage = self._download_webpage(url, playlist_id)
-        tracks_paths = re.findall(r'<a href="(.*?)" itemprop="url">', webpage)
+        track_elements = re.findall(
-        if not tracks_paths:
+            r'(?s)<div[^>]*>(.*?<a[^>]+href="([^"]+?)"[^>]+itemprop="url"[^>]*>.*?)</div>', webpage)
        if not track_elements:
            raise ExtractorError('The page doesn\'t contain any tracks')
        # Only tracks with duration info have songs
        entries = [
            self.url_result(compat_urlparse.urljoin(url, t_path), ie=BandcampIE.ie_key())
-            for t_path in tracks_paths]
+            for elem_content, t_path in track_elements
            if self._html_search_meta('duration', elem_content, default=None)]
        title = self._html_search_regex(
            r'album_title\s*:\s*"((?:\\.|[^"\\])+?)"',
            webpage, 'title', fatal=False)
--- a/youtube_dl/extractor/bbc.py
+++ b/youtube_dl/extractor/bbc.py
@ -225,6 +225,8 @@ class BBCCoUkIE(InfoExtractor):
        }
    ]
    _USP_RE = r'/([^/]+?)\.ism(?:\.hlsv2\.ism)?/[^/]+\.m3u8'
    class MediaSelectionError(Exception):
        def __init__(self, id):
            self.id = id
@ -336,6 +338,15 @@ class BBCCoUkIE(InfoExtractor):
                        formats.extend(self._extract_m3u8_formats(
                            href, programme_id, ext='mp4', entry_protocol='m3u8_native',
                            m3u8_id=format_id, fatal=False))
                        if re.search(self._USP_RE, href):
                            usp_formats = self._extract_m3u8_formats(
                                re.sub(self._USP_RE, r'/\1.ism/\1.m3u8', href),
                                programme_id, ext='mp4', entry_protocol='m3u8_native',
                                m3u8_id=format_id, fatal=False)
                            for f in usp_formats:
                                if f.get('height') and f['height'] > 720:
                                    continue
                                formats.append(f)
                    elif transfer_format == 'hds':
                        formats.extend(self._extract_f4m_formats(
                            href, programme_id, f4m_id=format_id, fatal=False))
--- a/youtube_dl/extractor/bellmedia.py
+++ b/youtube_dl/extractor/bellmedia.py
@ -24,7 +24,7 @@ class BellMediaIE(InfoExtractor):
                space
            )\.ca|
            much\.com
-        )/.*?(?:\bvid=|-vid|~|%7E|/(?:episode)?)(?P<id>[0-9]{6})'''
+        )/.*?(?:\bvid=|-vid|~|%7E|/(?:episode)?)(?P<id>[0-9]{6,})'''
    _TESTS = [{
        'url': 'http://www.ctv.ca/video/player?vid=706966',
        'md5': 'ff2ebbeae0aa2dcc32a830c3fd69b7b0',
@ -55,6 +55,9 @@ class BellMediaIE(InfoExtractor):
    }, {
        'url': 'http://www.much.com/shows/the-almost-impossible-gameshow/928979/episode-6',
        'only_matching': True,
    }, {
        'url': 'http://www.ctv.ca/DCs-Legends-of-Tomorrow/Video/S2E11-Turncoat-vid1051430',
        'only_matching': True,
    }]
    _DOMAINS = {
        'thecomedynetwork': 'comedy',
--- a/youtube_dl/extractor/bilibili.py
+++ b/youtube_dl/extractor/bilibili.py
@ -5,19 +5,27 @@ import hashlib
 import re
 from .common import InfoExtractor
-from ..compat import compat_parse_qs
+from ..compat import (
    compat_parse_qs,
    compat_urlparse,
 )
 from ..utils import (
    ExtractorError,
    int_or_none,
    float_or_none,
    parse_iso8601,
    smuggle_url,
    strip_jsonp,
    unified_timestamp,
    unsmuggle_url,
    urlencode_postdata,
 )
 class BiliBiliIE(InfoExtractor):
-    _VALID_URL = r'https?://(?:www\.|bangumi\.|)bilibili\.(?:tv|com)/(?:video/av|anime/v/)(?P<id>\d+)'
+    _VALID_URL = r'https?://(?:www\.|bangumi\.|)bilibili\.(?:tv|com)/(?:video/av|anime/(?P<anime_id>\d+)/play#)(?P<id>\d+)'
-    _TEST = {
+    _TESTS = [{
        'url': 'http://www.bilibili.tv/video/av1074402/',
        'md5': '9fa226fe2b8a9a4d5a69b4c6a183417e',
        'info_dict': {
@ -32,25 +40,61 @@ class BiliBiliIE(InfoExtractor):
            'uploader': '菊子桑',
            'uploader_id': '156160',
        },
-    }
+    }, {
        # Tested in BiliBiliBangumiIE
        'url': 'http://bangumi.bilibili.com/anime/1869/play#40062',
        'only_matching': True,
    }, {
        'url': 'http://bangumi.bilibili.com/anime/5802/play#100643',
        'md5': '3f721ad1e75030cc06faf73587cfec57',
        'info_dict': {
            'id': '100643',
            'ext': 'mp4',
            'title': 'CHAOS;CHILD',
            'description': '如果你是神明，并且能够让妄想成为现实。那你会进行怎么样的妄想？是淫靡的世界？独裁社会？毁灭性的制裁？还是……2015年，涩谷。从6年前发生的大灾害“涩谷地震”之后复兴了的这个街区里新设立的私立高中...',
        },
        'skip': 'Geo-restricted to China',
    }]
    _APP_KEY = '84956560bc028eb7'
    _BILIBILI_KEY = '94aba54af9065f71de72f5508f1cd42e'
    def _report_error(self, result):
        if 'message' in result:
            raise ExtractorError('%s said: %s' % (self.IE_NAME, result['message']), expected=True)
        elif 'code' in result:
            raise ExtractorError('%s returns error %d' % (self.IE_NAME, result['code']), expected=True)
        else:
            raise ExtractorError('Can\'t extract Bangumi episode ID')
    def _real_extract(self, url):
-        video_id = self._match_id(url)
+        url, smuggled_data = unsmuggle_url(url, {})
        mobj = re.match(self._VALID_URL, url)
        video_id = mobj.group('id')
        anime_id = mobj.group('anime_id')
        webpage = self._download_webpage(url, video_id)
-        if 'anime/v' not in url:
+        if 'anime/' not in url:
            cid = compat_parse_qs(self._search_regex(
                [r'EmbedPlayer\([^)]+,\s*"([^"]+)"\)',
                 r'<iframe[^>]+src="https://secure\.bilibili\.com/secure,([^"]+)"'],
                webpage, 'player parameters'))['cid'][0]
        else:
            if 'no_bangumi_tip' not in smuggled_data:
                self.to_screen('Downloading episode %s. To download all videos in anime %s, re-run youtube-dl with %s' % (
                    video_id, anime_id, compat_urlparse.urljoin(url, '//bangumi.bilibili.com/anime/%s' % anime_id)))
            headers = {
                'Content-Type': 'application/x-www-form-urlencoded; charset=UTF-8',
            }
            headers.update(self.geo_verification_headers())
            js = self._download_json(
                'http://bangumi.bilibili.com/web_api/get_source', video_id,
                data=urlencode_postdata({'episode_id': video_id}),
-                headers={'Content-Type': 'application/x-www-form-urlencoded; charset=UTF-8'})
+                headers=headers)
            if 'result' not in js:
                self._report_error(js)
            cid = js['result']['cid']
        payload = 'appkey=%s&cid=%s&otype=json&quality=2&type=mp4' % (self._APP_KEY, cid)
@ -58,7 +102,11 @@ class BiliBiliIE(InfoExtractor):
        video_info = self._download_json(
            'http://interface.bilibili.com/playurl?%s&sign=%s' % (payload, sign),
-            video_id, note='Downloading video info page')
+            video_id, note='Downloading video info page',
            headers=self.geo_verification_headers())
        if 'durl' not in video_info:
            self._report_error(video_info)
        entries = []
@ -85,7 +133,7 @@ class BiliBiliIE(InfoExtractor):
        title = self._html_search_regex('<h1[^>]+title="([^"]+)">', webpage, 'title')
        description = self._html_search_meta('description', webpage)
        timestamp = unified_timestamp(self._html_search_regex(
-            r'<time[^>]+datetime="([^"]+)"', webpage, 'upload time', fatal=False))
+            r'<time[^>]+datetime="([^"]+)"', webpage, 'upload time', default=None))
        thumbnail = self._html_search_meta(['og:image', 'thumbnailUrl'], webpage)
        # TODO 'view_count' requires deobfuscating Javascript
@ -99,7 +147,7 @@ class BiliBiliIE(InfoExtractor):
        }
        uploader_mobj = re.search(
-            r'<a[^>]+href="https?://space\.bilibili\.com/(?P<id>\d+)"[^>]+title="(?P<name>[^"]+)"',
+            r'<a[^>]+href="(?:https?:)?//space\.bilibili\.com/(?P<id>\d+)"[^>]+title="(?P<name>[^"]+)"',
            webpage)
        if uploader_mobj:
            info.update({
@ -123,3 +171,70 @@ class BiliBiliIE(InfoExtractor):
                'description': description,
                'entries': entries,
            }
 class BiliBiliBangumiIE(InfoExtractor):
    _VALID_URL = r'https?://bangumi\.bilibili\.com/anime/(?P<id>\d+)'
    IE_NAME = 'bangumi.bilibili.com'
    IE_DESC = 'BiliBili番剧'
    _TESTS = [{
        'url': 'http://bangumi.bilibili.com/anime/1869',
        'info_dict': {
            'id': '1869',
            'title': '混沌武士',
            'description': 'md5:6a9622b911565794c11f25f81d6a97d2',
        },
        'playlist_count': 26,
    }, {
        'url': 'http://bangumi.bilibili.com/anime/1869',
        'info_dict': {
            'id': '1869',
            'title': '混沌武士',
            'description': 'md5:6a9622b911565794c11f25f81d6a97d2',
        },
        'playlist': [{
            'md5': '91da8621454dd58316851c27c68b0c13',
            'info_dict': {
                'id': '40062',
                'ext': 'mp4',
                'title': '混沌武士',
                'description': '故事发生在日本的江户时代。风是一个小酒馆的打工女。一日，酒馆里来了一群恶霸，虽然他们的举动令风十分不满，但是毕竟风只是一届女流，无法对他们采取什么行动，只能在心里嘟哝。这时，酒家里又进来了个“不良份子...',
                'timestamp': 1414538739,
                'upload_date': '20141028',
                'episode': '疾风怒涛 Tempestuous Temperaments',
                'episode_number': 1,
            },
        }],
        'params': {
            'playlist_items': '1',
        },
    }]
    @classmethod
    def suitable(cls, url):
        return False if BiliBiliIE.suitable(url) else super(BiliBiliBangumiIE, cls).suitable(url)
    def _real_extract(self, url):
        bangumi_id = self._match_id(url)
        # Sometimes this API returns a JSONP response
        season_info = self._download_json(
            'http://bangumi.bilibili.com/jsonp/seasoninfo/%s.ver' % bangumi_id,
            bangumi_id, transform_source=strip_jsonp)['result']
        entries = [{
            '_type': 'url_transparent',
            'url': smuggle_url(episode['webplay_url'], {'no_bangumi_tip': 1}),
            'ie_key': BiliBiliIE.ie_key(),
            'timestamp': parse_iso8601(episode.get('update_time'), delimiter=' '),
            'episode': episode.get('index_title'),
            'episode_number': int_or_none(episode.get('index')),
        } for episode in season_info['episodes']]
        entries = sorted(entries, key=lambda entry: entry.get('episode_number'))
        return self.playlist_result(
            entries, bangumi_id,
            season_info.get('bangumi_title'), season_info.get('evaluate'))
--- a/youtube_dl/extractor/bloomberg.py
+++ b/youtube_dl/extractor/bloomberg.py
@ -33,6 +33,10 @@ class BloombergIE(InfoExtractor):
        'params': {
            'format': 'best[format_id^=hds]',
        },
    }, {
        # data-bmmrid=
        'url': 'https://www.bloomberg.com/politics/articles/2017-02-08/le-pen-aide-briefed-french-central-banker-on-plan-to-print-money',
        'only_matching': True,
    }, {
        'url': 'http://www.bloomberg.com/news/articles/2015-11-12/five-strange-things-that-have-been-happening-in-financial-markets',
        'only_matching': True,
@ -45,9 +49,10 @@ class BloombergIE(InfoExtractor):
        name = self._match_id(url)
        webpage = self._download_webpage(url, name)
        video_id = self._search_regex(
-            (r'["\']bmmrId["\']\s*:\s*(["\'])(?P<url>(?:(?!\1).)+)\1',
+            (r'["\']bmmrId["\']\s*:\s*(["\'])(?P<id>(?:(?!\1).)+)\1',
-             r'videoId\s*:\s*(["\'])(?P<url>(?:(?!\1).)+)\1'),
+             r'videoId\s*:\s*(["\'])(?P<id>(?:(?!\1).)+)\1',
-            webpage, 'id', group='url', default=None)
+             r'data-bmmrid=(["\'])(?P<id>(?:(?!\1).)+)\1'),
            webpage, 'id', group='id', default=None)
        if not video_id:
            bplayer_data = self._parse_json(self._search_regex(
                r'BPlayer\(null,\s*({[^;]+})\);', webpage, 'id'), name)
--- a/youtube_dl/extractor/canalplus.py
+++ b/youtube_dl/extractor/canalplus.py
@ -27,6 +27,7 @@ class CanalplusIE(InfoExtractor):
                                    (?:www\.)?d8\.tv|
                                    (?:www\.)?c8\.fr|
                                    (?:www\.)?d17\.tv|
                                    (?:(?:football|www)\.)?cstar\.fr|
                                    (?:www\.)?itele\.fr
                                )/(?:(?:[^/]+/)*(?P<display_id>[^/?#&]+))?(?:\?.*\bvid=(?P<vid>\d+))?|
                                player\.canalplus\.fr/#/(?P<id>\d+)
@ -40,6 +41,7 @@ class CanalplusIE(InfoExtractor):
        'd8': 'd8',
        'c8': 'd8',
        'd17': 'd17',
        'cstar': 'd17',
        'itele': 'itele',
    }
@ -86,6 +88,19 @@ class CanalplusIE(InfoExtractor):
            'description': 'Chaque matin du lundi au vendredi, Michaël Darmon reçoit un invité politique à 8h25.',
            'upload_date': '20161014',
        },
    }, {
        'url': 'http://football.cstar.fr/cstar-minisite-foot/pid7566-feminines-videos.html?vid=1416769',
        'info_dict': {
            'id': '1416769',
            'display_id': 'pid7566-feminines-videos',
            'ext': 'mp4',
            'title': 'France - Albanie : les temps forts de la soirée - 20/09/2016',
            'description': 'md5:c3f30f2aaac294c1c969b3294de6904e',
            'upload_date': '20160921',
        },
        'params': {
            'skip_download': True,
        },
    }, {
        'url': 'http://m.canalplus.fr/?vid=1398231',
        'only_matching': True,
--- a/youtube_dl/extractor/cbc.py
+++ b/youtube_dl/extractor/cbc.py
@ -296,6 +296,12 @@ class CBCWatchVideoIE(CBCWatchBaseIE):
        formats = self._extract_m3u8_formats(re.sub(r'/([^/]+)/[^/?]+\.m3u8', r'/\1/\1.m3u8', m3u8_url), video_id, 'mp4', fatal=False)
        if len(formats) < 2:
            formats = self._extract_m3u8_formats(m3u8_url, video_id, 'mp4')
        for f in formats:
            format_id = f.get('format_id')
            if format_id.startswith('AAC'):
                f['acodec'] = 'aac'
            elif format_id.startswith('AC3'):
                f['acodec'] = 'ac-3'
        self._sort_formats(formats)
        info = {
--- a/youtube_dl/extractor/common.py
+++ b/youtube_dl/extractor/common.py
@ -1025,13 +1025,13 @@ class InfoExtractor(object):
                unique_formats.append(f)
        formats[:] = unique_formats
-    def _is_valid_url(self, url, video_id, item='video'):
+    def _is_valid_url(self, url, video_id, item='video', headers={}):
        url = self._proto_relative_url(url, scheme='http:')
        # For now assume non HTTP(S) URLs always valid
        if not (url.startswith('http://') or url.startswith('https://')):
            return True
        try:
-            self._request_webpage(url, video_id, 'Checking %s URL' % item)
+            self._request_webpage(url, video_id, 'Checking %s URL' % item, headers=headers)
            return True
        except ExtractorError as e:
            if isinstance(e.cause, compat_urllib_error.URLError):
@ -1208,6 +1208,9 @@ class InfoExtractor(object):
        m3u8_doc, urlh = res
        m3u8_url = urlh.geturl()
        if '#EXT-X-FAXS-CM:' in m3u8_doc:  # Adobe Flash Access
            return []
        formats = [self._m3u8_meta_format(m3u8_url, ext, preference, m3u8_id)]
        format_url = lambda u: (
@ -1315,8 +1318,8 @@ class InfoExtractor(object):
                        'abr': abr,
                    })
                f.update(parse_codecs(last_info.get('CODECS')))
-                if audio_in_video_stream.get(last_info.get('AUDIO')) is False:
+                if audio_in_video_stream.get(last_info.get('AUDIO')) is False and f['vcodec'] != 'none':
-                    # TODO: update acodec for for audio only formats with the same GROUP-ID
+                    # TODO: update acodec for audio only formats with the same GROUP-ID
                    f['acodec'] = 'none'
                formats.append(f)
                last_info = {}
@ -1959,7 +1962,12 @@ class InfoExtractor(object):
        media_tags = [(media_tag, media_type, '')
                      for media_tag, media_type
                      in re.findall(r'(?s)(<(video|audio)[^>]*/>)', webpage)]
-        media_tags.extend(re.findall(r'(?s)(<(?P<tag>video|audio)[^>]*>)(.*?)</(?P=tag)>', webpage))
+        media_tags.extend(re.findall(
            # We only allow video|audio followed by a whitespace or '>'.
            # Allowing more characters may end up in significant slow down (see
            # https://github.com/rg3/youtube-dl/issues/11979, example URL:
            # http://www.porntrex.com/maps/videositemap.xml).
            r'(?s)(<(?P<tag>video|audio)(?:\s+[^>]*)?>)(.*?)</(?P=tag)>', webpage))
        for media_tag, media_type, media_content in media_tags:
            media_info = {
                'formats': [],
--- a/youtube_dl/extractor/commonmistakes.py
+++ b/youtube_dl/extractor/commonmistakes.py
@ -7,7 +7,7 @@ from ..utils import ExtractorError
 class CommonMistakesIE(InfoExtractor):
    IE_DESC = False  # Do not list
    _VALID_URL = r'''(?x)
-        (?:url|URL)
+        (?:url|URL)$
    '''
    _TESTS = [{
--- a/youtube_dl/extractor/corus.py
+++ b/youtube_dl/extractor/corus.py
@ -0,0 +1,72 @@
 # coding: utf-8
 from __future__ import unicode_literals
 import re
 from .theplatform import ThePlatformFeedIE
 from ..utils import int_or_none
 class CorusIE(ThePlatformFeedIE):
    _VALID_URL = r'https?://(?:www\.)?(?P<domain>(?:globaltv|etcanada)\.com|(?:hgtv|foodnetwork|slice)\.ca)/(?:video/|(?:[^/]+/)+(?:videos/[a-z0-9-]+-|video\.html\?.*?\bv=))(?P<id>\d+)'
    _TESTS = [{
        'url': 'http://www.hgtv.ca/shows/bryan-inc/videos/movie-night-popcorn-with-bryan-870923331648/',
        'md5': '05dcbca777bf1e58c2acbb57168ad3a6',
        'info_dict': {
            'id': '870923331648',
            'ext': 'mp4',
            'title': 'Movie Night Popcorn with Bryan',
            'description': 'Bryan whips up homemade popcorn, the old fashion way for Jojo and Lincoln.',
            'uploader': 'SHWM-NEW',
            'upload_date': '20170206',
            'timestamp': 1486392197,
        },
    }, {
        'url': 'http://www.foodnetwork.ca/shows/chopped/video/episode/chocolate-obsession/video.html?v=872683587753',
        'only_matching': True,
    }, {
        'url': 'http://etcanada.com/video/873675331955/meet-the-survivor-game-changers-castaways-part-2/',
        'only_matching': True,
    }]
    _TP_FEEDS = {
        'globaltv': {
            'feed_id': 'ChQqrem0lNUp',
            'account_id': 2269680845,
        },
        'etcanada': {
            'feed_id': 'ChQqrem0lNUp',
            'account_id': 2269680845,
        },
        'hgtv': {
            'feed_id': 'L0BMHXi2no43',
            'account_id': 2414428465,
        },
        'foodnetwork': {
            'feed_id': 'ukK8o58zbRmJ',
            'account_id': 2414429569,
        },
        'slice': {
            'feed_id': '5tUJLgV2YNJ5',
            'account_id': 2414427935,
        },
    }
    def _real_extract(self, url):
        domain, video_id = re.match(self._VALID_URL, url).groups()
        feed_info = self._TP_FEEDS[domain.split('.')[0]]
        return self._extract_feed_info('dtjsEC', feed_info['feed_id'], 'byId=' + video_id, video_id, lambda e: {
            'episode_number': int_or_none(e.get('pl1$episode')),
            'season_number': int_or_none(e.get('pl1$season')),
            'series': e.get('pl1$show'),
        }, {
            'HLS': {
                'manifest': 'm3u',
            },
            'DesktopHLS Default': {
                'manifest': 'm3u',
            },
            'MP4 MBR': {
                'manifest': 'm3u',
            },
        }, feed_info['account_id'])
--- a/youtube_dl/extractor/disney.py
+++ b/youtube_dl/extractor/disney.py
@ -9,13 +9,15 @@ from ..utils import (
    unified_strdate,
    compat_str,
    determine_ext,
    ExtractorError,
 )
 class DisneyIE(InfoExtractor):
    _VALID_URL = r'''(?x)
-        https?://(?P<domain>(?:[^/]+\.)?(?:disney\.[a-z]{2,3}(?:\.[a-z]{2})?|disney(?:(?:me|latino)\.com|turkiye\.com\.tr)|starwars\.com))/(?:embed/|(?:[^/]+/)+[\w-]+-)(?P<id>[a-z0-9]{24})'''
+        https?://(?P<domain>(?:[^/]+\.)?(?:disney\.[a-z]{2,3}(?:\.[a-z]{2})?|disney(?:(?:me|latino)\.com|turkiye\.com\.tr)|(?:starwars|marvelkids)\.com))/(?:(?:embed/|(?:[^/]+/)+[\w-]+-)(?P<id>[a-z0-9]{24})|(?:[^/]+/)?(?P<display_id>[^/?#]+))'''
    _TESTS = [{
        # Disney.EmbedVideo
        'url': 'http://video.disney.com/watch/moana-trailer-545ed1857afee5a0ec239977',
        'info_dict': {
            'id': '545ed1857afee5a0ec239977',
@ -28,6 +30,20 @@ class DisneyIE(InfoExtractor):
            # m3u8 download
            'skip_download': True,
        }
    }, {
        # Grill.burger
        'url': 'http://www.starwars.com/video/rogue-one-a-star-wars-story-intro-featurette',
        'info_dict': {
            'id': '5454e9f4e9804a552e3524c8',
            'ext': 'mp4',
            'title': '"Intro" Featurette: Rogue One: A Star Wars Story',
            'upload_date': '20170104',
            'description': 'Go behind-the-scenes of Rogue One: A Star Wars Story in this featurette with Director Gareth Edwards and the cast of the film.',
        },
        'params': {
            # m3u8 download
            'skip_download': True,
        }
    }, {
        'url': 'http://videos.disneylatino.com/ver/spider-man-de-regreso-a-casa-primer-adelanto-543a33a1850bdcfcca13bae2',
        'only_matching': True,
@ -43,31 +59,55 @@ class DisneyIE(InfoExtractor):
    }, {
        'url': 'http://www.starwars.com/embed/54690d1e6c42e5f09a0fb097',
        'only_matching': True,
    }, {
        'url': 'http://spiderman.marvelkids.com/embed/522900d2ced3c565e4cc0677',
        'only_matching': True,
    }, {
        'url': 'http://spiderman.marvelkids.com/videos/contest-of-champions-part-four-clip-1',
        'only_matching': True,
    }, {
        'url': 'http://disneyjunior.en.disneyme.com/dj/watch-my-friends-tigger-and-pooh-promo',
        'only_matching': True,
    }, {
        'url': 'http://disneyjunior.disney.com/galactech-the-galactech-grab-galactech-an-admiral-rescue',
        'only_matching': True,
    }]
    def _real_extract(self, url):
-        domain, video_id = re.match(self._VALID_URL, url).groups()
+        domain, video_id, display_id = re.match(self._VALID_URL, url).groups()
        if not video_id:
            webpage = self._download_webpage(url, display_id)
            grill = re.sub(r'"\s*\+\s*"', '', self._search_regex(
                r'Grill\.burger\s*=\s*({.+})\s*:',
                webpage, 'grill data'))
            page_data = next(s for s in self._parse_json(grill, display_id)['stack'] if s.get('type') == 'video')
            video_data = page_data['data'][0]
        else:
            webpage = self._download_webpage(
                'http://%s/embed/%s' % (domain, video_id), video_id)
-        video_data = self._parse_json(self._search_regex(
+            page_data = self._parse_json(self._search_regex(
-            r'Disney\.EmbedVideo=({.+});', webpage, 'embed data'), video_id)['video']
+                r'Disney\.EmbedVideo\s*=\s*({.+});',
                webpage, 'embed data'), video_id)
            video_data = page_data['video']
        for external in video_data.get('externals', []):
            if external.get('source') == 'vevo':
                return self.url_result('vevo:' + external['data_id'], 'Vevo')
        video_id = video_data['id']
        title = video_data['title']
        formats = []
        for flavor in video_data.get('flavors', []):
            flavor_format = flavor.get('format')
            flavor_url = flavor.get('url')
-            if not flavor_url or not re.match(r'https?://', flavor_url):
+            if not flavor_url or not re.match(r'https?://', flavor_url) or flavor_format == 'mp4_access':
                continue
            tbr = int_or_none(flavor.get('bitrate'))
            if tbr == 99999:
                formats.extend(self._extract_m3u8_formats(
-                    flavor_url, video_id, 'mp4', m3u8_id=flavor_format, fatal=False))
+                    flavor_url, video_id, 'mp4',
                    m3u8_id=flavor_format, fatal=False))
                continue
            format_id = []
            if flavor_format:
@ -88,6 +128,10 @@ class DisneyIE(InfoExtractor):
                'ext': ext,
                'vcodec': 'none' if (width == 0 and height == 0) else None,
            })
        if not formats and video_data.get('expired'):
            raise ExtractorError(
                '%s said: %s' % (self.IE_NAME, page_data['translations']['video_expired']),
                expected=True)
        self._sort_formats(formats)
        subtitles = {}
--- a/youtube_dl/extractor/douyutv.py
+++ b/youtube_dl/extractor/douyutv.py
@ -18,7 +18,7 @@ from ..utils import (
 class DouyuTVIE(InfoExtractor):
    IE_DESC = '斗鱼'
-    _VALID_URL = r'https?://(?:www\.)?douyu(?:tv)?\.com/(?P<id>[A-Za-z0-9]+)'
+    _VALID_URL = r'https?://(?:www\.)?douyu(?:tv)?\.com/(?:[^/]+/)*(?P<id>[A-Za-z0-9]+)'
    _TESTS = [{
        'url': 'http://www.douyutv.com/iseven',
        'info_dict': {
@ -68,6 +68,10 @@ class DouyuTVIE(InfoExtractor):
    }, {
        'url': 'http://www.douyu.com/xiaocang',
        'only_matching': True,
    }, {
        # \"room_id\"
        'url': 'http://www.douyu.com/t/lpl',
        'only_matching': True,
    }]
    # Decompile core.swf in webpage by ffdec "Search SWFs in memory". core.swf
@ -82,7 +86,7 @@ class DouyuTVIE(InfoExtractor):
        else:
            page = self._download_webpage(url, video_id)
            room_id = self._html_search_regex(
-                r'"room_id"\s*:\s*(\d+),', page, 'room id')
+                r'"room_id\\?"\s*:\s*(\d+),', page, 'room id')
        room = self._download_json(
            'http://m.douyu.com/html5/live?roomId=%s' % room_id, video_id,
--- a/youtube_dl/extractor/drtv.py
+++ b/youtube_dl/extractor/drtv.py
@ -9,12 +9,13 @@ from ..utils import (
    mimetype2ext,
    parse_iso8601,
    remove_end,
    update_url_query,
 )
 class DRTVIE(InfoExtractor):
-    _VALID_URL = r'https?://(?:www\.)?dr\.dk/(?:tv/se|nyheder)/(?:[^/]+/)*(?P<id>[\da-z-]+)(?:[/#?]|$)'
+    _VALID_URL = r'https?://(?:www\.)?dr\.dk/(?:tv/se|nyheder|radio/ondemand)/(?:[^/]+/)*(?P<id>[\da-z-]+)(?:[/#?]|$)'
-
+    IE_NAME = 'drtv'
    _TESTS = [{
        'url': 'https://www.dr.dk/tv/se/boern/ultra/klassen-ultra/klassen-darlig-taber-10',
        'md5': '25e659cccc9a2ed956110a299fdf5983',
@ -79,9 +80,10 @@ class DRTVIE(InfoExtractor):
        subtitles = {}
        for asset in data['Assets']:
-            if asset.get('Kind') == 'Image':
+            kind = asset.get('Kind')
            if kind == 'Image':
                thumbnail = asset.get('Uri')
-            elif asset.get('Kind') == 'VideoResource':
+            elif kind in ('VideoResource', 'AudioResource'):
                duration = float_or_none(asset.get('DurationInMilliseconds'), 1000)
                restricted_to_denmark = asset.get('RestrictedToDenmark')
                spoken_subtitles = asset.get('Target') == 'SpokenSubtitles'
@ -96,9 +98,13 @@ class DRTVIE(InfoExtractor):
                        preference = -1
                        format_id += '-spoken-subtitles'
                    if target == 'HDS':
-                        formats.extend(self._extract_f4m_formats(
+                        f4m_formats = self._extract_f4m_formats(
                            uri + '?hdcore=3.3.0&plugin=aasp-3.3.0.99.43',
-                            video_id, preference, f4m_id=format_id))
+                            video_id, preference, f4m_id=format_id)
                        if kind == 'AudioResource':
                            for f in f4m_formats:
                                f['vcodec'] = 'none'
                        formats.extend(f4m_formats)
                    elif target == 'HLS':
                        formats.extend(self._extract_m3u8_formats(
                            uri, video_id, 'mp4', entry_protocol='m3u8_native',
@ -112,6 +118,7 @@ class DRTVIE(InfoExtractor):
                            'format_id': format_id,
                            'tbr': int_or_none(bitrate),
                            'ext': link.get('FileFormat'),
                            'vcodec': 'none' if kind == 'AudioResource' else None,
                        })
                subtitles_list = asset.get('SubtitlesList')
                if isinstance(subtitles_list, list):
@ -144,3 +151,58 @@ class DRTVIE(InfoExtractor):
            'formats': formats,
            'subtitles': subtitles,
        }
 class DRTVLiveIE(InfoExtractor):
    IE_NAME = 'drtv:live'
    _VALID_URL = r'https?://(?:www\.)?dr\.dk/(?:tv|TV)/live/(?P<id>[\da-z-]+)'
    _TEST = {
        'url': 'https://www.dr.dk/tv/live/dr1',
        'info_dict': {
            'id': 'dr1',
            'ext': 'mp4',
            'title': 're:^DR1 [0-9]{4}-[0-9]{2}-[0-9]{2} [0-9]{2}:[0-9]{2}$',
        },
        'params': {
            # m3u8 download
            'skip_download': True,
        },
    }
    def _real_extract(self, url):
        channel_id = self._match_id(url)
        channel_data = self._download_json(
            'https://www.dr.dk/mu-online/api/1.0/channel/' + channel_id,
            channel_id)
        title = self._live_title(channel_data['Title'])
        formats = []
        for streaming_server in channel_data.get('StreamingServers', []):
            server = streaming_server.get('Server')
            if not server:
                continue
            link_type = streaming_server.get('LinkType')
            for quality in streaming_server.get('Qualities', []):
                for stream in quality.get('Streams', []):
                    stream_path = stream.get('Stream')
                    if not stream_path:
                        continue
                    stream_url = update_url_query(
                        '%s/%s' % (server, stream_path), {'b': ''})
                    if link_type == 'HLS':
                        formats.extend(self._extract_m3u8_formats(
                            stream_url, channel_id, 'mp4',
                            m3u8_id=link_type, fatal=False, live=True))
                    elif link_type == 'HDS':
                        formats.extend(self._extract_f4m_formats(update_url_query(
                            '%s/%s' % (server, stream_path), {'hdcore': '3.7.0'}),
                            channel_id, f4m_id=link_type, fatal=False))
        self._sort_formats(formats)
        return {
            'id': channel_id,
            'title': title,
            'thumbnail': channel_data.get('PrimaryImageUri'),
            'formats': formats,
            'is_live': True,
        }
--- a/youtube_dl/extractor/einthusan.py
+++ b/youtube_dl/extractor/einthusan.py
@ -1,67 +1,94 @@
 # coding: utf-8
 from __future__ import unicode_literals
 import base64
 import json
 from .common import InfoExtractor
-from ..compat import compat_urlparse
+from ..compat import (
    compat_urlparse,
    compat_str,
 )
 from ..utils import (
-    remove_start,
+    extract_attributes,
-    sanitized_Request,
+    ExtractorError,
    get_elements_by_class,
    urlencode_postdata,
 )
 class EinthusanIE(InfoExtractor):
-    _VALID_URL = r'https?://(?:www\.)?einthusan\.com/movies/watch.php\?([^#]*?)id=(?P<id>[0-9]+)'
+    _VALID_URL = r'https?://einthusan\.tv/movie/watch/(?P<id>[0-9]+)'
-    _TESTS = [
+    _TEST = {
-        {
+        'url': 'https://einthusan.tv/movie/watch/9097/',
-            'url': 'http://www.einthusan.com/movies/watch.php?id=2447',
+        'md5': 'ff0f7f2065031b8a2cf13a933731c035',
            'md5': 'd71379996ff5b7f217eca034c34e3461',
        'info_dict': {
-                'id': '2447',
+            'id': '9097',
            'ext': 'mp4',
-                'title': 'Ek Villain',
+            'title': 'Ae Dil Hai Mushkil',
            'description': 'md5:33ef934c82a671a94652a9b4e54d931b',
            'thumbnail': r're:^https?://.*\.jpg$',
                'description': 'md5:9d29fc91a7abadd4591fb862fa560d93',
        }
        },
        {
            'url': 'http://www.einthusan.com/movies/watch.php?id=1671',
            'md5': 'b16a6fd3c67c06eb7c79c8a8615f4213',
            'info_dict': {
                'id': '1671',
                'ext': 'mp4',
                'title': 'Soodhu Kavvuum',
                'thumbnail': r're:^https?://.*\.jpg$',
                'description': 'md5:b40f2bf7320b4f9414f3780817b2af8c',
    }
-        },
+
-    ]
+    # reversed from jsoncrypto.prototype.decrypt() in einthusan-PGMovieWatcher.js
    def _decrypt(self, encrypted_data, video_id):
        return self._parse_json(base64.b64decode((
            encrypted_data[:10] + encrypted_data[-1] + encrypted_data[12:-1]
        ).encode('ascii')).decode('utf-8'), video_id)
    def _real_extract(self, url):
        video_id = self._match_id(url)
-        request = sanitized_Request(url)
+        webpage = self._download_webpage(url, video_id)
        request.add_header('User-Agent', 'Mozilla/5.0 (Windows NT 5.2; WOW64; rv:43.0) Gecko/20100101 Firefox/43.0')
        webpage = self._download_webpage(request, video_id)
-        title = self._html_search_regex(
+        title = self._html_search_regex(r'<h3>([^<]+)</h3>', webpage, 'title')
            r'<h1><a[^>]+class=["\']movie-title["\'][^>]*>(.+?)</a></h1>',
            webpage, 'title')
-        video_id = self._search_regex(
+        player_params = extract_attributes(self._search_regex(
-            r'data-movieid=["\'](\d+)', webpage, 'video id', default=video_id)
+            r'(<section[^>]+id="UIVideoPlayer"[^>]+>)', webpage, 'player parameters'))
-        m3u8_url = self._download_webpage(
+        page_id = self._html_search_regex(
-            'http://cdn.einthusan.com/geturl/%s/hd/London,Washington,Toronto,Dallas,San,Sydney/'
+            '<html[^>]+data-pageid="([^"]+)"', webpage, 'page ID')
-            % video_id, video_id, headers={'Referer': url})
+        video_data = self._download_json(
-        formats = self._extract_m3u8_formats(
+            'https://einthusan.tv/ajax/movie/watch/%s/' % video_id, video_id,
-            m3u8_url, video_id, ext='mp4', entry_protocol='m3u8_native')
+            data=urlencode_postdata({
                'xEvent': 'UIVideoPlayer.PingOutcome',
                'xJson': json.dumps({
                    'EJOutcomes': player_params['data-ejpingables'],
                    'NativeHLS': False
                }),
                'arcVersion': 3,
                'appVersion': 59,
                'gorilla.csrf.Token': page_id,
            }))['Data']
-        description = self._html_search_meta('description', webpage)
+        if isinstance(video_data, compat_str) and video_data.startswith('/ratelimited/'):
            raise ExtractorError(
                'Download rate reached. Please try again later.', expected=True)
        ej_links = self._decrypt(video_data['EJLinks'], video_id)
        formats = []
        m3u8_url = ej_links.get('HLSLink')
        if m3u8_url:
            formats.extend(self._extract_m3u8_formats(
                m3u8_url, video_id, ext='mp4', entry_protocol='m3u8_native'))
        mp4_url = ej_links.get('MP4Link')
        if mp4_url:
            formats.append({
                'url': mp4_url,
            })
        self._sort_formats(formats)
        description = get_elements_by_class('synopsis', webpage)[0]
        thumbnail = self._html_search_regex(
-            r'''<a class="movie-cover-wrapper".*?><img src=["'](.*?)["'].*?/></a>''',
+            r'''<img[^>]+src=(["'])(?P<url>(?!\1).+?/moviecovers/(?!\1).+?)\1''',
-            webpage, "thumbnail url", fatal=False)
+            webpage, 'thumbnail url', fatal=False, group='url')
        if thumbnail is not None:
-            thumbnail = compat_urlparse.urljoin(url, remove_start(thumbnail, '..'))
+            thumbnail = compat_urlparse.urljoin(url, thumbnail)
        return {
            'id': video_id,
--- a/youtube_dl/extractor/elpais.py
+++ b/youtube_dl/extractor/elpais.py
@ -2,7 +2,7 @@
 from __future__ import unicode_literals
 from .common import InfoExtractor
-from ..utils import unified_strdate
+from ..utils import strip_jsonp, unified_strdate
 class ElPaisIE(InfoExtractor):
@ -29,6 +29,16 @@ class ElPaisIE(InfoExtractor):
            'description': 'Que sí, que las cápsulas son cómodas. Pero si le pides algo más a la vida, quizá deberías aprender a usar bien la cafetera italiana. No tienes más que ver este vídeo y seguir sus siete normas básicas.',
            'upload_date': '20160303',
        }
    }, {
        'url': 'http://elpais.com/elpais/2017/01/26/ciencia/1485456786_417876.html',
        'md5': '9c79923a118a067e1a45789e1e0b0f9c',
        'info_dict': {
            'id': '1485456786_417876',
            'ext': 'mp4',
            'title': 'Hallado un barco de la antigua Roma que naufragó en Baleares hace 1.800 años',
            'description': 'La nave portaba cientos de ánforas y se hundió cerca de la isla de Cabrera por razones desconocidas',
            'upload_date': '20170127',
        },
    }]
    def _real_extract(self, url):
@ -37,6 +47,13 @@ class ElPaisIE(InfoExtractor):
        prefix = self._html_search_regex(
            r'var\s+url_cache\s*=\s*"([^"]+)";', webpage, 'URL prefix')
        id_multimedia = self._search_regex(
            r"id_multimedia\s*=\s*'([^']+)'", webpage, 'ID multimedia', default=None)
        if id_multimedia:
            url_info = self._download_json(
                'http://elpais.com/vdpep/1/?pepid=' + id_multimedia, video_id, transform_source=strip_jsonp)
            video_suffix = url_info['mp4']
        else:
            video_suffix = self._search_regex(
                r"(?:URLMediaFile|urlVideo_\d+)\s*=\s*url_cache\s*\+\s*'([^']+)'", webpage, 'video URL')
        video_url = prefix + video_suffix
--- a/youtube_dl/extractor/extractors.py
+++ b/youtube_dl/extractor/extractors.py
@ -103,7 +103,10 @@ from .beatport import BeatportIE
 from .bet import BetIE
 from .bigflix import BigflixIE
 from .bild import BildIE
-from .bilibili import BiliBiliIE
+from .bilibili import (
    BiliBiliIE,
    BiliBiliBangumiIE,
 )
 from .biobiochiletv import BioBioChileTVIE
 from .biqle import BIQLEIE
 from .bleacherreport import (
@ -199,6 +202,7 @@ from .commonprotocols import (
    RtmpIE,
 )
 from .condenast import CondeNastIE
 from .corus import CorusIE
 from .cracked import CrackedIE
 from .crackle import CrackleIE
 from .criterion import CriterionIE
@ -245,7 +249,10 @@ from .dramafever import (
 from .dreisat import DreiSatIE
 from .drbonanza import DRBonanzaIE
 from .drtuber import DrTuberIE
-from .drtv import DRTVIE
+from .drtv import (
    DRTVIE,
    DRTVLiveIE,
 )
 from .dvtv import DVTVIE
 from .dumpert import DumpertIE
 from .defense import DefenseGouvFrIE
@ -296,6 +303,10 @@ from .fc2 import (
    FC2EmbedIE,
 )
 from .fczenit import FczenitIE
 from .filmon import (
    FilmOnIE,
    FilmOnChannelIE,
 )
 from .firstpost import FirstpostIE
 from .firsttv import FirstTVIE
 from .fivemin import FiveMinIE
@ -339,6 +350,7 @@ from .gameone import (
 from .gamersyde import GamersydeIE
 from .gamespot import GameSpotIE
 from .gamestar import GameStarIE
 from .gaskrank import GaskrankIE
 from .gazeta import GazetaIE
 from .gdcvault import GDCVaultIE
 from .generic import GenericIE
@ -370,10 +382,7 @@ from .heise import HeiseIE
 from .hellporno import HellPornoIE
 from .helsinki import HelsinkiIE
 from .hentaistigma import HentaiStigmaIE
-from .hgtv import (
+from .hgtv import HGTVComShowIE
    HGTVIE,
    HGTVComShowIE,
 )
 from .historicfilms import HistoricFilmsIE
 from .hitbox import HitboxIE, HitboxLiveIE
 from .hitrecord import HitRecordIE
@ -827,6 +836,7 @@ from .sbs import SBSIE
 from .scivee import SciVeeIE
 from .screencast import ScreencastIE
 from .screencastomatic import ScreencastOMaticIE
 from .scrippsnetworks import ScrippsNetworksWatchIE
 from .seeker import SeekerIE
 from .senateisvp import SenateISVPIE
 from .sendtonews import SendtoNewsIE
@ -881,12 +891,10 @@ from .spiegeltv import SpiegeltvIE
 from .spike import SpikeIE
 from .stitcher import StitcherIE
 from .sport5 import Sport5IE
-from .sportbox import (
+from .sportbox import SportBoxEmbedIE
    SportBoxIE,
    SportBoxEmbedIE,
 )
 from .sportdeutschland import SportDeutschlandIE
 from .sportschau import SportschauIE
 from .sprout import SproutIE
 from .srgssr import (
    SRGSSRIE,
    SRGSSRPlayIE,
@ -1009,6 +1017,7 @@ from .tvplay import (
    TVPlayIE,
    ViafreeIE,
 )
 from .tvplayer import TVPlayerIE
 from .tweakers import TweakersIE
 from .twentyfourvideo import TwentyFourVideoIE
 from .twentymin import TwentyMinutenIE
@ -1088,6 +1097,7 @@ from .videomore import (
    VideomoreSeasonIE,
 )
 from .videopremium import VideoPremiumIE
 from .videopress import VideoPressIE
 from .vidio import VidioIE
 from .vidme import (
    VidmeIE,
--- a/youtube_dl/extractor/facebook.py
+++ b/youtube_dl/extractor/facebook.py
@ -1,3 +1,4 @@
 # coding: utf-8
 from __future__ import unicode_literals
 import re
@ -73,7 +74,7 @@ class FacebookIE(InfoExtractor):
        'info_dict': {
            'id': '274175099429670',
            'ext': 'mp4',
-            'title': 'Facebook video #274175099429670',
+            'title': 'Asif Nawab Butt posted a video to his Timeline.',
            'uploader': 'Asif Nawab Butt',
            'upload_date': '20140506',
            'timestamp': 1399398998,
@ -134,6 +135,46 @@ class FacebookIE(InfoExtractor):
            'upload_date': '20161030',
            'uploader': 'CNN',
        },
    }, {
        # bigPipe.onPageletArrive ... onPageletArrive pagelet_group_mall
        'url': 'https://www.facebook.com/yaroslav.korpan/videos/1417995061575415/',
        'info_dict': {
            'id': '1417995061575415',
            'ext': 'mp4',
            'title': 'md5:a7b86ca673f51800cd54687b7f4012fe',
            'timestamp': 1486648217,
            'upload_date': '20170209',
            'uploader': 'Yaroslav Korpan',
        },
        'params': {
            'skip_download': True,
        },
    }, {
        'url': 'https://www.facebook.com/LaGuiaDelVaron/posts/1072691702860471',
        'info_dict': {
            'id': '1072691702860471',
            'ext': 'mp4',
            'title': 'md5:ae2d22a93fbb12dad20dc393a869739d',
            'timestamp': 1477305000,
            'upload_date': '20161024',
            'uploader': 'La Guía Del Varón',
        },
        'params': {
            'skip_download': True,
        },
    }, {
        'url': 'https://www.facebook.com/groups/1024490957622648/permalink/1396382447100162/',
        'info_dict': {
            'id': '1396382447100162',
            'ext': 'mp4',
            'title': 'md5:e2d2700afdf84e121f5d0f999bad13a3',
            'timestamp': 1486035494,
            'upload_date': '20170202',
            'uploader': 'Elisabeth Ahtn',
        },
        'params': {
            'skip_download': True,
        },
    }, {
        'url': 'https://www.facebook.com/video.php?v=10204634152394104',
        'only_matching': True,
@ -249,7 +290,7 @@ class FacebookIE(InfoExtractor):
            for item in instances:
                if item[1][0] == 'VideoConfig':
                    video_item = item[2][0]
-                    if video_item.get('video_id') == video_id:
+                    if video_item.get('video_id'):
                        return video_item['videoData']
        server_js_data = self._parse_json(self._search_regex(
@ -262,7 +303,7 @@ class FacebookIE(InfoExtractor):
        if not video_data:
            server_js_data = self._parse_json(
                self._search_regex(
-                    r'bigPipe\.onPageletArrive\(({.+?})\)\s*;\s*}\s*\)\s*,\s*["\']onPageletArrive\s+stream_pagelet',
+                    r'bigPipe\.onPageletArrive\(({.+?})\)\s*;\s*}\s*\)\s*,\s*["\']onPageletArrive\s+(?:stream_pagelet|pagelet_group_mall)',
                    webpage, 'js data', default='{}'),
                video_id, transform_source=js_to_json, fatal=False)
            if server_js_data:
@ -318,10 +359,16 @@ class FacebookIE(InfoExtractor):
            video_title = self._html_search_regex(
                r'(?s)<span class="fbPhotosPhotoCaption".*?id="fbPhotoPageCaption"><span class="hasCaption">(.*?)</span>',
                webpage, 'alternative title', default=None)
            video_title = limit_length(video_title, 80)
        if not video_title:
            video_title = self._html_search_meta(
                'description', webpage, 'title')
        if video_title:
            video_title = limit_length(video_title, 80)
        else:
            video_title = 'Facebook video #%s' % video_id
-        uploader = clean_html(get_element_by_id('fbPhotoPageAuthorName', webpage))
+        uploader = clean_html(get_element_by_id(
            'fbPhotoPageAuthorName', webpage)) or self._search_regex(
            r'ownerName\s*:\s*"([^"]+)"', webpage, 'uploader', fatal=False)
        timestamp = int_or_none(self._search_regex(
            r'<abbr[^>]+data-utime=["\'](\d+)', webpage,
            'timestamp', default=None))
--- a/youtube_dl/extractor/filmon.py
+++ b/youtube_dl/extractor/filmon.py
@ -0,0 +1,178 @@
 # coding: utf-8
 from __future__ import unicode_literals
 from .common import InfoExtractor
 from ..compat import (
    compat_str,
    compat_HTTPError,
 )
 from ..utils import (
    qualities,
    strip_or_none,
    int_or_none,
    ExtractorError,
 )
 class FilmOnIE(InfoExtractor):
    IE_NAME = 'filmon'
    _VALID_URL = r'(?:https?://(?:www\.)?filmon\.com/vod/view/|filmon:)(?P<id>\d+)'
    _TESTS = [{
        'url': 'https://www.filmon.com/vod/view/24869-0-plan-9-from-outer-space',
        'info_dict': {
            'id': '24869',
            'ext': 'mp4',
            'title': 'Plan 9 From Outer Space',
            'description': 'Dead human, zombies and vampires',
        },
    }, {
        'url': 'https://www.filmon.com/vod/view/2825-1-popeye-series-1',
        'info_dict': {
            'id': '2825',
            'title': 'Popeye Series 1',
            'description': 'The original series of Popeye.',
        },
        'playlist_mincount': 8,
    }]
    def _real_extract(self, url):
        video_id = self._match_id(url)
        try:
            response = self._download_json(
                'https://www.filmon.com/api/vod/movie?id=%s' % video_id,
                video_id)['response']
        except ExtractorError as e:
            if isinstance(e.cause, compat_HTTPError):
                errmsg = self._parse_json(e.cause.read().decode(), video_id)['reason']
                raise ExtractorError('%s said: %s' % (self.IE_NAME, errmsg), expected=True)
            raise
        title = response['title']
        description = strip_or_none(response.get('description'))
        if response.get('type_id') == 1:
            entries = [self.url_result('filmon:' + episode_id) for episode_id in response.get('episodes', [])]
            return self.playlist_result(entries, video_id, title, description)
        QUALITY = qualities(('low', 'high'))
        formats = []
        for format_id, stream in response.get('streams', {}).items():
            stream_url = stream.get('url')
            if not stream_url:
                continue
            formats.append({
                'format_id': format_id,
                'url': stream_url,
                'ext': 'mp4',
                'quality': QUALITY(stream.get('quality')),
                'protocol': 'm3u8_native',
            })
        self._sort_formats(formats)
        thumbnails = []
        poster = response.get('poster', {})
        thumbs = poster.get('thumbs', {})
        thumbs['poster'] = poster
        for thumb_id, thumb in thumbs.items():
            thumb_url = thumb.get('url')
            if not thumb_url:
                continue
            thumbnails.append({
                'id': thumb_id,
                'url': thumb_url,
                'width': int_or_none(thumb.get('width')),
                'height': int_or_none(thumb.get('height')),
            })
        return {
            'id': video_id,
            'title': title,
            'formats': formats,
            'description': description,
            'thumbnails': thumbnails,
        }
 class FilmOnChannelIE(InfoExtractor):
    IE_NAME = 'filmon:channel'
    _VALID_URL = r'https?://(?:www\.)?filmon\.com/(?:tv|channel)/(?P<id>[a-z0-9-]+)'
    _TESTS = [{
        # VOD
        'url': 'http://www.filmon.com/tv/sports-haters',
        'info_dict': {
            'id': '4190',
            'ext': 'mp4',
            'title': 'Sports Haters',
            'description': 'md5:dabcb4c1d9cfc77085612f1a85f8275d',
        },
    }, {
        # LIVE
        'url': 'https://www.filmon.com/channel/filmon-sports',
        'only_matching': True,
    }, {
        'url': 'https://www.filmon.com/tv/2894',
        'only_matching': True,
    }]
    _THUMBNAIL_RES = [
        ('logo', 56, 28),
        ('big_logo', 106, 106),
        ('extra_big_logo', 300, 300),
    ]
    def _real_extract(self, url):
        channel_id = self._match_id(url)
        try:
            channel_data = self._download_json(
                'http://www.filmon.com/api-v2/channel/' + channel_id, channel_id)['data']
        except ExtractorError as e:
            if isinstance(e.cause, compat_HTTPError):
                errmsg = self._parse_json(e.cause.read().decode(), channel_id)['message']
                raise ExtractorError('%s said: %s' % (self.IE_NAME, errmsg), expected=True)
            raise
        channel_id = compat_str(channel_data['id'])
        is_live = not channel_data.get('is_vod') and not channel_data.get('is_vox')
        title = channel_data['title']
        QUALITY = qualities(('low', 'high'))
        formats = []
        for stream in channel_data.get('streams', []):
            stream_url = stream.get('url')
            if not stream_url:
                continue
            if not is_live:
                formats.extend(self._extract_wowza_formats(
                    stream_url, channel_id, skip_protocols=['dash', 'rtmp', 'rtsp']))
                continue
            quality = stream.get('quality')
            formats.append({
                'format_id': quality,
                # this is an m3u8 stream, but we are deliberately not using _extract_m3u8_formats
                # because it doesn't have bitrate variants anyway
                'url': stream_url,
                'ext': 'mp4',
                'quality': QUALITY(quality),
            })
        self._sort_formats(formats)
        thumbnails = []
        for name, width, height in self._THUMBNAIL_RES:
            thumbnails.append({
                'id': name,
                'url': 'http://static.filmon.com/assets/channels/%s/%s.png' % (channel_id, name),
                'width': width,
                'height': height,
            })
        return {
            'id': channel_id,
            'display_id': channel_data.get('alias'),
            'title': self._live_title(title) if is_live else title,
            'description': channel_data.get('description'),
            'thumbnails': thumbnails,
            'formats': formats,
            'is_live': is_live,
        }
--- a/youtube_dl/extractor/gaskrank.py
+++ b/youtube_dl/extractor/gaskrank.py
@ -0,0 +1,123 @@
 # coding: utf-8
 from __future__ import unicode_literals
 import re
 from .common import InfoExtractor
 from ..utils import (
    float_or_none,
    int_or_none,
    js_to_json,
    unified_strdate,
 )
 class GaskrankIE(InfoExtractor):
    """InfoExtractor for gaskrank.tv"""
    _VALID_URL = r'https?://(?:www\.)?gaskrank\.tv/tv/(?P<categories>[^/]+)/(?P<id>[^/]+)\.html?'
    _TESTS = [
        {
            'url': 'http://www.gaskrank.tv/tv/motorrad-fun/strike-einparken-durch-anfaenger-crash-mit-groesserem-flurschaden.htm',
            'md5': '1ae88dbac97887d85ebd1157a95fc4f9',
            'info_dict': {
                'id': '201601/26955',
                'ext': 'mp4',
                'title': 'Strike! Einparken können nur Männer - Flurschaden hält sich in Grenzen *lol*',
                'thumbnail': r're:^https?://.*\.jpg$',
                'categories': ['motorrad-fun'],
                'display_id': 'strike-einparken-durch-anfaenger-crash-mit-groesserem-flurschaden',
                'uploader_id': 'Bikefun',
                'upload_date': '20170110',
                'uploader_url': None,
            }
        },
        {
            'url': 'http://www.gaskrank.tv/tv/racing/isle-of-man-tt-2011-michael-du-15920.htm',
            'md5': 'c33ee32c711bc6c8224bfcbe62b23095',
            'info_dict': {
                'id': '201106/15920',
                'ext': 'mp4',
                'title': 'Isle of Man - Michael Dunlop vs Guy Martin - schwindelig kucken',
                'thumbnail': r're:^https?://.*\.jpg$',
                'categories': ['racing'],
                'display_id': 'isle-of-man-tt-2011-michael-du-15920',
                'uploader_id': 'IOM',
                'upload_date': '20160506',
                'uploader_url': 'www.iomtt.com',
            }
        }
    ]
    def _real_extract(self, url):
        """extract information from gaskrank.tv"""
        def fix_json(code):
            """Removes trailing comma in json: {{},} --> {{}}"""
            return re.sub(r',\s*}', r'}', js_to_json(code))
        display_id = self._match_id(url)
        webpage = self._download_webpage(url, display_id)
        categories = [re.match(self._VALID_URL, url).group('categories')]
        title = self._search_regex(
            r'movieName\s*:\s*\'([^\']*)\'',
            webpage, 'title')
        thumbnail = self._search_regex(
            r'poster\s*:\s*\'([^\']*)\'',
            webpage, 'thumbnail', default=None)
        mobj = re.search(
            r'Video von:\s*(?P<uploader_id>[^|]*?)\s*\|\s*vom:\s*(?P<upload_date>[0-9][0-9]\.[0-9][0-9]\.[0-9][0-9][0-9][0-9])',
            webpage)
        if mobj is not None:
            uploader_id = mobj.groupdict().get('uploader_id')
            upload_date = unified_strdate(mobj.groupdict().get('upload_date'))
        uploader_url = self._search_regex(
            r'Homepage:\s*<[^>]*>(?P<uploader_url>[^<]*)',
            webpage, 'uploader_url', default=None)
        tags = re.findall(
            r'/tv/tags/[^/]+/"\s*>(?P<tag>[^<]*?)<',
            webpage)
        view_count = self._search_regex(
            r'class\s*=\s*"gkRight"(?:[^>]*>\s*<[^>]*)*icon-eye-open(?:[^>]*>\s*<[^>]*)*>\s*(?P<view_count>[0-9\.]*)',
            webpage, 'view_count', default=None)
        if view_count:
            view_count = int_or_none(view_count.replace('.', ''))
        average_rating = self._search_regex(
            r'itemprop\s*=\s*"ratingValue"[^>]*>\s*(?P<average_rating>[0-9,]+)',
            webpage, 'average_rating')
        if average_rating:
            average_rating = float_or_none(average_rating.replace(',', '.'))
        playlist = self._parse_json(
            self._search_regex(
                r'playlist\s*:\s*\[([^\]]*)\]',
                webpage, 'playlist', default='{}'),
            display_id, transform_source=fix_json, fatal=False)
        video_id = self._search_regex(
            r'https?://movies\.gaskrank\.tv/([^-]*?)(-[^\.]*)?\.mp4',
            playlist.get('0').get('src'), 'video id')
        formats = []
        for key in playlist:
            formats.append({
                'url': playlist[key]['src'],
                'format_id': key,
                'quality': playlist[key].get('quality')})
        self._sort_formats(formats, field_preference=['format_id'])
        return {
            'id': video_id,
            'title': title,
            'formats': formats,
            'thumbnail': thumbnail,
            'categories': categories,
            'display_id': display_id,
            'uploader_id': uploader_id,
            'upload_date': upload_date,
            'uploader_url': uploader_url,
            'tags': tags,
            'view_count': view_count,
            'average_rating': average_rating,
        }
--- a/youtube_dl/extractor/generic.py
+++ b/youtube_dl/extractor/generic.py
@ -29,6 +29,7 @@ from ..utils import (
    UnsupportedError,
    xpath_text,
 )
 from .commonprotocols import RtmpIE
 from .brightcove import (
    BrightcoveLegacyIE,
    BrightcoveNewIE,
@ -81,6 +82,7 @@ from .videa import VideaIE
 from .twentymin import TwentyMinutenIE
 from .ustream import UstreamIE
 from .openload import OpenloadIE
 from .videopress import VideoPressIE
 class GenericIE(InfoExtractor):
@ -946,6 +948,19 @@ class GenericIE(InfoExtractor):
                'title': 'Webinar: Using Discovery, The National Archives’ online catalogue',
            },
        },
        # jwplayer rtmp
        {
            'url': 'http://www.suffolk.edu/sjc/',
            'info_dict': {
                'id': 'sjclive',
                'ext': 'flv',
                'title': 'Massachusetts Supreme Judicial Court Oral Arguments',
                'uploader': 'www.suffolk.edu',
            },
            'params': {
                'skip_download': True,
            }
        },
        # rtl.nl embed
        {
            'url': 'http://www.rtlnieuws.nl/nieuws/buitenland/aanslagen-kopenhagen',
@ -976,19 +991,6 @@ class GenericIE(InfoExtractor):
                'title': 'Os Guinness // Is It Fools Talk? // Unbelievable? Conference 2014',
            },
        },
        # Kaltura embed protected with referrer
        {
            'url': 'http://www.disney.nl/disney-channel/filmpjes/achter-de-schermen#/videoId/violetta-achter-de-schermen-ruggero',
            'info_dict': {
                'id': '1_g4fbemnq',
                'ext': 'mp4',
                'title': 'Violetta - Achter De Schermen - Ruggero',
                'description': 'Achter de schermen met Ruggero',
                'timestamp': 1435133761,
                'upload_date': '20150624',
                'uploader_id': 'echojecka',
            },
        },
        # Kaltura embed with single quotes
        {
            'url': 'http://fod.infobase.com/p_ViewPlaylist.aspx?AssignmentID=NUN8ZY',
@ -1473,6 +1475,21 @@ class GenericIE(InfoExtractor):
                'skip_download': True,
            },
            'add_ie': [TwentyMinutenIE.ie_key()],
        },
        {
            # VideoPress embed
            'url': 'https://en.support.wordpress.com/videopress/',
            'info_dict': {
                'id': 'OcobLTqC',
                'ext': 'm4v',
                'title': 'IMG_5786',
                'timestamp': 1435711927,
                'upload_date': '20150701',
            },
            'params': {
                'skip_download': True,
            },
            'add_ie': [VideoPressIE.ie_key()],
        }
        # {
        #     # TODO: find another test
@ -2320,8 +2337,9 @@ class GenericIE(InfoExtractor):
                'Channel': 'channel',
                'ChannelList': 'channel_list',
            }
-            return self.url_result('limelight:%s:%s' % (
+            return self.url_result(smuggle_url('limelight:%s:%s' % (
-                lm[mobj.group(1)], mobj.group(2)), 'Limelight%s' % mobj.group(1), mobj.group(2))
+                lm[mobj.group(1)], mobj.group(2)), {'source_url': url}),
                'Limelight%s' % mobj.group(1), mobj.group(2))
        mobj = re.search(
            r'''(?sx)
@ -2331,7 +2349,9 @@ class GenericIE(InfoExtractor):
                        value=(["\'])(?:(?!\3).)*mediaId=(?P<id>[a-z0-9]{32})
            ''', webpage)
        if mobj:
-            return self.url_result('limelight:media:%s' % mobj.group('id'))
+            return self.url_result(smuggle_url(
                'limelight:media:%s' % mobj.group('id'),
                {'source_url': url}), 'LimelightMedia', mobj.group('id'))
        # Look for AdobeTVVideo embeds
        mobj = re.search(
@ -2438,6 +2458,12 @@ class GenericIE(InfoExtractor):
            return _playlist_from_matches(
                openload_urls, ie=OpenloadIE.ie_key())
        # Look for VideoPress embeds
        videopress_urls = VideoPressIE._extract_urls(webpage)
        if videopress_urls:
            return _playlist_from_matches(
                videopress_urls, ie=VideoPressIE.ie_key())
        # Looking for http://schema.org/VideoObject
        json_ld = self._search_json_ld(
            webpage, video_id, default={}, expected_type='VideoObject')
@ -2465,6 +2491,8 @@ class GenericIE(InfoExtractor):
        def check_video(vurl):
            if YoutubeIE.suitable(vurl):
                return True
            if RtmpIE.suitable(vurl):
                return True
            vpath = compat_urlparse.urlparse(vurl).path
            vext = determine_ext(vpath)
            return '.' in vpath and vext not in ('swf', 'png', 'jpg', 'srt', 'sbv', 'sub', 'vtt', 'ttml', 'js')
@ -2572,6 +2600,15 @@ class GenericIE(InfoExtractor):
                'age_limit': age_limit,
            }
            if RtmpIE.suitable(video_url):
                entry_info_dict.update({
                    '_type': 'url_transparent',
                    'ie_key': RtmpIE.ie_key(),
                    'url': video_url,
                })
                entries.append(entry_info_dict)
                continue
            ext = determine_ext(video_url)
            if ext == 'smil':
                entry_info_dict['formats'] = self._extract_smil_formats(video_url, video_id)
--- a/youtube_dl/extractor/go.py
+++ b/youtube_dl/extractor/go.py
@ -3,7 +3,7 @@ from __future__ import unicode_literals
 import re
-from .common import InfoExtractor
+from .adobepass import AdobePassIE
 from ..utils import (
    int_or_none,
    determine_ext,
@ -13,15 +13,30 @@ from ..utils import (
 )
-class GoIE(InfoExtractor):
+class GoIE(AdobePassIE):
-    _BRANDS = {
+    _SITE_INFO = {
-        'abc': '001',
+        'abc': {
-        'freeform': '002',
+            'brand': '001',
-        'watchdisneychannel': '004',
+            'requestor_id': 'ABC',
-        'watchdisneyjunior': '008',
+        },
-        'watchdisneyxd': '009',
+        'freeform': {
            'brand': '002',
            'requestor_id': 'ABCFamily',
        },
        'watchdisneychannel': {
            'brand': '004',
            'requestor_id': 'Disney',
        },
        'watchdisneyjunior': {
            'brand': '008',
            'requestor_id': 'DisneyJunior',
        },
        'watchdisneyxd': {
            'brand': '009',
            'requestor_id': 'DisneyXD',
        }
-    _VALID_URL = r'https?://(?:(?P<sub_domain>%s)\.)?go\.com/(?:[^/]+/)*(?:vdka(?P<id>\w+)|season-\d+/\d+-(?P<display_id>[^/?#]+))' % '|'.join(_BRANDS.keys())
+    }
    _VALID_URL = r'https?://(?:(?P<sub_domain>%s)\.)?go\.com/(?:[^/]+/)*(?:vdka(?P<id>\w+)|season-\d+/\d+-(?P<display_id>[^/?#]+))' % '|'.join(_SITE_INFO.keys())
    _TESTS = [{
        'url': 'http://abc.go.com/shows/castle/video/most-recent/vdka0_g86w5onx',
        'info_dict': {
@ -43,8 +58,12 @@ class GoIE(InfoExtractor):
        sub_domain, video_id, display_id = re.match(self._VALID_URL, url).groups()
        if not video_id:
            webpage = self._download_webpage(url, display_id)
-            video_id = self._search_regex(r'data-video-id=["\']VDKA(\w+)', webpage, 'video id')
+            video_id = self._search_regex(
-        brand = self._BRANDS[sub_domain]
+                # There may be inner quotes, e.g. data-video-id="'VDKA3609139'"
                # from http://freeform.go.com/shows/shadowhunters/episodes/season-2/1-this-guilty-blood
                r'data-video-id=["\']*VDKA(\w+)', webpage, 'video id')
        site_info = self._SITE_INFO[sub_domain]
        brand = site_info['brand']
        video_data = self._download_json(
            'http://api.contents.watchabc.go.com/vp2/ws/contents/3000/videos/%s/001/-1/-1/-1/%s/-1/-1.json' % (brand, video_id),
            video_id)['video'][0]
@ -60,14 +79,26 @@ class GoIE(InfoExtractor):
            if ext == 'm3u8':
                video_type = video_data.get('type')
                if video_type == 'lf':
-                    entitlement = self._download_json(
+                    data = {
                        'https://api.entitlement.watchabc.go.com/vp2/ws-secure/entitlement/2020/authorize.json',
                        video_id, data=urlencode_postdata({
                        'video_id': video_data['id'],
                        'video_type': video_type,
                        'brand': brand,
                        'device': '001',
-                        }))
+                    }
                    if video_data.get('accesslevel') == '1':
                        requestor_id = site_info['requestor_id']
                        resource = self._get_mvpd_resource(
                            requestor_id, title, video_id, None)
                        auth = self._extract_mvpd_auth(
                            url, video_id, requestor_id, resource)
                        data.update({
                            'token': auth,
                            'token_type': 'ap',
                            'adobe_requestor_id': requestor_id,
                        })
                    entitlement = self._download_json(
                        'https://api.entitlement.watchabc.go.com/vp2/ws-secure/entitlement/2020/authorize.json',
                        video_id, data=urlencode_postdata(data), headers=self.geo_verification_headers())
                    errors = entitlement.get('errors', {}).get('errors', [])
                    if errors:
                        error_message = ', '.join([error['message'] for error in errors])
--- a/youtube_dl/extractor/googledrive.py
+++ b/youtube_dl/extractor/googledrive.py
@ -6,6 +6,7 @@ from .common import InfoExtractor
 from ..utils import (
    ExtractorError,
    int_or_none,
    lowercase_escape,
 )
@ -13,12 +14,12 @@ class GoogleDriveIE(InfoExtractor):
    _VALID_URL = r'https?://(?:(?:docs|drive)\.google\.com/(?:uc\?.*?id=|file/d/)|video\.google\.com/get_player\?.*?docid=)(?P<id>[a-zA-Z0-9_-]{28,})'
    _TESTS = [{
        'url': 'https://drive.google.com/file/d/0ByeS4oOUV-49Zzh4R1J6R09zazQ/edit?pli=1',
-        'md5': '881f7700aec4f538571fa1e0eed4a7b6',
+        'md5': 'd109872761f7e7ecf353fa108c0dbe1e',
        'info_dict': {
            'id': '0ByeS4oOUV-49Zzh4R1J6R09zazQ',
            'ext': 'mp4',
            'title': 'Big Buck Bunny.mp4',
-            'duration': 46,
+            'duration': 45,
        }
    }, {
        # video id is longer than 28 characters
@ -55,7 +56,7 @@ class GoogleDriveIE(InfoExtractor):
    def _real_extract(self, url):
        video_id = self._match_id(url)
        webpage = self._download_webpage(
-            'http://docs.google.com/file/d/%s' % video_id, video_id, encoding='unicode_escape')
+            'http://docs.google.com/file/d/%s' % video_id, video_id)
        reason = self._search_regex(r'"reason"\s*,\s*"([^"]+)', webpage, 'reason', default=None)
        if reason:
@ -74,7 +75,7 @@ class GoogleDriveIE(InfoExtractor):
            resolution = fmt.split('/')[1]
            width, height = resolution.split('x')
            formats.append({
-                'url': fmt_url,
+                'url': lowercase_escape(fmt_url),
                'format_id': fmt_id,
                'resolution': resolution,
                'width': int_or_none(width),
--- a/youtube_dl/extractor/hgtv.py
+++ b/youtube_dl/extractor/hgtv.py
@ -2,50 +2,6 @@
 from __future__ import unicode_literals
 from .common import InfoExtractor
 from ..utils import (
    int_or_none,
    js_to_json,
    smuggle_url,
 )
 class HGTVIE(InfoExtractor):
    _VALID_URL = r'https?://(?:www\.)?hgtv\.ca/[^/]+/video/(?P<id>[^/]+)/video.html'
    _TEST = {
        'url': 'http://www.hgtv.ca/homefree/video/overnight-success/video.html?v=738081859718&p=1&s=da#video',
        'md5': '',
        'info_dict': {
            'id': 'aFH__I_5FBOX',
            'ext': 'mp4',
            'title': 'Overnight Success',
            'description': 'After weeks of hard work, high stakes, breakdowns and pep talks, the final 2 contestants compete to win the ultimate dream.',
            'uploader': 'SHWM-NEW',
            'timestamp': 1470320034,
            'upload_date': '20160804',
        },
        'params': {
            # m3u8 download
            'skip_download': True,
        },
    }
    def _real_extract(self, url):
        display_id = self._match_id(url)
        webpage = self._download_webpage(url, display_id)
        embed_vars = self._parse_json(self._search_regex(
            r'(?s)embed_vars\s*=\s*({.*?});',
            webpage, 'embed vars'), display_id, js_to_json)
        return {
            '_type': 'url_transparent',
            'url': smuggle_url(
                'http://link.theplatform.com/s/dtjsEC/%s?mbr=true&manifest=m3u' % embed_vars['pid'], {
                    'force_smil_url': True
                }),
            'series': embed_vars.get('show'),
            'season_number': int_or_none(embed_vars.get('season')),
            'episode_number': int_or_none(embed_vars.get('episode')),
            'ie_key': 'ThePlatform',
        }
 class HGTVComShowIE(InfoExtractor):
--- a/youtube_dl/extractor/hotstar.py
+++ b/youtube_dl/extractor/hotstar.py
@ -34,11 +34,9 @@ class HotStarIE(InfoExtractor):
        'only_matching': True,
    }]
-    _GET_CONTENT_TEMPLATE = 'http://account.hotstar.com/AVS/besc?action=GetAggregatedContentDetails&channel=PCTV&contentId=%s'
+    def _download_json(self, url_or_request, video_id, note='Downloading JSON metadata', fatal=True, query=None):
-    _GET_CDN_TEMPLATE = 'http://getcdn.hotstar.com/AVS/besc?action=GetCDN&asJson=Y&channel=%s&id=%s&type=%s'
+        json_data = super(HotStarIE, self)._download_json(
-
+            url_or_request, video_id, note, fatal=fatal, query=query)
    def _download_json(self, url_or_request, video_id, note='Downloading JSON metadata', fatal=True):
        json_data = super(HotStarIE, self)._download_json(url_or_request, video_id, note, fatal=fatal)
        if json_data['resultCode'] != 'OK':
            if fatal:
                raise ExtractorError(json_data['errorDescription'])
@ -48,20 +46,37 @@ class HotStarIE(InfoExtractor):
    def _real_extract(self, url):
        video_id = self._match_id(url)
        video_data = self._download_json(
-            self._GET_CONTENT_TEMPLATE % video_id,
+            'http://account.hotstar.com/AVS/besc', video_id, query={
-            video_id)['contentInfo'][0]
+                'action': 'GetAggregatedContentDetails',
                'channel': 'PCTV',
                'contentId': video_id,
            })['contentInfo'][0]
        title = video_data['episodeTitle']
        if video_data.get('encrypted') == 'Y':
            raise ExtractorError('This video is DRM protected.', expected=True)
        formats = []
-        # PCTV for extracting f4m manifest
+        for f in ('JIO',):
        for f in ('TABLET',):
            format_data = self._download_json(
-                self._GET_CDN_TEMPLATE % (f, video_id, 'VOD'),
+                'http://getcdn.hotstar.com/AVS/besc',
-                video_id, 'Downloading %s JSON metadata' % f, fatal=False)
+                video_id, 'Downloading %s JSON metadata' % f,
                fatal=False, query={
                    'action': 'GetCDN',
                    'asJson': 'Y',
                    'channel': f,
                    'id': video_id,
                    'type': 'VOD',
                })
            if format_data:
-                format_url = format_data['src']
+                format_url = format_data.get('src')
                if not format_url:
                    continue
                ext = determine_ext(format_url)
                if ext == 'm3u8':
-                    formats.extend(self._extract_m3u8_formats(format_url, video_id, 'mp4', m3u8_id='hls', fatal=False))
+                    formats.extend(self._extract_m3u8_formats(
                        format_url, video_id, 'mp4',
                        m3u8_id='hls', fatal=False))
                elif ext == 'f4m':
                    # produce broken files
                    continue
@ -75,9 +90,12 @@ class HotStarIE(InfoExtractor):
        return {
            'id': video_id,
-            'title': video_data['episodeTitle'],
+            'title': title,
            'description': video_data.get('description'),
            'duration': int_or_none(video_data.get('duration')),
            'timestamp': int_or_none(video_data.get('broadcastDate')),
            'formats': formats,
            'episode': title,
            'episode_number': int_or_none(video_data.get('episodeNumber')),
            'series': video_data.get('contentTitle'),
        }
--- a/youtube_dl/extractor/infoq.py
+++ b/youtube_dl/extractor/infoq.py
@ -4,7 +4,10 @@ from __future__ import unicode_literals
 import base64
-from ..compat import compat_urllib_parse_unquote
+from ..compat import (
    compat_urllib_parse_unquote,
    compat_urlparse,
 )
 from ..utils import determine_ext
 from .bokecc import BokeCCBaseIE
@ -33,9 +36,21 @@ class InfoQIE(BokeCCBaseIE):
            'ext': 'flv',
            'description': 'md5:308d981fb28fa42f49f9568322c683ff',
        },
    }, {
        'url': 'https://www.infoq.com/presentations/Simple-Made-Easy',
        'md5': '0e34642d4d9ef44bf86f66f6399672db',
        'info_dict': {
            'id': 'Simple-Made-Easy',
            'title': 'Simple Made Easy',
            'ext': 'mp3',
            'description': 'md5:3e0e213a8bbd074796ef89ea35ada25b',
        },
        'params': {
            'format': 'bestaudio',
        },
    }]
-    def _extract_rtmp_videos(self, webpage):
+    def _extract_rtmp_video(self, webpage):
        # The server URL is hardcoded
        video_url = 'rtmpe://video.infoq.com/cfx/st/'
@ -47,28 +62,53 @@ class InfoQIE(BokeCCBaseIE):
        playpath = 'mp4:' + real_id
        return [{
-            'format_id': 'rtmp',
+            'format_id': 'rtmp_video',
            'url': video_url,
            'ext': determine_ext(playpath),
            'play_path': playpath,
        }]
-    def _extract_http_videos(self, webpage):
+    def _extract_cookies(self, webpage):
        http_video_url = self._search_regex(r'P\.s\s*=\s*\'([^\']+)\'', webpage, 'video URL')
        policy = self._search_regex(r'InfoQConstants.scp\s*=\s*\'([^\']+)\'', webpage, 'policy')
        signature = self._search_regex(r'InfoQConstants.scs\s*=\s*\'([^\']+)\'', webpage, 'signature')
        key_pair_id = self._search_regex(r'InfoQConstants.sck\s*=\s*\'([^\']+)\'', webpage, 'key-pair-id')
        return 'CloudFront-Policy=%s; CloudFront-Signature=%s; CloudFront-Key-Pair-Id=%s' % (
            policy, signature, key_pair_id)
    def _extract_http_video(self, webpage):
        http_video_url = self._search_regex(r'P\.s\s*=\s*\'([^\']+)\'', webpage, 'video URL')
        return [{
-            'format_id': 'http',
+            'format_id': 'http_video',
            'url': http_video_url,
            'http_headers': {
-                'Cookie': 'CloudFront-Policy=%s; CloudFront-Signature=%s; CloudFront-Key-Pair-Id=%s' % (
+                'Cookie': self._extract_cookies(webpage)
                    policy, signature, key_pair_id),
            },
        }]
    def _extract_http_audio(self, webpage, video_id):
        fields = self._hidden_inputs(webpage)
        http_audio_url = fields['filename']
        if http_audio_url is None:
            return []
        cookies_header = {'Cookie': self._extract_cookies(webpage)}
        # base URL is found in the Location header in the response returned by
        # GET https://www.infoq.com/mp3download.action?filename=... when logged in.
        http_audio_url = compat_urlparse.urljoin('http://res.infoq.com/downloads/mp3downloads/', http_audio_url)
        # audio file seem to be missing some times even if there is a download link
        # so probe URL to make sure
        if not self._is_valid_url(http_audio_url, video_id, headers=cookies_header):
            return []
        return [{
            'format_id': 'http_audio',
            'url': http_audio_url,
            'vcodec': 'none',
            'http_headers': cookies_header,
        }]
    def _real_extract(self, url):
        video_id = self._match_id(url)
        webpage = self._download_webpage(url, video_id)
@ -80,7 +120,10 @@ class InfoQIE(BokeCCBaseIE):
            # for China videos, HTTP video URL exists but always fails with 403
            formats = self._extract_bokecc_formats(webpage, video_id)
        else:
-            formats = self._extract_rtmp_videos(webpage) + self._extract_http_videos(webpage)
+            formats = (
                self._extract_rtmp_video(webpage) +
                self._extract_http_video(webpage) +
                self._extract_http_audio(webpage, video_id))
        self._sort_formats(formats)
--- a/youtube_dl/extractor/iprima.py
+++ b/youtube_dl/extractor/iprima.py
@ -65,7 +65,7 @@ class IPrimaIE(InfoExtractor):
        options = self._parse_json(
            self._search_regex(
-                r'(?s)var\s+playerOptions\s*=\s*({.+?});',
+                r'(?s)(?:TDIPlayerOptions|playerOptions)\s*=\s*({.+?});\s*\]\]',
                playerpage, 'player options', default='{}'),
            video_id, transform_source=js_to_json, fatal=False)
        if options:
--- a/youtube_dl/extractor/iqiyi.py
+++ b/youtube_dl/extractor/iqiyi.py
@ -173,11 +173,12 @@ class IqiyiIE(InfoExtractor):
        }
    }, {
        'url': 'http://www.iqiyi.com/v_19rrhnnclk.html',
-        'md5': '667171934041350c5de3f5015f7f1152',
+        'md5': 'b7dc800a4004b1b57749d9abae0472da',
        'info_dict': {
            'id': 'e3f585b550a280af23c98b6cb2be19fb',
            'ext': 'mp4',
-            'title': '名侦探柯南 国语版：第752集 迫近灰原秘密的黑影 下篇',
+            # This can be either Simplified Chinese or Traditional Chinese
            'title': r're:^(?:名侦探柯南 国语版：第752集 迫近灰原秘密的黑影 下篇|名偵探柯南 國語版：第752集 迫近灰原秘密的黑影 下篇)$',
        },
        'skip': 'Geo-restricted to China',
    }, {
--- a/youtube_dl/extractor/iwara.py
+++ b/youtube_dl/extractor/iwara.py
@ -3,14 +3,18 @@ from __future__ import unicode_literals
 from .common import InfoExtractor
 from ..compat import compat_urllib_parse_urlparse
-from ..utils import remove_end
+from ..utils import (
    int_or_none,
    mimetype2ext,
    remove_end,
 )
 class IwaraIE(InfoExtractor):
    _VALID_URL = r'https?://(?:www\.|ecchi\.)?iwara\.tv/videos/(?P<id>[a-zA-Z0-9]+)'
    _TESTS = [{
        'url': 'http://iwara.tv/videos/amVwUl1EHpAD9RD',
-        'md5': '1d53866b2c514b23ed69e4352fdc9839',
+        # md5 is unstable
        'info_dict': {
            'id': 'amVwUl1EHpAD9RD',
            'ext': 'mp4',
@ -23,17 +27,17 @@ class IwaraIE(InfoExtractor):
        'info_dict': {
            'id': '0B1LvuHnL-sRFNXB1WHNqbGw4SXc',
            'ext': 'mp4',
-            'title': '[3D Hentai] Kyonyu Ã\x97 Genkai Ã\x97 Emaki Shinobi Girls.mp4',
+            'title': '[3D Hentai] Kyonyu × Genkai × Emaki Shinobi Girls.mp4',
            'age_limit': 18,
        },
        'add_ie': ['GoogleDrive'],
    }, {
        'url': 'http://www.iwara.tv/videos/nawkaumd6ilezzgq',
-        'md5': '1d85f1e5217d2791626cff5ec83bb189',
+        # md5 is unstable
        'info_dict': {
            'id': '6liAP9s2Ojc',
            'ext': 'mp4',
-            'age_limit': 0,
+            'age_limit': 18,
            'title': '[MMD] Do It Again Ver.2 [1080p 60FPS] (Motion,Camera,Wav+DL)',
            'description': 'md5:590c12c0df1443d833fbebe05da8c47a',
            'upload_date': '20160910',
@ -52,9 +56,9 @@ class IwaraIE(InfoExtractor):
        # ecchi is 'sexy' in Japanese
        age_limit = 18 if hostname.split('.')[0] == 'ecchi' else 0
-        entries = self._parse_html5_media_entries(url, webpage, video_id)
+        video_data = self._download_json('http://www.iwara.tv/api/video/%s' % video_id, video_id)
-        if not entries:
+        if not video_data:
            iframe_url = self._html_search_regex(
                r'<iframe[^>]+src=([\'"])(?P<url>[^\'"]+)\1',
                webpage, 'iframe URL', group='url')
@ -67,11 +71,25 @@ class IwaraIE(InfoExtractor):
        title = remove_end(self._html_search_regex(
            r'<title>([^<]+)</title>', webpage, 'title'), ' | Iwara')
-        info_dict = entries[0]
+        formats = []
-        info_dict.update({
+        for a_format in video_data:
            format_id = a_format.get('resolution')
            height = int_or_none(self._search_regex(
                r'(\d+)p', format_id, 'height', default=None))
            formats.append({
                'url': a_format['uri'],
                'format_id': format_id,
                'ext': mimetype2ext(a_format.get('mime')) or 'mp4',
                'height': height,
                'width': int_or_none(height / 9.0 * 16.0 if height else None),
                'quality': 1 if format_id == 'Source' else 0,
            })
        self._sort_formats(formats)
        return {
            'id': video_id,
            'title': title,
            'age_limit': age_limit,
-        })
+            'formats': formats,
-
+        }
        return info_dict
--- a/youtube_dl/extractor/kaltura.py
+++ b/youtube_dl/extractor/kaltura.py
@ -23,11 +23,11 @@ class KalturaIE(InfoExtractor):
                (?:
                    kaltura:(?P<partner_id>\d+):(?P<id>[0-9a-z_]+)|
                    https?://
-                        (:?(?:www|cdnapi(?:sec)?)\.)?kaltura\.com/
+                        (:?(?:www|cdnapi(?:sec)?)\.)?kaltura\.com(?::\d+)?/
                        (?:
                            (?:
                                # flash player
-                                index\.php/kwidget|
+                                index\.php/(?:kwidget|extwidget/preview)|
                                # html5 player
                                html5/html5lib/[^/]+/mwEmbedFrame\.php
                            )
@ -94,6 +94,14 @@ class KalturaIE(InfoExtractor):
            'params': {
                'skip_download': True,
            },
        },
        {
            'url': 'https://www.kaltura.com/index.php/extwidget/preview/partner_id/1770401/uiconf_id/37307382/entry_id/0_58u8kme7/embed/iframe?&flashvars[streamerType]=auto',
            'only_matching': True,
        },
        {
            'url': 'https://www.kaltura.com:443/index.php/extwidget/preview/partner_id/1770401/uiconf_id/37307382/entry_id/0_58u8kme7/embed/iframe?&flashvars[streamerType]=auto',
            'only_matching': True,
        }
    ]
@ -112,7 +120,7 @@ class KalturaIE(InfoExtractor):
            re.search(
                r'''(?xs)
                    (?P<q1>["\'])
-                        (?:https?:)?//cdnapi(?:sec)?\.kaltura\.com/(?:(?!(?P=q1)).)*(?:p|partner_id)/(?P<partner_id>\d+)(?:(?!(?P=q1)).)*
+                        (?:https?:)?//cdnapi(?:sec)?\.kaltura\.com(?::\d+)?/(?:(?!(?P=q1)).)*\b(?:p|partner_id)/(?P<partner_id>\d+)(?:(?!(?P=q1)).)*
                    (?P=q1).*?
                    (?:
                        entry_?[Ii]d|
@ -209,6 +217,8 @@ class KalturaIE(InfoExtractor):
                partner_id = params['wid'][0][1:]
            elif 'p' in params:
                partner_id = params['p'][0]
            elif 'partner_id' in params:
                partner_id = params['partner_id'][0]
            else:
                raise ExtractorError('Invalid URL', expected=True)
            if 'entry_id' in params:
--- a/youtube_dl/extractor/lemonde.py
+++ b/youtube_dl/extractor/lemonde.py
@ -7,20 +7,40 @@ class LemondeIE(InfoExtractor):
    _VALID_URL = r'https?://(?:.+?\.)?lemonde\.fr/(?:[^/]+/)*(?P<id>[^/]+)\.html'
    _TESTS = [{
        'url': 'http://www.lemonde.fr/police-justice/video/2016/01/19/comprendre-l-affaire-bygmalion-en-cinq-minutes_4849702_1653578.html',
-        'md5': '01fb3c92de4c12c573343d63e163d302',
+        'md5': 'da120c8722d8632eec6ced937536cc98',
        'info_dict': {
            'id': 'lqm3kl',
            'ext': 'mp4',
            'title': "Comprendre l'affaire Bygmalion en 5 minutes",
            'thumbnail': r're:^https?://.*\.jpg',
-            'duration': 320,
+            'duration': 309,
            'upload_date': '20160119',
            'timestamp': 1453194778,
            'uploader_id': '3pmkp',
        },
    }, {
        # standard iframe embed
        'url': 'http://www.lemonde.fr/les-decodeurs/article/2016/10/18/tout-comprendre-du-ceta-le-petit-cousin-du-traite-transatlantique_5015920_4355770.html',
        'info_dict': {
            'id': 'uzsxms',
            'ext': 'mp4',
            'title': "CETA : quelles suites pour l'accord commercial entre l'Europe et le Canada ?",
            'thumbnail': r're:^https?://.*\.jpg',
            'duration': 325,
            'upload_date': '20161021',
            'timestamp': 1477044540,
            'uploader_id': '3pmkp',
        },
        'params': {
            'skip_download': True,
        },
    }, {
        'url': 'http://redaction.actu.lemonde.fr/societe/video/2016/01/18/calais-debut-des-travaux-de-defrichement-dans-la-jungle_4849233_3224.html',
        'only_matching': True,
    }, {
        # YouTube embeds
        'url': 'http://www.lemonde.fr/pixels/article/2016/12/09/pourquoi-pewdiepie-superstar-de-youtube-a-menace-de-fermer-sa-chaine_5046649_4408996.html',
        'only_matching': True,
    }]
    def _real_extract(self, url):
@ -30,5 +50,9 @@ class LemondeIE(InfoExtractor):
        digiteka_url = self._proto_relative_url(self._search_regex(
            r'url\s*:\s*(["\'])(?P<url>(?:https?://)?//(?:www\.)?(?:digiteka\.net|ultimedia\.com)/deliver/.+?)\1',
-            webpage, 'digiteka url', group='url'))
+            webpage, 'digiteka url', group='url', default=None))
        if digiteka_url:
            return self.url_result(digiteka_url, 'Digiteka')
        return self.url_result(url, 'Generic')
--- a/youtube_dl/extractor/limelight.py
+++ b/youtube_dl/extractor/limelight.py
@ -8,6 +8,7 @@ from ..utils import (
    determine_ext,
    float_or_none,
    int_or_none,
    unsmuggle_url,
 )
@ -15,20 +16,23 @@ class LimelightBaseIE(InfoExtractor):
    _PLAYLIST_SERVICE_URL = 'http://production-ps.lvp.llnw.net/r/PlaylistService/%s/%s/%s'
    _API_URL = 'http://api.video.limelight.com/rest/organizations/%s/%s/%s/%s.json'
-    def _call_playlist_service(self, item_id, method, fatal=True):
+    def _call_playlist_service(self, item_id, method, fatal=True, referer=None):
        headers = {}
        if referer:
            headers['Referer'] = referer
        return self._download_json(
            self._PLAYLIST_SERVICE_URL % (self._PLAYLIST_SERVICE_PATH, item_id, method),
-            item_id, 'Downloading PlaylistService %s JSON' % method, fatal=fatal)
+            item_id, 'Downloading PlaylistService %s JSON' % method, fatal=fatal, headers=headers)
    def _call_api(self, organization_id, item_id, method):
        return self._download_json(
            self._API_URL % (organization_id, self._API_PATH, item_id, method),
            item_id, 'Downloading API %s JSON' % method)
-    def _extract(self, item_id, pc_method, mobile_method, meta_method):
+    def _extract(self, item_id, pc_method, mobile_method, meta_method, referer=None):
-        pc = self._call_playlist_service(item_id, pc_method)
+        pc = self._call_playlist_service(item_id, pc_method, referer=referer)
        metadata = self._call_api(pc['orgId'], item_id, meta_method)
-        mobile = self._call_playlist_service(item_id, mobile_method, fatal=False)
+        mobile = self._call_playlist_service(item_id, mobile_method, fatal=False, referer=referer)
        return pc, mobile, metadata
    def _extract_info(self, streams, mobile_urls, properties):
@ -207,10 +211,13 @@ class LimelightMediaIE(LimelightBaseIE):
    _API_PATH = 'media'
    def _real_extract(self, url):
        url, smuggled_data = unsmuggle_url(url, {})
        video_id = self._match_id(url)
        pc, mobile, metadata = self._extract(
-            video_id, 'getPlaylistByMediaId', 'getMobilePlaylistByMediaId', 'properties')
+            video_id, 'getPlaylistByMediaId',
            'getMobilePlaylistByMediaId', 'properties',
            smuggled_data.get('source_url'))
        return self._extract_info(
            pc['playlistItems'][0].get('streams', []),
@ -247,11 +254,13 @@ class LimelightChannelIE(LimelightBaseIE):
    _API_PATH = 'channels'
    def _real_extract(self, url):
        url, smuggled_data = unsmuggle_url(url, {})
        channel_id = self._match_id(url)
        pc, mobile, medias = self._extract(
            channel_id, 'getPlaylistByChannelId',
-            'getMobilePlaylistWithNItemsByChannelId?begin=0&count=-1', 'media')
+            'getMobilePlaylistWithNItemsByChannelId?begin=0&count=-1',
            'media', smuggled_data.get('source_url'))
        entries = [
            self._extract_info(
--- a/youtube_dl/extractor/myspace.py
+++ b/youtube_dl/extractor/myspace.py
@ -17,9 +17,10 @@ class MySpaceIE(InfoExtractor):
    _TESTS = [
        {
            'url': 'https://myspace.com/fiveminutestothestage/video/little-big-town/109594919',
            'md5': '9c1483c106f4a695c47d2911feed50a7',
            'info_dict': {
                'id': '109594919',
-                'ext': 'flv',
+                'ext': 'mp4',
                'title': 'Little Big Town',
                'description': 'This country quartet was all smiles while playing a sold out show at the Pacific Amphitheatre in Orange County, California.',
                'uploader': 'Five Minutes to the Stage',
@ -27,37 +28,30 @@ class MySpaceIE(InfoExtractor):
                'timestamp': 1414108751,
                'upload_date': '20141023',
            },
            'params': {
                # rtmp download
                'skip_download': True,
            },
        },
        # songs
        {
            'url': 'https://myspace.com/killsorrow/music/song/of-weakened-soul...-93388656-103880681',
            'md5': '1d7ee4604a3da226dd69a123f748b262',
            'info_dict': {
                'id': '93388656',
-                'ext': 'flv',
+                'ext': 'm4a',
                'title': 'Of weakened soul...',
                'uploader': 'Killsorrow',
                'uploader_id': 'killsorrow',
            },
            'params': {
                # rtmp download
                'skip_download': True,
            },
        }, {
-            'add_ie': ['Vevo'],
+            'add_ie': ['Youtube'],
            'url': 'https://myspace.com/threedaysgrace/music/song/animal-i-have-become-28400208-28218041',
            'info_dict': {
-                'id': 'USZM20600099',
+                'id': 'xqds0B_meys',
-                'ext': 'mp4',
+                'ext': 'webm',
-                'title': 'Animal I Have Become',
+                'title': 'Three Days Grace - Animal I Have Become',
-                'uploader': 'Three Days Grace',
+                'description': 'md5:8bd86b3693e72a077cf863a8530c54bb',
-                'timestamp': int,
+                'uploader': 'ThreeDaysGraceVEVO',
-                'upload_date': '20060502',
+                'uploader_id': 'ThreeDaysGraceVEVO',
                'upload_date': '20091002',
            },
            'skip': 'VEVO is only available in some countries',
        }, {
            'add_ie': ['Youtube'],
            'url': 'https://myspace.com/starset2/music/song/first-light-95799905-106964426',
@ -76,13 +70,25 @@ class MySpaceIE(InfoExtractor):
    def _real_extract(self, url):
        mobj = re.match(self._VALID_URL, url)
        video_id = mobj.group('id')
        is_song = mobj.group('mediatype').startswith('music/song')
        webpage = self._download_webpage(url, video_id)
        player_url = self._search_regex(
-            r'playerSwf":"([^"?]*)', webpage, 'player URL')
+            r'videoSwf":"([^"?]*)', webpage, 'player URL', fatal=False)
-        def rtmp_format_from_stream_url(stream_url, width=None, height=None):
+        def formats_from_stream_urls(stream_url, hls_stream_url, http_stream_url, width=None, height=None):
            formats = []
            vcodec = 'none' if is_song else None
            if hls_stream_url:
                formats.append({
                    'format_id': 'hls',
                    'url': hls_stream_url,
                    'protocol': 'm3u8_native',
                    'ext': 'm4a' if is_song else 'mp4',
                    'vcodec': vcodec,
                })
            if stream_url and player_url:
                rtmp_url, play_path = stream_url.split(';', 1)
-            return {
+                formats.append({
                    'format_id': 'rtmp',
                    'url': rtmp_url,
                    'play_path': play_path,
@ -91,9 +97,19 @@ class MySpaceIE(InfoExtractor):
                    'ext': 'flv',
                    'width': width,
                    'height': height,
-            }
+                    'vcodec': vcodec,
                })
            if http_stream_url:
                formats.append({
                    'format_id': 'http',
                    'url': http_stream_url,
                    'width': width,
                    'height': height,
                    'vcodec': vcodec,
                })
            return formats
-        if mobj.group('mediatype').startswith('music/song'):
+        if is_song:
            # songs don't store any useful info in the 'context' variable
            song_data = self._search_regex(
                r'''<button.*data-song-id=(["\'])%s\1.*''' % video_id,
@ -108,8 +124,10 @@ class MySpaceIE(InfoExtractor):
                return self._search_regex(
                    r'''data-%s=([\'"])(?P<data>.*?)\1''' % name,
                    song_data, name, default='', group='data')
-            stream_url = search_data('stream-url')
+            formats = formats_from_stream_urls(
-            if not stream_url:
+                search_data('stream-url'), search_data('hls-stream-url'),
                search_data('http-stream-url'))
            if not formats:
                vevo_id = search_data('vevo-id')
                youtube_id = search_data('youtube-id')
                if vevo_id:
@ -121,6 +139,7 @@ class MySpaceIE(InfoExtractor):
                else:
                    raise ExtractorError(
                        'Found song but don\'t know how to download it')
            self._sort_formats(formats)
            return {
                'id': video_id,
                'title': self._og_search_title(webpage),
@ -128,27 +147,16 @@ class MySpaceIE(InfoExtractor):
                'uploader_id': search_data('artist-username'),
                'thumbnail': self._og_search_thumbnail(webpage),
                'duration': int_or_none(search_data('duration')),
-                'formats': [rtmp_format_from_stream_url(stream_url)]
+                'formats': formats,
            }
        else:
            video = self._parse_json(self._search_regex(
                r'context = ({.*?});', webpage, 'context'),
                video_id)['video']
-            formats = []
+            formats = formats_from_stream_urls(
-            hls_stream_url = video.get('hlsStreamUrl')
+                video.get('streamUrl'), video.get('hlsStreamUrl'),
-            if hls_stream_url:
+                video.get('mp4StreamUrl'), int_or_none(video.get('width')),
-                formats.append({
+                int_or_none(video.get('height')))
                    'format_id': 'hls',
                    'url': hls_stream_url,
                    'protocol': 'm3u8_native',
                    'ext': 'mp4',
                })
            stream_url = video.get('streamUrl')
            if stream_url:
                formats.append(rtmp_format_from_stream_url(
                    stream_url,
                    int_or_none(video.get('width')),
                    int_or_none(video.get('height'))))
            self._sort_formats(formats)
            return {
                'id': video_id,
--- a/youtube_dl/extractor/nbc.py
+++ b/youtube_dl/extractor/nbc.py
@ -4,23 +4,26 @@ import re
 from .common import InfoExtractor
 from .theplatform import ThePlatformIE
 from .adobepass import AdobePassIE
 from ..compat import compat_urllib_parse_urlparse
 from ..utils import (
    find_xpath_attr,
    lowercase_escape,
    smuggle_url,
    unescapeHTML,
    update_url_query,
    int_or_none,
 )
-class NBCIE(InfoExtractor):
+class NBCIE(AdobePassIE):
    _VALID_URL = r'https?://(?:www\.)?nbc\.com/(?:[^/]+/)+(?P<id>n?\d+)'
    _TESTS = [
        {
-            'url': 'http://www.nbc.com/the-tonight-show/segments/112966',
+            'url': 'http://www.nbc.com/the-tonight-show/video/jimmy-fallon-surprises-fans-at-ben-jerrys/2848237',
            'info_dict': {
-                'id': '112966',
+                'id': '2848237',
                'ext': 'mp4',
                'title': 'Jimmy Fallon Surprises Fans at Ben & Jerry\'s',
                'description': 'Jimmy gives out free scoops of his new "Tonight Dough" ice cream flavor by surprising customers at the Ben & Jerry\'s scoop shop.',
@ -69,7 +72,7 @@ class NBCIE(InfoExtractor):
            # HLS streams requires the 'hdnea3' cookie
            'url': 'http://www.nbc.com/Kings/video/goliath/n1806',
            'info_dict': {
-                'id': 'n1806',
+                'id': '101528f5a9e8127b107e98c5e6ce4638',
                'ext': 'mp4',
                'title': 'Goliath',
                'description': 'When an unknown soldier saves the life of the King\'s son in battle, he\'s thrust into the limelight and politics of the kingdom.',
@ -87,6 +90,46 @@ class NBCIE(InfoExtractor):
    def _real_extract(self, url):
        video_id = self._match_id(url)
        webpage = self._download_webpage(url, video_id)
        info = {
            '_type': 'url_transparent',
            'ie_key': 'ThePlatform',
            'id': video_id,
        }
        video_data = None
        preload = self._search_regex(
            r'PRELOAD\s*=\s*({.+})', webpage, 'preload data', default=None)
        if preload:
            preload_data = self._parse_json(preload, video_id)
            path = compat_urllib_parse_urlparse(url).path.rstrip('/')
            entity_id = preload_data.get('xref', {}).get(path)
            video_data = preload_data.get('entities', {}).get(entity_id)
        if video_data:
            query = {
                'mbr': 'true',
                'manifest': 'm3u',
            }
            video_id = video_data['guid']
            title = video_data['title']
            if video_data.get('entitlement') == 'auth':
                resource = self._get_mvpd_resource(
                    'nbcentertainment', title, video_id,
                    video_data.get('vChipRating'))
                query['auth'] = self._extract_mvpd_auth(
                    url, video_id, 'nbcentertainment', resource)
            theplatform_url = smuggle_url(update_url_query(
                'http://link.theplatform.com/s/NnzsPC/media/guid/2410887629/' + video_id,
                query), {'force_smil_url': True})
            info.update({
                'id': video_id,
                'title': title,
                'url': theplatform_url,
                'description': video_data.get('description'),
                'keywords': video_data.get('keywords'),
                'season_number': int_or_none(video_data.get('seasonNumber')),
                'episode_number': int_or_none(video_data.get('episodeNumber')),
                'series': video_data.get('showName'),
            })
        else:
            theplatform_url = unescapeHTML(lowercase_escape(self._html_search_regex(
                [
                    r'(?:class="video-player video-player-full" data-mpx-url|class="player" src)="(.*?)"',
@ -96,12 +139,8 @@ class NBCIE(InfoExtractor):
                webpage, 'theplatform url').replace('_no_endcard', '').replace('\\/', '/')))
            if theplatform_url.startswith('//'):
                theplatform_url = 'http:' + theplatform_url
-        return {
+            info['url'] = smuggle_url(theplatform_url, {'source_url': url})
-            '_type': 'url_transparent',
+        return info
            'ie_key': 'ThePlatform',
            'url': smuggle_url(theplatform_url, {'source_url': url}),
            'id': video_id,
        }
 class NBCSportsVPlayerIE(InfoExtractor):
--- a/youtube_dl/extractor/piksel.py
+++ b/youtube_dl/extractor/piksel.py
@ -16,7 +16,8 @@ from ..utils import (
 class PikselIE(InfoExtractor):
    _VALID_URL = r'https?://player\.piksel\.com/v/(?P<id>[a-z0-9]+)'
-    _TEST = {
+    _TESTS = [
        {
            'url': 'http://player.piksel.com/v/nv60p12f',
            'md5': 'd9c17bbe9c3386344f9cfd32fad8d235',
            'info_dict': {
@ -27,7 +28,21 @@ class PikselIE(InfoExtractor):
                'timestamp': 1465231790,
                'upload_date': '20160606',
            }
        },
        {
            # Original source: http://www.uscourts.gov/cameras-courts/state-washington-vs-donald-j-trump-et-al
            'url': 'https://player.piksel.com/v/v80kqp41',
            'md5': '753ddcd8cc8e4fa2dda4b7be0e77744d',
            'info_dict': {
                'id': 'v80kqp41',
                'ext': 'mp4',
                'title': 'WAW- State of Washington vs. Donald J. Trump, et al',
                'description': 'State of Washington vs. Donald J. Trump, et al, Case Number 17-CV-00141-JLR, TRO Hearing, Civil Rights Case, 02/3/2017, 1:00 PM (PST), Seattle Federal Courthouse, Seattle, WA, Judge James L. Robart presiding.',
                'timestamp': 1486171129,
                'upload_date': '20170204',
            }
        }
    ]
    @staticmethod
    def _extract_url(webpage):
@ -40,8 +55,10 @@ class PikselIE(InfoExtractor):
    def _real_extract(self, url):
        video_id = self._match_id(url)
        webpage = self._download_webpage(url, video_id)
-        app_token = self._search_regex(
+        app_token = self._search_regex([
-            r'clientAPI\s*:\s*"([^"]+)"', webpage, 'app token')
+            r'clientAPI\s*:\s*"([^"]+)"',
            r'data-de-api-key\s*=\s*"([^"]+)"'
        ], webpage, 'app token')
        response = self._download_json(
            'http://player.piksel.com/ws/ws_program/api/%s/mode/json/apiv/5' % app_token,
            video_id, query={
--- a/youtube_dl/extractor/pluralsight.py
+++ b/youtube_dl/extractor/pluralsight.py
@ -18,6 +18,7 @@ from ..utils import (
    parse_duration,
    qualities,
    srt_subtitles_timecode,
    update_url_query,
    urlencode_postdata,
 )
@ -92,6 +93,10 @@ class PluralsightIE(PluralsightBaseIE):
            raise ExtractorError('Unable to login: %s' % error, expected=True)
        if all(p not in response for p in ('__INITIAL_STATE__', '"currentUser"')):
            BLOCKED = 'Your account has been blocked due to suspicious activity'
            if BLOCKED in response:
                raise ExtractorError(
                    'Unable to login: %s' % BLOCKED, expected=True)
            raise ExtractorError('Unable to log in')
    def _get_subtitles(self, author, clip_id, lang, name, duration, video_id):
@ -327,25 +332,44 @@ class PluralsightCourseIE(PluralsightBaseIE):
        # TODO: PSM cookie
        course = self._download_json(
-            '%s/data/course/%s' % (self._API_BASE, course_id),
+            '%s/player/functions/rpc' % self._API_BASE, course_id,
-            course_id, 'Downloading course JSON')
+            'Downloading course JSON',
            data=json.dumps({
                'fn': 'bootstrapPlayer',
                'payload': {
                    'courseId': course_id,
                }
            }).encode('utf-8'),
            headers={
                'Content-Type': 'application/json;charset=utf-8'
            })['payload']['course']
        title = course['title']
        course_name = course['name']
        course_data = course['modules']
        description = course.get('description') or course.get('shortDescription')
        course_data = self._download_json(
            '%s/data/course/content/%s' % (self._API_BASE, course_id),
            course_id, 'Downloading course data JSON')
        entries = []
        for num, module in enumerate(course_data, 1):
-            for clip in module.get('clips', []):
+            author = module.get('author')
-                player_parameters = clip.get('playerParameters')
+            module_name = module.get('name')
-                if not player_parameters:
+            if not author or not module_name:
                continue
            for clip in module.get('clips', []):
                clip_index = int_or_none(clip.get('index'))
                if clip_index is None:
                    continue
                clip_url = update_url_query(
                    '%s/player' % self._API_BASE, query={
                        'mode': 'live',
                        'course': course_name,
                        'author': author,
                        'name': module_name,
                        'clip': clip_index,
                    })
                entries.append({
                    '_type': 'url_transparent',
-                    'url': '%s/training/player?%s' % (self._API_BASE, player_parameters),
+                    'url': clip_url,
                    'ie_key': PluralsightIE.ie_key(),
                    'chapter': module.get('title'),
                    'chapter_number': num,
--- a/youtube_dl/extractor/pornhub.py
+++ b/youtube_dl/extractor/pornhub.py
@ -156,7 +156,18 @@ class PornHubIE(InfoExtractor):
        comment_count = self._extract_count(
            r'All Comments\s*<span>\(([\d,.]+)\)', webpage, 'comment')
-        video_urls = list(map(compat_urllib_parse_unquote, re.findall(r"player_quality_[0-9]{3}p\s*=\s*'([^']+)'", webpage)))
+        video_variables = {}
        for video_variablename, quote, video_variable in re.findall(
                r'(player_quality_[0-9]{3,4}p\w+)\s*=\s*(["\'])(.+?)\2;', webpage):
            video_variables[video_variablename] = video_variable
        video_urls = []
        for encoded_video_url in re.findall(
                r'player_quality_[0-9]{3,4}p\s*=(.+?);', webpage):
            for varname, varval in video_variables.items():
                encoded_video_url = encoded_video_url.replace(varname, varval)
            video_urls.append(re.sub(r'[\s+]', '', encoded_video_url))
        if webpage.find('"encrypted":true') != -1:
            password = compat_urllib_parse_unquote_plus(
                self._search_regex(r'"video_title":"([^"]+)', webpage, 'password'))
--- a/youtube_dl/extractor/radiocanada.py
+++ b/youtube_dl/extractor/radiocanada.py
@ -54,9 +54,8 @@ class RadioCanadaIE(InfoExtractor):
            raise ExtractorError('This video is DRM protected.', expected=True)
        device_types = ['ipad']
        if app_code != 'toutv':
            device_types.append('flash')
        if not smuggled_data:
            device_types.append('flash')
            device_types.append('android')
        formats = []
@ -103,7 +102,7 @@ class RadioCanadaIE(InfoExtractor):
                        continue
                    f_url = re.sub(r'\d+\.%s' % ext, '%d.%s' % (tbr, ext), v_url)
                    protocol = determine_protocol({'url': f_url})
-                    formats.append({
+                    f = {
                        'format_id': '%s-%d' % (protocol, tbr),
                        'url': f_url,
                        'ext': 'flv' if protocol == 'rtmp' else ext,
@ -111,7 +110,14 @@ class RadioCanadaIE(InfoExtractor):
                        'width': int_or_none(url_e.get('width')),
                        'height': int_or_none(url_e.get('height')),
                        'tbr': tbr,
                    }
                    mobj = re.match(r'(?P<url>rtmp://[^/]+/[^/]+)/(?P<playpath>[^?]+)(?P<auth>\?.+)', f_url)
                    if mobj:
                        f.update({
                            'url': mobj.group('url') + mobj.group('auth'),
                            'play_path': mobj.group('playpath'),
                        })
                    formats.append(f)
                    if protocol == 'rtsp':
                        base_url = self._search_regex(
                            r'rtsp://([^?]+)', f_url, 'base url', default=None)
--- a/youtube_dl/extractor/scrippsnetworks.py
+++ b/youtube_dl/extractor/scrippsnetworks.py
@ -0,0 +1,60 @@
 # coding: utf-8
 from __future__ import unicode_literals
 from .adobepass import AdobePassIE
 from ..utils import (
    int_or_none,
    smuggle_url,
    update_url_query,
 )
 class ScrippsNetworksWatchIE(AdobePassIE):
    IE_NAME = 'scrippsnetworks:watch'
    _VALID_URL = r'https?://watch\.(?:hgtv|foodnetwork|travelchannel|diynetwork|cookingchanneltv)\.com/player\.[A-Z0-9]+\.html#(?P<id>\d+)'
    _TEST = {
        'url': 'http://watch.hgtv.com/player.HNT.html#0256538',
        'md5': '26545fd676d939954c6808274bdb905a',
        'info_dict': {
            'id': '0256538',
            'ext': 'mp4',
            'title': 'Seeking a Wow House',
            'description': 'Buyers retiring in Palm Springs, California, want a modern house with major wow factor. They\'re also looking for a pool and a large, open floorplan with tall windows looking out at the views.',
            'uploader': 'SCNI',
            'upload_date': '20170207',
            'timestamp': 1486450493,
        },
        'skip': 'requires TV provider authentication',
    }
    def _real_extract(self, url):
        video_id = self._match_id(url)
        webpage = self._download_webpage(url, video_id)
        channel = self._parse_json(self._search_regex(
            r'"channels"\s*:\s*(\[.+\])',
            webpage, 'channels'), video_id)[0]
        video_data = next(v for v in channel['videos'] if v.get('nlvid') == video_id)
        title = video_data['title']
        release_url = video_data['releaseUrl']
        if video_data.get('restricted'):
            requestor_id = self._search_regex(
                r'requestorId\s*=\s*"([^"]+)";', webpage, 'requestor id')
            resource = self._get_mvpd_resource(
                requestor_id, title, video_id,
                video_data.get('ratings', [{}])[0].get('rating'))
            auth = self._extract_mvpd_auth(
                url, video_id, requestor_id, resource)
            release_url = update_url_query(release_url, {'auth': auth})
        return {
            '_type': 'url_transparent',
            'id': video_id,
            'title': title,
            'url': smuggle_url(release_url, {'force_smil_url': True}),
            'description': video_data.get('description'),
            'thumbnail': video_data.get('thumbnailUrl'),
            'series': video_data.get('showTitle'),
            'season_number': int_or_none(video_data.get('season')),
            'episode_number': int_or_none(video_data.get('episodeNumber')),
            'ie_key': 'ThePlatform',
        }
--- a/youtube_dl/extractor/sixplay.py
+++ b/youtube_dl/extractor/sixplay.py
@ -1,64 +1,101 @@
 # coding: utf-8
 from __future__ import unicode_literals
 import re
 from .common import InfoExtractor
 from ..compat import compat_str
 from ..utils import (
    qualities,
    int_or_none,
    mimetype2ext,
    determine_ext,
    int_or_none,
    try_get,
    qualities,
 )
 class SixPlayIE(InfoExtractor):
    IE_NAME = '6play'
    _VALID_URL = r'(?:6play:|https?://(?:www\.)?6play\.fr/.+?-c_)(?P<id>[0-9]+)'
    _TEST = {
-        'url': 'http://www.6play.fr/jamel-et-ses-amis-au-marrakech-du-rire-p_1316/jamel-et-ses-amis-au-marrakech-du-rire-2015-c_11495320',
+        'url': 'http://www.6play.fr/le-meilleur-patissier-p_1807/le-meilleur-patissier-special-fetes-mercredi-a-21-00-sur-m6-c_11638450',
        'md5': '42310bffe4ba3982db112b9cd3467328',
        'info_dict': {
-            'id': '11495320',
+            'id': '11638450',
            'ext': 'mp4',
-            'title': 'Jamel et ses amis au Marrakech du rire 2015',
+            'title': 'Le Meilleur Pâtissier, spécial fêtes mercredi à 21:00 sur M6',
-            'description': 'md5:ba2149d5c321d5201b78070ee839d872',
+            'description': 'md5:308853f6a5f9e2d55a30fc0654de415f',
            'duration': 39,
            'series': 'Le meilleur pâtissier',
        },
        'params': {
            'skip_download': True,
        },
    }
    def _real_extract(self, url):
        video_id = self._match_id(url)
        clip_data = self._download_json(
            'https://player.m6web.fr/v2/video/config/6play-auth/FR/%s.json' % video_id,
            video_id)
        video_data = clip_data['videoInfo']
        data = self._download_json(
            'https://pc.middleware.6play.fr/6play/v2/platforms/m6group_web/services/6play/videos/clip_%s' % video_id,
            video_id, query={
                'csa': 5,
                'with': 'clips',
            })
        clip_data = data['clips'][0]
        title = clip_data['title']
        urls = []
        quality_key = qualities(['lq', 'sd', 'hq', 'hd'])
        formats = []
-        for source in clip_data['sources']:
+        for asset in clip_data['assets']:
-            source_type, source_url = source.get('type'), source.get('src')
+            asset_url = asset.get('full_physical_path')
-            if not source_url or source_type == 'hls/primetime':
+            protocol = asset.get('protocol')
            if not asset_url or protocol == 'primetime' or asset_url in urls:
                continue
-            ext = mimetype2ext(source_type) or determine_ext(source_url)
+            urls.append(asset_url)
-            if ext == 'm3u8':
+            container = asset.get('video_container')
            ext = determine_ext(asset_url)
            if container == 'm3u8' or ext == 'm3u8':
                if protocol == 'usp':
                    asset_url = re.sub(r'/([^/]+)\.ism/[^/]*\.m3u8', r'/\1.ism/\1.m3u8', asset_url)
                    formats.extend(self._extract_m3u8_formats(
-                    source_url, video_id, 'mp4', 'm3u8_native',
+                        asset_url, video_id, 'mp4', 'm3u8_native',
                        m3u8_id='hls', fatal=False))
                    formats.extend(self._extract_f4m_formats(
-                    source_url.replace('.m3u8', '.f4m'),
+                        asset_url.replace('.m3u8', '.f4m'),
                        video_id, f4m_id='hds', fatal=False))
-            elif ext == 'mp4':
+                    formats.extend(self._extract_mpd_formats(
-                quality = source.get('quality')
+                        asset_url.replace('.m3u8', '.mpd'),
                        video_id, mpd_id='dash', fatal=False))
                    formats.extend(self._extract_ism_formats(
                        re.sub(r'/[^/]+\.m3u8', '/Manifest', asset_url),
                        video_id, ism_id='mss', fatal=False))
                else:
                    formats.extend(self._extract_m3u8_formats(
                        asset_url, video_id, 'mp4', 'm3u8_native',
                        m3u8_id='hls', fatal=False))
            elif container == 'mp4' or ext == 'mp4':
                quality = asset.get('video_quality')
                formats.append({
-                    'url': source_url,
+                    'url': asset_url,
                    'format_id': quality,
                    'quality': quality_key(quality),
                    'ext': ext,
                })
        self._sort_formats(formats)
        def get(getter):
            for src in (data, clip_data):
                v = try_get(src, getter, compat_str)
                if v:
                    return v
        return {
            'id': video_id,
-            'title': video_data['title'].strip(),
+            'title': title,
-            'description': video_data.get('description'),
+            'description': get(lambda x: x['description']),
-            'duration': int_or_none(video_data.get('duration')),
+            'duration': int_or_none(clip_data.get('duration')),
-            'series': video_data.get('titlePgm'),
+            'series': get(lambda x: x['program']['title']),
            'formats': formats,
        }
--- a/youtube_dl/extractor/sportbox.py
+++ b/youtube_dl/extractor/sportbox.py
@ -4,65 +4,7 @@ from __future__ import unicode_literals
 import re
 from .common import InfoExtractor
-from ..compat import compat_urlparse
+from ..utils import js_to_json
 from ..utils import (
    js_to_json,
    unified_strdate,
 )
 class SportBoxIE(InfoExtractor):
    _VALID_URL = r'https?://news\.sportbox\.ru/(?:[^/]+/)+spbvideo_NI\d+_(?P<display_id>.+)'
    _TESTS = [{
        'url': 'http://news.sportbox.ru/Vidy_sporta/Avtosport/Rossijskij/spbvideo_NI483529_Gonka-2-zaezd-Obyedinenniy-2000-klassi-Turing-i-S',
        'md5': 'ff56a598c2cf411a9a38a69709e97079',
        'info_dict': {
            'id': '80822',
            'ext': 'mp4',
            'title': 'Гонка 2  заезд ««Объединенный 2000»: классы Туринг и Супер-продакшн',
            'description': 'md5:3d72dc4a006ab6805d82f037fdc637ad',
            'thumbnail': r're:^https?://.*\.jpg$',
            'upload_date': '20140928',
        },
        'params': {
            # m3u8 download
            'skip_download': True,
        },
    }, {
        'url': 'http://news.sportbox.ru/Vidy_sporta/billiard/spbvideo_NI486287_CHempionat-mira-po-dinamichnoy-piramide-4',
        'only_matching': True,
    }, {
        'url': 'http://news.sportbox.ru/video/no_ads/spbvideo_NI536574_V_Novorossijske_proshel_detskij_turnir_Pole_slavy_bojevoj?ci=211355',
        'only_matching': True,
    }]
    def _real_extract(self, url):
        mobj = re.match(self._VALID_URL, url)
        display_id = mobj.group('display_id')
        webpage = self._download_webpage(url, display_id)
        player = self._search_regex(
            r'src="/?(vdl/player/[^"]+)"', webpage, 'player')
        title = self._html_search_regex(
            [r'"nodetitle"\s*:\s*"([^"]+)"', r'class="node-header_{1,2}title">([^<]+)'],
            webpage, 'title')
        description = self._og_search_description(webpage) or self._html_search_meta(
            'description', webpage, 'description')
        thumbnail = self._og_search_thumbnail(webpage)
        upload_date = unified_strdate(self._html_search_meta(
            'dateCreated', webpage, 'upload date'))
        return {
            '_type': 'url_transparent',
            'url': compat_urlparse.urljoin(url, '/%s' % player),
            'display_id': display_id,
            'title': title,
            'description': description,
            'thumbnail': thumbnail,
            'upload_date': upload_date,
        }
 class SportBoxEmbedIE(InfoExtractor):
--- a/youtube_dl/extractor/sprout.py
+++ b/youtube_dl/extractor/sprout.py
@ -0,0 +1,52 @@
 # coding: utf-8
 from __future__ import unicode_literals
 from .adobepass import AdobePassIE
 from ..utils import (
    extract_attributes,
    update_url_query,
    smuggle_url,
 )
 class SproutIE(AdobePassIE):
    _VALID_URL = r'https?://(?:www\.)?sproutonline\.com/watch/(?P<id>[^/?#]+)'
    _TEST = {
        'url': 'http://www.sproutonline.com/watch/cowboy-adventure',
        'md5': '74bf14128578d1e040c3ebc82088f45f',
        'info_dict': {
            'id': '9dexnwtmh8_X',
            'ext': 'mp4',
            'title': 'A Cowboy Adventure',
            'description': 'Ruff-Ruff, Tweet and Dave get to be cowboys for the day at Six Cow Corral.',
            'timestamp': 1437758640,
            'upload_date': '20150724',
            'uploader': 'NBCU-SPROUT-NEW',
        }
    }
    def _real_extract(self, url):
        video_id = self._match_id(url)
        webpage = self._download_webpage(url, video_id)
        video_component = self._search_regex(
            r'(?s)(<div[^>]+data-component="video"[^>]*?>)',
            webpage, 'video component', default=None)
        if video_component:
            options = self._parse_json(extract_attributes(
                video_component)['data-options'], video_id)
            theplatform_url = options['video']
            query = {
                'mbr': 'true',
                'manifest': 'm3u',
            }
            if options.get('protected'):
                query['auth'] = self._extract_mvpd_auth(url, options['pid'], 'sprout', 'sprout')
            theplatform_url = smuggle_url(update_url_query(
                theplatform_url, query), {'force_smil_url': True})
        else:
            iframe = self._search_regex(
                r'(<iframe[^>]+id="sproutVideoIframe"[^>]*?>)',
                webpage, 'iframe')
            theplatform_url = extract_attributes(iframe)['src']
        return self.url_result(theplatform_url, 'ThePlatform')
--- a/youtube_dl/extractor/theplatform.py
+++ b/youtube_dl/extractor/theplatform.py
@ -306,9 +306,10 @@ class ThePlatformFeedIE(ThePlatformBaseIE):
        },
    }]
-    def _extract_feed_info(self, provider_id, feed_id, filter_query, video_id, custom_fields=None, asset_types_query={}):
+    def _extract_feed_info(self, provider_id, feed_id, filter_query, video_id, custom_fields=None, asset_types_query={}, account_id=None):
        real_url = self._URL_TEMPLATE % (self.http_scheme(), provider_id, feed_id, filter_query)
        entry = self._download_json(real_url, video_id)['entries'][0]
        main_smil_url = 'http://link.theplatform.com/s/%s/media/guid/%d/%s' % (provider_id, account_id, entry['guid']) if account_id else None
        formats = []
        subtitles = {}
@ -333,7 +334,7 @@ class ThePlatformFeedIE(ThePlatformBaseIE):
                if asset_type in asset_types_query:
                    query.update(asset_types_query[asset_type])
                cur_formats, cur_subtitles = self._extract_theplatform_smil(update_url_query(
-                    smil_url, query), video_id, 'Downloading SMIL data for %s' % asset_type)
+                    main_smil_url or smil_url, query), video_id, 'Downloading SMIL data for %s' % asset_type)
                formats.extend(cur_formats)
                subtitles = self._merge_subtitles(subtitles, cur_subtitles)
--- a/youtube_dl/extractor/turner.py
+++ b/youtube_dl/extractor/turner.py
@ -100,9 +100,13 @@ class TurnerBaseIE(AdobePassIE):
                formats.extend(self._extract_smil_formats(
                    video_url, video_id, fatal=False))
            elif ext == 'm3u8':
-                formats.extend(self._extract_m3u8_formats(
+                m3u8_formats = self._extract_m3u8_formats(
                    video_url, video_id, 'mp4',
-                    m3u8_id=format_id or 'hls', fatal=False))
+                    m3u8_id=format_id or 'hls', fatal=False)
                if '/secure/' in video_url and '?hdnea=' in video_url:
                    for f in m3u8_formats:
                        f['_seekable'] = False
                formats.extend(m3u8_formats)
            elif ext == 'f4m':
                formats.extend(self._extract_f4m_formats(
                    update_url_query(video_url, {'hdcore': '3.7.0'}),
--- a/youtube_dl/extractor/tvplayer.py
+++ b/youtube_dl/extractor/tvplayer.py
@ -0,0 +1,75 @@
 # coding: utf-8
 from __future__ import unicode_literals
 from .common import InfoExtractor
 from ..compat import compat_HTTPError
 from ..utils import (
    extract_attributes,
    urlencode_postdata,
    ExtractorError,
 )
 class TVPlayerIE(InfoExtractor):
    _VALID_URL = r'https?://(?:www\.)?tvplayer\.com/watch/(?P<id>[^/?#]+)'
    _TEST = {
        'url': 'http://tvplayer.com/watch/bbcone',
        'info_dict': {
            'id': '89',
            'ext': 'mp4',
            'title': r're:^BBC One [0-9]{4}-[0-9]{2}-[0-9]{2} [0-9]{2}:[0-9]{2}$',
        },
        'params': {
            # m3u8 download
            'skip_download': True,
        }
    }
    def _real_extract(self, url):
        display_id = self._match_id(url)
        webpage = self._download_webpage(url, display_id)
        current_channel = extract_attributes(self._search_regex(
            r'(<div[^>]+class="[^"]*current-channel[^"]*"[^>]*>)',
            webpage, 'channel element'))
        title = current_channel['data-name']
        resource_id = self._search_regex(
            r'resourceId\s*=\s*"(\d+)"', webpage, 'resource id')
        platform = self._search_regex(
            r'platform\s*=\s*"([^"]+)"', webpage, 'platform')
        token = self._search_regex(
            r'token\s*=\s*"([^"]+)"', webpage, 'token', default='null')
        validate = self._search_regex(
            r'validate\s*=\s*"([^"]+)"', webpage, 'validate', default='null')
        try:
            response = self._download_json(
                'http://api.tvplayer.com/api/v2/stream/live',
                resource_id, headers={
                    'Content-Type': 'application/x-www-form-urlencoded; charset=UTF-8',
                }, data=urlencode_postdata({
                    'service': 1,
                    'platform': platform,
                    'id': resource_id,
                    'token': token,
                    'validate': validate,
                }))['tvplayer']['response']
        except ExtractorError as e:
            if isinstance(e.cause, compat_HTTPError):
                response = self._parse_json(
                    e.cause.read().decode(), resource_id)['tvplayer']['response']
                raise ExtractorError(
                    '%s said: %s' % (self.IE_NAME, response['error']), expected=True)
            raise
        formats = self._extract_m3u8_formats(response['stream'], resource_id, 'mp4')
        self._sort_formats(formats)
        return {
            'id': resource_id,
            'display_id': display_id,
            'title': self._live_title(title),
            'formats': formats,
            'is_live': True,
        }
--- a/youtube_dl/extractor/twitch.py
+++ b/youtube_dl/extractor/twitch.py
@ -447,7 +447,14 @@ class TwitchHighlightsIE(TwitchVideosBaseIE):
 class TwitchStreamIE(TwitchBaseIE):
    IE_NAME = 'twitch:stream'
-    _VALID_URL = r'%s/(?P<id>[^/#?]+)/?(?:\#.*)?$' % TwitchBaseIE._VALID_URL_BASE
+    _VALID_URL = r'''(?x)
                    https?://
                        (?:
                            (?:www\.)?twitch\.tv/|
                            player\.twitch\.tv/\?.*?\bchannel=
                        )
                        (?P<id>[^/#?]+)
                    '''
    _TESTS = [{
        'url': 'http://www.twitch.tv/shroomztv',
@ -471,8 +478,25 @@ class TwitchStreamIE(TwitchBaseIE):
    }, {
        'url': 'http://www.twitch.tv/miracle_doto#profile-0',
        'only_matching': True,
    }, {
        'url': 'https://player.twitch.tv/?channel=lotsofs',
        'only_matching': True,
    }]
    @classmethod
    def suitable(cls, url):
        return (False
                if any(ie.suitable(url) for ie in (
                    TwitchVideoIE,
                    TwitchChapterIE,
                    TwitchVodIE,
                    TwitchProfileIE,
                    TwitchAllVideosIE,
                    TwitchUploadsIE,
                    TwitchPastBroadcastsIE,
                    TwitchHighlightsIE))
                else super(TwitchStreamIE, cls).suitable(url))
    def _real_extract(self, url):
        channel_id = self._match_id(url)
--- a/youtube_dl/extractor/videopress.py
+++ b/youtube_dl/extractor/videopress.py
@ -0,0 +1,99 @@
 # coding: utf-8
 from __future__ import unicode_literals
 import random
 import re
 from .common import InfoExtractor
 from ..compat import compat_str
 from ..utils import (
    determine_ext,
    float_or_none,
    parse_age_limit,
    qualities,
    try_get,
    unified_timestamp,
    urljoin,
 )
 class VideoPressIE(InfoExtractor):
    _VALID_URL = r'https?://videopress\.com/embed/(?P<id>[\da-zA-Z]+)'
    _TESTS = [{
        'url': 'https://videopress.com/embed/kUJmAcSf',
        'md5': '706956a6c875873d51010921310e4bc6',
        'info_dict': {
            'id': 'kUJmAcSf',
            'ext': 'mp4',
            'title': 'VideoPress Demo',
            'thumbnail': r're:^https?://.*\.jpg',
            'duration': 634.6,
            'timestamp': 1434983935,
            'upload_date': '20150622',
            'age_limit': 0,
        },
    }, {
        # 17+, requires birth_* params
        'url': 'https://videopress.com/embed/iH3gstfZ',
        'only_matching': True,
    }]
    @staticmethod
    def _extract_urls(webpage):
        return re.findall(
            r'<iframe[^>]+src=["\']((?:https?://)?videopress\.com/embed/[\da-zA-Z]+)',
            webpage)
    def _real_extract(self, url):
        video_id = self._match_id(url)
        video = self._download_json(
            'https://public-api.wordpress.com/rest/v1.1/videos/%s' % video_id,
            video_id, query={
                'birth_month': random.randint(1, 12),
                'birth_day': random.randint(1, 31),
                'birth_year': random.randint(1950, 1995),
            })
        title = video['title']
        def base_url(scheme):
            return try_get(
                video, lambda x: x['file_url_base'][scheme], compat_str)
        base_url = base_url('https') or base_url('http')
        QUALITIES = ('std', 'dvd', 'hd')
        quality = qualities(QUALITIES)
        formats = []
        for format_id, f in video['files'].items():
            if not isinstance(f, dict):
                continue
            for ext, path in f.items():
                if ext in ('mp4', 'ogg'):
                    formats.append({
                        'url': urljoin(base_url, path),
                        'format_id': '%s-%s' % (format_id, ext),
                        'ext': determine_ext(path, ext),
                        'quality': quality(format_id),
                    })
        original_url = try_get(video, lambda x: x['original'], compat_str)
        if original_url:
            formats.append({
                'url': original_url,
                'format_id': 'original',
                'quality': len(QUALITIES),
            })
        self._sort_formats(formats)
        return {
            'id': video_id,
            'title': title,
            'description': video.get('description'),
            'thumbnail': video.get('poster'),
            'duration': float_or_none(video.get('duration'), 1000),
            'timestamp': unified_timestamp(video.get('upload_date')),
            'age_limit': parse_age_limit(video.get('rating')),
            'formats': formats,
        }
--- a/youtube_dl/extractor/vine.py
+++ b/youtube_dl/extractor/vine.py
@ -6,8 +6,9 @@ import itertools
 from .common import InfoExtractor
 from ..utils import (
    determine_ext,
    int_or_none,
-    unified_strdate,
+    unified_timestamp,
 )
@ -20,50 +21,16 @@ class VineIE(InfoExtractor):
            'id': 'b9KOOWX7HUx',
            'ext': 'mp4',
            'title': 'Chicken.',
-            'alt_title': 'Vine by Jack Dorsey',
+            'alt_title': 'Vine by Jack',
            'timestamp': 1368997951,
            'upload_date': '20130519',
-            'uploader': 'Jack Dorsey',
+            'uploader': 'Jack',
            'uploader_id': '76',
            'view_count': int,
            'like_count': int,
            'comment_count': int,
            'repost_count': int,
        },
    }, {
        'url': 'https://vine.co/v/MYxVapFvz2z',
        'md5': '7b9a7cbc76734424ff942eb52c8f1065',
        'info_dict': {
            'id': 'MYxVapFvz2z',
            'ext': 'mp4',
            'title': 'Fuck Da Police #Mikebrown #justice #ferguson #prayforferguson #protesting #NMOS14',
            'alt_title': 'Vine by Mars Ruiz',
            'upload_date': '20140815',
            'uploader': 'Mars Ruiz',
            'uploader_id': '1102363502380728320',
            'view_count': int,
            'like_count': int,
            'comment_count': int,
            'repost_count': int,
        },
    }, {
        'url': 'https://vine.co/v/bxVjBbZlPUH',
        'md5': 'ea27decea3fa670625aac92771a96b73',
        'info_dict': {
            'id': 'bxVjBbZlPUH',
            'ext': 'mp4',
            'title': '#mw3 #ac130 #killcam #angelofdeath',
            'alt_title': 'Vine by Z3k3',
            'upload_date': '20130430',
            'uploader': 'Z3k3',
            'uploader_id': '936470460173008896',
            'view_count': int,
            'like_count': int,
            'comment_count': int,
            'repost_count': int,
        },
    }, {
        'url': 'https://vine.co/oembed/MYxVapFvz2z.json',
        'only_matching': True,
    }, {
        'url': 'https://vine.co/v/e192BnZnZ9V',
        'info_dict': {
@ -71,6 +38,7 @@ class VineIE(InfoExtractor):
            'ext': 'mp4',
            'title': 'ยิ้ม~ เขิน~ อาย~ น่าร้ากอ้ะ >//< @n_whitewo @orlameena #lovesicktheseries  #lovesickseason2',
            'alt_title': 'Vine by Pimry_zaa',
            'timestamp': 1436057405,
            'upload_date': '20150705',
            'uploader': 'Pimry_zaa',
            'uploader_id': '1135760698325307392',
@ -82,43 +50,60 @@ class VineIE(InfoExtractor):
        'params': {
            'skip_download': True,
        },
    }, {
        'url': 'https://vine.co/v/MYxVapFvz2z',
        'only_matching': True,
    }, {
        'url': 'https://vine.co/v/bxVjBbZlPUH',
        'only_matching': True,
    }, {
        'url': 'https://vine.co/oembed/MYxVapFvz2z.json',
        'only_matching': True,
    }]
    def _real_extract(self, url):
        video_id = self._match_id(url)
        webpage = self._download_webpage('https://vine.co/v/' + video_id, video_id)
-        data = self._parse_json(
+        data = self._download_json(
-            self._search_regex(
+            'https://archive.vine.co/posts/%s.json' % video_id, video_id)
                r'window\.POST_DATA\s*=\s*({.+?});\s*</script>',
                webpage, 'vine data'),
            video_id)
-        data = data[list(data.keys())[0]]
+        def video_url(kind):
-
+            for url_suffix in ('Url', 'URL'):
-        formats = [{
+                format_url = data.get('video%s%s' % (kind, url_suffix))
-            'format_id': '%(format)s-%(rate)s' % f,
+                if format_url:
-            'vcodec': f.get('format'),
+                    return format_url
            'quality': f.get('rate'),
            'url': f['videoUrl'],
        } for f in data['videoUrls'] if f.get('videoUrl')]
        formats = []
        for quality, format_id in enumerate(('low', '', 'dash')):
            format_url = video_url(format_id.capitalize())
            if not format_url:
                continue
            # DASH link returns plain mp4
            if format_id == 'dash' and determine_ext(format_url) == 'mpd':
                formats.extend(self._extract_mpd_formats(
                    format_url, video_id, mpd_id='dash', fatal=False))
            else:
                formats.append({
                    'url': format_url,
                    'format_id': format_id or 'standard',
                    'quality': quality,
                })
        self._sort_formats(formats)
        username = data.get('username')
        return {
            'id': video_id,
-            'title': data.get('description') or self._og_search_title(webpage),
+            'title': data.get('description'),
-            'alt_title': 'Vine by %s' % username if username else self._og_search_description(webpage, default=None),
+            'alt_title': 'Vine by %s' % username if username else None,
            'thumbnail': data.get('thumbnailUrl'),
-            'upload_date': unified_strdate(data.get('created')),
+            'timestamp': unified_timestamp(data.get('created')),
            'uploader': username,
            'uploader_id': data.get('userIdStr'),
-            'view_count': int_or_none(data.get('loops', {}).get('count')),
+            'view_count': int_or_none(data.get('loops')),
-            'like_count': int_or_none(data.get('likes', {}).get('count')),
+            'like_count': int_or_none(data.get('likes')),
-            'comment_count': int_or_none(data.get('comments', {}).get('count')),
+            'comment_count': int_or_none(data.get('comments')),
-            'repost_count': int_or_none(data.get('reposts', {}).get('count')),
+            'repost_count': int_or_none(data.get('reposts')),
            'formats': formats,
        }
--- a/youtube_dl/extractor/vk.py
+++ b/youtube_dl/extractor/vk.py
@ -281,6 +281,11 @@ class VKIE(VKBaseIE):
        {
            'url': 'http://new.vk.com/video205387401_165548505',
            'only_matching': True,
        },
        {
            # This video is no longer available, because its author has been blocked.
            'url': 'https://vk.com/video-10639516_456240611',
            'only_matching': True,
        }
    ]
@ -328,6 +333,12 @@ class VKIE(VKBaseIE):
            r'<!>Access denied':
            'Access denied to video %s.',
            r'<!>Видеозапись недоступна, так как её автор был заблокирован.':
            'Video %s is no longer available, because its author has been blocked.',
            r'<!>This video is no longer available, because its author has been blocked.':
            'Video %s is no longer available, because its author has been blocked.',
        }
        for error_re, error_msg in ERRORS.items():
--- a/youtube_dl/extractor/xtube.py
+++ b/youtube_dl/extractor/xtube.py
@ -44,6 +44,9 @@ class XTubeIE(InfoExtractor):
    }, {
        'url': 'xtube:625837',
        'only_matching': True,
    }, {
        'url': 'xtube:kVTUy_G222_',
        'only_matching': True,
    }]
    def _real_extract(self, url):
@ -53,14 +56,20 @@ class XTubeIE(InfoExtractor):
        if not display_id:
            display_id = video_id
            url = 'http://www.xtube.com/watch.php?v=%s' % video_id
-        req = sanitized_Request(url)
+        if video_id.isdigit() and len(video_id) < 11:
-        req.add_header('Cookie', 'age_verified=1; cookiesAccepted=1')
+            url_pattern = 'http://www.xtube.com/video-watch/-%s'
-        webpage = self._download_webpage(req, display_id)
+        else:
            url_pattern = 'http://www.xtube.com/watch.php?v=%s'
        webpage = self._download_webpage(
            url_pattern % video_id, display_id, headers={
                'Cookie': 'age_verified=1; cookiesAccepted=1',
            })
        sources = self._parse_json(self._search_regex(
-            r'sources\s*:\s*({.+?}),', webpage, 'sources'), video_id)
+            r'(["\'])sources\1\s*:\s*(?P<sources>{.+?}),',
            webpage, 'sources', group='sources'), video_id)
        formats = []
        for format_id, format_url in sources.items():
@ -72,7 +81,7 @@ class XTubeIE(InfoExtractor):
        self._sort_formats(formats)
        title = self._search_regex(
-            (r'<h1>(?P<title>[^<]+)</h1>', r'videoTitle\s*:\s*(["\'])(?P<title>.+?)\1'),
+            (r'<h1>\s*(?P<title>[^<]+?)\s*</h1>', r'videoTitle\s*:\s*(["\'])(?P<title>.+?)\1'),
            webpage, 'title', group='title')
        description = self._search_regex(
            r'</h1>\s*<p>([^<]+)', webpage, 'description', fatal=False)
@ -81,10 +90,10 @@ class XTubeIE(InfoExtractor):
             r'<span[^>]+class="nickname"[^>]*>([^<]+)'),
            webpage, 'uploader', fatal=False)
        duration = parse_duration(self._search_regex(
-            r'<dt>Runtime:</dt>\s*<dd>([^<]+)</dd>',
+            r'<dt>Runtime:?</dt>\s*<dd>([^<]+)</dd>',
            webpage, 'duration', fatal=False))
        view_count = str_to_int(self._search_regex(
-            r'<dt>Views:</dt>\s*<dd>([\d,\.]+)</dd>',
+            r'<dt>Views:?</dt>\s*<dd>([\d,\.]+)</dd>',
            webpage, 'view count', fatal=False))
        comment_count = str_to_int(self._html_search_regex(
            r'>Comments? \(([\d,\.]+)\)<',
--- a/youtube_dl/extractor/youtube.py
+++ b/youtube_dl/extractor/youtube.py
@ -34,6 +34,7 @@ from ..utils import (
    int_or_none,
    mimetype2ext,
    orderedSet,
    parse_codecs,
    parse_duration,
    remove_quotes,
    remove_start,
@ -329,6 +330,8 @@ class YoutubeIE(YoutubeBaseInfoExtractor):
        '141': {'ext': 'm4a', 'format_note': 'DASH audio', 'acodec': 'aac', 'abr': 256, 'preference': -50, 'container': 'm4a_dash'},
        '256': {'ext': 'm4a', 'format_note': 'DASH audio', 'acodec': 'aac', 'preference': -50, 'container': 'm4a_dash'},
        '258': {'ext': 'm4a', 'format_note': 'DASH audio', 'acodec': 'aac', 'preference': -50, 'container': 'm4a_dash'},
        '325': {'ext': 'm4a', 'format_note': 'DASH audio', 'acodec': 'dtse', 'preference': -50, 'container': 'm4a_dash'},
        '328': {'ext': 'm4a', 'format_note': 'DASH audio', 'acodec': 'ec-3', 'preference': -50, 'container': 'm4a_dash'},
        # Dash webm
        '167': {'ext': 'webm', 'height': 360, 'width': 640, 'format_note': 'DASH video', 'container': 'webm', 'vcodec': 'vp8', 'preference': -40},
@ -1694,15 +1697,7 @@ class YoutubeIE(YoutubeBaseInfoExtractor):
                                    codecs = mobj.group('val')
                                    break
                            if codecs:
-                                codecs = codecs.split(',')
+                                dct.update(parse_codecs(codecs))
                                if len(codecs) == 2:
                                    acodec, vcodec = codecs[1], codecs[0]
                                else:
                                    acodec, vcodec = (codecs[0], 'none') if kind == 'audio' else ('none', codecs[0])
                                dct.update({
                                    'acodec': acodec,
                                    'vcodec': vcodec,
                                })
                formats.append(dct)
        elif video_info.get('hlsvp'):
            manifest_url = video_info['hlsvp'][0]
@ -1857,13 +1852,13 @@ class YoutubePlaylistIE(YoutubePlaylistBaseInfoExtractor):
                            youtu\.be/[0-9A-Za-z_-]{11}\?.*?\blist=
                        )
                        (
-                            (?:PL|LL|EC|UU|FL|RD|UL)?[0-9A-Za-z-_]{10,}
+                            (?:PL|LL|EC|UU|FL|RD|UL|TL)?[0-9A-Za-z-_]{10,}
                            # Top tracks, they can also include dots
                            |(?:MC)[\w\.]*
                        )
                        .*
                     |
-                        ((?:PL|LL|EC|UU|FL|RD|UL)[0-9A-Za-z-_]{10,})
+                        ((?:PL|LL|EC|UU|FL|RD|UL|TL)[0-9A-Za-z-_]{10,})
                     )"""
    _TEMPLATE_URL = 'https://www.youtube.com/playlist?list=%s&disable_polymer=true'
    _VIDEO_RE = r'href="\s*/watch\?v=(?P<id>[0-9A-Za-z_-]{11})&amp;[^"]*?index=(?P<index>\d+)(?:[^>]+>(?P<title>[^<]+))?'
@ -1985,6 +1980,9 @@ class YoutubePlaylistIE(YoutubePlaylistBaseInfoExtractor):
    }, {
        'url': 'https://youtu.be/uWyaPkt-VOI?list=PL9D9FC436B881BA21',
        'only_matching': True,
    }, {
        'url': 'TLGGrESM50VT6acwMjAyMjAxNw',
        'only_matching': True,
    }]
    def _real_initialize(self):
@ -2345,18 +2343,18 @@ class YoutubeSearchIE(SearchInfoExtractor, YoutubePlaylistIE):
        videos = []
        limit = n
        for pagenum in itertools.count(1):
        url_query = {
            'search_query': query.encode('utf-8'),
                'page': pagenum,
                'spf': 'navigate',
        }
        url_query.update(self._EXTRA_QUERY_ARGS)
        result_url = 'https://www.youtube.com/results?' + compat_urllib_parse_urlencode(url_query)
        for pagenum in itertools.count(1):
            data = self._download_json(
                result_url, video_id='query "%s"' % query,
                note='Downloading page %s' % pagenum,
-                errnote='Unable to download API page')
+                errnote='Unable to download API page',
                query={'spf': 'navigate'})
            html_content = data[1]['body']['content']
            if 'class="search-message' in html_content:
@ -2368,6 +2366,12 @@ class YoutubeSearchIE(SearchInfoExtractor, YoutubePlaylistIE):
            videos += new_videos
            if not new_videos or len(videos) > limit:
                break
            next_link = self._html_search_regex(
                r'href="(/results\?[^"]*\bsp=[^"]+)"[^>]*>\s*<span[^>]+class="[^"]*\byt-uix-button-content\b[^"]*"[^>]*>Next',
                html_content, 'next link', default=None)
            if next_link is None:
                break
            result_url = compat_urlparse.urljoin('https://www.youtube.com/', next_link)
        if len(videos) > n:
            videos = videos[:n]
--- a/youtube_dl/extractor/zdf.py
+++ b/youtube_dl/extractor/zdf.py
@ -20,9 +20,9 @@ from ..utils import (
 class ZDFBaseIE(InfoExtractor):
-    def _call_api(self, url, player, referrer, video_id):
+    def _call_api(self, url, player, referrer, video_id, item):
        return self._download_json(
-            url, video_id, 'Downloading JSON content',
+            url, video_id, 'Downloading JSON %s' % item,
            headers={
                'Referer': referrer,
                'Api-Auth': 'Bearer %s' % player['apiToken'],
@ -104,7 +104,7 @@ class ZDFIE(ZDFBaseIE):
            })
            formats.append(f)
-    def _extract_entry(self, url, content, video_id):
+    def _extract_entry(self, url, player, content, video_id):
        title = content.get('title') or content['teaserHeadline']
        t = content['mainVideoContent']['http://zdf.de/rels/target']
@ -116,7 +116,8 @@ class ZDFIE(ZDFBaseIE):
                'http://zdf.de/rels/streams/ptmd-template'].replace(
                '{playerId}', 'portal')
-        ptmd = self._download_json(urljoin(url, ptmd_path), video_id)
+        ptmd = self._call_api(
            urljoin(url, ptmd_path), player, url, video_id, 'metadata')
        formats = []
        track_uris = set()
@ -174,8 +175,9 @@ class ZDFIE(ZDFBaseIE):
        }
    def _extract_regular(self, url, player, video_id):
-        content = self._call_api(player['content'], player, url, video_id)
+        content = self._call_api(
-        return self._extract_entry(player['content'], content, video_id)
+            player['content'], player, url, video_id, 'content')
        return self._extract_entry(player['content'], player, content, video_id)
    def _extract_mobile(self, video_id):
        document = self._download_json(
--- a/youtube_dl/options.py
+++ b/youtube_dl/options.py
@ -470,6 +470,10 @@ def parseOpts(overrideArguments=None):
        '--playlist-reverse',
        action='store_true',
        help='Download playlist videos in reverse order')
    downloader.add_option(
        '--playlist-random',
        action='store_true',
        help='Download playlist videos in random order')
    downloader.add_option(
        '--xattr-set-filesize',
        dest='xattr_set_filesize', action='store_true',
--- a/youtube_dl/utils.py
+++ b/youtube_dl/utils.py
@ -337,17 +337,30 @@ def get_element_by_id(id, html):
 def get_element_by_class(class_name, html):
-    return get_element_by_attribute(
+    """Return the content of the first tag with the specified class in the passed HTML document"""
    retval = get_elements_by_class(class_name, html)
    return retval[0] if retval else None
 def get_element_by_attribute(attribute, value, html, escape_value=True):
    retval = get_elements_by_attribute(attribute, value, html, escape_value)
    return retval[0] if retval else None
 def get_elements_by_class(class_name, html):
    """Return the content of all tags with the specified class in the passed HTML document as a list"""
    return get_elements_by_attribute(
        'class', r'[^\'"]*\b%s\b[^\'"]*' % re.escape(class_name),
        html, escape_value=False)
-def get_element_by_attribute(attribute, value, html, escape_value=True):
+def get_elements_by_attribute(attribute, value, html, escape_value=True):
    """Return the content of the tag with the specified attribute in the passed HTML document"""
    value = re.escape(value) if escape_value else value
-    m = re.search(r'''(?xs)
+    retlist = []
    for m in re.finditer(r'''(?xs)
        <([a-zA-Z0-9:._-]+)
         (?:\s+[a-zA-Z0-9:._-]+(?:=[a-zA-Z0-9:._-]*|="[^"]*"|='[^']*'))*?
         \s+%s=['"]?%s['"]?
@ -355,16 +368,15 @@ def get_element_by_attribute(attribute, value, html, escape_value=True):
        \s*>
        (?P<content>.*?)
        </\1>
-    ''' % (re.escape(attribute), value), html)
+    ''' % (re.escape(attribute), value), html):
    if not m:
        return None
        res = m.group('content')
        if res.startswith('"') or res.startswith("'"):
            res = res[1:-1]
-    return unescapeHTML(res)
+        retlist.append(unescapeHTML(res))
    return retlist
 class HTMLAttributeParser(compat_HTMLParser):
@ -1672,6 +1684,11 @@ def setproctitle(title):
        libc = ctypes.cdll.LoadLibrary('libc.so.6')
    except OSError:
        return
    except TypeError:
        # LoadLibrary in Windows Python 2.7.13 only expects
        # a bytestring, but since unicode_literals turns
        # every string into a unicode string, it fails.
        return
    title_bytes = title.encode('utf-8')
    buf = ctypes.create_string_buffer(len(title_bytes))
    buf.value = title_bytes
@ -2103,11 +2120,18 @@ def strip_jsonp(code):
 def js_to_json(code):
    COMMENT_RE = r'/\*(?:(?!\*/).)*?\*/|//[^\n]*'
    SKIP_RE = r'\s*(?:{comment})?\s*'.format(comment=COMMENT_RE)
    INTEGER_TABLE = (
        (r'(?s)^(0[xX][0-9a-fA-F]+){skip}:?$'.format(skip=SKIP_RE), 16),
        (r'(?s)^(0+[0-7]+){skip}:?$'.format(skip=SKIP_RE), 8),
    )
    def fix_kv(m):
        v = m.group(0)
        if v in ('true', 'false', 'null'):
            return v
-        elif v.startswith('/*') or v == ',':
+        elif v.startswith('/*') or v.startswith('//') or v == ',':
            return ""
        if v[0] in ("'", '"'):
@ -2118,11 +2142,6 @@ def js_to_json(code):
                '\\x': '\\u00',
            }.get(m.group(0), m.group(0)), v[1:-1])
        INTEGER_TABLE = (
            (r'^(0[xX][0-9a-fA-F]+)\s*:?$', 16),
            (r'^(0+[0-7]+)\s*:?$', 8),
        )
        for regex, base in INTEGER_TABLE:
            im = re.match(regex, v)
            if im:
@ -2134,11 +2153,11 @@ def js_to_json(code):
    return re.sub(r'''(?sx)
        "(?:[^"\\]*(?:\\\\|\\['"nurtbfx/\n]))*[^"\\]*"|
        '(?:[^'\\]*(?:\\\\|\\['"nurtbfx/\n]))*[^'\\]*'|
-        /\*.*?\*/|,(?=\s*[\]}])|
+        {comment}|,(?={skip}[\]}}])|
        [a-zA-Z_][.a-zA-Z_0-9]*|
-        \b(?:0[xX][0-9a-fA-F]+|0+[0-7]+)(?:\s*:)?|
+        \b(?:0[xX][0-9a-fA-F]+|0+[0-7]+)(?:{skip}:)?|
-        [0-9]+(?=\s*:)
+        [0-9]+(?={skip}:)
-        ''', fix_kv, code)
+        '''.format(comment=COMMENT_RE, skip=SKIP_RE), fix_kv, code)
 def qualities(quality_ids):
--- a/youtube_dl/version.py
+++ b/youtube_dl/version.py
@ -1,3 +1,3 @@
 from __future__ import unicode_literals
-__version__ = '2017.02.01'
+__version__ = '2017.02.14'
Author	SHA1	Message	Date
Sergey M․	58a65ba852	release 2017.02.14	2017-02-14 01:09:18 +07:00
Sergey M․	cedf08ff54	[ChangeLog] Actualize	2017-02-14 01:07:35 +07:00
Sergey M․	50de3dbad3	[zdf] Fix extraction (closes #12117 )	2017-02-14 01:00:06 +07:00
Sergey M․	085f169ffe	[xtube] Fix extraction for both kinds of video id (closes #12088 )	2017-02-13 23:44:43 +07:00
Vobe	f6d6ca1db3	[xtube] Improve title extraction	2017-02-13 23:34:14 +07:00
Sergey M․	6e5956e6ba	[lemonde] Fallback delegate extraction to generic extractor (closes #12115 , closes #12116 )	2017-02-13 23:17:48 +07:00
Sergey M․	50fd3c2c69	Merge branch 'master' of github.com:rg3/youtube-dl	2017-02-13 22:58:50 +07:00
Remita Amine	89c6691f9d	[bellmedia] accept longer video id(closes #12114 )	2017-02-13 15:08:48 +01:00
Remita Amine	454e5cdb17	[limelight] add support referer protected videos	2017-02-13 14:29:05 +01:00
Sergey M	1de9f78e71	[travis] Separate builds for core and download	2017-02-13 18:56:05 +08:00
Remita Amine	9dad941853	[disney] improve extraction - add support for more urls - detect expired videos - skip Adobe Flash Access protected videos closes #4975 closes #11000 closes #11882 closes #11936	2017-02-13 11:43:20 +01:00
Sergey M․	1e2c3f61fc	[travis] Separate builds for core and download	2017-02-13 17:36:13 +07:00
Remita Amine	0dac7cbb09	[hotstar] improve extraction(closes #12096 ) - extract all qualities - detect drm protected videos - extract more metadata	2017-02-12 17:35:24 +01:00
Yen Chi Hsuan	f8514630db	[einthusan] Fix extraction (closes #11416 ) The old test URLs are no longer valid, so I replace them with the one from #11416	2017-02-12 20:53:55 +08:00
Aniruddh-J	459818e280	[aenetworks] Add support for lifetimemovieclub.com	2017-02-12 20:18:11 +08:00
Sergey M․	6310acf512	[youtube] Fix parsing codecs (closes #12091 )	2017-02-12 18:09:53 +07:00
Yen Chi Hsuan	8d38dafbbf	ChangeLog: update after #12085	2017-02-12 00:45:37 +08:00
Yen Chi Hsuan	f3915452de	Merge pull request #12085 from wiiaboo/python2 utils.py: Workaround TypeError with Python 2.7.13 in Windows	2017-02-12 00:42:43 +08:00
Ricardo Constantino	2f49bcd690	utils.py: Workaround TypeError with Python 2.7.13 in Windows Fixes #11540 Tested with Windows Python 2.7.12 and 2.7.13.	2017-02-11 14:51:28 +00:00
Yen Chi Hsuan	68c22c4c15	[iqiyi] Update _TESTS	2017-02-11 22:27:45 +08:00
Sergey M․	9b92a5917b	release 2017.02.11	2017-02-11 03:24:00 +07:00
Sergey M․	3e2274c8b7	[ChangeLog] Actualize	2017-02-11 17:08:22 +07:00
Sergey M․	3d7e3aaa0e	[pluralsight:course] Fix extraction (closes #12075 )	2017-02-11 17:00:52 +07:00
Sergey M․	624c4b92ff	[facebook] Add coding cookie	2017-02-11 16:18:45 +07:00
Thomas Christlieb	2af12ad9d2	Introduce get_elements_by_class and get_elements_by_attribute utility functions	2017-02-11 17:16:54 +08:00
Remita Amine	97eb9bd2ac	[bbc] extract m3u8 formats with 320k audio	2017-02-10 19:46:15 +01:00
Sergey M․	71cdd75628	[facebook] Relax video id matching (closes #11017 , closes #12055 , closes #12056 )	2017-02-11 01:05:22 +07:00
Remita Amine	c7d6f614f3	[corus] Add new extractor(closes #12060 )(#9164 )	2017-02-10 17:00:09 +01:00
Remita Amine	08a00eef79	[extractor/common] skip m3u8 manifests protected with Adobe Flash Access	2017-02-10 17:00:09 +01:00
Sergey M․	9dd5408c99	[pluralsight] Detect blocked account error message (#12070 )	2017-02-10 22:48:11 +07:00
Sergey M․	9510709575	[bloomberg] Add another video id regex (closes #12062 )	2017-02-10 22:16:20 +07:00
Remita Amine	5abcca9060	[sixplay] use raw string for regex	2017-02-10 09:34:59 +01:00
Sergey M․	e01bfc19c3	[extractor/commonmistakes] Restrict _VALID_URL (closes #12050 )	2017-02-10 09:39:24 +07:00
Remita Amine	4d32b63851	[tvplayer] Add new extractor	2017-02-09 23:09:21 +01:00
Sergey M․	55d4de2283	release 2017.02.10	2017-02-10 01:27:33 +07:00
Sergey M․	61ee556aea	[ChangeLog] Actualize	2017-02-10 01:26:00 +07:00
Sergey M․	ff24261ba0	[kaltura] Add explicit port to regexes They should not match e.g. cdnapi.kaltura.computernetworks.com/...	2017-02-10 01:24:14 +07:00
Sergey M․	fbc6dc525e	[xtube] Fix shortcuts	2017-02-10 01:06:23 +07:00
Sergey M․	9150d1eb69	[xtube] Fix extraction (closes #12023 )	2017-02-10 01:03:35 +07:00
Sergey M․	b7f9843bec	[pornhub] Simplify (closes #12018 )	2017-02-10 00:57:44 +07:00
Thomas Christlieb	e64b0fca14	[pornhub] Fix extraction (closes #12007 )	2017-02-10 00:56:12 +07:00
Sergey M․	78ef214d2d	[facebook] Improve JS data regex (closes #12042 )	2017-02-09 23:42:40 +07:00
Remita Amine	be670b8e8f	[external:ffmpeg] do not assume that ffmpeg unknown version format is new	2017-02-09 17:36:59 +01:00
Remita Amine	37084f6641	[kaltura] improve embed partner id extraction(fixes #12041 )	2017-02-09 16:24:54 +01:00
Remita Amine	b04975733c	[sprout] Add new extractor	2017-02-09 09:13:29 +01:00
Remita Amine	c8b8fb0a99	[sixplay] improve extraction - skip drm protected formats - extract more and better formats - skip duplicate asset urls	2017-02-08 22:56:10 +01:00
Remita Amine	8298018273	[scrippsnetworks:watch] Add new extractor(closes #10765 )	2017-02-08 20:44:23 +01:00
Remita Amine	ae8d5a5c59	[go] add support for adobe pass auth(closes #11468 )(closes #10831 )	2017-02-08 18:57:07 +01:00
Sergey M․	b9c9cb5f79	[6play] Fix extraction (closes #12011 )	2017-02-08 23:15:39 +07:00
Remita Amine	fdf9b959bc	[nbc] add support adobe pass auth(closes #12006 )	2017-02-08 16:23:42 +01:00
Sergey M․	013877298d	release 2017.02.07	2017-02-07 02:04:50 +07:00
Sergey M․	c87f95f991	[ChangeLog] Actualize	2017-02-07 01:58:57 +07:00
Sergey M․	f28aeff264	[pornhub] Fix extraction (closes #11997 )	2017-02-07 01:52:59 +07:00
Sergey M․	242a14a1f6	[extractor/common] Fix audio only with audio group in m3u8 (closes #11995 )	2017-02-07 00:22:16 +07:00
Sergey M․	d5d904ff7d	[canalplus] Add support for cstar.fr (#11990 )	2017-02-06 23:53:42 +07:00
Sergey M․	5620f840f6	[extractor/generic] Add test for #11993 and more metadata for rtmp	2017-02-06 23:31:58 +07:00
Sergey M․	b7a8c1bcfa	[extractor/generic] Improve rtmp support (closes #11993 )	2017-02-06 23:23:40 +07:00
Sergey M․	7097bffba6	[downloader/fragment] Respect --no-part	2017-02-06 23:07:59 +07:00
Sergey M․	2aec7256ae	[extractor/common] Speed-up media tags regex (closes #11979 )	2017-02-06 00:20:30 +07:00
Yen Chi Hsuan	815482d4eb	Credit @motophil for gaskrank.py (#11685 )	2017-02-06 00:38:22 +08:00
Yen Chi Hsuan	9c14fe9681	[gaskrank] Minor change and update ChangeLog after #11685	2017-02-06 00:25:28 +08:00
motophil	e705755739	[gaskrank] Add new extractor (#11685 ) * [gaskrank] Add new extractor * [gaskrank] Add new extractor - fixes as requested * [gaskrank] Add new extractor - style fix * [Gaskrank] Add new extractor - requested fixes * [Gaskrank] Add new extractor - fix md5 checksum * [gaskrank] Add new extractor - more requested fixes * [Gaskrank] Add new extractor - fixed all but one quantified code issues * [Gaskrank] add new extractor - more fields extracted, added second test * [Gaskrank] Add new extractor - requested fixes. * [Gaskrank] Add new extractor - requested changes. * [Gaskrank] Add new extractor - final(?) fixes.	2017-02-06 00:19:37 +08:00
Yen Chi Hsuan	019f4c0371	[bandcamp] Fix extraction for incomplete albums Closes #11727	2017-02-05 22:47:04 +08:00
Yen Chi Hsuan	2ab2c0d1f5	[iwara] Add width (closes #11724 ) The heuristic is from #11724	2017-02-05 22:30:13 +08:00
Yen Chi Hsuan	caf0f5f8b7	[iwara] Fix extraction (closes #11781 )	2017-02-05 21:48:13 +08:00
Yen Chi Hsuan	e4e50f60b1	[googledrive] Fix extraction on Python 3.6 Since Python 3.6, invalid escape sequences are deprecated. It's likely that there are invalid escape sequences somewhere on the webpage, so instead of unescaping the whole webpage, just unescape the URL. See https://bugs.python.org/issue27364. That change was designed for string literals, while it affects the 'unicode_escape' encoding as well. The code path is: str.decode('unicode_escape') codecs.unicode_escape_decode() PyUnicode_DecodeUnicodeEscape()	2017-02-05 21:41:08 +08:00
Sergey M․	6ef3e65a7b	[videopress] Add extractor	2017-02-05 13:37:27 +07:00
Sergey M․	6fd138bed8	[sportbox] PEP 8	2017-02-05 13:36:52 +07:00
Sergey M․	49bd8d5e2e	[travis] Add python 3.6	2017-02-05 02:41:22 +07:00
Remita Amine	3d2c2752c5	[afreecatv] extract rtmp formats	2017-02-04 18:18:28 +01:00
Sergey M․	a713a86755	release 2017.02.04.1	2017-02-04 23:26:39 +07:00
Sergey M․	7bccd5fc8a	[ChangeLog] Actualize	2017-02-04 23:23:38 +07:00
Sergey M․	3144eccf55	[ChangeLog] Actualize	2017-02-04 23:22:28 +07:00
Sergey M․	9db8f6c540	[twitch:stream] Improve _VALID_URL (closes #11971 )	2017-02-04 23:21:07 +07:00
Remita Amine	8e4041cf3f	[radiocanada] fix extraction for toutv rtmp formats	2017-02-04 17:05:35 +01:00
Sergey M․	31487eb974	release 2017.02.04	2017-02-04 22:57:48 +07:00
John Hawkinson	c2521c1ac6	[Piksel] Add another app token regex	2017-02-04 23:23:14 +08:00
A Connecticut Princess	643dc0fcfe	[vk] Catch author blocked error message Example link (video in blocked group): https://vk.com/search?c%5Bq%5D=%D0%9F%D1%80%D1%8B%D0%B6%D0%BE%D0%BA%20c%20%D0%BA%D1%80%D0%B0%D0%BD%D0%B0%20%D0%B2%20%D1%81%D1%82%D0%B8%D0%BB%D0%B5%20%D0%A7%D0%B5%D0%BB%D0%BE%D0%B2%D0%B5%D0%BA%D0%B0-%D0%BF%D0%B0%D1%83%D0%BA%D0%B0&c%5Bsection%5D=video&c%5Bsort%5D=2&z=video-10639516_456240611	2017-02-04 22:21:09 +07:00
Remita Amine	36fce54816	[turner] fix downloading of secure hls formats using ffmpeg(closes #11358 )(closes #11373 )(closes #11800 )	2017-02-04 15:23:46 +01:00
Remita Amine	2c15db829c	[drtv] add support for live and radio sections(closes #1827 )(closes #3427 )	2017-02-04 08:38:28 +01:00
Remita Amine	f65dba7cdb	[myspace] fix extraction and extract hls and http formats	2017-02-03 22:25:19 +01:00
Remita Amine	605fd6392f	[youtube] add format info for itag 325 and 328	2017-02-03 17:59:48 +01:00
Sergey M․	f962790ee5	[vine] Fix extraction (closes #11955 )	2017-02-03 21:56:48 +07:00
Sergey M․	b7cc5f078e	[extractors] Remove remnants of sportbox extractor (#11954 )	2017-02-03 21:56:10 +07:00
Sergey M․	f7a10d8cd6	[sportbox] Remove extractor (closes #11954 ) Covered by generic extractor	2017-02-03 21:25:44 +07:00
Yen Chi Hsuan	daac118bf4	[ChangeLog] Update after #11901	2017-02-03 18:56:40 +08:00
Yen Chi Hsuan	8939f784d9	Merge pull request #11901 from ThomasChr/randonplaylistorder New parameter --playlist-random to randomize playlist download order. Fixes #11889	2017-02-03 18:53:14 +08:00
Remita Amine	df0588a31f	Merge branch 'fstirlitz-filmon'	2017-02-03 10:15:52 +01:00
Remita Amine	4ce3407d08	[filmon] improve extraction	2017-02-03 10:15:03 +01:00
Yen Chi Hsuan	d7f9242e30	[ChangeLog] Update after #11565	2017-02-03 12:13:24 +08:00
Mattias Wadman	45024183ae	[infoq] Add audio only format if available (#11565 ) * [infoq] Add audio only format if available Refactor cookie code into a function. Renamed formats to http_video, http_audio, rtmp_video Renamed extract functions to video instead of videos as they return one or no video. * [infoq] Rename to _extract_cookies as it more than one * [infoq] Remove redundant determine_ext * [infoq] Add comment about hardcoded URL * [infoq] Use _hidden_inputs instead of messy regex * [infoq] Probe if audio URL is valid Make it possible to pass headers to _is_valid_url * [infoq] Add audio only test	2017-02-03 12:10:13 +08:00
Justsoos	33da98f493	[douyutv] Improve room id regex http://www.douyu.com/t/lpl source get extra '\' with "room_id\" (from js coding)	2017-02-03 03:26:41 +07:00
Sergey M․	4195096ea8	[utils] Improve comments processing in js_to_json (closes #11947 )	2017-02-03 03:04:33 +07:00
Michal Čihař	0bbcc8a10a	[iprima] Fix extraction (closes #11920 , closes #11896 )	2017-02-03 03:04:33 +07:00
Michal Čihař	b3ee552e4b	[utils] Handle single-line comments in js_to_json	2017-02-03 03:04:33 +07:00
Yen Chi Hsuan	a22b2fd19b	[youtube] Fix ytsearch* when cookies are provided Closes #11924 The API with `page` is no longer used in browsers, and YouTube always returns {'reload': 'now'} when cookies are provided. See http://youtube.github.io/spfjs/documentation/start/ for how SPF works. Basically appending static link with a `spf` parameter yields the corresponding dynamic link.	2017-02-03 01:28:24 +08:00
Sergey M․	c54c01f82d	[go] Relax video id regex (closes #11937 )	2017-02-02 23:04:46 +07:00
Sergey M․	5a116e1302	[facebook] Fix title extraction (closes #11941 )	2017-02-02 22:45:18 +07:00
Sergey M․	a685751051	[youtube:playlist] Recognize TL playlists (closes #11945 )	2017-02-02 22:01:11 +07:00
Yen Chi Hsuan	bd8f48c78b	[bilibili] Support new Bangumi URLs (closes #11845 ) To reduce complexity, I don't support old Bangumi URLs directly via _VALID_URL. Instead, I choose to let it go to generic redirection. An example can be found in #10190: http://bangumi.bilibili.com/anime/v/40062	2017-02-02 21:51:31 +08:00
Remita Amine	81aeafeb44	[cbc:watch] extract audio codec for audion only formats(fixes #11893 )	2017-02-02 08:07:28 +01:00
Remita Amine	8bdc149441	[downloader/external:ffmpeg] minimize the use of aac_adtstoasc filter	2017-02-02 08:07:28 +01:00
Jaime Marquínez Ferrándiz	020c5df52d	[elpais] Fix extraction for some URLs (closes #11765 )	2017-02-01 23:48:34 +01:00
Remita Amine	da162c1135	[compat] add compat_etree_register_namespace to __all__ list	2017-02-01 20:15:59 +01:00
Thomas Christlieb	75822ca790	New parameter --playlist-random to randomize playlist download order. Fixes #11889	2017-01-31 10:03:31 +01:00
felix	a0758dfa1a	[filmon] new extractor	2016-11-13 17:28:17 +01:00
`@ -1,3 +1,3 @@`
	`from __future__ import unicode_literals`	`from __future__ import unicode_literals`

	`__version__ = '2017.02.01'`	`__version__ = '2017.02.14'`