Help to download short video from ITV news
Hey,
As the title says, I would like to download a video from ITV webpage (itv.com) that is a short interview with an old gentleman from the UK. I have spent hours wrestling with the page code, usually I can get these things done, but no such luck here. Has anyone ever downloaded from their page before?
Here is the link in case someone can find some hole in the system. And yes, I have searched for this video on their youtube page, is not there and in a web search I found a facebook video page with only a 30seconds excerpt.
Thanks for anyone who can help.
I downloaded it with yt-dlp:
yt-dlp https://www.itv.com/news/wales/2021-10-19/ill-see-you-next-year-wales-youngest-looking-centenarian-turns-102
It saved as an .mp4 video in the directory I was in.
Edit: Here, I've uploaded it to archive.org if you need to watch it there: https://archive.org/details/wls-owen-filer-for-web-11.06.47.mp-4-6277820441001
Man thank you so much! I was now able to get the file from archive.org
Thank you so much! It was a great help!
Btw, yt-dlp is the same as youtube-dl? I can't find yt-dlp in the trisquel repos.
>"Btw, yt-dlp is the same as youtube-dl? I can't find yt-dlp in the trisquel repos."
No, yt-dlp is the improved version of youtube-dl. And it's in the Trisquel repos that I'm subscribed to:
$ apt policy yt-dlp
yt-dlp:
Installed: 2023.07.06-1~bpo22.04.1
Candidate: 2023.07.06-1~bpo22.04.1
Version table:
*** 2023.07.06-1~bpo22.04.1 500
500 https://archive.trisquel.info/trisquel aramo-backports/main amd64 Packages
500 https://archive.trisquel.info/trisquel aramo-backports/main i386 Packages
100 /var/lib/dpkg/status
2022.04.08-1 500
500 https://archive.trisquel.info/trisquel aramo/main amd64 Packages
500 https://archive.trisquel.info/trisquel aramo/main i386 Packages
Are you using an old version of Trisquel perhaps?
I'm on trisquel 10. You are probably on 11.
Anyway, I thank you for your help again Andy.
Ahh, ok, well leave a message if you need more videos in the future, I'll be happy to help, as I'm sure others would as well.
Thanks for that.
I do not wish to bother other people though. I usually can find ways to extract videos from webpages, but I admit ITV has some kind of blobing in place that I was unable to overcome.
Good thing youtube-dl / yt-dlp are such good tools at this sort of thing.
If you don't mind me asking two questions:
1. does yt-dlp have socks5 proxy option to be used over Tor? I remember youtube-dl had it but was experimental.
2. Which would be the easiest way to install yt-dlp in Trisquel 10 in case I choose to go that route?
Thanks again for your help.
sudo wget https://github.com/yt-dlp/yt-dlp/releases/latest/download/yt-dlp -O /usr/local/bin/yt-dlp
sudo chmod a+rx /usr/local/bin/yt-dlp
That was simple enough :P
Thanks, I didn't thought it would be so simple.
I do see it has a socks proxy option, though it doesn't mention if DNS is also passed through the proxy or not. I would rather not leak DNS requests constantly when using Tor.
Also, if anyone has ever tried it, is there a way to download only the audio of an mp4 file? say, the video that Andy uploaded on the archive.org If I wanted to download only the audio (suppose he had it uploaded in 4k and I only wanted to get the audio, there would be no point in downloading gigabytes of data when I would later remove only a few megabytes which were the audio track). Is that possible? Even if with other tool that is not yt-dlp, I would be interested in that option.
Thanks!
You may run the help like this:
yt-dlp --help
To list the available formats (like audio-only or video-only) write
yt-dlp -F https://youtube.com/some-video
and then download the format which you like like by its ID; like this:
yt-dlp -f ID https://youtube.com/some-video
Audio only (m4a, size about 4MB)
yt-dlp -x https://www.itv.com/news/wales/2021-10-19/ill-see-you-next-year-wales-youngest-looking-centenarian-turns-102
It seems that only works if the webpage itself has an audio file available.
What I meant was to extract audio from a mp4 file.
For example, I tried:
yt-dlp --proxy socks5://127.0.0.1:9150 -x https://archive.org/download/wls-owen-filer-for-web-11.06.47.mp-4-6277820441001/WLS_OWEN_FILER_FOR_WEB_11.06.47.mp4%20%5B6277820441001%5D.mp4
And it started downloading the entire mp4 video file.
On the other hand, my experiences show that the proxy also applies to DNS resolution.
I blocked port 53 in ufw, and tried running yt-dlp without proxy, it gives this result in lsof -i
yt-dlp 8586 trisquel 3u IPv4 240272 0t0 UDP localhost:47884->localhost:domain
And yt-dlp failed to resolve the hostname:
ERROR: [generic] None: Unable to download webpage: (caused by URLError(gaierror(-3, 'Temporary failure in name resolution')))
However, even with port 53 blocked, the proxy allows for hostname resolution. So I assume that the DNS request is being profixied. Not sure if some leak might happen, but from the above output, I think not.
Still, the audio extraction option doesn't work. yt-dlp only downloads the entire video file (maybe it would later locally extract audio from the file, but what I want is to not having to download the entire file).
What I meant was to extract audio from a mp4 file. For example, I tried (...). And it started downloading the entire mp4 video file.
I believe no Web server extracts audio from a video file. It either serves you a video or an audio, if such a file was created.
Yes, that was my understanding too.
However, and this was my hope, when you open VLC or SMplayer and give it a link (like the one in the archive that Andy provided), the player will start downloading data orderly from the beginning of the file to the end (that's why we can stream the file, unlike torrent). And the player will, even at the start of the downloading/streaming process, be able to look at the index of the file and state "there are 3 audio tracks for X Y Z languages, 4 subtitles tracks for X Y Z W languages, so on and so forth".
So basically, could it be done so that the player (or any other software) could selectively only download the data that pertains for that track? I have done some testing with VLC for example but no such luck.
If anyone knows a way to do this, it would be great.
Thanks!
https://superuser.com/questions/1753260/what-makes-a-mp4-streamable
"Additionally (as I just found out), the audio track data can be either interleaved with video data, or not. If it's not interleaved for some reason, the player again needs to wait for the entire audio stream to be downloaded before it starts receiving video data (again unless it can seek back/forward)."
Could this mean that there may or may not be a way to download the audio only, depending on the way tracks are stored in the mp4 container file?
>"Could this mean that there may or may not be a way to download the audio only, depending on the way tracks are stored in the mp4 container file?"
I believe that what it means is that this universe may or may not exist, depending on whether Schrödinger's cat is alive or dead when we open the box.
Why did you kill the paradox with such a simplistic interpretation? I believe paradoxicide is punished by fragmentation, segmentation, splitting and interleaving.
I never thought of this. Perhaps... I always believed that since in a MKV file for example, there are separate tracks (and in MP4 files it seems to be likewise) they would not be interdependent. Seems I might be wrong.
Thanks anyway :)
1- Using torsocks, you can simply do this
tor &
torsocks yt-dlp https://youtube.com/some-video
2- Or you may use "proxychains" (or "proxychains4"). This option supports tor or any other socks5 services.
sudo apt-get install proxychains proxychains4
# change the corresponding line to (tor's default port is 9050)
# socks5 127.0.0.1 9050
sudo vim /etc/proxychains4.conf
proxychains yt-dlp https://youtube.com/some-video
And to install yt-dlp, you can get the appimage directly from github (download yt-dlp). Then use it like this:
chmod +x /home/GNUser/downloads/yt-dlp
./yt-dlp
# or you can add it to your path (for example /usr/local/bin) and simply run it without "./"