Capture youtube subtitles of clips

Talk about anything at all....
Post Reply
User avatar
sayhello
Banned
Posts: 95
Joined: Sat Nov 10, 2012 1:48 pm

Capture youtube subtitles of clips

Post by sayhello »

hello dear Community,


i have seen that many youtube clips have got subtitles



eg this one: https://www.youtube.com/watch?v=BJ-VvGyQxho&t=133s

how can i capture the subtitle-text of this youtube video!?

heard about this procedure:
Go to your Video Manager by clicking your account in the top right
> Creator Studio > Video Manager > Videos.

Next to the video you want to add captions or subtitles to,
click the drop-down menu next to the Edit button.

Select Subtitles and CC.
well i am not sure how to do this!? Can you help here?
User avatar
Zizi64
Volunteer
Posts: 11352
Joined: Wed May 26, 2010 7:55 am
Location: Budapest, Hungary

Re: how to capture youtube subtitles of clips that have got

Post by Zizi64 »

How this task is related to the AOO or LO?
Tibor Kovacs, Hungary; LO7.5.8 /Win7-10 x64Prof.
PortableApps/winPenPack: LO3.3.0-7.6.2;AOO4.1.14
Please, edit the initial post in the topic: add the word [Solved] at the beginning of the subject line - if your problem has been solved.
erbsenzahl
Volunteer
Posts: 254
Joined: Tue Apr 18, 2017 8:23 am
Location: Germany

Re: how to capture youtube subtitles of clips that have got

Post by erbsenzahl »

@sayhello

Google2srt is your adequate program for that. For further editing the srt file you may use OpenOffice/LibreOffice Writer, often a simple editor is first choice.

Cheers
LibreOffice current versions 7 and OpenOffice 4.1.15
on LinuxMint 20 - 21 Mate, W10-64 pro
User avatar
Robert Tucker
Volunteer
Posts: 1250
Joined: Mon Oct 08, 2007 1:34 am
Location: Manchester UK

Re: Capture youtube subtitles of clips

Post by Robert Tucker »

There is youtube-dl https://ytdl-org.github.io/youtube-dl/index.html. However, it usually provides overlapping subtitles, and you'll probably find it easier to use one of the online sites that does the job.

If you just want the script, you can click on the three dots under the video and select "Open transcript".

Either way, if you want sensibly broken up subtitles, you'll probably want to just work with the text and go through it manually. You could use Writer to do this, though any text editor should do. When you've got them broken up sensibly, you'll want to use Aeneas (https://aeneasweb.org/login) to align them again with the spoken dialogue.

Writer is indispensable if you want to translate the subtitles using Deepl (ww.deepl.com/en/translator), since Deepl requires them in .xml format. (Office Open XML is just under the text options under "File type" when you go to Save.)
LibreOffice 7.x.x on Arch and Fedora.
CooloKelley
Posts: 2
Joined: Sun Apr 04, 2021 6:47 pm

Re: Capture youtube subtitles of clips

Post by CooloKelley »

subtitles on YouTube are put on automatically, or I misunderstood your problem. https://www.youtube.com/watch?v=rq8cL2X ... reySchafer for example this video is subtitled in english
OpenOffice 4.1.9
User avatar
Robert Tucker
Volunteer
Posts: 1250
Joined: Mon Oct 08, 2007 1:34 am
Location: Manchester UK

Re: Capture youtube subtitles of clips

Post by Robert Tucker »

If the video has a soundtrack suitable for voice recognition (rarely the case for songs), YouTube will generate AutoCaptions. The trouble is, if you download them, they look like this:

Code: Select all

00:00:00.060 --> 00:00:02.000 align:start position:0%
 
hey<00:00:00.359><c> everybody</c><00:00:00.690><c> how's</c><00:00:00.780><c> it</c><00:00:00.930><c> going</c><00:00:01.260><c> in</c><00:00:01.530><c> our</c><00:00:01.860><c> last</c>

00:00:02.000 --> 00:00:02.010 align:start position:0%
hey everybody how's it going in our last
 

00:00:02.010 --> 00:00:03.110 align:start position:0%
hey everybody how's it going in our last
video<00:00:02.190><c> we</c><00:00:02.460><c> looked</c><00:00:02.639><c> at</c><00:00:02.700><c> the</c><00:00:02.760><c> difference</c>

00:00:03.110 --> 00:00:03.120 align:start position:0%
video we looked at the difference
 
00:00:03.120 --> 00:00:04.849 align:start position:0%
video we looked at the difference
between<00:00:03.270><c> instance</c><00:00:03.840><c> variables</c><00:00:04.259><c> and</c><00:00:04.380><c> class</c>

00:00:04.849 --> 00:00:04.859 align:start position:0%
between instance variables and class
 

00:00:04.859 --> 00:00:06.499 align:start position:0%
between instance variables and class
variables<00:00:05.310><c> and</c><00:00:05.490><c> in</c><00:00:05.879><c> this</c><00:00:05.970><c> video</c><00:00:06.180><c> we'll</c><00:00:06.450><c> be</c>
So the easiest way to get subtitles to work with is to download the transcript as I described before.

YouTube also allows the video owner to upload subtitles and translations of the subtitles and these can usually be downloaded in a useful format.
LibreOffice 7.x.x on Arch and Fedora.
Post Reply