Page 1 of 7 HOW TO EXTRACT SUBTITLES FROM TV.JW.ORG VIDEOS This is a very basic tutorial on how to extract some subtitles from TV.JW.ORG videos.. Choose Open with Internet Explorer from
Trang 1Page 1 of 7
HOW TO EXTRACT SUBTITLES FROM TV.JW.ORG VIDEOS
This is a very basic tutorial on how to extract some subtitles from TV.JW.ORG videos There are different types of subtitles:
Manual
.srt
.vtt
I am going to explain vtt only The newer videos that have subtitles seem to be using this method
Programs and items you will need
Desktop/laptop with Internet Explorer If you are using Windows 10 you can search for
it or you can get to it using Edge
In Edge, click on … in the right corner Choose Open with Internet Explorer from the list
You will also need Excel by Microsoft and Word by Microsoft
Website: TV.JW.ORG
Trang 2Page 2 of 7
Following are the steps you need to take
Open Internet Explorer Press F12 on your keyboard F12 is the Developer Tools Choose the Network tab on the top row Minimize this screen
Go to TV.JW.ORG Find the video you would like to see if it has subtitles Go into settings first and choose Video Subtitles and check Display subtitles when available Save your choice and X out of this screen
Trang 3Page 3 of 7
Play the video now and see if there are subtitles
If your video has subtitles, the next step is to see if they are vtt subtitles Stop the video Toggle back to the Developer Tools screen Now you will see a list of files down the left side
Trang 4Page 4 of 7
Scroll down the list and look for a vtt file on the left side It is normally close to the bottom of the list You can type vtt in the search box on the right side right under the black box at the top to see if there is a vtt file If it shows one, then you can scroll down the left side to find it
If the file exists, right click on it and choose Copy URL Paste the URL into the address bar and hit enter A box will popup Choose open The file will open in Notepad
When you open the file in Notepad, it will look like the picture below
Trang 5Page 5 of 7
In Notepad, do Control + A (selects all the text) and then do Control + C (copies all the text)
Now open Excel Paste the text you just copied into cell B1 Format the cell so the text fits into it You do this by highlighting the column, go to Format in the ribbon, choose AutoFit Column Width In column A, create a number list from 1 to whatever to the end
of the text
You will notice that there are time stamps in the text You need to get rid of them and you will also need to get rid of all the blank lines To do this, highlight column B Click
on Data on the top of the ribbon Now sort by A to Z In the Sort Warning, choose Expand the selection, sort
Trang 6Page 6 of 7
Highlight all the time stamps and delete them Now highlight all the blank lines that have numbers by them and delete them Now highlight column A, go to Data, sort A to
Z Your remaining text in column B should be back in the correct order now with no times stamps or blank lines
Trang 7Page 7 of 7
The text will now need to be formatted Highlight column B Right click and choose copy Open a Word document Choose the down arrow under Paste in the top left corner Choose the last option, Keep Text Only (T) It will look like the picture below You now need to format the text
If you click on the Paragraph mark in the Paragraph section under the Home tab, you can see where all the returns are This will help you to format the text You can also use find and replace (Control + H) to fix some things
There are other ways to do this, but this was the easiest for me to do with the
knowledge I have