Page 1 of 1

Dumping all subtitles files

Posted: Sun Feb 28, 2016 12:09 am
by pkh
Hi,

I'm working on a large (open-source) software trying to support all kind of subtitles formats. So far, I think it supports every format allowed on opensubtitles.org. But from time to time, we get bug reports with subtitles files broken in various ways, or simply with unsupported features.

Opensubtitles being the largest subtitles database, I was wondering if the API (and the bandwidth rules or whatever) would allow me to dump every single file of the DB (it doesn't need to keep the original name, media reference or whatever, just the file content matters) in order to test that software and detect any unsupported content.

Of course, if someone already has such database dump, it's even better.

Re: Dumping all subtitles files

Posted: Sun Feb 28, 2016 7:49 am
by oss
You can write me pm. Btw in which language you are doind this, what is the name of software?

Re: Dumping all subtitles files

Posted: Mon Feb 29, 2016 5:36 pm
by pkh
Software is FFmpeg (mainly/exclusively C for the subtitles part). If I need to use the API, I'll probably go for Python or shell scripting with curl & friends.

Re: Dumping all subtitles files

Posted: Sun Mar 06, 2016 1:10 am
by wolfpack
I have a similar request for English files, I'd like to write a python script so I can search for keywords so I can find soundbites.

Re: Dumping all subtitles files

Posted: Sun Mar 06, 2016 6:38 am
by oss
with similar requests always write me PM