Forum rules
Under no circumstances is spamming or advertising of any kind allowed. Do not post any abusive, obscene, vulgar, slanderous, hateful, threatening, sexually-orientated or any other material that may violate others security. Profanity or any kind of insolent behavior to other members (regardless of rank) will not be tolerated. Remember, what you don’t find offensive can be offensive to other members. Please treat each other with the kind of reverence you’d expect from other members.
Failure to comply with any of the above will result in users being banned without notice. If any further details are needed, contact: “The team” using the link at the bottom of the forum page. Thank you.
TZOTZIOY
Posts: 25
Joined: Mon Dec 18, 2006 10:26 am
Location: Athens (the original one)
Contact: ICQ Website

Tue Jan 12, 2010 3:14 am

btw, I'm one of those guys that use the *episode* IMDB id, apart from adding redundant information in the release field in the format S#EE (season#episode). Thus my uploads already have the correct information, even though indirectly. I can provide a list correlating episode id to series id, just by analyzing imdb series episode lists using Python and BeautifulSoup.
I always thought that all that's needed was a "groupimdbid" column next to "imdbid". For movies, those fields would be the same. For TV series episodes, "imdbid" would be the episode imdbid and "groupimdbid" would be the series imdbid. "imdbid" could be a foreign key to another table providing "series", "episode", "episodetitle" fields (normalization, guys). That table won't be very large; I guesstimate around 100k-200k rows.
--
Just an earthbound misfit, I.

User avatar
oss
Site Admin
Posts: 5879
Joined: Sat Feb 25, 2006 11:26 pm
Contact: Website

Thu Jan 14, 2010 4:43 am

you are right, there is no so big deal to make TV series support (in tables), but there is more problem to make it visually, and download. Anyway, I need like 2 weeks to code it, and I still can not find time for some reason (last, I got accident 2.5 months ago on motobike, and I am still not recovered)

User avatar
eduo
Posts: 716
Joined: Sat Feb 10, 2007 1:40 am
Location: Information Technology
Contact: ICQ Website Yahoo Messenger

Tue Apr 13, 2010 1:37 am

Gentle reminder on this.

I'm willing to bet 2/3 of the subtitle downloads are for TV Series. Having a way to search (just search) by season and episode in the web or API would greatly help providing proper subs to users.

No need to change anything else in the meantime. Just parsing subtitle names and adding season and episode (or day for dailies, or episodes for multiepisode) and providing them in the results or in a special method would be enough.

I can't implement non-hash searches without this, because the hits for using fulltext search on the database would be unjustifiably big.
http://eduo.info/
[url=http://eduo.info/soleol/]OpenSubtitles from your desktop: SolEol for Mac/Windows/Linux[/url]
[url=http://forums.plexapp.com/index.php?showtopic=325&st=0&p=2480&#entry2480]My current episode processing work flow[/url].

User avatar
oss
Site Admin
Posts: 5879
Joined: Sat Feb 25, 2006 11:26 pm
Contact: Website

Fri Apr 16, 2010 4:51 am

I get it, yes, maybe this should be the first thing, what I should do. Problem is, when we got lets say 10 subtitles uploaded as one (TV Series set subtitles).

User avatar
eduo
Posts: 716
Joined: Sat Feb 10, 2007 1:40 am
Location: Information Technology
Contact: ICQ Website Yahoo Messenger

Fri Apr 16, 2010 10:50 pm

The easiest part would be the subtitlers and uploaders and hashers to assign the metadata themselve. Either explicitly upon upload (optional) or inferred from the name of the files recorded (moviename or subtitle name).

What could be done:

1.-To modify subtitle entries
1.1.-If they belong to a tv series imdbid flag them as such.
1.2.-Parse their name, removing the tv series name from the file name, for season/episode(s).
1.3.-To allow a method to search by IMDBID (for the series), season and episode(s).

2.-To modify moviehash entries
2.1.-Similar to 1.1
2.2.-Similar to 1.2
2.3.-Similar to SeenCount for MovieHash IMDB Identification of movie hashes, store the possible Season/Episode combinations for a given moviehash.

After this the metadata would start to flow into the database. It wouldn't be infallible but would at least be as accurate as the current information is (that is, the margin of error would be similar to what today we see for hashes matching the wrong movie ID or having the wrong language defined). Names that don't fit in the parsers wouldn't be considered valid and wouldn't be counted.

After this "Season" and "Episode(s)" can be parameters of the SearchSubtitles method and fields in the web search. Also with this it's easy to link to either IMDB or TheTVDB or TVRage for the rest of the information in the subtitle page, if you wanted to.

Currently the search methods are optimized for movies but are counterproductive for tv episodes. And the changes above are not that big (the initial harvest and parsing of existing subtitles would be a big task, of course).

For my own programs, what I've done as parsers of tv shows is:

Show can either be:
-EPI - 1 episode, regular show - Regular episodes
-EP2 - 2 episodes, regular show - Season Starters, Season Enders
-DAY - Daily Shows - i.e. Talk Shows, News Casts
-SPC - Special - i.e. Documentaries, behind-the-scenes, mid-seasons
-PRT - Multi-parters - i.e. Miniseries

The parsers I use are (they can probably be simplified, but I have had no need to do so yet):

EP2:
"^(.*) S([0-9][0-9]*)E([0-9][0-9]*)-E([0-9][0-9]*).*$"
"^(.*) S([0-9][0-9]*)E([0-9][0-9]*) E([0-9][0-9]*).*$"
"^(.*) S([0-9][0-9]*)E([0-9][0-9]*)E([0-9][0-9]*).*$"
"^(.*) ([0-9][0-9]*)x([0-9][0-9]*)-([0-9][0-9]*).*$"
"^(.*) season ([0-9][0-9]*) ep ([0-9][0-9]*)-([0-9][0-9]*).*$"

Season \2
Episodes \3-\4

EPI:
"^(.*) season ([0-9][0-9]*) ep ([0-9][0-9]*).*$"
"^(.*) S([0-9][0-9]*)E([0-9][0-9]*).*$"
"^(.*) ([0-9][0-9]*)x([0-9][0-9]*).*$"

Season \2
Episode \3

DAY:
"^(.*) (19[0-9]{2}|20[0-1][0-9]) (0[0-9]|1[0-2]) ([0-2][0-9]|3[0-1]).*$"
"^(.*) (0[0-9]|1[0-2]) ([0-2][0-9]|3[0-1]) (19[0-9]{2}|20[0-1][0-9]).*$"

Year: \2
Month: \3
Day: \4

SPC:
"^(.*) S([0-9][0-9]*).*$"
"^(.*) E([0-9][0-9]*).*$"

Special ID: \2

PRT:
"^(.*) Part ([0-9][0-9]*)[ ]*[Oo][Ff][ ]*([0-9][0-9]*).*$"
"^(.*) ([0-9][0-9]*)[ ]*[Oo][Ff][ ]*([0-9][0-9]*).*$"
"^(.*) Part[ ]*([0-9][0-9]*).*$"

Part #: \2
Parts: \3
http://eduo.info/
[url=http://eduo.info/soleol/]OpenSubtitles from your desktop: SolEol for Mac/Windows/Linux[/url]
[url=http://forums.plexapp.com/index.php?showtopic=325&st=0&p=2480&#entry2480]My current episode processing work flow[/url].

User avatar
oss
Site Admin
Posts: 5879
Joined: Sat Feb 25, 2006 11:26 pm
Contact: Website

Fri Apr 30, 2010 8:16 am

thanks a lot for these regexpes. I will try to implement it very simply (so at least something will be online)

User avatar
eduo
Posts: 716
Joined: Sat Feb 10, 2007 1:40 am
Location: Information Technology
Contact: ICQ Website Yahoo Messenger

Fri Apr 30, 2010 4:37 pm

These regex patterns can very easily be extended for other formats but they already cover 95% of what's out there.

In the name search thread I posted this screenshot:

http://grab.by/44M9

I can tell when a file is a tv episode by it's returned name in opensubtitles (if it has a double quote in the name) so I stop searching by imdb and search by name. But even that is prone to errors.
http://eduo.info/
[url=http://eduo.info/soleol/]OpenSubtitles from your desktop: SolEol for Mac/Windows/Linux[/url]
[url=http://forums.plexapp.com/index.php?showtopic=325&st=0&p=2480&#entry2480]My current episode processing work flow[/url].

User avatar
oss
Site Admin
Posts: 5879
Joined: Sat Feb 25, 2006 11:26 pm
Contact: Website

Sun May 02, 2010 4:21 am

thats really nice screenshot! it seems, I have to find one day free to implement all of this :)

I am doing much more work, when I dont have internet. Please disconnect me!!! :)

User avatar
eduo
Posts: 716
Joined: Sat Feb 10, 2007 1:40 am
Location: Information Technology
Contact: ICQ Website Yahoo Messenger

Re: TV Series support preview

Sat Jun 05, 2010 9:55 pm

Did I hear a bump?

*BUMP*

Yes, I DID!
http://eduo.info/
[url=http://eduo.info/soleol/]OpenSubtitles from your desktop: SolEol for Mac/Windows/Linux[/url]
[url=http://forums.plexapp.com/index.php?showtopic=325&st=0&p=2480&#entry2480]My current episode processing work flow[/url].

Wonderm00n
Posts: 15
Joined: Tue Feb 05, 2008 7:39 pm

Re: TV Series support preview

Thu Jul 01, 2010 1:05 am

So there's still no way to search by season/episode yet?

User avatar
eduo
Posts: 716
Joined: Sat Feb 10, 2007 1:40 am
Location: Information Technology
Contact: ICQ Website Yahoo Messenger

Re: TV Series support preview

Thu Jul 01, 2010 9:27 am

Nope
http://eduo.info/
[url=http://eduo.info/soleol/]OpenSubtitles from your desktop: SolEol for Mac/Windows/Linux[/url]
[url=http://forums.plexapp.com/index.php?showtopic=325&st=0&p=2480&#entry2480]My current episode processing work flow[/url].

Wonderm00n
Posts: 15
Joined: Tue Feb 05, 2008 7:39 pm

Re: TV Series support preview

Wed Jul 07, 2010 7:10 pm

Is it even planned?

I'm making this season/episode filter myself using regex on the "SearchSubtitles" results, but it's not a good way to do it...

THE PROBLEM:
Take this example:
The show "24" has 5000+ subtitles but when searching by imdbID I'm only getting 500 (limit by api), and after I filter for duplicates using "IDSubtitle" (which I don't even know why I'm getting them...) I now have only 207 subtitles. In this 207 subtitles I don't have ANY season 8 subtitle.

THE SOLUTION (?):
Wouldn't it be easy for you guys to implement two new parameters on the "SearchSubtitles": season and episode, and the use regex BEFORE trimming the results to 500?
Off course this would be not 100% accurate but this could be mentioned on the documentation and each developer could chose to use it or not as they pleased...

If this is not feasible, please comment on why. I'm curious about this problem...

User avatar
eduo
Posts: 716
Joined: Sat Feb 10, 2007 1:40 am
Location: Information Technology
Contact: ICQ Website Yahoo Messenger

Re: TV Series support preview

Wed Jul 07, 2010 7:17 pm

Or, conversely, the fulltext search could also work, if it allowed for searching something like *5*12*

Ideally, having a field for season and another for episode would go a long way at making this useable.

Problem is, as everything in this life, it's not priority and other things are, so it gets bumped back and back.
http://eduo.info/
[url=http://eduo.info/soleol/]OpenSubtitles from your desktop: SolEol for Mac/Windows/Linux[/url]
[url=http://forums.plexapp.com/index.php?showtopic=325&st=0&p=2480&#entry2480]My current episode processing work flow[/url].

Wonderm00n
Posts: 15
Joined: Tue Feb 05, 2008 7:39 pm

Re: TV Series support preview

Wed Jul 07, 2010 7:35 pm

eduo: I've just noticed that I suggested almost exactly the same thing you did on a previous post (only you explained everything :wink:)

I think the opensubtitles.org developers aren't seeing this as a priority...

User avatar
eduo
Posts: 716
Joined: Sat Feb 10, 2007 1:40 am
Location: Information Technology
Contact: ICQ Website Yahoo Messenger

Re: TV Series support preview

Wed Jul 07, 2010 7:40 pm

eduo: I've just noticed that I suggested almost exactly the same thing you did on a previous post (only you explained everything :wink:)
I bump this thread periodically because it saves me retyping it all over again :)
http://eduo.info/
[url=http://eduo.info/soleol/]OpenSubtitles from your desktop: SolEol for Mac/Windows/Linux[/url]
[url=http://forums.plexapp.com/index.php?showtopic=325&st=0&p=2480&#entry2480]My current episode processing work flow[/url].

Return to “Developing”

Who is online

Users browsing this forum: No registered users and 26 guests