Page 1 of 1

GetIMDBMovieDetails

Posted: Sun Aug 12, 2012 11:59 pm
by sarathkcm
Just wanted to report that GetIMDBMovieDetails works well (thanks), but it contains encoded HTML symbols such as

Code: Select all

" &Ã
etc...
Also in some cases, in the plot of movie returned, the last sentence is incomplete which is followed by few dots and then" See Full Summary"..... (for example, this Movie http://www.imdb.com/title/tt1258197/...)

Re: GetIMDBMovieDetails

Posted: Tue Aug 14, 2012 11:59 am
by oss
send me IMDB where it contains encoded HTML symbols, so I can check it.

For See Full Summary - it is given by perl script which we use for parsing, if you want there full summary, one more page must be retrieved...

Re: GetIMDBMovieDetails

Posted: Tue Aug 14, 2012 10:06 pm
by sarathkcm
send me IMDB where it contains encoded HTML symbols, so I can check it.
As an example : Movie Fast & Furious
IMDB : 1013752

Here is the Code i used for calling GetIMDBDetails from online debugger on this site

Code: Select all

<param> <value><string>fkl840r5vl4apv0n0l3pnmrdj1</string></value> </param> <param> <value><string>1013752</string></value> </param>
and for the title of the movie it returns

Code: Select all

<member> <name>title</name> <value> <string>Fast & Furious</string> </value> </member>

The following have this problem too, mostly in the "plot" part.... (I didn't checked the following from the online debugger, these were shown by my software, I've checked the above movie from online debugger, to verify that its not the problem of my xml parser).

IMDB IDs:

0101764
0395677
0342965
0163025
0822847
0449088
0367479
0454848
0364970
1655442
1258197
0406759
0928154
0119654
1270798
0258463


Most of these have encoded symbols in the "plot", while some have in the "cast"

For See Full Summary - it is given by perl script which we use for parsing, if you want there full summary, one more page must be retrieved...
Is there any way I can retrieve full summary through opensubtitles api?


Also,
I have made a list of all movies from my harddisk (around 140) using my piece of software, which uses opensubtitles api to identify movie files. when I look across that list, i see many film titles that I do not have on my computer.... I hope something is being done to clean the database from wrong hash entries....:)

Re: GetIMDBMovieDetails

Posted: Sun Nov 04, 2012 5:15 pm
by eduo
Quick note: As I recall, the ampersand is forbidden in most cases for XML (and I believe XMLRPC is one of them). This may be the reason it shows by default encoded and why it may be an "error" if in the API it ever shows decoded.

Re: GetIMDBMovieDetails

Posted: Wed Nov 07, 2012 7:02 am
by sarathkcm
hm... I thought ampersand is written as "&" in xml.... My xml parser doesn't decode "&" to "&" but it does read "&" correctly as "&"...
Anyway I replaced the occurrence of all those encoded characters with actual character before displaying it... :)

Re: GetIMDBMovieDetails

Posted: Wed Nov 07, 2012 10:27 pm
by eduo
hm... I thought ampersand is written as "&" in xml.... My xml parser doesn't decode "&" to "&" but it does read "&" correctly as "&"...
Anyway I replaced the occurrence of all those encoded characters with actual character before displaying it... :)
http://en.wikipedia.org/wiki/List%5Fof% ... 5Fin%5FXML
http://stackoverflow.com/posts/4940233/revisions
http://stackoverflow.com/posts/4940233/revisions

The unicode code point is as valid as the html entity, but it's more resilient to parsing errors.

Code: Select all

they must be escaped using either numeric character references or the strings " & " and " < " respectively.

Re: GetIMDBMovieDetails

Posted: Thu Nov 08, 2012 8:15 pm
by sarathkcm
I got what you said... Thanks for the reply... :wink: