Hello.
I found out today an error in the XML-RPC interface when searching subtitles.
In the Sub dataset there's a SubHash (md5sum of the subtitle) and then both a GZIP and ZIP versions of files to download.
SolEol has always downloaded the GZIP version because it's the smallest and easiest to manage. ZIP files contain a directory and an NFO file that make it harder to put files in their places.
As of some time ago, the SubHash doesn't match the GZIP subtitle but it matches the subtitle inside the zipfile. I believe this is incorrect.
For example:
subtitle 3538948
URL: http://www.opensubtitles.org/en/downloa ... 2161815.gz
Reported Subhash: b7a82a41c90693e49493527d14f9728c
Real Subhash: a08873deb9cd1137dca7760e6b7265ff
Reported Size: 25221
Real Size: 25303
The ZIP version http://www.opensubtitles.org/en/download/sub/3538948 matches both in SubHash and in size.
After review it becomes clear the subtitles are different. The GZ version has an extra line at the beginning (subtitle line 2) and an extra line at the end (subtitle 326). Obviously md5sums can't match.
This seems to happen for all subtitles.
What solution is there for this? I can only see one the following:
-Discontinue Gzip support (I'd rather this didn't happen)
-Match the zipped subtitle to the gzipped subtitle (or vice versa)
In the meantime SolEol (and I assume others like it) are re-downloading subtitles when they shouldn't. Adding to the loads of the servers unnecessarily.