Hi,
I noticed that similar sites like OpenSubtitles use different hashing algorithms.
For example,TheSubDB just does an MD5 hash over the first and last 64 KB. Filesize is not used. Their Python example is misleading.
http://thesubdb.com/api/
Is there an advantage of the OpenSubtitles hashing, which does a CRC64 (?) ?
Were there many hash collisions in the past, or why the additional "moviebytesize" ?
Moviebytesize is already added to the hash, why is it needed for the search?
Do you think the Hashing of the SubDB has advantages or is it flawed to create hash collisions in the future?
Sublight also does the hashing differently: http://www.sublight.si/Article/6/How-to ... -hash.aspx