(I started this message as a reply to an admin complaining about some duplicate, and as the reply gets bigger I decided to post it here to enable discussion and further viewing).
Sometimes I downloaded some subtitle directly from legendas.tv* and uploaded here WITHOUT any change. And very often situations like these happened: I upload it latter than expected, so someone else download it there, edit the original to add some of his "credits" and upload it here first. And when I upload the *original* translation here, the hash differ so opensubtitles accept it.
(* legendas.tv is the current Brazilian hot site for the major subtitle teams, and yes, I invited some of the teams to join the opensubtitles
So, I understand the huge problem you guys have with duplicates. This common situation lead us to some important issue: the "hash" algorithm maybe isn't the best way to deal with duplicates. Maybe the OS admins should think about applying a diff on the subtitles for the same release and maybe ask the user for "are you really sure it isn't a duplicate?" and show the diff lines or something like that.
Sometimes I have to correct only 3 or 5 misspelled words and bad-translated ones and how neat it will be if I could *update* the subtitle instead of re-uploading a new one?
Diff will be nice, but a better feature to implement is a way to collaborate and manage version controlling of the subtitle production, and THAT will be very welcome. I'm thinking about something like the Rosetta web-translation on Launchpad.
Another reason why the file-size "hash" isn't so good: When I re-sync some subtitle for another release, I *must* change some char or add a new line break because the "hash" is based on simple file-size: the re-sync'd and the subtitle for another release are different but the hash tells me they are not. And it gets worse: sometimes when I try to upload a subtitle for some TV show, the "hash" tells me that I can't because it is supposedly the same subtitle for some unrelated obscure 60's movie.
I like to say thank you all for the hard work and dedication on building and maintaining the so important initiative that Opensubtitles are. I'm a registered user since 2007 and I'm uploading every subtitle I made, re-sync or download from another source for Brazilian Portuguese. And I have to tell you: Opensubtitles was *very* important on the begin of 2009, when police rages shut down the major Brazilian subtitle sites. And after that, Opensubtitle grow big on the Brazilian hearts and collaborators come more and more everyday!
BTW, I use Subdownloader to easily upload the subtitles I get from another sources.