Re: Duplicate removal

Nick Lamb (njl98r nospam at ecs.soton.ac.uk)
Fri, 10 Dec 1999 09:28:26 +0000

On Thu, Dec 09, 1999 at 10:39:12AM -0800, robert nospam at moon.eorbit.net wrote:
> On 2 Dec, Nick Lamb wrote:
>
> > If someone runs this on a DB with more than a few dozen CDs in it and
> > gets good results I'd like to hear about it, and hopefully provide
> > the extra four or so lines to do the merge for real.
>
> I ran it on a staging server that contains all the data as of a few
> days ago. The results are:
>
> Would have merged two albums called Damn The Torpedoes
> Would have merged two albums called Dream Dance Vol. 12 - cd 1
> Would have merged two albums called feeling strangely fine
> Would have merged two albums called Freak Out!
> Would have merged two albums called Fumbling Towards Ecstasy
<snip>
> 109 possible duplicates considered...
> 30 duplicate albums would have been merged in database...
>
> Roughly what you were expecting?

Yes, excellent. When I have a moment later today I will send you the
finished version of this script and you can then do whatever you think
is appropriate to arrange for dupes to be removed at regular intervals

Nick.