Thread:Kirkburn/@comment-4694083-20131126232913/@comment-126761-20140121165814

That's a fair point - I guess the big question is whether or not there will be significantly more identically-named-but-different images or identically-named-and-identical images.

Here's a chart of the duplicates by namespace:

That is, 2851 articles with identical names, 1930 images with identical names, 149 templates with identical names. Checking 149 templates probably isn't too hard, but checking 1930 images does sound like rather more work.

We could just skip duplicate images then? i.e. ...
 * Import the Main and Template namespace. Duplicate pagenames get "/import" added to the end. Place Template:Import at the top of all imported pages.
 * Import the File namespace. Duplicate filenames are not imported.