Thanks.

I have bi-sected the initial file and now have a chunk of 15k initial docs which contain the 6k that are dropped. A bit more manageable, but I still do not see anything obvious in them.

I will take a look at the library before resuming the painstaking bisecting exercise.