We moved metric, component and plugin out of the meta field since keeping it in there would create way too many buckets and slow down queries.
What we are seeing on the timeseries database is that while CPU, memory, networking etc. is at okay levels qwrites seem to fill up from time to time which completely blocks the collection. The TS collection with the format seems to receive from 100k to 300k requests per second. The non-TS collection needs about 7k upsert per second and seems to be much more stable but slightly slower when reading.
Are we doing something wrong, how can we make sure that we are getting more stable inserting experience?
We are now running a 3 member replicaset with 128gb ram, nvme drives and 32 cores per machine. CPU usage is between 10-20%.