Unfornately just knowing your CPU is an "i7" doesn't mean much anymore.
For example the Intel Core i7-11700K @ 3.60GHz is around four times faster than the Intel Core i7-620UM @ 1.07GHz for single threaded tasks and 30 times faster for multi-threaded.
I know you said that RAM usage was only 60%, but 16GB of RAM also probably isn't enough if you are playing around with 28GB files. (you can buy an additional 16GB for just $80, so it is well worth it if it saves you a few hours). If you get into a position when you are even a little short on RAM, the O/S will start swapping memory memory pages to disk. This incurs something like a 500x performance hit.
Mechanical drives are useless for this type of work. Even SATA SSDs are pretty rubbish. For $50 you can get a small M2 SSD that will be around 100x faster for random access. If this only saves you 1 hour, this a great investment.
DOJ asked for the emails to be submitted as tiff files with concordance load files
I've added it to out list of things to have a look at in the future. In the meantime you might need to use PDF and then convert to concordance as a 2nd step.
In my opinion, the whole plan doesn't make make any sense. In a 28GB file there must be 100s of thousands of EMails. What could any lawyer do with 300,000 random TIFF files? You would need to rebuild them back into some type of structured index, (i.e. exactly what a PST is to start with). But by then you have lost all the attachments and meta data, etc.. TIFF is also a rubbish format from a storage efficiency point of view. So your 28GB file might end up being 500GB of TIFFs. Someone might be stupid enough to then attempt an OCR job on the TIFFs, which might take weeks.
Leave a comment: