
Over 40 early case assessment reports from YOUR data
The following is a brief introduction to the reports available in earlyCASE.
1. Assumptions – Outlines the assumptions about the data, the cost to process & review it. The assumptions directly impact many of the reports. If you are going to conduct what if scenarios with different sets of assumptions, make sure you print the assumptions report and attach it to the reports generated based on those assumptions.
2. Matter – This report is the information you entered on the first screen when you created a new matter. This information can be updated and does affect the other reports. The amount of anticipated raw data (ESI) and how long you have to review it are pivotal to the other reports.
3. File Type Summary – This report will show you all of the file types seen in this ESI (including email attachments), a description of the file type, how many there are, the aggregate size of all of the files of this type, and the date range (last modified) of these files. The report is sorted by the count with the largest (highest #) first. This report should also be used in conjunction with the office files report.
4. De-Dup – File Type Summary - (Professional Run Required) This report is very similar to the file type summary – BUT with shows the counts and sizes based on a global deduplication of the documents. It will show you all of the file types seen in this ESI (including email attachments), a description of the file type, the DE Duplicated count of how many there are, the aggregate size of the DE Duplicated files of this type, and the date range (last modified) of these files. The report is sorted by the count with the largest (highest #) first. This report also includes totals at the report of the report – Count and Size.
5. File Date Summary – This can be a very long report in that it shows by date and time the number of documents along with the aggregate size of the documents for that date time. This report is useful when you are looking for a specific document based on a narrow date range.
6. File Author Summary – This report relies on the metadata in the documents and will summarize by document author how many documents they authored, the aggregate size and the date range. For this report to be useful the applications installed / configured on the machine which generated the documents must have been set up with the users name and not some generic information. If you use this report make sure you validate it, as many machines do not have this information configured and hence it is not in the document metadata.
7. Duplicates (Custodian) – (Professional Run Required) This report provides a summary by custodian of the number of duplicate emails and duplicate files. If both MD5 and SHA1 were generated it will show the counts by hash type as well. In addition this report will summarize across all of the custodians the total number of duplicate emails and duplicate files. For more information on how we determine duplicate emails please refer to this document on our website: https://www.earlycase.com/resources/earlyCASE%20Detecting%20Duplicate%20Emails.pdf
8. Duplicates (Global) - (Professional Run Required) This report provides a summary across all of the custodians of the number of duplicate emails and duplicate files. If both MD5 and SHA1 were generated it will show the counts by hash type as well. In addition this report lists the duplicate documents with there MD5 hash and the count of how many duplicates there are. This section is sorted by the count descending (largest # first) and is useful to spot check the duplicates as well as see what documents were duplicated the most. For more information on how we determine duplicate emails please refer to this document on our website: https://www.earlycase.com/resources/earlyCASE%20Detecting%20Duplicate%20Emails.pdf
9. Image Summary –This report shows the types of images (pictures, etc) along with the aggregate size and date range. Depending on the matter pictures (images) may not be useful to process, OCR, and review. This report identifies the types and counts to assist in the budget and planning process of dealing with them if needed.
10. Warnings & Errors – Any warnings of errors encountered in the processing of the ESI by earlyCASE will be reported here. It is a good idea to check this report. Password protected files, corrupt files, unsupported file compression, all are reported here.
11. Budgets & Timelines - This report contains the summary of the amounts of data, applies the assumption to this data and calculates the budgets and time required to handle the sample analyzed, but also extrapolates what the larger population of data will look like based on the sample. This report is an excellent report to use when changing assumptions and effect that those changed assumption have on the project budget. This report should be used in conjunction with the sizes and counts expanded report.
12. 26(f) report – This report is a Rollup of 6 of the other reports in earlyCASE. The intent of this report is to provide a snapshot of the data in a matter, the assumptions, etc. – all without disclosing anything that would be privileged or confidential. As such this report is careful to only include summary type information useful in communicating and negotiating with the other side in a matter about the ESI / eDiscovery.
13. Email “To:” summary – This report shows you who the custodians have been sending email “TO”, the counts and the date ranges of those emails. It can be useful in identifying additional custodians as well in being able to justify eliminating a custodian from what is processed and reviewed.
14. Email “From:” summary - This report shows you who the custodians have been receiving email “FROM”, the counts and the date ranges of those emails. It can be useful in identifying additional custodians as well in being able to justify eliminating a custodian from what is processed and reviewed.
15. Email Dates summary – Summaries by Year and Month the number of emails and their aggregate size. It is useful in determining date cutoffs based on the ESI you have against the request. Allows you to quickly isolate blocks of emails by data that are clearly outside the date range at issue.
16. Conversation Summary – This report summarizes the number of emails (including responses and forwards) in an email thread as well as the date range of the messages in that thread. This report should generally NOT be provided to the other side in a matter as the subject lines may contain privileged or confidential information.
17. Duplicate emails – (Professional Run Required) This report can be fairly long in a large population of emails. It shows the MD5 hash of the email message, the sent date, the subject and provides totals at the end of the report. This report should generally NOT be provided to the other side in a matter as the subject lines may contain privileged or confidential information. For more information on how we determine duplicate emails please refer to this document on our website: https://www.earlycase.com/resources/earlyCASE%20Detecting%20Duplicate%20Emails.pdf
18. PST & NSF Analyzed - (Professional Run Required for NSF) This report will show you a summary of the types of email containers processed, there aggregate size, how many emails where processed from those containers and the aggregate size of the messages. This report also shows by custodian and container how many messages and the aggregate size of the messages. Lotus Notes (NSF) processing requires that a Professional earlyCASE run was done.
19. Other EMail Containers – This report identifies less common email container files that are not processed by earlyCASE and that may require additional work to extract and review the messages. The objective of this report is make you aware that you have some uncommon / less common email mailboxes in the population of data.
20. Summary of Container Files - (Professional Run Required) This report summarizes by custodian the containers observer well as summarizing the container files (by type), provides a description of that container type along with the number of that type of container, the aggregate size of the container, and the date (last modified date) ranges of those containers. Totals of all the containers are also on this report.
21. Container File Details - (Professional Run Required) This report summarizes by custodian the containers observer well as providing details on what was extracted from each container.
22. Encase, DD, AFF Image Info - (Professional Run Required) This report details the drive images seen in the data that was processed by earlyCASE, the size of the image, the date of the image and the number of files that the image contains. It also provides the details about the image - when it was created, by who, the machine information, and any notes entered by the examiner when the image was created.
23. Data Collection Summary – This report is show the details on drive images that were created using the new earlyCASE “Collect” feature. This includes the Hash values of the images, the machine information, who and when the image was created, etc.
24. Sizes and Counts Expanded – This report provides a summary view of how the data expanded both in size and counts. It breaks this down by run and by piece of media. This report also identifies the number and percentage of what was on that media that was a duplicates of things already processed. Included on this report is a summary of the containers, what was extracted from the containers and a anticipated size of what you would pay to process and review based on the actual data you analyzed.
25. Top 25 File Types – Generally the most common file types seen in a population of documents make up the majority of the count as well as size. This report identifies the most frequent file types, the count, the aggregate size along with totals for what the top 25 file type represent in count, size and percentage of the total document population. It is not uncommon to see the top 25 file types represent over 85% of the total population and it a good starting place to understand both the loose files as well as the files types that were attached to emails.
26. Top 25 File Dates – The top 25 file dates report summarizes the population of documents by year and month and order the list based on the year / month with the most documents. This is useful in understanding the distribution of the documents by date. It be useful in seeing how the data you have relates to the dates of interest in the matter.
27. Office File Types – This report looks at just the “Common” office types of documents against the larger population of documents and provides counts, and aggregate sizes by file type for the normal Office Document types. This report should be used in conjunction with the Top 25 file types report to form a picture of what files are predominant and meaningful.
28. Other Costs and Expenses – This report takes the cost assumptions and applies it to the data that was analyzed to estimate the costs in this area for both the analyzed sample as well as the larger (extrapolated) set of data. These costs include items like data collection, litigation support, project management, hosting costs, etc.
29. Backup Tape Costs – Backup tape related costs are reported here and are NOT rolled into the budget for a project as generally backup tape is treated as inaccessible information. This report identifies these costs based on the assumptions provided. If backup tapes are to be handled in a matter these costs need to be manually added into your budget.
30. Extrapolated Costs – This report extrapolates the size and counts of the larger data set size based on the sample that was analyzed. It applies the filter cull assumptions provided and extrapolates the anticipated costs pre and post filtering for the larger set of data anticipated.
31. Attorney Review Cost –This report shows the costs anticipated on reviewing the extrapolated set of data. This report uses the assumption provided to arrive at cost of attorney review as well as the number of full time attorneys that would be required to complete the review in the time frame specified.
32. Generally Included File Types – This report compares the file types observed in the data analyzed against a database (Filetypes) to summarize what the sizes and counts of the generally reviewed file types would look like. This differs from the top 25 file types and office file types in that you can customize the file types which show on this report
33. Generally Excluded File Types - This report compares the file types observed in the data analyzed against a database (Filetypes) to summarize what the sizes and counts of file types which are generally NOT processed or reviewed. This differs from the top 25 file types and office file types in that you can customize the file types which show on this report
34. Unknown File Types – This report summarizes the file types, counts and sizes for any file type which is unknown / not defined in the FileTypes table. This report should be checked to see if there are file types which the generally included and generally excluded file types reports did not pickup.
35. Compare Run 1 and Run 2 –This report compares the files and emails analyzed in the first and second run of a matter to summarize what the differences are in the 2 runs. This is useful when you have 2 images of the same hard drive and you need to understand what (if anything) has changed. It also is helpful to validate a drive image against the source drive to insure that the image has everything.
36. Folders > 100 Files – This report shows the folders (and path) of any folder which has over 100 files in it. This is useful in checking for temporary locations, system storage which may be in the collection of data but really does not need to be processed and reviewed. This report should be used with the Folder Inventory ALL report to form a complete picture of where documents were stored and folders / paths that can be filtered out.
37. Folder Inventory ALL - This report shows the counts by folders (and path) and is ordered by the folder with the most files to the folders with the least files. This is useful in understanding where the bulk of the files came from and potentially identifying paths which really do not need to be processed and reviewed. This report can be pretty long, and you may want to focus on just the first 3 or 4 pages of this report.
38. File Inventory with MD5 – This report is useful when you intend to turn over a population of native files to another party and you want to provide a complete inventory of what you are giving them which includes the hash values of every document. This report will be very long!
39. Charts and Graphs – earlyCASE provides charts and graphs of the file types, containers and other key pieces of information displayed in chart and graph form.
40. File Type Pivot – Pivot tables allow you to select / deselect filetypes which are then charted or visible in table form. This is useful in seeing the impact size and count of filtering file types from the population. This is an advanced Microsoft Excel Function and is well worth learning more about.
41. File Date Pivot - Pivot tables allow you to select / deselect date ranges which are then charted or visible in table form. This is useful in seeing the impact size and count of filtering date ranges from the population. This is an advanced Microsoft Excel Function and is well worth learning more about.
42. Export Email and File tables to CSV – Use this feature to export the data in the access database to standard CSV files. The Email Details, File Details, and Folder Details are exported with this option.
43. Microsoft Excel Export - Use this feature to export the data in the access database to standard Microsoft Excel Spreadsheets. The Email Details, File Details, and Folder Details are exported with this option. Be aware that MS Excel has a limit if 64,000 rows, so larger matters will overflow the limits of Excel and you should use the Export to CSV report.
44. XML Export of Data – Users who have installed a full copy of Microsoft Access 2007 have the option of exporting the data into XML (instead of CSV). Because XML is very verbose you should only use this if the CSV option will not work for you.
Alphabetical List of the currently available reports:
26(f) report
Assumptions
Attorney Review Cost
Backup Tape Costs
Budgets & Timelines
Charts and Graphs
Compare Run 1 and Run 2
Container File Details
Conversation Summary
Data Collection Summary
De-Dup – File Type Summary
Duplicate emails
Duplicates (Custodian)
Duplicates (Global)
Email “From:” summary
Email “To:” summary
Email Dates summary
Encase, DD, AFF Image Info
Export Email and File tables to CSV
Extrapolated Costs
File Author Summary
File Date Pivot
File Date Summary
File Inventory with MD5
File Type Pivot
File Type Summary
Folder Inventory ALL
Folders > 100 Files
Generally Excluded File Types
Generally Included File Types
Image Summary
Matter
Microsoft Excel Export
Office File Types
Other Costs and Expenses
Other EMail Containers
PST & NSF Analyzed
Sizes and Counts Expanded
Summary of Container Files
Top 25 File Dates
Top 25 File Types
Unknown File Types
Warnings & Errors
XML Export of Data