Enron Dataset Research: E-mail Relevance Classification
dc.contributor.author | VanBuren, Victoria | |
dc.contributor.author | Villarreal, David | |
dc.contributor.author | McMillen, Thomas A. | |
dc.contributor.author | Minnicks, Andrew L. | |
dc.date.accessioned | 2009-10-14T10:03:29Z | |
dc.date.available | 2012-02-24T10:03:31Z | |
dc.date.issued | 2009-09-25 | |
dc.description.abstract | This paper discusses a probabilistic approach to address the problem of searching through large amount of data to find case-relevant documents. Using a valuable collection of data, e-mail communications from Enron, an actual corporation, we train a Bayes-based text classifier algorithm to identify e-mails known to be case-relevant and those known to be case-irrelevant. | |
dc.description.department | Computer Science | |
dc.format | Text | |
dc.format.extent | 16 pages | |
dc.format.medium | 1 file (.pdf) | |
dc.identifier.citation | VanBuren, V., Villarreal, D., McMillen, T. A., & Minnick, A. L. (2009). Enron dataset research: E-mail relevance classification (Report No. TXSTATE-CS-TR-2009-12). Texas State University-San Marcos, Department of Computer Science. | |
dc.identifier.uri | https://hdl.handle.net/10877/2583 | |
dc.language.iso | en | |
dc.subject | enron dataset | |
dc.subject | e-mail Relevance | |
dc.subject | e-mail classification | |
dc.subject | Bayes classifier | |
dc.subject | electronic discovery | |
dc.subject | forensics | |
dc.subject | Computer Science | |
dc.title | Enron Dataset Research: E-mail Relevance Classification | |
dc.type | Technical Report |
Files
Original bundle
1 - 1 of 1