Text and data mining are associated methods for identifying patterns within large bodies of text, in the case of text mining, or data, in the case of data mining. There are a number of different techniques associated with this method.
Marti Hearst defines Text Mining as "the discovery by computer of new, previously unknown information, by automatically extracting information from different written resources" and later distinguishes text mining from data mining, noting that "in text mining the patterns are extracted from natural language text rather than from structured databases of facts" ("What is Text Mining").
This guide is intended to help you find textual data for a TDM project, point to platforms, tools, and learning resources, and answer questions about copyright and licensing associated with TDM.
If you have any questions, contact us at mdl@library.utoronto.ca.
University of Toronto Libraries
130 St. George St.,Toronto, ON, M5S 1A5
libraryhelp@utoronto.ca
416-978-8450
Map
About web accessibility. Tell us about a web accessibility problem.
About online privacy and data collection.
© University of Toronto. All rights reserved. Terms and conditions.