Platform giving users the ability to create custom content sets containing as many as 10,000 documents. Users can search across the several primary source databases of Gale Cengage (see under 'More') and seamlessly select documents to be added to their custom content set. To access this resource off-campus, you need to connect through EduVPN. To start working with GDSL, you need to login with either Google or Microsoft.
These can then be analyzed and interrogated with various text analysis and visualization tools. Digital humanities analysis methods such as Named Entity Recognition, Topic Modelling, Parts of Speech, and others are included in the tool. For an explanation of the various methods, see this support page. Important information on the process of cleaning unstructured text (aka how to deal with OCR errors), can be found here.
TDM Studio provides a platform for text and data mining and data visualization of EUR subscribed Proquest content. TDM studio comprises a data visualization platform - requiring no further knowledge of programming languages - and a TDM workbench , accessible with R and Python.
New users can self-sign-up for TDM Studio accounts by: 1) Navigating to tdmstudio.proquest.com and 2) Creating an account using their university email address.
The following Library subscribed newspaper archives can be text mined with TDM Studio: