DataLab-Getting Started with Textual Data in Python (3-part series) - Part 1 of 3 Monday, February 14, 2022, 12 – 2pm |
![]() Location:Via Zoom. Event Type:Workshops and Training Audience Type:Students: Graduate and Professional This three-part workshop series covers the basics of text mining with Python. We will focus primarily on unstructured text data, discussing how to format and clean text to enable the discovery of significant patterns in collections of documents. Sessions will introduce participants to core terminology in text mining/natural language processing and will walk through different methods of ranking terms and documents. We will conclude by using these methods to classify texts and to build models of"topics." Basic familiarity with Python is required. We welcome students, postdocs, faculty, and staff from a variety of research domains,ranging from health informatics to the humanities. This workshop occurs during UC Love Data Week, and all members of the University of California system are welcome to register. Workshop dates for this series are: February 14, February 16, and February 18, 2022. Prerequisites: Instructors will distribute a zipped directory of notebooks and files the week prior to the workshop. Participants are required to load this data into their Google Drive account before our first session. We also ask that participants read the first two sections of the workshop reader in advance to prepare for the series. Registration: DataLab - Getting Started with Textual Data in Python |
![]() | ![]() ![]() Get Event Link × ![]() |