Text Data: Basics of Text Processing and Regular Expressions
Event box
Text Data: Basics of Text Processing and Regular Expressions Online
Join workshop in Webex: https://gsumeetings.webex.com/meet/jwalker184
RDS@GSU Data Certification: https://lib.gsu.edu/data-certified
The size and volume of textual data available to academic researchers is absolutely immense. Consequently, for some researchers, having the skills to process, transform, and analyze text data using computational tools is increasingly necessary for certain types of research.
This workshop will introduce the fundamentals of working with and manipulating text data using scripting languages (e.g. Python, R). This includes loading, processing, and preparing text data for use with quantitative models. Although advanced natural language processing (NLP) models are not included in this workshop, some possible applications may be demonstrated if time permits.
No special background knowledge or skills are required to attend. All are welcome.
Workshop Topics
-- Common text and string operations
-- Tokenization, transformations, and processing
-- Regular Expressions
-- N-grams and term frequencies
Prerequisites: Basic familiarity with Python, R, or any scripting language preferred.
Software Requirements:
-- Participants will need a RStudio Cloud account.
-- No software installation is required.
NOTE: Please read our Workshops ~ Etiquette & Policies page for pertinent information to your workshop attendance.
Presenter: Jeremy Walker, Data Services Librarian and member of the Library's Research Data Services Team
Related LibGuide: *RESEARCH DATA SERVICES (RDS) @ Georgia State University Library by Mandy Swygart-Hobaugh
- Date:
- Tuesday, October 26, 2021
- Time:
- 11:00am - 1:30pm
- Time Zone:
- Eastern Time - US & Canada (change)
- Campus:
- All Campuses
- Online:
- This is an online event.
- Event URL:
- https://gsumeetings.webex.com/meet/jwalker184
- Categories:
- Data Services Workshops Online workshops