Web Scraping (ONLINE)
Web Scraping (ONLINE) Online
Join workshop in Webex: https://gsumeetings.webex.com/meet/jwalker184
Can't make it? Watch recorded workshop: https://lib.gsu.edu/rds-recordings
GSU Data Ready! Badges Micro-Credentialing: https://lib.gsu.edu/data-ready
This workshop will introduce participants to fundamental concepts and tools for programmatically scraping data from public websites.
The internet contains immense amount of data, both structured and unstructured, that is not readily available to researchers through traditional commercial and academic databases. Blogs, market data, movie reviews, social media, press releases, and newspaper headlines are just a few examples of the type of rich content available on the web that may be best collected using web scraping tools and methods.
The workshop will be taught exclusively using Python code (i.e. no point-and-click tools). However, everyone who is interested in web scraping, regardless of coding experience, are still welcome to join the workshop, participate in chat, and engage with the content as best they can. All are welcome.
-- Introduction to concepts, context, and use-cases related to web scraping
-- Brief discussion of legal and ethical considerations
-- Using Selenium WebDriver to access and navigate websites
-- Extracting, parsing, and processing data scraped from webpages
Prerequisites: Basic familiarity with Python.
Software Requirements for Hands-on Participation:
For participants wishing to follow along with the “hands-on” portion of the workshop will need to have specific software installed prior to the start of the workshop. Software requirements and installation details can be found on the workshop webpage: https://research.library.gsu.edu/webdata/workshop
NOTE: Please read our Workshops ~ Etiquette & Policies page for pertinent information to your workshop attendance.
- Friday, February 11, 2022
- 1:00pm - 3:30pm
- Time Zone:
- Eastern Time - US & Canada (change)
- All Campuses
- This is an online event. Event URL will be sent via registration email.