Event box

Web Scraping (ONLINE)

Web Scraping (ONLINE) Online

Join workshop in Webex: https://gsumeetings.webex.com/meet/jwalker184

Can't make it? Watch recorded workshop: https://lib.gsu.edu/rds-recordings

GSU Data Ready! Badges Micro-Credentialing: https://lib.gsu.edu/data-ready


This workshop will introduce participants to fundamental concepts and tools for programmatically scraping data from public websites.

The internet contains immense amount of data, both structured and unstructured, that is not readily available to researchers through traditional commercial and academic databases. Blogs, market data, movie reviews, social media, press releases, and newspaper headlines are just a few examples of the type of rich content available on the web that may be best collected using web scraping tools and methods.

The workshop will be taught exclusively using Python code (i.e. no point-and-click tools). However, everyone who is interested in web scraping, regardless of coding experience, are still welcome to join the workshop, participate in chat, and engage with the content as best they can. All are welcome.

Workshop Topics:

-- Introduction to concepts, context, and use-cases related to web scraping

-- Brief discussion of legal and ethical considerations

-- Using Selenium WebDriver to access and navigate websites

-- Extracting, parsing, and processing data scraped from webpages

Prerequisites: Basic familiarity with Python.

Software Requirements for Hands-on Participation:

For participants wishing to follow along with the “hands-on” portion of the workshop will need to have specific software installed prior to the start of the workshop. Software requirements and installation details can be found on the workshop webpage: https://research.library.gsu.edu/webdata/workshop

NOTE: Please read our Workshops ~ Etiquette & Policies page for pertinent information to your workshop attendance.

Presenter: Jeremy Walker, Data Services Librarian and member of the Library's Research Data Services Team

Related LibGuide: *Research Data Services @ Georgia State University Library by Mandy Swygart-Hobaugh

Friday, February 11, 2022
1:00pm - 3:30pm
Time Zone:
Eastern Time - US & Canada (change)
All Campuses
This is an online event. Event URL will be sent via registration email.

Registration is required. There are 98 seats available.

Event Organizer

Profile photo of Jeremy Walker
Jeremy Walker